Holosoma Inference

Policy inference for humanoid robot policies.

Supported Policies

Robot	Locomotion	WBT
Unitree G1	✅	✅
Booster T1	✅	❌

Simulator	Locomotion	WBT
IsaacGym	✅	❌
IsaacSim	✅	✅
MuJoCo (inference only)	✅	✅

✅ (full support)
🚧 (in progress/partial support)
❌ (no support)

Quick Start

Setup the Environment

cd ~/holosoma
bash scripts/setup_inference.sh             # Create a virtual environment with all dependencies
source scripts/source_inference_setup.sh    # Activate the virtual environment

Choose Your Workflow

Select the appropriate workflow guide based on your setup:

Real Robot Deployment

Real Robot Locomotion - Run locomotion policies on physical Unitree G1 or Booster T1 robots
Real Robot Whole Body Tracking - Run WBT policies on physical Unitree G1 robots

Simulation (MuJoCo)

Sim-to-Sim Locomotion - Test locomotion policies in MuJoCo simulation
Sim-to-Sim Whole Body Tracking - Test WBT policies in MuJoCo simulation

Each workflow guide includes:

Hardware/environment setup instructions
Step-by-step commands
Control references
Deployment options (offboard/onboard/Docker)
Troubleshooting tips

Policy Controls

Commands for controlling policies during execution.

Important: All policy controls that use keyboard should be entered in the policy terminal (where you ran run_policy.py), not in the MuJoCo window. MuJoCo has separate controls for simulation (see workflow docs).

General Controls

Action	Keyboard	Joystick
Start the policy	`]`	A button
Stop the policy	`o`	B button
Set robot to default pose	`i`	Y button
Kill controller program	-	L1 (LB) + R1 (RB)

Locomotion (Velocity Tracking)

Action	Keyboard	Joystick
Switch walking/standing	`=`	Start button
Adjust linear velocity	`w` `a` `s` `d`	Left stick
Adjust angular velocity	`q` `e`	Right stick

Default pose: Standing pose

Whole-Body Tracking

Action	Keyboard	Joystick
Start the policy	`]`	A button
Start motion clip	`s`	Start button

Default pose: Standing with raised arms

Joystick-Only Features

Select button: Switch between policies (when multiple policies are loaded)

Configuration Overrides

Loading ONNX Checkpoints from Wandb

You can load ONNX checkpoints directly from Wandb without manually downloading them first. This is useful for quickly testing models from training runs.

Syntax:

--task.model-path wandb://entity/project_name/run_id/model.onnx

Example with G1 locomotion:

source scripts/source_inference_setup.sh
python3 src/holosoma_inference/holosoma_inference/run_policy.py inference:g1-29dof-loco \
    --task.model-path wandb://my-username/my-project/run-abc123/fastsac_g1_29dof.onnx \
    --task.use-joystick \
    --task.interface eth0

Example with Wandb HTTPS URL:

python3 src/holosoma_inference/holosoma_inference/run_policy.py inference:g1-29dof-loco \
    --task.model-path https://wandb.ai/username/project/runs/abc123/files/model.onnx \
    --task.use-joystick \
    --task.interface eth0

The model will be automatically downloaded and cached locally. The entity is your Wandb username or organization name.

Finding Your Network Interface

The --task.interface parameter specifies which network interface to use for communicating with the robot. The correct interface name varies by computer and network card.

Common interface names:

eth0 - Common Ethernet interface name
enp0s31f6 - Modern Linux Ethernet naming
lo - Loopback interface (for sim2sim)

To find your interface name:

ifconfig

Look for the interface connected to your robot's network. For real robot deployments, use the interface with an IP address on the same subnet as your robot. For sim2sim deployments, use lo (loopback).

Observation History Length (> 1)

If a policy was trained with stacked observations (e.g., history length 4), you must pass the same history length at inference time so the observation tensor matches the model's expected input size.

Example:

python3 src/holosoma_inference/holosoma_inference/run_policy.py inference:g1-29dof-wbt \
    --task.model-path <path-to-model>.onnx \
    --task.interface eth0 \
    --observation.history_length_dict.actor_obs=4

The override updates the actor_obs buffer before the ONNX session is initialized, so any policy (locomotion or WBT) can run with longer observation histories as long as the underlying model was trained that way.

Overriding Control Gains

By default, control gains (kp/kd) are loaded from ONNX model metadata. You can override these values in your configuration:

G1 29-DOF with Custom Control Gains

python3 src/holosoma_inference/holosoma_inference/run_policy.py inference:g1-29dof-loco \
    --task.model-path src/holosoma_inference/holosoma_inference/models/loco/g1_29dof/fastsac_g1_29dof.onnx \
    --robot.motor-kp 40.2 99.1 40.2 99.1 28.5 28.5 40.2 99.1 40.2 99.1 28.5 28.5 40.2 28.5 28.5 14.3 14.3 14.3 14.3 14.3 16.8 16.8 14.3 14.3 14.3 14.3 14.3 16.8 16.8 \
    --robot.motor-kd 2.6 6.3 2.6 6.3 1.8 1.8 2.6 6.3 2.6 6.3 1.8 1.8 2.6 1.8 1.8 0.9 0.9 0.9 0.9 0.9 1.1 1.1 0.9 0.9 0.9 0.9 0.9 1.1 1.1

T1 29-DOF with Custom Control Gains

python3 src/holosoma_inference/holosoma_inference/run_policy.py inference:t1-29dof-loco \
    --task.model-path src/holosoma_inference/holosoma_inference/models/loco/t1_29dof/ppo_t1_29dof.onnx \
    --robot.motor-kp 5.0 5.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 20.0 200.0 200.0 200.0 200.0 200.0 50.0 50.0 200.0 200.0 200.0 200.0 50.0 50.0 \
    --robot.motor-kd 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 5.0 5.0 5.0 5.0 5.0 3.0 3.0 5.0 5.0 5.0 5.0 3.0 3.0

Note: When control gains are not specified, they will be automatically loaded from the ONNX model metadata. This is the recommended approach as it ensures the gains match those used during training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Holosoma Inference

Supported Policies

Quick Start

Setup the Environment

Choose Your Workflow

Real Robot Deployment

Simulation (MuJoCo)

Policy Controls

General Controls

Locomotion (Velocity Tracking)

Whole-Body Tracking

Joystick-Only Features

Configuration Overrides

Loading ONNX Checkpoints from Wandb

Finding Your Network Interface

Observation History Length (> 1)

Overriding Control Gains

G1 29-DOF with Custom Control Gains

T1 29-DOF with Custom Control Gains

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Holosoma Inference

Supported Policies

Quick Start

Setup the Environment

Choose Your Workflow

Real Robot Deployment

Simulation (MuJoCo)

Policy Controls

General Controls

Locomotion (Velocity Tracking)

Whole-Body Tracking

Joystick-Only Features

Configuration Overrides

Loading ONNX Checkpoints from Wandb

Finding Your Network Interface

Observation History Length (> 1)

Overriding Control Gains

G1 29-DOF with Custom Control Gains

T1 29-DOF with Custom Control Gains