Humanoidbench RL#
We provide a basic RL training example for Humanoidbench tasks.
RL framework: stable-baselines3
RL algorithm: PPO
Simulator: MuJoCo and IsaacGym and IsaacLab
Installation#
pip install stable-baselines3
pip install wandb
pip install tensorboard
Wandb login, enter your wandb account token.
wandb login
Training#
NOTE: You can add
--use_wandb --wandb_entity <your_wandb_entity_name>to use wandb to log the training process.
MuJoCo:
python roboverse_learn/humanoidbench_rl/train_sb3.py --sim mujoco --num_envs 1 --robot=h1 --task humanoidbench:Stand
IsaacGym:
python roboverse_learn/humanoidbench_rl/train_sb3.py --sim isaacgym --num_envs 2 --robot=h1 --task humanoidbench:Stand
IsaacLab:
python roboverse_learn/humanoidbench_rl/train_sb3.py --sim isaaclab --num_envs 2 --robot=h1 --task humanoidbench:Stand
Task list#
Balance
Not Implemented because of collision detection issues.
Crawl
Cube
Door
Highbar
Not implemented due to the need to connect H1 and Highbar.
Hurdle
Not Implemented because of collision detection issues.
Maze
Not Implemented because of collision detection issues.
Package
Pole
Not Implemented because of collision detection issues.
Powerlift
Push
Run
Sit
Slide
Spoon
Not Implemented because of sensor detection issues.
Stair
Stand