HIL-SERL tutorial for simulation fixed and expanded #2

spirosperos · 2025-08-07T14:47:12Z

What this does

This PR fixes two critical bugs in the HIL-SERL simulation training framework and adds comprehensive documentation:

Bug Fixes:

🐛 Bug Fix: Fixed control_time_s parameter not being respected during training - episodes were always 10 seconds regardless of configuration
🐛 Bug Fix: Fixed random_block_position flag not being properly passed to the gym environment, preventing cube randomization

Improvements:

📚 Documentation: Added comprehensive training guide (hil_serl_simulation_training_guide_README.md) for HIL-SERL simulation training
🔧 Enhancement: Added extensive debug logging for easier training monitoring and troubleshooting

Key Changes:

Implemented TimeLimitWrapper class to properly enforce episode time limits based on control_time_s configuration
Added random_block_position parameter to environment configuration and properly passed it to gym environment
Enhanced logging throughout training process with detailed episode progress, time tracking, and environment state information

How it was tested

Time Limit Fix: Verified that setting "control_time_s": 40.0 in configuration now properly limits episodes to 40 seconds instead of default 10 seconds
Cube Randomization Fix: Confirmed that "random_block_position": true now properly randomizes cube positions between episodes
Debug Logging: Tested debug outputs during training to ensure proper monitoring of episode progress, time remaining, and environment state
Documentation: Verified all commands and configurations in the new README work correctly with the fixed implementation

Test Commands:

# Test recording with new time limit
python -m lerobot.scripts.rl.gym_manipulator --config_path examples/hil_serl_simulation_training/hi_rl_test_gamepad.json

# Test training with both fixes
python -m lerobot.scripts.rl.learner --config_path examples/hil_serl_simulation_training/train_gym_hil_env_gamepad.json
python -m lerobot.scripts.rl.actor --config_path examples/hil_serl_simulation_training/train_gym_hil_env_gamepad.json

How to checkout & try? (for the reviewer)

Test Time Limit Fix:

# Modify control_time_s in hi_rl_test_gamepad.json to 20.0 and verify episodes last 20 seconds
python -m lerobot.scripts.rl.gym_manipulator --config_path examples/hil_serl_simulation_training/hi_rl_test_gamepad.json

Test Cube Randomization:

# Set random_block_position to false in config and verify cube stays in same position
# Set to true and verify cube randomizes between episodes
python -m lerobot.scripts.rl.gym_manipulator --config_path examples/hil_serl_simulation_training/hi_rl_test_gamepad.json

Test Training with Both Fixes:

# Terminal 1: Start learner
python -m lerobot.scripts.rl.learner --config_path examples/hil_serl_simulation_training/train_gym_hil_env_gamepad.json

# Terminal 2: Start actor  
python -m lerobot.scripts.rl.actor --config_path examples/hil_serl_simulation_training/train_gym_hil_env_gamepad.json

Review Documentation:

# Check the new training guide
cat examples/hil_serl_simulation_training/hil_serl_simulation_training_guide_README.md

Expected Behavior:

Episodes should respect the control_time_s setting (30s in training config, 40s in recording config)
Cube should randomize position when random_block_position: true
Debug logs should show detailed episode progress and time tracking

spirosperos added 2 commits August 7, 2025 16:46

Initial commit

229eee6

trained model

6a0af00

spirosperos changed the title ~~Initial commit~~ HIL-SERL tutorial for simulation fixed and expanded Aug 7, 2025

lukicdarkoo mentioned this pull request Aug 8, 2025

HIL-SERL tutorial for simulation fixed and expanded #1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HIL-SERL tutorial for simulation fixed and expanded #2

HIL-SERL tutorial for simulation fixed and expanded #2

Uh oh!

spirosperos commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HIL-SERL tutorial for simulation fixed and expanded #2

Are you sure you want to change the base?

HIL-SERL tutorial for simulation fixed and expanded #2

Uh oh!

Conversation

spirosperos commented Aug 7, 2025

What this does

How it was tested

How to checkout & try? (for the reviewer)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants