You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+98Lines changed: 98 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,6 +5,30 @@ Scaffold for a modular imitation learning toolkit with dataset loaders, retarget
5
5
6
6
This is **only a skeleton**—fill in each module with real logic as you develop.
7
7
8
+
## Installation
9
+
10
+
We recommend using the `conda-forge` channel for managing dependencies. For a faster experience, use `mamba` to create the environment and `uv` to install packages.
# 3. Reset trajectories at the beginning of a training session
104
+
manager.reset_trajectories()
105
+
106
+
# 4. Fetch reference data in a loop (e.g., inside your RL environment's step function)
107
+
for_in range(1000): # Simulate 1000 steps
108
+
# Get a batch of reference data for the current timestep
109
+
reference_data = manager.get_reference_data()
110
+
111
+
# The data is returned as a TensorDict for easy access
112
+
# Shape: [num_envs, ...]
113
+
com_positions = reference_data["com_pos"]
114
+
joint_positions = reference_data["joint_pos"]
115
+
116
+
# Your agent would use this data to compute actions...
117
+
# print(f"Step {_}: COM Position Batch Shape: {com_positions.shape}")
118
+
119
+
# To reset specific environments that have completed their episode:
120
+
# done_env_ids = torch.tensor([2, 5], device=device) # Example IDs
121
+
# manager.reset_trajectories(done_env_ids)
122
+
123
+
```
124
+
125
+
## Testing
126
+
127
+
To verify that the `loco_mujoco` dataset loader is working correctly, you can run the specific test file for it. This is currently the recommended test to run.
128
+
129
+
First, ensure you have the necessary test dependencies installed:
0 commit comments