- fix(eval_policy): Fix the logic for destroying the video pipeline
episode1.hdf5
├── endpose (float64, shape: [147, 14])
├── joint_action/
│ ├── left_arm (float64, shape: [147, 6])
│ ├── left_gripper (float64, shape: [147])
│ ├── right_arm (float64, shape: [147, 6])
│ ├── right_gripper (float64, shape: [147])
│ └── vector (float64, shape: [147, 14])
├── observation/
│ ├── front_camera/
│ │ ├── cam2world_gl (float32, shape: [147, 4, 4])
│ │ ├── depth (float64, shape: [147, 240, 320])
│ │ ├── extrinsic_cv (float32, shape: [147, 3, 4])
│ │ ├── intrinsic_cv (float32, shape: [147, 3, 3])
│ │ └── rgb (|S17330, shape: [147])
│ ├── head_camera/
│ │ ├── cam2world_gl (float32, shape: [147, 4, 4])
│ │ ├── depth (float64, shape: [147, 240, 320])
│ │ ├── extrinsic_cv (float32, shape: [147, 3, 4])
│ │ ├── intrinsic_cv (float32, shape: [147, 3, 3])
│ │ └── rgb (|S19188, shape: [147])
│ ├── left_camera/
│ │ ├── cam2world_gl (float32, shape: [147, 4, 4])
│ │ ├── depth (float64, shape: [147, 240, 320])
│ │ ├── extrinsic_cv (float32, shape: [147, 3, 4])
│ │ ├── intrinsic_cv (float32, shape: [147, 3, 3])
│ │ └── rgb (|S17970, shape: [147])
│ └── right_camera/
│ ├── cam2world_gl (float32, shape: [147, 4, 4])
│ ├── depth (float64, shape: [147, 240, 320])
│ ├── extrinsic_cv (float32, shape: [147, 3, 4])
│ ├── intrinsic_cv (float32, shape: [147, 3, 3])
│ └── rgb (|S5962, shape: [147])
└── pointcloud (float32, shape: [147, 1024, 6])
/data/sea_disk0/cuihz/code/RoboTwin2.0/RoboTwin/data
This dataset includes two distinct environment settings designed to support robust perception and policy learning:
- The table and surrounding environment have a fixed white appearance
- Lighting is consistent, stable, and well-suited for perception
- No distracting or irrelevant objects are present in the scene
- This subset is being prioritized for organization, and data has been processed up to the task:
open_laptop
- Table textures and environmental materials are randomized
- Scenes may include additional irrelevant objects to increase clutter
- Lighting conditions are highly randomized, with variations in direction, intensity, and possible extreme lighting scenarios
-
Please verify that the command
vulkaninfocan correctly list all four L40 GPUs. -
If
vulkaninfofails to display the GPUs, refer to this GitHub issue for troubleshooting: 👉 NVIDIA Container Toolkit Issue #16 -
The Vulkan ICD (Installable Client Driver) configuration file should look like this:
{ "file_format_version": "1.0.0", "ICD": { "library_path": "libGLX_nvidia.so.0", "api_version": "1.3.277" } }
Docker images typically do not include ffmpeg by default. To enable video/audio processing features, follow these steps:
-
Install
libvpx7first Because our cluster uses theamd64CPU architecture, you must manually installlibvpx7before installingffmpeg.A pre-downloaded
.debpackage is located at:/data/sea_disk0/cuihz/code/libvpx7_1.12.0-1+deb12u4_amd64.debInstall it with:
sudo apt install /data/sea_disk0/cuihz/code/libvpx7_1.12.0-1+deb12u4_amd64.deb
-
Then install FFmpeg:
sudo apt update sudo apt install ffmpeg
Some of our bash scripts rely on commands such as envsubst, which are not available in minimal Docker images by default. These tools are provided by the gettext package.
To ensure full compatibility, run:
sudo apt install gettextSee RoboTwin 2.0 Document (Usage - Install & Download) for installation instructions. It takes about 20 minutes for installation.
See RoboTwin 2.0 Tasks Doc for more details.
Please Refer to RoboTwin 2.0 Document (Usage) for more details.
We provide over 100,000 pre-collected trajectories as part of the open-source release RoboTwin Dataset. However, we strongly recommend users to perform data collection themselves due to the high configurability and diversity of task and embodiment setups.
Running the following command will first search for a random seed for the target collection quantity, and then replay the seed to collect data.
bash collect_data.sh ${task_name} ${task_config} ${gpu_id}
# Example: bash collect_data.sh beat_block_hammer demo_randomized 0
See RoboTwin 2.0 Tasks Configurations Doc for more details.
TinyVLA, DexVLA (Contributed by Media Group)
Deploy Your Policy: guide
⏰ TODO: G3Flow, HybridVLA, DexVLA, OpenVLA-OFT, SmolVLA, AVR, UniVLA
We recommend that the RoboTwin Platform can be used to explore the following topics:
- single - task fine - tuning capability
- visual robustness
- language diversity robustness (language condition)
- multi-tasks capability
- cross-embodiment performance
Coming Soon.
If you find our work useful, please consider citing:
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Coming Soon.
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins, accepted to CVPR 2025 (Highlight)
@InProceedings{Mu_2025_CVPR,
author = {Mu, Yao and Chen, Tianxing and Chen, Zanxin and Peng, Shijia and Lan, Zhiqian and Gao, Zeyu and Liang, Zhixuan and Yu, Qiaojun and Zou, Yude and Xu, Mingkun and Lin, Lunkai and Xie, Zhiqiang and Ding, Mingyu and Luo, Ping},
title = {RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins},
booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
month = {June},
year = {2025},
pages = {27649-27660}
}
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version), accepted to ECCV Workshop 2024 (Best Paper Award)
@article{mu2024robotwin,
title={RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)},
author={Mu, Yao and Chen, Tianxing and Peng, Shijia and Chen, Zanxin and Gao, Zeyu and Zou, Yude and Lin, Lunkai and Xie, Zhiqiang and Luo, Ping},
journal={arXiv preprint arXiv:2409.02920},
year={2024}
}
Software Support: D-Robotics, Hardware Support: AgileX Robotics, AIGC Support: Deemos
Code Style: find . -name "*.py" -exec sh -c 'echo "Processing: {}"; yapf -i --style='"'"'{based_on_style: pep8, column_limit: 120}'"'"' {}' \;
Contact Tianxing Chen if you have any questions or suggestions.
This repository is released under the MIT license. See LICENSE for additional details.

