1. Do I need real hardware to use the RL environment?
No. The simulation environments (MuJoCo, IsaacSim, Genesis) run entirely in software. You only need real hardware when you want to validate sim-to-real transfer. The Community and Research tiers are fully software-based.
2. How accurate are the simulation models?
OpenArm 101 sim-to-real position error is under 3mm at the end-effector after calibration. Joint dynamics (friction, damping) are identified from real hardware and validated quarterly. Contact physics are approximate — expect 10-20% performance gap on contact-rich tasks, which is why we recommend real-world fine-tuning for deployment.
3. Can I use my own robot in the SVRC RL environment?
Yes, if you have a URDF file. The environment framework supports arbitrary URDF models. However, benchmark results and baselines are only provided for SVRC hardware. Contact us if you need help creating or validating a URDF for your robot.
4. What RL algorithms do you support?
The environment is algorithm-agnostic — it provides a standard Gymnasium interface. Our example scripts include PPO, SAC, and TD3 implementations. You can use any RL library that supports Gymnasium: Stable-Baselines3, CleanRL, RLlib, or your own implementation.
5. How do I submit benchmark results?
Run the evaluation script with the --submit flag. It records your policy checkpoint hash, evaluation metrics, and environment configuration, then submits to the SVRC Benchmark Leaderboard. Research License or Enterprise access required for submission.
6. Can I use the environment for a university course?
Yes. The Community tier is free and sufficient for most course assignments. For courses that need IsaacSim environments or benchmark submission access, request a Research License with your faculty email. See the educator page for course integration guidance.
7. What GPU do I need?
MuJoCo runs on CPU and is fast enough for most training tasks. IsaacSim requires an NVIDIA GPU (RTX 3070 or better recommended). Genesis runs on both CPU and GPU. For the getting-started example above, no GPU is needed — CPU training on the reach task converges in 10 minutes.
8. How does this relate to SVRC Data Services?
The RL Environment provides simulation and real-hardware environments for training. Data Services provides human-collected teleoperation data. The two complement each other: use teleoperation data to bootstrap RL training, then use RL to fine-tune beyond human performance. The approach page explains our philosophy on combining data sources.