Hardware adapters & EmbodiedGovBench v2
v0.10.0 ships the governance core. v0.11.0 takes that core out in two directions: real-hardware adapters (Unitree G1 + Franka) and a public, multi-LLM benchmark with leaderboard.
- Real Unitree G1 adapter —
RealUnitreeG1Robotimplements the Robot ABC against Unitree SDK 2 RPC.identity_hashbyte-equal acrossg1_sim ↔ g1_realswap. - Real Franka adapter —
RealFrankaRobotagainst thefranka_ros2driver, with workspace-bounds + speed-factor defense in depth. - Humanoid scenario suite — 4 G1 scenarios for
NavigateECM+GraspAndPlaceECM, including an HR-4 enforcement check (operator-relax without signature must be refused). - Multi-LLM provider harness — matrix runner across Claude / GLM / Gemini / local-onnx.
--cheapoffline mode for CI. - Public leaderboard —
docs/leaderboard/v0.11.0.json+docs/LEADERBOARD.md, regenerable, every row commit-pinned. - Paper experiment reproducibility — P11 / P12 / P14 claims now run as
pytest tests/benchmarks/papers/; existing scripts kept as thin wrappers. - Operator docs —
docs/BENCHMARK.md+docs/SCENARIOS.mdcovering full-matrix runs, env vars, cost estimates, scenario authoring. - Test baseline — ~2,347 passing on
main.
The ROS 2 bridge process and the end-to-end real-hardware demo (G1 walks 1m + Franka grasps under live bridge) are deferred until a maintainer has both robots in physical reach. Those land as the v0.11.0 final tag along with the full hardware-track release notes.