GEAR-SONIC encoder+decoder tracking on Unitree G1 using the published HuggingFace ONNX models. The clip is a deterministic marching fixture run through the real tracking stack, not a mock decoder path.
| Metric | Expected | Actual | Severity | Result |
|---|---|---|---|---|
| min_torso_z | ge 0.65 | 0.7433720515840081 | critical | PASS |
| final_tracking_frame | ge 179 | 179.0 | critical | PASS |
| mean_rms_joint_error | le 0.6 | 0.25449907042913966 | major | PASS |
| max_rms_joint_error | le 0.95 | 0.3062019944190979 | major | PASS |
| left_hip_pitch_span | ge 0.1 | 0.6614512205123901 | major | PASS |
| right_hip_pitch_span | ge 0.1 | 0.667294442653656 | major | PASS |
| left_knee_span | ge 0.1 | 0.8977575898170471 | major | PASS |
| right_knee_span | ge 0.1 | 1.0016324520111084 | major | PASS |
| torso_z | ge 0.65 | 0.7960637142120386 | critical | PASS |
| rms_joint_error | le 0.85 | 0.212299644947052 | major | PASS |
| torso_z | ge 0.65 | 0.7950028397257438 | critical | PASS |
| rms_joint_error | le 0.85 | 0.21850714087486267 | major | PASS |
| torso_z | ge 0.65 | 0.7474431603556333 | critical | PASS |
| rms_joint_error | le 0.85 | 0.30155473947525024 | major | PASS |
| torso_z | ge 0.65 | 0.7439395041866083 | critical | PASS |
| rms_joint_error | le 0.85 | 0.2334790974855423 | major | PASS |
Step: 22 | Cameras: global_view, head_camera | Capability: render_camera
global_view
head_camera
Step: 60 | Cameras: global_view, head_camera | Capability: render_camera
global_view
head_camera
Step: 120 | Cameras: global_view, head_camera | Capability: render_camera
global_view
head_camera
Step: 180 | Cameras: global_view, head_camera | Capability: render_camera
global_view
head_camera