Weekly Robotics #358 — Digest for Xiaomi Robotics EI Team

MiaoDX|2026-04-28T00:00:00.000Z
roboticsembodied-aivlnros2

Weekly Robotics #358

🔑 Top picks this week

VLA Foundry — Vision-Language-Action Model Training Framework

Relevance: 9/10 | Embodied AI / VLA

VLA Foundry is an open-source framework from TRI-ML that unifies LLM, VLM, and VLA training in a single codebase. Unlike most open-source VLA efforts that focus only on the action fine-tuning stage, VLA Foundry provides a shared training stack with end-to-end control — from language pretraining all the way to action-expert fine-tuning. Supports both from-scratch training and pretrained backbones from Hugging Face.

Why it matters for Xiaomi EI: The unified training pipeline could significantly streamline VLA research. If we're building or iterating on embodied action models, this framework reduces the integration overhead of stitching together incompatible pretraining stages. Worth a deep dive before starting any new VLA training run.

🔗 Source · GitHub


NCore — NVIDIA's Open-Source Neural 3D Reconstruction & Simulation Platform

Relevance: 8/10 | Simulation / ROS2 / Data Pipeline

NCore is NVIDIA's open-source Python-based platform providing data representations, APIs, and tools for neural 3D reconstruction and simulation, with a focus on robotics and autonomous vehicle data. It natively supports modern neural rendering approaches and serves as a data infrastructure layer for physics-based simulation.

Why it matters for Xiaomi EI: Simulation-to-real gap is a core challenge in robot learning. NCore's focus on neural 3D reconstruction could enable richer environment representations for training. The AV focus is also relevant given that both AV and robotics share sensor-fusion and world-modeling pipelines.

🔗 Source · GitHub


Sony AI Table Tennis Robot ("Ace") — Event Camera for Spin Detection

Relevance: 7/10 | Embodied Perception / Manipulation

Sony AI developed a robot manipulator on a gantry that achieves outstanding table tennis performance. The system uses multiple RGB cameras at 200Hz, with the standout innovation being the use of an event camera to measure ball spin — a genuinely novel sensor application for high-speed manipulation.

Why it matters for Xiaomi EI: Event cameras are underutilized in robotics manipulation pipelines. This demo shows that edge cases like fast-spinning objects (which blur in standard cameras) can be handled with event-based sensing. If our robot platforms encounter similar high-speed, partial-occlusion scenarios, event cameras could be a key sensor upgrade.

🔗 Source


📦 Worth watching

reBot-DevArm — $1,200 Open-Source 6 DoF Robot Arm (SeeedStudio)

A cost-efficient open-source 6 DoF arm at $1,200 with a 4.5 kg weight and 1.5 kg payload. The weight-to-payload ratio (3:1) is impressive for development platforms. Good candidate for research曙 idesk setups or低成本 manipulation experiments. GitHub

WebODM — Drone Mapping Software

Open-source photogrammetry toolkit for drone-captured imagery. If your team is building spatial awareness or 3D mapping capabilities, WebODM provides an end-to-end pipeline from imagery to 3D point clouds and models. Relevant for outdoor robot navigation and environment modeling. webodm.org

Pudu Robotics Raises ~$150M, Targets Industrial Applications

Pudu Technology's $150M funding round values the company at $1.5B+. The company is pivoting from service robots (food service, hospitality) toward industrial applications. This signals intensifying competition in the Chinese industrial robotics market and potentially more open-source or partnership opportunities as Pudu scales its industrial portfolio. Source


📅 Notable events

EventDateLocationNotes
Actuate Global — LondonApr 30, 2026London, UKFoxglove-backed community meetup
Hands on Workshop: Scaling VLA Models with RayApr 30, 2026Pittsburgh, USADirectly relevant to VLA training at scale
Actuate Global — ParisMay 04, 2026Paris, FranceCommunity meetup
Actuate Global — Zurich: Field SessionMay 06, 2026Zurich, SwitzerlandHands-on robotics session
NextGen Robotics LIVEMay 06, 2026Odense, DenmarkDenmark's robotics hub
Actuate Global — MunichMay 07, 2026Munich, GermanyCommunity meetup
Open Hardware SummitMay 22–23, 2026Berlin, GermanyOpen-source hardware
Robotics Summit & Expo 2026May 27–28, 2026Boston, USAHigh priority — major US robotics conference

Recommendation: The Robotics Summit & Expo 2026 (May 27–28, Boston) is the highest-signal event for Xiaomi EI. If budget allows, attending or monitoring publications from this event is strongly advised.


💡 Quick take

This week's issue is VLA-heavy, continuing a clear trend from recent issues. VLA Foundry is the most important item — it's a serious attempt at building a unified, open-source VLA training stack, and teams starting new action model research should evaluate it before building custom pipelines.

On the industry side, Pudu's $150M raise and pivot to industrial applications underscores that Chinese robot companies are aggressively moving up the value chain. The table tennis robot's use of event cameras for spin detection is a practical example of how unconventional sensors can solve real-world perception problems that plague standard RGB approaches.

For simulation, NCore's emphasis on neural 3D reconstruction over traditional mesh-based methods suggests the field is converging on differentiable, learned representations as the foundation for sim-to-real transfer. Teams investing in physics simulation should monitor this space closely.

M
MiaoDX × AI Agents
机器人研发工程师,OPC 实践者 — One Person, plus multi Claws。白天给机器人写 bug,其他时间和 AI Agents 一起做更多的事。