DexNDM: Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model
Overview
Overall Novelty Assessment
The paper contributes a joint-wise neural dynamics model for sim-to-real transfer in dexterous in-hand object rotation, paired with an autonomous data collection strategy. It resides in the 'Adaptive and Fine-Tuning Transfer' leaf, which contains five papers total including the original work. This leaf sits within the broader 'Sim-to-Real Transfer Methods and Frameworks' branch, indicating a moderately populated research direction. The sibling papers explore online adaptation (Rapid Motor Adaptation), controller refinements (DexCtrl), and privileged information distillation (DROP), suggesting this adaptive transfer area is active but not overcrowded compared to domain randomization approaches.
The taxonomy reveals neighboring directions that contextualize this work's positioning. The adjacent 'Domain Randomization and Direct Transfer' leaf contains six papers emphasizing zero-shot deployment without fine-tuning, while 'Simulation-Guided and Digital Twin Approaches' (three papers) explores world models for transfer. The paper's focus on data-efficient adaptation distinguishes it from purely randomization-based methods, yet shares goals with simulation-guided approaches. The 'Sensory Modalities and Perception' branch (thirteen papers across vision, tactile, and multimodal categories) addresses complementary challenges in state estimation, while this work assumes sensory inputs are available and focuses on dynamics modeling.
Among twenty-three candidates examined across three contributions, no clearly refutable prior work emerged. The joint-wise dynamics model examined three candidates with zero refutations, suggesting this factorized approach to dynamics learning may represent a less-explored angle. The autonomous data collection strategy and the overall sim-to-real framework each examined ten candidates without refutation, indicating these contributions appear novel within the limited search scope. However, the search examined only top-K semantic matches plus citations, not an exhaustive survey, so related work in adjacent subfields (e.g., system identification, residual learning) may exist outside this candidate pool.
The analysis suggests the paper occupies a moderately novel position within adaptive sim-to-real transfer, with no strong prior work overlap detected among examined candidates. The joint-wise factorization and autonomous data collection appear distinctive given the search scope, though the limited candidate pool (twenty-three papers) means adjacent literature in dynamics modeling or automated data collection may not have been fully captured. The taxonomy structure indicates this adaptive transfer direction remains active, with ongoing exploration of data efficiency versus generalization trade-offs.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors introduce a novel neural dynamics model that factorizes system dynamics across individual joints rather than modeling the whole hand-object system. Each joint's evolution is predicted from its own proprioceptive history, compressing system-wide influences into low-dimensional variables. This design achieves high data efficiency and generalizability across different interaction distributions.
The authors develop an autonomous data collection method called Chaos Box that gathers real-world interaction data by placing the robotic hand in a container with soft balls and replaying actions from the simulated policy. This approach eliminates catastrophic failures and human resets while providing diverse, object-loaded interaction data at scale.
The authors present a complete pipeline combining specialist-to-generalist policy training with their joint-wise dynamics model and autonomous data collection. This framework demonstrates unprecedented generality in rotating challenging objects with complex shapes, high aspect ratios, small sizes, and diverse wrist orientations in real-world settings.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
[2] In-hand object rotation via rapid motor adaptation PDF
[11] Maniptrans: Efficient dexterous bimanual manipulation transfer via residual learning PDF
[21] Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning PDF
[29] DexCtrl: Towards Sim-to-Real Dexterity with Adaptive Controller Learning PDF
Contribution Analysis
Detailed comparisons for each claimed contribution
Joint-wise neural dynamics model for sim-to-real transfer
The authors introduce a novel neural dynamics model that factorizes system dynamics across individual joints rather than modeling the whole hand-object system. Each joint's evolution is predicted from its own proprioceptive history, compressing system-wide influences into low-dimensional variables. This design achieves high data efficiency and generalizability across different interaction distributions.
[51] Design of minimal model-free control structure for fast trajectory tracking of robotic arms PDF
[52] Learning task space actions for bipedal locomotion PDF
[53] Sim-to-Real Latent Dynamics Adaptation in High-Gear-Ratio Humanoid Robots for Robust and Expressive Locomotion * PDF
Fully autonomous data collection strategy
The authors develop an autonomous data collection method called Chaos Box that gathers real-world interaction data by placing the robotic hand in a container with soft balls and replaying actions from the simulated policy. This approach eliminates catastrophic failures and human resets while providing diverse, object-loaded interaction data at scale.
[58] Droid: A large-scale in-the-wild robot manipulation dataset PDF
[59] Efficient Data Collection for Robotic Manipulation via Compositional Generalization PDF
[60] Precise and dexterous robotic manipulation via human-in-the-loop reinforcement learning PDF
[61] Scaling up and distilling down: Language-guided robot skill acquisition PDF
[62] Boosting Robotic Manipulation Generalization with Minimal Costly Data PDF
[63] QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation PDF
[64] Mimicgen: A data generation system for scalable robot learning using human demonstrations PDF
[65] RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation PDF
[66] Self-supervised learning of state estimation for manipulating deformable linear objects PDF
[67] Scaling robot learning with semantically imagined experience PDF
Sim-to-real framework for general-purpose in-hand rotation
The authors present a complete pipeline combining specialist-to-generalist policy training with their joint-wise dynamics model and autonomous data collection. This framework demonstrates unprecedented generality in rotating challenging objects with complex shapes, high aspect ratios, small sizes, and diverse wrist orientations in real-world settings.