DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 5.5 Download Report PDF

Dexterous Manipulation; Deep Reinforcement Learning; Robot Learning; Learning in Simulation

We study the problem of functional retargeting: learning dexterous manipulation policies to track object states from human hand-object demonstrations. We focus on long-horizon, bimanual tasks with articulated objects, which are challenging due to large action space, spatiotemporal discontinuities, and the embodiment gap between human and robot hands. We propose DexMachina, a novel curriculum-based algorithm: the key idea is to use virtual object controllers with decaying strength: an object is first driven automatically towards its target states, such that the policy can gradually learn to take over under motion and contact guidance. We release a simulation benchmark with a diverse set of tasks and dexterous hands, and show that DexMachina significantly outperforms baseline methods. Our algorithm and benchmark enable a functional comparison for hardware designs, and we present key findings informed by quantitative and qualitative results. With the recent surge in dexterous hand development, we hope this work will provide a useful platform for identifying desirable hardware capabilities and lower the barrier for contributing to future research. Videos and more at \url{project-dexmachina.github.io}

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces DexMachina, a curriculum-based reinforcement learning algorithm for functional retargeting of bimanual dexterous manipulation from human demonstrations. It resides in the 'Functional Retargeting and Embodiment Transfer' leaf, which contains six papers total including the original work. This leaf sits within the broader 'Human Motion Capture and Retargeting' branch, indicating a moderately populated research direction focused on bridging the embodiment gap between human and robot hands while preserving task semantics rather than exact kinematic correspondence.

The taxonomy reveals neighboring leaves addressing related but distinct challenges: 'Hand-Object Motion Extraction from Video' (four papers) focuses on markerless capture, while 'Sensor-Based Motion Capture' (three papers) uses wearable devices. The sibling papers in the same leaf—including Dexh2r, ManipTrans, and Object-centric Dexterous approaches—similarly tackle embodiment transfer but differ in scope. DexMachina's emphasis on bimanual coordination and long-horizon articulated object tasks positions it at the intersection of retargeting methods and spatial coordination learning, a branch with three papers exploring geometric relationships between dual arms.

Among twenty-five candidates examined, the DexMachina curriculum algorithm shows no clear refutation across five candidates reviewed, suggesting relative novelty in its specific approach. The simulation benchmark contribution encountered two refutable candidates among ten examined, indicating existing evaluation platforms in this space. The functional retargeting problem formulation examined ten candidates with no refutations, though this may reflect the limited search scope rather than absolute novelty. The statistics suggest the algorithmic contribution appears more distinctive than the benchmark or problem framing within the examined literature.

Based on the top-twenty-five semantic matches and taxonomy structure, the work appears to occupy a moderately explored niche within functional retargeting. The curriculum-based approach and bimanual focus differentiate it from sibling papers, though the benchmark contribution overlaps with existing evaluation efforts. The analysis covers a focused subset of the field and does not claim exhaustive coverage of all related work in dexterous manipulation or demonstration learning.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: learning bimanual dexterous manipulation from human demonstrations. The field organizes around several interconnected branches that address distinct facets of transferring human skill to robots. Human Motion Capture and Retargeting focuses on translating observed human movements into robot-executable actions, often dealing with embodiment mismatches between human hands and robotic end-effectors. Data Collection Systems and Teleoperation develops interfaces and hardware for gathering high-quality demonstrations, while Policy Learning from Demonstrations tackles the algorithmic challenge of distilling these examples into robust control policies. Active Perception and Egocentric Vision emphasizes the role of visual feedback during manipulation, and Symbolic Task Modeling and Temporal Constraints captures higher-level task structure and sequencing. Spatial Coordination Learning addresses the geometric relationships between two arms, and Synthetic Data Generation and Augmentation explores ways to expand limited real-world datasets. Specialized Manipulation Contexts, Foundation Models and Cross-Embodiment Transfer, and Benchmarks and Datasets round out the taxonomy by addressing domain-specific challenges, generalization across platforms, and standardized evaluation. A particularly active line of work centers on functional retargeting methods that preserve task semantics rather than exact kinematic correspondence, exemplified by DexMachina[0], which learns to map human hand motions to dexterous robotic grippers by focusing on object-centric goals. Nearby efforts such as Dexh2r[6] and ManipTrans[11] similarly emphasize bridging the embodiment gap through learned mappings or trajectory adaptation. In contrast, works like Object-centric Dexterous[1] and Teach Once[3] explore how to generalize demonstrations across object instances or enable rapid skill transfer with minimal retraining. DexMachina[0] sits within this cluster of retargeting approaches but distinguishes itself by targeting bimanual coordination and dexterous manipulation jointly, whereas some neighbors like DexTrack[16] or Hermes[31] prioritize tracking fidelity or cross-embodiment transfer. Open questions remain around balancing kinematic accuracy with task success, scaling to diverse object geometries, and integrating temporal constraints from symbolic models.

Claimed Contributions

DexMachina curriculum-based RL algorithm for functional retargeting

5 retrieved papers

The authors introduce DexMachina, a reinforcement learning algorithm that uses virtual object controllers to automatically drive objects toward target states initially, then gradually decays their strength so the policy learns to take over manipulation under motion and contact guidance. This curriculum approach addresses challenges in learning long-horizon bimanual dexterous manipulation from human demonstrations.

5 retrieved papers

Simulation benchmark for dexterous manipulation evaluation

Can Refute

10 retrieved papers

The authors release a simulation benchmark containing six dexterous hand models and five articulated objects. This benchmark provides a unified testbed for evaluating functional retargeting algorithms and enables functional comparison across different hardware designs, where new hands and tasks can be easily added and quickly evaluated.

10 retrieved papers

Can Refute

Functional retargeting problem formulation

10 retrieved papers

The authors formalize the functional retargeting problem, which aims to learn feasible dexterous robot policies that manipulate objects to follow demonstrated trajectories from human hand-object demonstrations. This is distinguished from kinematic retargeting by emphasizing task capability and feasibility rather than merely producing human-like motions.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[1] Object-centric dexterous manipulation from human motion data PDF

Chen, Yuanpei, Wang Chen, Yuanpei Chen, Yang YaoDong, Chen Wang, Liu, C. Karen, Yaodong Yang, C. K. Liu (2024)

[6] Dexh2r: Task-oriented dexterous manipulation from human to robots PDF

Zhao Shuqi, Zhu Xing-hao, Shuqi Zhao, Chen, Yuxin, Xinghao Zhu, Li, Chenran, Yuxin Chen, Zhang Xiang, Chenran Li, Ding, Mingyu, Xiang Zhang, Tomizuka, Masayoshi, Mingyu Ding, Masayoshi Tomizuka (2024)

[11] Maniptrans: Efficient dexterous bimanual manipulation transfer via residual learning PDF

Li Kailin, Li, Puhao, Liu, Tengyu, Li Yuyang, Huang, Siyuan (2025)

[16] DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References PDF

Liu Xueyi, Xueyi Liu, Han, Qianwei, Jianibieke Adalibieke, Qin, Yuzhe, Qianwei Han, Yi Li, Yuzhe Qin, Li Yi (2025) • International Conference on Learning Representations

[31] Hermes: Human-to-robot embodied learning from multi-source motion data for mobile dexterous manipulation PDF

Yuan, Zhecheng, Wei Tian-ming, Hua Pu, Chen, Yuanpei, Xu, Huazhe (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

DexMachina curriculum-based RL algorithm for functional retargeting

[51] Demostart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots PDF

Cannot Refute

[52] Acquiring musculoskeletal skills with curriculum-based reinforcement learning PDF

Cannot Refute

[53] AI in Surgical Curriculum Design and Unintended Outcomes for Technical Competencies in Simulation Training PDF

Cannot Refute

[54] ForceGrip: Reference-Free Curriculum Learning for Realistic Grip Force Control in VR Hand Manipulation PDF

Cannot Refute

[55] Curriculum-based Sensing Reduction in Simulation to Real-World Transfer for In-hand Manipulation PDF

Cannot Refute

Contribution

Simulation benchmark for dexterous manipulation evaluation

[60] DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated Objects PDF

Can Refute

[61] Bi-dexhands: Towards human-level bimanual dexterous manipulation PDF

Can Refute

[10] DexMV: Imitation Learning for Dexterous Manipulation from Human Videos PDF

Cannot Refute

[56] Maniskill2: A unified benchmark for generalizable manipulation skills PDF

Cannot Refute

[57] HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning PDF

Cannot Refute

[58] HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation PDF

Cannot Refute

[59] Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation PDF

Cannot Refute

[62] DexYCB: A Benchmark for Capturing Hand Grasping of Objects PDF

Cannot Refute

[63] TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation PDF

Cannot Refute

[64] Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning PDF

Cannot Refute

Contribution

Functional retargeting problem formulation

[9] DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning PDF

Cannot Refute

[65] What matters in learning from offline human demonstrations for robot manipulation PDF

Cannot Refute

[66] Concept2Robot: Learning manipulation concepts from instructions and human demonstrations PDF

Cannot Refute

[67] Robot learning from human teachers PDF

Cannot Refute

[68] Universal manipulation interface: In-the-wild robot teaching without in-the-wild robots PDF

Cannot Refute

[69] Language-Conditioned Imitation Learning for Robot Manipulation Tasks PDF

Cannot Refute

[70] Watch and act: Learning robotic manipulation from visual demonstration PDF

Cannot Refute

[71] Skill Learning in Robot-Assisted Micro-Manipulation Through Human Demonstrations with Attention Guidance PDF

Cannot Refute

[72] Data Scaling Laws in Imitation Learning for Robotic Manipulation PDF

Cannot Refute

[73] Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt PDF

Cannot Refute

DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[1] Object-centric dexterous manipulation from human motion data PDF

[6] Dexh2r: Task-oriented dexterous manipulation from human to robots PDF

[11] Maniptrans: Efficient dexterous bimanual manipulation transfer via residual learning PDF

[16] DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References PDF

[31] Hermes: Human-to-robot embodied learning from multi-source motion data for mobile dexterous manipulation PDF

Contribution Analysis

DexMachina curriculum-based RL algorithm for functional retargeting

[51] Demostart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots PDF

[52] Acquiring musculoskeletal skills with curriculum-based reinforcement learning PDF

[53] AI in Surgical Curriculum Design and Unintended Outcomes for Technical Competencies in Simulation Training PDF

[54] ForceGrip: Reference-Free Curriculum Learning for Realistic Grip Force Control in VR Hand Manipulation PDF

[55] Curriculum-based Sensing Reduction in Simulation to Real-World Transfer for In-hand Manipulation PDF

Simulation benchmark for dexterous manipulation evaluation

[60] DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated Objects PDF

[61] Bi-dexhands: Towards human-level bimanual dexterous manipulation PDF

[10] DexMV: Imitation Learning for Dexterous Manipulation from Human Videos PDF

[56] Maniskill2: A unified benchmark for generalizable manipulation skills PDF

[57] HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning PDF

[58] HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation PDF

[59] Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation PDF

[62] DexYCB: A Benchmark for Capturing Hand Grasping of Objects PDF

[63] TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation PDF

[64] Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning PDF

Functional retargeting problem formulation

[9] DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning PDF

[65] What matters in learning from offline human demonstrations for robot manipulation PDF

[66] Concept2Robot: Learning manipulation concepts from instructions and human demonstrations PDF

[67] Robot learning from human teachers PDF

[68] Universal manipulation interface: In-the-wild robot teaching without in-the-wild robots PDF

[69] Language-Conditioned Imitation Learning for Robot Manipulation Tasks PDF

[70] Watch and act: Learning robotic manipulation from visual demonstration PDF

[71] Skill Learning in Robot-Assisted Micro-Manipulation Through Human Demonstrations with Attention Guidance PDF

[72] Data Scaling Laws in Imitation Learning for Robotic Manipulation PDF

[73] Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt PDF

Table of Contents