DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation

ICLR 2026 Conference SubmissionAnonymous Authors
Dexterous Manipulation; Deep Reinforcement Learning; Robot Learning; Learning in Simulation
Abstract:

We study the problem of functional retargeting: learning dexterous manipulation policies to track object states from human hand-object demonstrations. We focus on long-horizon, bimanual tasks with articulated objects, which are challenging due to large action space, spatiotemporal discontinuities, and the embodiment gap between human and robot hands. We propose DexMachina, a novel curriculum-based algorithm: the key idea is to use virtual object controllers with decaying strength: an object is first driven automatically towards its target states, such that the policy can gradually learn to take over under motion and contact guidance. We release a simulation benchmark with a diverse set of tasks and dexterous hands, and show that DexMachina significantly outperforms baseline methods. Our algorithm and benchmark enable a functional comparison for hardware designs, and we present key findings informed by quantitative and qualitative results. With the recent surge in dexterous hand development, we hope this work will provide a useful platform for identifying desirable hardware capabilities and lower the barrier for contributing to future research. Videos and more at \url{project-dexmachina.github.io}

Disclaimer
This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.
NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.
If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces DexMachina, a curriculum-based reinforcement learning algorithm for functional retargeting of bimanual dexterous manipulation from human demonstrations. It resides in the 'Functional Retargeting and Embodiment Transfer' leaf, which contains six papers total including the original work. This leaf sits within the broader 'Human Motion Capture and Retargeting' branch, indicating a moderately populated research direction focused on bridging the embodiment gap between human and robot hands while preserving task semantics rather than exact kinematic correspondence.

The taxonomy reveals neighboring leaves addressing related but distinct challenges: 'Hand-Object Motion Extraction from Video' (four papers) focuses on markerless capture, while 'Sensor-Based Motion Capture' (three papers) uses wearable devices. The sibling papers in the same leaf—including Dexh2r, ManipTrans, and Object-centric Dexterous approaches—similarly tackle embodiment transfer but differ in scope. DexMachina's emphasis on bimanual coordination and long-horizon articulated object tasks positions it at the intersection of retargeting methods and spatial coordination learning, a branch with three papers exploring geometric relationships between dual arms.

Among twenty-five candidates examined, the DexMachina curriculum algorithm shows no clear refutation across five candidates reviewed, suggesting relative novelty in its specific approach. The simulation benchmark contribution encountered two refutable candidates among ten examined, indicating existing evaluation platforms in this space. The functional retargeting problem formulation examined ten candidates with no refutations, though this may reflect the limited search scope rather than absolute novelty. The statistics suggest the algorithmic contribution appears more distinctive than the benchmark or problem framing within the examined literature.

Based on the top-twenty-five semantic matches and taxonomy structure, the work appears to occupy a moderately explored niche within functional retargeting. The curriculum-based approach and bimanual focus differentiate it from sibling papers, though the benchmark contribution overlaps with existing evaluation efforts. The analysis covers a focused subset of the field and does not claim exhaustive coverage of all related work in dexterous manipulation or demonstration learning.

Taxonomy

Core-task Taxonomy Papers
50
3
Claimed Contributions
25
Contribution Candidate Papers Compared
2
Refutable Paper

Research Landscape Overview

Core task: learning bimanual dexterous manipulation from human demonstrations. The field organizes around several interconnected branches that address distinct facets of transferring human skill to robots. Human Motion Capture and Retargeting focuses on translating observed human movements into robot-executable actions, often dealing with embodiment mismatches between human hands and robotic end-effectors. Data Collection Systems and Teleoperation develops interfaces and hardware for gathering high-quality demonstrations, while Policy Learning from Demonstrations tackles the algorithmic challenge of distilling these examples into robust control policies. Active Perception and Egocentric Vision emphasizes the role of visual feedback during manipulation, and Symbolic Task Modeling and Temporal Constraints captures higher-level task structure and sequencing. Spatial Coordination Learning addresses the geometric relationships between two arms, and Synthetic Data Generation and Augmentation explores ways to expand limited real-world datasets. Specialized Manipulation Contexts, Foundation Models and Cross-Embodiment Transfer, and Benchmarks and Datasets round out the taxonomy by addressing domain-specific challenges, generalization across platforms, and standardized evaluation. A particularly active line of work centers on functional retargeting methods that preserve task semantics rather than exact kinematic correspondence, exemplified by DexMachina[0], which learns to map human hand motions to dexterous robotic grippers by focusing on object-centric goals. Nearby efforts such as Dexh2r[6] and ManipTrans[11] similarly emphasize bridging the embodiment gap through learned mappings or trajectory adaptation. In contrast, works like Object-centric Dexterous[1] and Teach Once[3] explore how to generalize demonstrations across object instances or enable rapid skill transfer with minimal retraining. DexMachina[0] sits within this cluster of retargeting approaches but distinguishes itself by targeting bimanual coordination and dexterous manipulation jointly, whereas some neighbors like DexTrack[16] or Hermes[31] prioritize tracking fidelity or cross-embodiment transfer. Open questions remain around balancing kinematic accuracy with task success, scaling to diverse object geometries, and integrating temporal constraints from symbolic models.

Claimed Contributions

DexMachina curriculum-based RL algorithm for functional retargeting

The authors introduce DexMachina, a reinforcement learning algorithm that uses virtual object controllers to automatically drive objects toward target states initially, then gradually decays their strength so the policy learns to take over manipulation under motion and contact guidance. This curriculum approach addresses challenges in learning long-horizon bimanual dexterous manipulation from human demonstrations.

5 retrieved papers
Simulation benchmark for dexterous manipulation evaluation

The authors release a simulation benchmark containing six dexterous hand models and five articulated objects. This benchmark provides a unified testbed for evaluating functional retargeting algorithms and enables functional comparison across different hardware designs, where new hands and tasks can be easily added and quickly evaluated.

10 retrieved papers
Can Refute
Functional retargeting problem formulation

The authors formalize the functional retargeting problem, which aims to learn feasible dexterous robot policies that manipulate objects to follow demonstrated trajectories from human hand-object demonstrations. This is distinguished from kinematic retargeting by emphasizing task capability and feasibility rather than merely producing human-like motions.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

DexMachina curriculum-based RL algorithm for functional retargeting

The authors introduce DexMachina, a reinforcement learning algorithm that uses virtual object controllers to automatically drive objects toward target states initially, then gradually decays their strength so the policy learns to take over manipulation under motion and contact guidance. This curriculum approach addresses challenges in learning long-horizon bimanual dexterous manipulation from human demonstrations.

Contribution

Simulation benchmark for dexterous manipulation evaluation

The authors release a simulation benchmark containing six dexterous hand models and five articulated objects. This benchmark provides a unified testbed for evaluating functional retargeting algorithms and enables functional comparison across different hardware designs, where new hands and tasks can be easily added and quickly evaluated.

Contribution

Functional retargeting problem formulation

The authors formalize the functional retargeting problem, which aims to learn feasible dexterous robot policies that manipulate objects to follow demonstrated trajectories from human hand-object demonstrations. This is distinguished from kinematic retargeting by emphasizing task capability and feasibility rather than merely producing human-like motions.