Paper Detail

Morphology-Consistent Humanoid Interaction through Robot-Centric Video Synthesis基于机器人中心化视频合成的形态一致人形机器人交互

cs.CV端到端CV热门获取具身智能多模态

Dream2Act Team

2026年03月20日

arXiv: 2603.19709v1

作者人数

1

标签数量

5

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper presents Dream2Act, a robot-centric framework enabling zero-shot interaction through generative video synthesis for humanoid robots. The approach addresses the morphology gap in traditional human-to-robot motion retargeting by synthesizing robot-native motion directly. Given a third-person image of the robot and target object, video generation models envision the robot completing tasks with morphology-consistent motion. A high-fidelity pose extraction system recovers physically feasible joint trajectories from synthesized videos, which are subsequently executed via a general-purpose whole-body controller.

摘要 / Abstract

分类 / Categories

深度分析