作者人数
标签数量
内容状态
原文 + 中文
同页查看标题和摘要的双语信息
PDF 预览
直接在详情页阅读或下载论文全文
深度分析
继续下钻到 AI 生成的结构化解读
摘要 / Abstract
This paper presents Dream2Act, a robot-centric framework enabling zero-shot interaction through generative video synthesis for humanoid robots. The approach addresses the morphology gap in traditional human-to-robot motion retargeting by synthesizing robot-native motion directly. Given a third-person image of the robot and target object, video generation models envision the robot completing tasks with morphology-consistent motion. A high-fidelity pose extraction system recovers physically feasible joint trajectories from synthesized videos, which are subsequently executed via a general-purpose whole-body controller.
本文提出了Dream2Act,一个机器人中心化框架,通过生成式视频合成实现人形机器人的零样本交互。该方法通过直接合成机器人原生运动来解决传统人体到机器人动作重定向中的形态差异问题。高精度姿态提取系统从合成视频中恢复物理可行的关节轨迹,随后通过通用全身控制器执行。
分类 / Categories
深度分析
AI 深度理解论文内容,生成具有洞见性的总结