返回论文列表
Paper Detail
Bayesian Active Object Recognition and 6D Pose Estimation from Multimodal Contact Sensing基于多模态接触感知的贝叶斯主动目标识别与6D位姿估计
cs.CV自动驾驶热门获取
Haodong Zheng, Gabriele M. Caddeo, Andrei C. Jalba, Wijnand A. IJsselsteijn, Lorenzo Natale, Raymond H. Cuijpers
2026年03月23日
arXiv: 2603.21410v1

作者人数

6

标签数量

2

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

We present an active tactile exploration framework for joint object recognition and 6D pose estimation. The proposed method integrates wrist force/torque sensing, GelSight tactile sensing, and free-space constraints within a Bayesian inference framework that maintains a belief over object class and pose during active tactile exploration. By combining contact and non-contact evidence, the framework reduces ambiguity and improves robustness in the joint class-pose estimation problem. To enable efficient inference in the large hypothesis space, we employ a customized particle filter that progressively samples particles based on new observations. The inferred belief is further used to guide active exploration by selecting informative next touches under reachability constraints. For effective data collection, a motion planning and control framework is developed to plan and execute feasible paths for tactile exploration, handle unexpected contacts and GelSight-surface alignment with tactile servoing. We evaluate the framework in simulation and on a Franka Panda robot using 11 YCB objects. Results show that incorporating tactile and free-space information substantially improves recognition and pose estimation accuracy and stability, while reducing the number of action cycles compared with force/torque-only baselines. Code, dataset, and supplementary material will be made available online.

我们提出了一种用于联合目标识别与6D位姿估计的主动触觉探索框架。该方法在贝叶斯推理框架中融合了腕部力/力矩感知、GelSight触觉感知和自由空间约束,并采用定制化粒子滤波器在扩展假设空间中进行高效推理。我们在仿真环境和Franka Panda机器人上对11个YCB物体进行了评估,结果表明融合触觉与自由空间信息显著提升了识别与位姿估计的准确性和稳定性,同时减少了动作循环次数。

PDF 预览
1
在 arXiv 查看下载 PDF

分类 / Categories

cs.CVcs.AI

深度分析

AI 深度理解论文内容,生成具有洞见性的总结