作者人数
标签数量
内容状态
原文 + 中文
同页查看标题和摘要的双语信息
PDF 预览
直接在详情页阅读或下载论文全文
深度分析
继续下钻到 AI 生成的结构化解读
摘要 / Abstract
This paper presents Gau-Occ, a multi-modal framework for 3D semantic occupancy prediction in autonomous driving that models scenes as compact collections of semantic 3D Gaussians, bypassing traditional dense volumetric processing. The proposed LiDAR Completion Diffuser recovers missing structures from sparse LiDAR point clouds to initialize robust Gaussian anchors, while Gaussian Anchor Fusion efficiently integrates multi-view image semantics through geometry-aligned 2D sampling and cross-modal alignment. By refining these compact Gaussian descriptors, the method achieves both spatial consistency and semantic discriminability for comprehensive 3D scene understanding in autonomous vehicles.
本文提出Gau-Occ,一种用于自动驾驶3D语义占用预测的多模态框架。该框架将场景建模为紧凑的语义3D高斯集合以规避传统密集体素处理,通过LiDAR Completion Diffuser恢复稀疏点云缺失结构以初始化高斯锚点,并利用几何对齐采样和跨模态对齐融合多视角语义,实现空间一致性和语义可区分性的全面场景理解。
分类 / Categories
深度分析
AI 深度理解论文内容,生成具有洞见性的总结