返回论文列表
Paper Detail
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy PredictionGau-Occ:用于多模态3D占用预测的几何补全高斯方法
cs.CV自动驾驶CV热门获取3D检测多模态
Chengxin Lv, Yihui Li, Hongyu Yang, YunHong Wang
2026年03月24日
arXiv: 2603.22852v1

作者人数

4

标签数量

5

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper presents Gau-Occ, a multi-modal framework for 3D semantic occupancy prediction in autonomous driving that models scenes as compact collections of semantic 3D Gaussians, bypassing traditional dense volumetric processing. The proposed LiDAR Completion Diffuser recovers missing structures from sparse LiDAR point clouds to initialize robust Gaussian anchors, while Gaussian Anchor Fusion efficiently integrates multi-view image semantics through geometry-aligned 2D sampling and cross-modal alignment. By refining these compact Gaussian descriptors, the method achieves both spatial consistency and semantic discriminability for comprehensive 3D scene understanding in autonomous vehicles.

本文提出Gau-Occ,一种用于自动驾驶3D语义占用预测的多模态框架。该框架将场景建模为紧凑的语义3D高斯集合以规避传统密集体素处理,通过LiDAR Completion Diffuser恢复稀疏点云缺失结构以初始化高斯锚点,并利用几何对齐采样和跨模态对齐融合多视角语义,实现空间一致性和语义可区分性的全面场景理解。

PDF 预览
1
在 arXiv 查看下载 PDF

分类 / Categories

cs.CVcs.RO

深度分析

AI 深度理解论文内容,生成具有洞见性的总结