Paper Detail

RoadBench: Benchmarking MLLMs on Fine-Grained Spatial Understanding and Reasoning under Urban Road ScenariosRoadBench：城市道路场景下多模态大语言模型的细粒度空间理解与推理基准测试

cs.CV自动驾驶CVTransformer

2026年03月30日

作者人数

0

标签数量

3

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper presents RoadBench, a comprehensive benchmark designed to evaluate Multi-Modal Large Language Models (MLLMs) on fine-grained spatial understanding and reasoning tasks in urban road scenarios. The benchmark includes diverse urban road images, detailed annotations, and challenging questions that require precise spatial perception and reasoning. RoadBench aims to address the gap in existing benchmarks that lack focus on fine-grained spatial understanding and reasoning under complex urban road conditions. Experimental results demonstrate that current MLLMs still face significant challenges in handling fine-grained spatial understanding and reasoning in urban road scenarios.

本文提出了RoadBench，这是一个综合基准测试，旨在评估多模态大语言模型在城市道路场景中的细粒度空间理解与推理任务。该基准测试包含多样化的城市道路图像、详细标注以及需要精确空间感知和推理的挑战性问题。实验结果表明，当前多模态大语言模型在城市道路场景中处理细粒度空间理解与推理方面仍面临重大挑战。

PDF 预览

摘要 / Abstract

分类 / Categories

深度分析