作者人数
标签数量
内容状态
原文 + 中文
同页查看标题和摘要的双语信息
PDF 预览
直接在详情页阅读或下载论文全文
深度分析
继续下钻到 AI 生成的结构化解读
摘要 / Abstract
This paper presents RoadBench, a comprehensive benchmark designed to evaluate Multi-Modal Large Language Models (MLLMs) on fine-grained spatial understanding and reasoning tasks in urban road scenarios. The benchmark includes diverse urban road images, detailed annotations, and challenging questions that require precise spatial perception and reasoning. RoadBench aims to address the gap in existing benchmarks that lack focus on fine-grained spatial understanding and reasoning under complex urban road conditions. Experimental results demonstrate that current MLLMs still face significant challenges in handling fine-grained spatial understanding and reasoning in urban road scenarios.
本文提出了RoadBench,这是一个综合基准测试,旨在评估多模态大语言模型在城市道路场景中的细粒度空间理解与推理任务。该基准测试包含多样化的城市道路图像、详细标注以及需要精确空间感知和推理的挑战性问题。实验结果表明,当前多模态大语言模型在城市道路场景中处理细粒度空间理解与推理方面仍面临重大挑战。
分类 / Categories
深度分析
AI 深度理解论文内容,生成具有洞见性的总结