作者人数
标签数量
内容状态
原文 + 中文
同页查看标题和摘要的双语信息
PDF 预览
直接在详情页阅读或下载论文全文
深度分析
继续下钻到 AI 生成的结构化解读
摘要 / Abstract
This paper presents Oph-Guid-RAG, a multimodal visual retrieval-augmented generation system designed for ophthalmology clinical decision support. The system treats guideline pages as independent evidence units and retrieves page images while preserving visual elements like tables and flowcharts. It implements a controllable retrieval framework with routing and filtering mechanisms to reduce noise while selectively introducing external evidence. The system combines query decomposition, rewriting, retrieval, reranking, and multimodal reasoning to provide traceable outputs with guideline references. Evaluated on HealthBench with doctor-based scoring, the approach significantly outperforms GPT-5.2 and GPT-5.4 on hard subsets, achieving +30.0% improvement in overall score and +10.4% to +24.4% gains in accuracy.
本文提出了Oph-Guid-RAG,这是一种用于眼科临床决策支持的多模态视觉检索增强生成系统。该系统将指南页面视为独立证据单元,在检索页面图像时保留表格和流程图等视觉元素,并结合查询分解、重写、检索、重排序和多模态推理技术,提供带指南引用的可追溯输出。在基于医生评分的HealthBench评估中,该方法在困难子集上显著优于GPT-5.2和GPT-5.4,总体得分提升30.0%,准确率提升10.4%至24.4%。
分类 / Categories
深度分析
AI 深度理解论文内容,生成具有洞见性的总结