Paper Detail

SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image GenerationSpatialReward：用于文本到图像生成中细粒度空间一致性的可验证空间奖励建模

cs.CV大语言模型CV热门获取目标检测多模态

Anonymous

2026年03月24日

arXiv: 2603.22228v1

作者人数

1

标签数量

5

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper presents SpatialReward, a novel verifiable reward model designed to evaluate fine-grained spatial relationships in text-to-image generation. The approach employs a multi-stage pipeline consisting of a Prompt Decomposer for extracting spatial metadata, expert detectors for visual grounding of object positions, and a vision-language model with chain-of-thought reasoning to assess complex spatial relations. By focusing on spatial layout evaluation rather than just semantic alignment, this work addresses a critical gap in current reward modeling approaches for generative AI systems.

摘要 / Abstract

分类 / Categories

深度分析