Paper Detail

VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake DetectionVIGIL：基于部件的结构化推理实现可泛化的Deepfake检测

cs.CVCVTransformer热门获取具身智能多模态

VIGIL Authors

2026年03月23日

arXiv: 2603.21526v1

作者人数

1

标签数量

5

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper presents VIGIL, a novel part-centric structured forensic framework for deepfake detection using multimodal large language models. The approach employs a plan-then-examine pipeline where the model first plans which facial parts warrant inspection based on global visual cues, then examines each part with independently sourced forensic evidence. A stage-gated injection mechanism delivers part-level forensic evidence only during examination to ensure unbiased part selection. The framework is inspired by expert forensic practice and aims to improve the reliability of deepfake detection by separating evidence generation from manipulation localization, addressing the issue of hallucinated explanations in current MLLM-based methods.

摘要 / Abstract

分类 / Categories

深度分析