Paper Detail

A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors用于分心驾驶行为时序定位的两阶段Transformer框架

cs.CV自动驾驶CVTransformer热门获取

Anonymous Authors

2026年03月22日

arXiv: 2603.21048v1

作者人数

1

标签数量

4

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper presents a temporal action localization framework specifically designed for driver monitoring systems in autonomous driving applications. The framework employs a two-stage pipeline combining VideoMAE-based feature extraction with an Augmented Self-Mask Attention detector to identify hazardous driving behaviors from in-cabin video streams. A Spatial Pyramid Pooling-Fast module captures multi-scale temporal features for improved localization accuracy. The approach is optimized for transportation safety checkpoints and fleet management assessment systems, demonstrating a trade-off between model capacity and computational efficiency.

摘要 / Abstract

分类 / Categories

深度分析