作者人数
标签数量
内容状态
原文 + 中文
同页查看标题和摘要的双语信息
PDF 预览
直接在详情页阅读或下载论文全文
深度分析
继续下钻到 AI 生成的结构化解读
摘要 / Abstract
This paper presents a temporal action localization framework specifically designed for driver monitoring systems in autonomous driving applications. The framework employs a two-stage pipeline combining VideoMAE-based feature extraction with an Augmented Self-Mask Attention detector to identify hazardous driving behaviors from in-cabin video streams. A Spatial Pyramid Pooling-Fast module captures multi-scale temporal features for improved localization accuracy. The approach is optimized for transportation safety checkpoints and fleet management assessment systems, demonstrating a trade-off between model capacity and computational efficiency.
本文提出了一种专为自动驾驶应用中驾驶员监控系统设计的时序动作定位框架。该框架采用两阶段流水线,结合基于VideoMAE的特征提取与增强自掩码注意力检测器,从车内视频流中识别危险驾驶行为。空间金字塔池化-快速模块捕获多尺度时序特征以提高定位精度。该方法针对交通安全检查站和车队管理评估系统进行了优化,在模型容量与计算效率之间实现了权衡。
分类 / Categories
深度分析
AI 深度理解论文内容,生成具有洞见性的总结