Paper Detail

SLURP-TN: Resource for Tunisian Dialect Spoken Language UnderstandingSLURP-TN：突尼斯方言口语理解资源

cs.CL大语言模型端到端Transformer热门获取多模态

Elyadata Research Team

2026年03月23日

arXiv: 2603.21940v1

作者人数

1

标签数量

5

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper introduces SLURP-TN, a comprehensive spoken language understanding resource specifically designed for the Tunisian dialect. The dataset comprises 4165 sentences recorded from 55 native speakers, totaling approximately 5 hours of acoustic material. By translating sentences from six SLURP domains, the authors address the critical gap in SLU resources for low-resource languages. The research develops baseline Automatic Speech Recognition and SLU models that leverage deep neural networks and pre-trained language models to extract semantic information from speech utterances in task-oriented dialogue systems. This work enables the Tunisian-speaking population to benefit from recent advances in natural language processing and speech recognition technology.

摘要 / Abstract

分类 / Categories

深度分析