Paper Detail

SPA: A Simple but Tough-to-Beat Baseline for Knowledge InjectionSPA：一种简洁但难以超越的知识注入基线方法

cs.CL大语言模型Transformer热门获取

Anonymous

2026年03月24日

arXiv: 2603.22213v1

作者人数

1

标签数量

3

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper addresses the challenge of incomplete knowledge coverage in large language models, particularly in specialized and data-scarce domains. The authors propose SPA (Scaling Prompt-engineered Augmentation), a method that uses carefully designed prompts to generate large-scale synthetic training data for knowledge injection. Through systematic comparisons, SPA demonstrates superior performance over strong baselines. The research identifies key limitations in existing approaches: RL-based methods suffer from diversity collapse at scale, while multi-stage prompting advantages disappear after careful tuning. These findings provide valuable insights for optimizing knowledge injection strategies in language models.

摘要 / Abstract

分类 / Categories

深度分析