Paper Detail

A transformer architecture alteration to incentivise externalised reasoning一种激励外部化推理的Transformer架构改进方案

cs.CL自动驾驶端到端Transformer热门获取

Anonymous

2026年03月23日

arXiv: 2603.21376v1

作者人数

1

标签数量

4

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

We propose a novel architectural modification and post-training pipeline for enhancing large language model reasoning capabilities by teaching models to truncate forward passes early. Our approach augments the standard transformer architecture with an early-exit mechanism at intermediate layers, enabling the model to exit at shallower layers when tokens can be predicted without deep computation. Through a calibration stage followed by reinforcement learning, we incentivize the model to exit as early as possible while preserving task performance. Preliminary experiments on small reasoning models demonstrate adaptive computation reduction across tokens, suggesting that at appropriate scale, this approach can minimize excess computation for non-myopic planning using internal activations, reserving deep computation only for difficult-to-predict tokens.

摘要 / Abstract

分类 / Categories

深度分析