Paper Detail

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models推理还是修辞？大型语言模型道德推理解释的实证分析

cs.CL大语言模型Transformer热门获取

Researchers

2026年03月23日

arXiv: 2603.21854v1

作者人数

1

标签数量

3

内容状态

含 PDF

原文 + 中文

同页查看标题和摘要的双语信息

PDF 预览

直接在详情页阅读或下载论文全文

深度分析

继续下钻到 AI 生成的结构化解读

摘要 / Abstract

This paper investigates whether large language models demonstrate genuine moral reasoning capabilities or merely produce superficially convincing reasoning-like outputs. The study analyzes responses from 13 different LLMs across six classical moral dilemmas using Kohlberg's stages of moral development as an evaluation framework. Through an LLM-as-judge scoring pipeline validated across three judge models, over 600 responses were classified and analyzed. The research reveals a significant finding that LLM responses predominantly exhibit post-conventional reasoning patterns (Stages 5-6), which contradicts typical human developmental trajectories where such reasoning emerges later in moral development. This inversion suggests that alignment training may produce outputs that mimic advanced moral reasoning without the underlying developmental progression characteristic of human moral cognition.

摘要 / Abstract

分类 / Categories

深度分析