Computer ScienceInternational Conference on Learning Representations (ICLR) 2024
LARGE LANGUAGE MODELS CANNOT SELF-CORRECT REASONING YET
J. Huang, X. Chen, et al.
This paper critically examines self-correction in Large Language Models, focusing on intrinsic self-correction—when a model tries to fix its own answers without external feedback. The authors find that LLMs often struggle to self-correct during reasoning and can even degrade after attempting fixes, and they offer directions for future research and applications. This research was conducted by Jie Huang, Xinyun Chen, Swaroop Mishra, Huaixiu Steven Zheng, Adams Wei Yu, Xinying Song, and Denny Zhou.
Related Publications
Explore these studies to deepen your understanding
Adjacent work that informs or extends this paper's methodology and findings.
Computer Science
Confidence in the Reasoning of Large Language Models
Y. Pawitan and C. Holmes
Computer Science
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
J. Liu, S. Chunqiu, et al.
Computer Science
AI-AI bias: Large language models favor communications generated by large language models
W. Laurito, B. Davis, et al.
Linguistics and Languages
DISSOCIATING LANGUAGE AND THOUGHT IN LARGE LANGUAGE MODELS
K. Mahowald, I. A. Blank, et al.

