Computer Science
LARGE LANGUAGE MODELS CANNOT SELF-CORRECT REASONING YET
J. Huang, X. Chen, et al.
This paper critically examines self-correction in Large Language Models, focusing on intrinsic self-correction—when a model tries to fix its own answers without external feedback. The authors find that LLMs often struggle to self-correct during reasoning and can even degrade after attempting fixes, and they offer directions for future research and applications. This research was conducted by Jie Huang, Xinyun Chen, Swaroop Mishra, Huaixiu Steven Zheng, Adams Wei Yu, Xinying Song, and Denny Zhou.
Related Publications
Explore these studies to deepen your understanding of the subject.

