
Computer Science
Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models
J. Chen, Y. Zhang, et al.
This groundbreaking research by Jie Chen, Yupeng Zhang, Bingning Wang, Wayne Xin Zhao, Ji-Rong Wen, and Weipeng Chen explores the flaws in synthetic question-answer data used for training large language models. It highlights how these issues can lead to performance drops and proposes innovative unlearning techniques to mitigate these effects effectively and affordably.
Playback language: English
Related Publications
Explore these studies to deepen your understanding of the subject.