logo
Loading...
SemEval-2024 Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
Computer ScienceProceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

SemEval-2024 Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

T. Mickus, E. Zosa, et al.

SHROOM presents a shared task on detecting hallucinations—fluent but inaccurate NLG outputs—using a newly constructed dataset of 4,000 model outputs labeled by five annotators across machine translation, paraphrase generation, and definition modeling. The research was conducted by Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, and Marianna Apidianaki.... show more
Introduction
Literature Review
Methodology
Key Findings
Discussion
Conclusion
Limitations
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 22+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny