logo
Loading...
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Computer SciencearXiv

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

W. Chiang, L. Zheng, et al.

Discover Chatbot Arena, an open, crowdsourced platform that evaluates Large Language Models via pairwise human-preference comparisons, powered by over 240K votes and rigorous statistical ranking — research conducted by Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios N. Angelopoulos, Tianle Li, Dacheng Li, Banghua Zhu, Hao Zhang, Michael I. Jordan, Joseph E. Gonzalez, and Ion Stoica.... show more
Introduction
Literature Review
Methodology
Key Findings
Discussion
Conclusion
Limitations
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 22+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny