Holistic Evaluation of Language Models
Percy Liang(Stanford University), Yuta Koreeda, Binhang Yuan, Ce Zhang, Michihiro Yasunaga, Nathan Kim, Yifan Mai, Mirac Süzgün, Ryan Chi, Ananya Kumar, Omar Khattab, Tatsunori Hashimoto, Yuhuai Wu, Keshav Santhanam, Drew A. Hudson, Dimitris Tsipras, Niladri S. Chatterji, Benjamin T. Newman, Diana Acosta-Navas, Christopher D. Manning(Stanford University), Peter Henderson(McGill University), Frieda Rong, Huaxiu Yao, Lucia Zheng, Vishrav Chaudhary, Laurel Orr, Surya Ganguli, Bobby Yan, Mert Yüksekgönül, Eric Zelikman, Neel Guha, Shibani Santurkar, Sang Michael Xie, Rishi Bommasani, Tong Lee, Christian Cosgrove, Faisal Ladhak, Thomas Icard, Deepak Narayanan, Dilara Soylu, Tianyi Zhang, Jue Wang, Yuhui Zhang, William Yang Wang(Massachusetts Institute of Technology), Xuechen Li, Qian Huang, Hong‐Yu Ren, Yian Zhang, Christopher Ré, Esin Durmus
Cited by 119
Related Papers
On the Opportunities and Risks of Foundation Models
|arXiv (Cornell University)|2021|2.2k
Deep Reinforcement Learning That Matters
|Proceedings of the AAAI Conference on Artificial Intelligence|2018|1.5k
Lost in the Middle: How Language Models Use Long Contexts
|Transactions of the Association for Computational Linguistics|2024|844