Performance of ChatGPT on USMLE: Potential for AI-Assisted Medical Education Using Large Language Models

Tiffany H. Kung; Morgan Cheatham; ChatGPT; Arielle Medenilla; Czarina Sillos; Lorie De Leon; Camille Elepaño; Maria Madriaga; Rimel Aggabao; Giezel Diaz-Candido; James Maningo; Victor Tseng

doi:10.1101/2022.12.19.22283643

Performance of ChatGPT on USMLE: Potential for AI-Assisted Medical Education Using Large Language Models

Tiffany H. Kung(Massachusetts General Hospital), Morgan Cheatham(Brown University), ChatGPT, Arielle Medenilla, Czarina Sillos, Lorie De Leon, Camille Elepaño, Maria Madriaga, Rimel Aggabao, Giezel Diaz-Candido, James Maningo(SeaWorld Entertainment), Victor Tseng(SeaWorld Entertainment)

medRxiv

December 20, 2022

10.1101/2022.12.19.22283643

Cited by 617Open Access

Full Text

Abstract

ABSTRACT We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations. These results suggest that large language models may have the potential to assist with medical education, and potentially, clinical decision-making.

Related Papers

No related papers found

Powered by citation graph analysis