Bridging Machine Learning and Thermodynamics for Accurate p <i>K</i> <sub>a</sub> Prediction

Weiliang Luo(Massachusetts Institute of Technology), Gengmo Zhou(Renmin University of China), Zhengdan Zhu, Yannan Yuan, Guolin Ke, Zhewei Wei(Renmin University of China), Zhifeng Gao, Hang Zheng
JACS Au
July 17, 2024
Cited by 27Open Access
Full Text

Abstract

Integrating scientific principles into machine learning models to enhance their predictive performance and generalizability is a central challenge in the development of AI for Science. Herein, we introduce Uni-pKa, a novel framework that successfully incorporates thermodynamic principles into machine learning modeling, achieving high-precision predictions of acid dissociation constants (pKa), a crucial task in the rational design of drugs and catalysts, as well as a modeling challenge in computational physical chemistry for small organic molecules. Uni-pKa utilizes a comprehensive free energy model to represent molecular protonation equilibria accurately. It features a structure enumerator that reconstructs molecular configurations from pKa data, coupled with a neural network that functions as a free energy predictor, ensuring high-throughput, data-driven prediction while preserving thermodynamic consistency. Employing a pretraining-finetuning strategy with both predicted and experimental pKa data, Uni-pKa not only achieves state-of-the-art accuracy in chemoinformatics but also shows comparable precision to quantum mechanics-based methods.


Related Papers

No related papers found

Powered by citation graph analysis