TransFoxMol: predicting molecular property with focused attention

Jian Gao(Zhejiang University), Zheyuan Shen(Zhejiang University), Yufeng Xie(Zhejiang University), Jialiang Lu(Zhejiang University), Yang Lu(Zhejiang University), Sikang Chen(Zhejiang University), Qingyu Bian(Zhejiang University), Yue Guo(Zhejiang Lab), Liteng Shen(Zhejiang University), Jian Wu(Zhejiang University), Binbin Zhou(Zhejiang University), Tingjun Hou(Zhejiang Lab), Qiaojun He(Zhejiang Cancer Hospital), Jinxin Che(Zhejiang University), Xiaowu Dong(Zhejiang Cancer Hospital)
Briefings in Bioinformatics
August 21, 2023
Cited by 40Open Access
Full Text

Abstract

Predicting the biological properties of molecules is crucial in computer-aided drug development, yet it's often impeded by data scarcity and imbalance in many practical applications. Existing approaches are based on self-supervised learning or 3D data and using an increasing number of parameters to improve performance. These approaches may not take full advantage of established chemical knowledge and could inadvertently introduce noise into the respective model. In this study, we introduce a more elegant transformer-based framework with focused attention for molecular representation (TransFoxMol) to improve the understanding of artificial intelligence (AI) of molecular structure property relationships. TransFoxMol incorporates a multi-scale 2D molecular environment into a graph neural network + Transformer module and uses prior chemical maps to obtain a more focused attention landscape compared to that obtained using existing approaches. Experimental results show that TransFoxMol achieves state-of-the-art performance on MoleculeNet benchmarks and surpasses the performance of baselines that use self-supervised learning or geometry-enhanced strategies on small-scale datasets. Subsequent analyses indicate that TransFoxMol's predictions are highly interpretable and the clever use of chemical knowledge enables AI to perceive molecules in a simple but rational way, enhancing performance.


Related Papers

No related papers found

Powered by citation graph analysis