Molecular Similarity Searching Using Atom Environments, Information-Based Feature Selection, and a Naïve Bayesian Classifier

Andreas Bender(Unilever (United Kingdom)), Hamse Y. Mussa(Unilever (United Kingdom)), Robert C. Glen(Unilever (United Kingdom)), Stephan Reiling
Journal of Chemical Information and Computer Sciences
December 24, 2003
Cited by 247

Abstract

A novel technique for similarity searching is introduced. Molecules are represented by atom environments, which are fed into an information-gain-based feature selection. A naïve Bayesian classifier is then employed for compound classification. The new method is tested by its ability to retrieve five sets of active molecules seeded in the MDL Drug Data Report (MDDR). In comparison experiments, the algorithm outperforms all current retrieval methods assessed here using two- and three-dimensional descriptors and offers insight into the significance of structural components for binding.


Related Papers

No related papers found

Powered by citation graph analysis