MutationFinder: a high-performance system for extracting point mutation mentions from text

J. Gregory Caporaso(University of Colorado Health), William A. Baumgartner(University of Colorado Health), David A. Randolph(University of Colorado Health), Kevin Bretonnel Cohen(University of Colorado Health), Lawrence Hunter(University of Colorado Health)
Bioinformatics
May 11, 2007
Cited by 163Open Access
Full Text

Abstract

Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFinder, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline. AVAILABILITY: MutationFinder, along with a high-quality gold standard data set, and a scoring script for mutation extraction systems have been made publicly available. Implementations, source code and unit tests are available in Python, Perl and Java. MutationFinder can be used as a stand-alone script, or imported by other applications. PROJECT URL: http://bionlp.sourceforge.net.


Related Papers

No related papers found

Powered by citation graph analysis