Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters

Christopher Funk(University of Colorado Denver), William A. Baumgartner(University of Colorado Denver), Benjamin J. Garcia(National Jewish Health), Christophe Roeder(University of Colorado Denver), Michael Bada(University of Colorado Denver), Kevin Bretonnel Cohen(University of Colorado Denver), Lawrence Hunter(University of Colorado Denver), Karin Verspoor(Data61)
BMC Bioinformatics
February 26, 2014
Cited by 130Open Access
Full Text

Abstract

BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. RESULTS: Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. CONCLUSIONS: Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14-0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented.


Related Papers

No related papers found

Powered by citation graph analysis