The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS)

Bjoern Menze(Institut national de recherche en sciences et technologies du numérique), András Jakab(University of Debrecen), Stefan Bauer(University of Bern), Jayashree Kalpathy–Cramer(Harvard University), Keyvan Farahani(National Institutes of Health), Justin Kirby(National Institutes of Health), Yuliya Burren(University Hospital of Bern), Nicole Porz(University Hospital of Bern), Johannes Slotboom(University Hospital of Bern), Roland Wiest(University Hospital of Bern), Levente Lánczi(University of Debrecen), Elizabeth R. Gerstner(Harvard University), Marc‐André Weber(Heidelberg University), Tal Arbel(McGill University), Brian Avants(University of Pennsylvania), Nicholas Ayache(Institut national de recherche en sciences et technologies du numérique), Patricia Buendia, D. Louis Collins(McGill Genome Centre), Nicolas Cordier(Institut national de recherche en sciences et technologies du numérique), Jason J. Corso(Buffalo State University), Antonio Criminisi(Microsoft Research (United Kingdom)), Tilak Das(Cambridge University Hospitals NHS Foundation Trust), Hervé Delingette(Institut national de recherche en sciences et technologies du numérique), Çağatay Demiralp(Stanford University), Christopher R. Durst(University of Virginia), Michel Dojat(Institut national de recherche en sciences et technologies du numérique), Senan Doyle(Institut national de recherche en sciences et technologies du numérique), Joana Festa(University of Minho), Florence Forbes(Institut national de recherche en sciences et technologies du numérique), Ezequiel Geremia(Institut national de recherche en sciences et technologies du numérique), Ben Glocker(Imperial College London), Polina Golland(Massachusetts Institute of Technology), Xiaotao Guo(Columbia University), Andaç Hamamcı(Sabancı Üniversitesi), Khan M. Iftekharuddin(Old Dominion University), Raj Jena(Cambridge University Hospitals NHS Foundation Trust), Nigel John(University of Miami), Ender Konukoğlu(Harvard University), Danial Lashkari(Massachusetts Institute of Technology), José Mariz(University of Minho), Raphael Meier(University of Bern), Sérgio Pereira(University of Minho), Doina Precup(McGill University), Stephen J. Price(Cambridge University Hospitals NHS Foundation Trust), Tammy Riklin Raviv(Ben-Gurion University of the Negev), Syed M. S. Reza(Old Dominion University), Michael J. Ryan, Duygu Sarıkaya(Buffalo State University), Lawrence H. Schwartz(Columbia University), Hoo-Chang Shin(Ben-Gurion University of the Negev), Jamie Shotton(Microsoft Research (United Kingdom)), Carlos A. Silva(University of Minho), Nuno Sousa(University of Minho), Nagesh K. Subbanna(Heidelberg University), Gábor Székely(ETH Zurich), Thomas J. Taylor, Owen Thomas(Cambridge University Hospitals NHS Foundation Trust), Nicholas J. Tustison(University of Virginia), Gözde Ünal(Sabancı Üniversitesi), Flor Vasseur(Institut national de recherche en sciences et technologies du numérique), Max Wintermark(University of Virginia), Dong Hye Ye(Purdue University West Lafayette), Liang Zhao(Buffalo State University), Binsheng Zhao(Columbia University), Darko Zikic(Microsoft Research (United Kingdom)), Marcel Prastawa(GE Global Research (United States)), Mauricio Reyes(University of Bern), Koen Van Leemput(Harvard University)
IEEE Transactions on Medical Imaging
December 4, 2014
Cited by 6,461Open Access
Full Text

Abstract

In this paper we report the set-up and results of the Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) organized in conjunction with the MICCAI 2012 and 2013 conferences. Twenty state-of-the-art tumor segmentation algorithms were applied to a set of 65 multi-contrast MR scans of low- and high-grade glioma patients-manually annotated by up to four raters-and to 65 comparable scans generated using tumor image simulation software. Quantitative evaluations revealed considerable disagreement between the human raters in segmenting various tumor sub-regions (Dice scores in the range 74%-85%), illustrating the difficulty of this task. We found that different algorithms worked best for different sub-regions (reaching performance comparable to human inter-rater variability), but that no single algorithm ranked in the top for all sub-regions simultaneously. Fusing several good algorithms using a hierarchical majority vote yielded segmentations that consistently ranked above all individual algorithms, indicating remaining opportunities for further methodological improvements. The BRATS image data and manual annotations continue to be publicly available through an online evaluation system as an ongoing benchmarking resource.


Related Papers

No related papers found

Powered by citation graph analysis