Cloud-Based Evaluation of Anatomical Structure Segmentation and Landmark Detection Algorithms: VISCERAL Anatomy Benchmarks

Oscar Jiménez–del–Toro(University of Geneva), Henning Müller(University of Geneva), Markus Krenn(Medical University of Vienna), Katharina Gruenberg(University Hospital Heidelberg), Abdel Aziz Taha(TU Wien), Marianne Winterstein(University Hospital Heidelberg), Ivan Eggel(HES-SO University of Applied Sciences and Arts Western Switzerland), Antonio Foncubierta–Rodríguez(ETH Zurich), Orçun Göksel(ETH Zurich), András Jakab(Medical University of Vienna), Georgios Kontokotsios(TU Wien), Georg Langs(Medical University of Vienna), Bjoern Menze(ETH Zurich), Tomás Salas(Agència de Qualitat i Avaluació Sanitàries de Catalunya), Roger Schaer(HES-SO University of Applied Sciences and Arts Western Switzerland), Anna Walleyo(University Hospital Heidelberg), Marc‐André Weber(University Hospital Heidelberg), Yashin Dicente Cid(University Hospital of Geneva), Tobias Gass(ETH Zurich), Mattias P. Heinrich‬(University of Lübeck), Fucang Jia(Shenzhen Institutes of Advanced Technology), Fredrik Kahl(Chalmers University of Technology), Razmig Kéchichian(Université Claude Bernard Lyon 1), Dominic Mai(University of Freiburg), Assaf B. Spanier(Hebrew University of Jerusalem), G.R. Vincent, Chunliang Wang(KTH Royal Institute of Technology), Daniel Wyeth, Allan Hanbury(TU Wien)
IEEE Transactions on Medical Imaging
June 9, 2016
Cited by 167

Abstract

Variations in the shape and appearance of anatomical structures in medical images are often relevant radiological signs of disease. Automatic tools can help automate parts of this manual process. A cloud-based evaluation framework is presented in this paper including results of benchmarking current state-of-the-art medical imaging algorithms for anatomical structure segmentation and landmark detection: the VISCERAL Anatomy benchmarks. The algorithms are implemented in virtual machines in the cloud where participants can only access the training data and can be run privately by the benchmark administrators to objectively compare their performance in an unseen common test set. Overall, 120 computed tomography and magnetic resonance patient volumes were manually annotated to create a standard Gold Corpus containing a total of 1295 structures and 1760 landmarks. Ten participants contributed with automatic algorithms for the organ segmentation task, and three for the landmark localization task. Different algorithms obtained the best scores in the four available imaging modalities and for subsets of anatomical structures. The annotation framework, resulting data set, evaluation setup, results and performance analysis from the three VISCERAL Anatomy benchmarks are presented in this article. Both the VISCERAL data set and Silver Corpus generated with the fusion of the participant algorithms on a larger set of non-manually-annotated medical images are available to the research community.


Related Papers

No related papers found

Powered by citation graph analysis