Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problemJohannes Hofmanninger, Florian Prayer, Jeanny Pan et al.|European Radiology Experimental|2020 BACKGROUND: Automated segmentation of anatomical structures is a crucial step in image analysis. For lung segmentation in computed tomography, a variety of approaches exists, involving sophisticated pipelines trained and validated on different datasets. However, the clinical applicability of these approaches across diseases remains limited. METHODS: We compared four generic deep learning approaches trained on various datasets and two readily available lung segmentation algorithms. We performed evaluation on routine imaging data with more than six different disease patterns and three published data sets. RESULTS: Using different deep learning approaches, mean Dice similarity coefficients (DSCs) on test datasets varied not over 0.02. When trained on a diverse routine dataset (n = 36), a standard approach (U-net) yields a higher DSC (0.97 ± 0.05) compared to training on public datasets such as the Lung Tissue Research Consortium (0.94 ± 0.13, p = 0.024) or Anatomy 3 (0.92 ± 0.15, p = 0.001). Trained on routine data (n = 231) covering multiple diseases, U-net compared to reference methods yields a DSC of 0.98 ± 0.03 versus 0.94 ± 0.12 (p = 0.024). CONCLUSIONS: The accuracy and reliability of lung segmentation algorithms on demanding cases primarily relies on the diversity of the training data, highlighting the importance of data diversity compared to model choice. Efforts in developing new datasets and providing trained models to the public are critical. By releasing the trained model under General Public License 3.0, we aim to foster research on lung diseases by providing a readily available tool for segmentation of pathological lungs.
Variability of computed tomography radiomics features of fibrosing interstitial lung disease: A test-retest studyUnsupervised machine learning identifies predictive progression markers of IPFOBJECTIVES: To identify and evaluate predictive lung imaging markers and their pathways of change during progression of idiopathic pulmonary fibrosis (IPF) from sequential data of an IPF cohort. To test if these imaging markers predict outcome. METHODS: We studied radiological disease progression in 76 patients with IPF, including overall 190 computed tomography (CT) examinations of the chest. An algorithm identified candidates for imaging patterns marking progression by computationally clustering visual CT features. A classification algorithm selected clusters associated with radiological disease progression by testing their value for recognizing the temporal sequence of examinations. This resulted in radiological disease progression signatures, and pathways of lung tissue change accompanying progression observed across the cohort. Finally, we tested if the dynamics of marker patterns predict outcome, and performed an external validation study on a cohort from a different center. RESULTS: Progression marker patterns were identified and exhibited high stability in a repeatability experiment with 20 random sub-cohorts of the overall cohort. The 4 top-ranked progression markers were consistently selected as most informative for progression across all random sub-cohorts. After spatial image registration, local tracking of lung pattern transitions revealed a network of tissue transition pathways from healthy to a sequence of disease tissues. The progression markers were predictive for outcome, and the model achieved comparable results on a replication cohort. CONCLUSIONS: Unsupervised learning can identify radiological disease progression markers that predict outcome. Local tracking of pattern transitions reveals pathways of radiological disease progression from healthy lung tissue through a sequence of diseased tissue types. KEY POINTS: • Unsupervised learning can identify radiological disease progression markers that predict outcome in patients with idiopathic pulmonary fibrosis. • Local tracking of pattern transitions reveals pathways of radiological disease progression from healthy lung tissue through a sequence of diseased tissue types. • The progression markers achieved comparable results on a replication cohort.
Künstliche Intelligenz in der Bildgebung der LungeCLINICAL/METHODICAL ISSUE: Artificial intelligence (AI) has the potential to improve diagnostic accuracy and management in patients with lung disease through automated detection, quantification, classification, and prediction of disease progression. STANDARD RADIOLOGICAL METHODS: Owing to unspecific symptoms, few well-defined CT disease patterns, and varying prognosis, interstitial lungs disease represents a focus of AI-based research. METHODICAL INNOVATIONS: Supervised and unsupervised machine learning can identify CT disease patterns using features which may allow the analysis of associations with specific diseases and outcomes. PERFORMANCE: Machine learning on the one hand improves computer-aided detection of pulmonary nodules. On the other hand it enables further characterization of pulmonary nodules, which may improve resource effectiveness regarding lung cancer screening programs. ACHIEVEMENTS: There are several challenges regarding AI-based CT data analysis. Besides the need for powerful algorithms, expert annotations and extensive training data sets that reflect physiologic and pathologic variability are required for effective machine learning. Comparability and reproducibility of AI research deserve consideration due to a lack of standardization in this emerging field. PRACTICAL RECOMMENDATIONS: This review article presents the state of the art and the challenges concerning AI in lung imaging with special consideration of interstitial lung disease, and detection and consideration of pulmonary nodules.
Disentanglement of Biological and Technical Factors via Latent Space Rotation in Clinical Imaging Improves Disease Pattern DiscoveryJeanny Pan, Philipp Seeböck, Christoph Fürböck et al.|Lecture notes in computer science|2026 Abstract Identifying new disease-related patterns in medical imaging data with the help of machine learning enlarges the vocabulary of recognizable findings. This supports diagnostic and prognostic assessment. However, image appearance varies not only due to biological differences, but also due to imaging technology linked to vendors, scanning- or reconstruction parameters. The resulting domain shifts impedes data representation learning strategies and the discovery of biologically meaningful cluster appearances. To address these challenges, we introduce an approach to actively learn the domain shift via post-hoc rotation of the data latent space, enabling disentanglement of biological and technical factors. Results on real-world heterogeneous clinical data showcase that the learned disentangled representation leads to stable clusters representing tissue-types across different acquisition settings. Cluster consistency is improved by +19.01% (ARI), +16.85% (NMI), and +12.39% (Dice) compared to the entangled representation, outperforming four state-of-the-art harmonization methods. When using the clusters to quantify tissue composition on idiopathic pulmonary fibrosis patients, the learned profiles enhance Cox survival prediction. This indicates that the proposed label-free framework facilitates biomarker discovery in multi-center routine imaging data. Code is available on GitHub https://github.com/cirmuw/latent-space-rotation-disentanglement .