Segment anything in medical imagesJun Ma, Yuting He, Feifei Li et al.|Nature Communications|2024 Medical image segmentation is a critical component in clinical practice, facilitating accurate diagnosis, treatment planning, and disease monitoring. However, existing methods, often tailored to specific modalities or disease types, lack generalizability across the diverse spectrum of medical image segmentation tasks. Here we present MedSAM, a foundation model designed for bridging this gap by enabling universal medical image segmentation. The model is developed on a large-scale medical image dataset with 1,570,263 image-mask pairs, covering 10 imaging modalities and over 30 cancer types. We conduct a comprehensive evaluation on 86 internal validation tasks and 60 external validation tasks, demonstrating better accuracy and robustness than modality-wise specialist models. By delivering accurate and efficient segmentation across a wide spectrum of tasks, MedSAM holds significant potential to expedite the evolution of diagnostic tools and the personalization of treatment plans.
Segment Anything in Medical ImagesJun Ma, Yuting He, Feifei Li et al.|arXiv (Cornell University)|2023 Medical image segmentation is a critical component in clinical practice, facilitating accurate diagnosis, treatment planning, and disease monitoring. However, existing methods, often tailored to specific modalities or disease types, lack generalizability across the diverse spectrum of medical image segmentation tasks. Here we present MedSAM, a foundation model designed for bridging this gap by enabling universal medical image segmentation. The model is developed on a large-scale medical image dataset with 1,570,263 image-mask pairs, covering 10 imaging modalities and over 30 cancer types. We conduct a comprehensive evaluation on 86 internal validation tasks and 60 external validation tasks, demonstrating better accuracy and robustness than modality-wise specialist models. By delivering accurate and efficient segmentation across a wide spectrum of tasks, MedSAM holds significant potential to expedite the evolution of diagnostic tools and the personalization of treatment plans.
Segment Anything in Medical Images and Videos: Benchmark and DeploymentJun Ma, Sumin Kim, Feifei Li et al.|arXiv (Cornell University)|2024 Recent advances in segmentation foundation models have enabled accurate and efficient segmentation across a wide range of natural images and videos, but their utility to medical data remains unclear. In this work, we first present a comprehensive benchmarking of the Segment Anything Model 2 (SAM2) across 11 medical image modalities and videos and point out its strengths and weaknesses by comparing it to SAM1 and MedSAM. Then, we develop a transfer learning pipeline and demonstrate SAM2 can be quickly adapted to medical domain by fine-tuning. Furthermore, we implement SAM2 as a 3D slicer plugin and Gradio API for efficient 3D image and video segmentation. The code has been made publicly available at \url{https://github.com/bowang-lab/MedSAM}.
MOVING OBJECT DETECTION METHOD OF VIDEO SATELLITE BASED ON TRACKING CORRECTION DETECTIONX. Yang, Feifei Li, Mu Lu et al.|ISPRS annals of the photogrammetry, remote sensing and spatial information sciences|2020 Abstract. It is the focus of current research that how to realize high precision and real-time dynamic monitoring and tracking of moving targets by video satellites because of instantaneous and dynamic continuous observation of targets in a certain area by the video satellites. The existing detection and tracking methods for moving objects have target misdetection and missed detection, which reduces the accuracy of moving object detection. In this paper, a Tracking Correction Detection Correction (TCD) method is proposed to solve these problems. Firstly, the background model is established by using the improved ViBe target detection algorithm, and the moving target mask is obtained by adaptive threshold calculation. By using pyramid structure iterative algorithm, the moving object can be classified as noise or real object according to the set of detection results of different detection windows. The high-order correlation vector tracking method is used to modify the detection result of the moving target acquired in the previous frame, and finally the accurate detection result of the moving target is obtained. The comparison analysis between the frame difference (FD) method, GMM method, ViBe method and TCD method shows that the TCD method has better robustness for noise, light and background dynamic changes, and the test results of TCD method are more complete and the real-time is better. It is proved by this work that the accuracy of the target detection of TCD method has reached 85%, which has a high engineering application value.
Efficient MedSAMs: Segment Anything in Medical Images on LaptopJun Ma, Feifei Li, Sumin Kim et al.|arXiv (Cornell University)|2024 Promptable segmentation foundation models have emerged as a transformative approach to addressing the diverse needs in medical images, but most existing models require expensive computing, posing a big barrier to their adoption in clinical practice. In this work, we organized the first international competition dedicated to promptable medical image segmentation, featuring a large-scale dataset spanning nine common imaging modalities from over 20 different institutions. The top teams developed lightweight segmentation foundation models and implemented an efficient inference pipeline that substantially reduced computational requirements while maintaining state-of-the-art segmentation accuracy. Moreover, the post-challenge phase advanced the algorithms through the design of performance booster and reproducibility tasks, resulting in improved algorithms and validated reproducibility of the winning solution. Furthermore, the best-performing algorithms have been incorporated into the open-source software with a user-friendly interface to facilitate clinical adoption. The data and code are publicly available to foster the further development of medical image segmentation foundation models and pave the way for impactful real-world applications.