Pulmonary embolism (PE) represents a blood clot that travels to the blood vessels in the lung, causing vascular obstruction, and in some patients, death. CT pulmonary angiography (CTPA), is the most common type of medical imaging to evaluate patients with suspected PE. These CT scans consist of hundreds of images that require detailed review to identify clots within the pulmonary arteries. Recent research in deep learning across academia and industry produced numerous architectures, various model initializations, and distinct learning paradigms. It has resulted in many competing approaches to Computer-aided Diagnosis (CAD) implementation in medical imaging and produced great confusion in the CAD community. We have conducted extensive experiments with various deep learning architectures, model initializations, learning paradigms, and data pre-processing techniques applicable for PE diagnosis at both the slice and exam levels.
Seeking an Optimal Approach for Computer-aided Pulmonary Embolism Detection
Nahid Ul Islam1, Shiv Gehlot1, Zongwei Zhou1, Michael B Gotway2, and Jianming Liang1
1Arizona State University, 2Mayo Clinic
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2021); Machine Learning in Medical Imaging (MLMI2021)
Paper | Graphical Abstract | Poster | Code | Slides | Presentation
Seeking an Optimal Approach for Computer-aided Diagnosis of Pulmonary Embolism
Nahid Ul Islam1, Zongwei Zhou1, Shiv Gehlot1, Michael B Gotway2, and Jianming Liang1
1Arizona State University, 2Mayo Clinic
Medical Image Analysis (MedIA)
Paper | Graphical Abstract | Poster | Code | Slides | Presentation)
- RSNA PE
- CAD-PE Challenge Dataset
- Ferdowsi University of Mashhad's PE dataset
- Our in-house PE-CAD dataset
1. Transfer learning boosts performance despite differences between photographic images and CTPA scans
2. Squeeze & excitation blocks enhance CNN performance
3. Self-supervised pre-training can surpass (fully) supervised pre-training
4. Model trained on the RSNA PE dataset demonstrates promising performance when tested on unseen datasets for slice-level PE classification
5. Proposed Embedding-based ViT (E-ViT) utilizes class and exam-level embeddings generated by the transformer encoder and outperforms BiGRU
6. Vessel-oriented image representation and self-supervised pre-training both enhance performance for PE false positive reduction across image dimensions.
This framework was built on top of the first place solution. The necessary data organization as well as the preprocessing steps can be found here.
Insprired from SeResNet50 and SeResNext50, we added squeeze and excitation (SE) block to Xception architecture and pre-trained it on ImageNet dataset. The pre-trained SeXception model can be found here.
You can download the pretrained models used/developed in our paper from here.
If you use this code or use our pre-trained models for your research, please cite our paper:
@inproceedings{islam2021seeking,
title={Seeking an optimal approach for computer-aided pulmonary embolism detection},
author={Islam, Nahid Ul and Gehlot, Shiv and Zhou, Zongwei and Gotway, Michael B and Liang, Jianming},
booktitle={Machine Learning in Medical Imaging: 12th International Workshop, MLMI 2021, Held in Conjunction with MICCAI 2021, Strasbourg, France, September 27, 2021, Proceedings 12},
pages={692--702},
year={2021},
organization={Springer}
}
@article{islam2023seeking,
title={Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism},
author={Islam, Nahid Ul and Zhou, Zongwei and Gehlot, Shiv and Gotway, Michael B and Liang, Jianming},
journal={Medical Image Analysis},
pages={102988},
year={2023},
publisher={Elsevier}
}
This research has been supported in part by ASU and Mayo Clinic through a Seed Grant and an Innovation Grant, and in part by the NIH under Award Number R01HL128785. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. This work has utilized the GPUs, made avail-able in part by ASU Research Computing, Bridges-2 at the Pittsburgh Supercomputing Center (allocated under BCS190015), and Anvil at Purdue University (allocated under MED220025), all of which are sup-ported by the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program. This program is supported by National Science Foundation grants #2138259, #2138286, #2138307, #2137603, and #2138296. We thank Ruibin Feng for aggregating 19self-supervised pre-trained models and Jae Shin and Douglas Amoo-Sargon for creating the 3D VOIR dataset. We also acknowledge the exploration and preliminary experiments by Utkarsh Nath, which have been redesigned and replaced in this version. We extend our gratitude to Zuwei Guo for his efforts in preparing the pre-trained models for the latest 3D self-supervised approaches and for his contributions in experimenting with the 3D VOIR target task. The content of this paper is covered by patents pending.
Released under the ASU GitHub Project License