Improving Medical X-ray Imaging Diagnosis with Attention Mechanisms and Robust Transfer Learning Techniques
Article
| Article Title | Improving Medical X-ray Imaging Diagnosis with Attention Mechanisms and Robust Transfer Learning Techniques |
|---|---|
| ERA Journal ID | 210567 |
| Article Category | Article |
| Authors | Das, Isha, Sheakh, Md. Alif, Abdulla, Shahab, Tahosin, Mst. Sazia, Bhuiyan, Md. Mehedi, Zaman, Sadika and Shukla, Abhay |
| Journal Title | IEEE Access |
| Journal Citation | 11, pp. 159002-159027 |
| Number of Pages | 26 |
| Year | 2025 |
| Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
| Place of Publication | IEEE |
| ISSN | 2169-3536 |
| Digital Object Identifier (DOI) | https://doi.org/10.1109/ACCESS.2025.3607639 |
| Web Address (URL) | https://ieeexplore.ieee.org/document/11153815 |
| Abstract | X-ray imaging remains a cornerstone in medical diagnostics for conditions such as bone fractures, knee osteoarthritis, and lung diseases. However, variability in image quality and dataset diversity presents significant challenges for automated analysis using deep learning models. This study addresses these issues by proposing an EfficientNet B0 architecture enhanced with a Convolutional Block Attention Module (CBAM) to improve classification accuracy and interpretability across multiple X-ray datasets: FracAtlas, Knee, and Lung X-ray. A robust preprocessing pipeline comprising LAB color space conversion, morphological filtering, gamma correction, Non-Local Means denoising, resizing, and normalization was applied to optimize image quality, with each step’s effectiveness verified through established image quality metrics. Additionally, geometric augmentation techniques were performed to increase dataset variability and improve model generalization. Comparative experiments with transfer learning, transformer-based, and attention-based models identified the attention-based EfficientNet B0 as the best performer. An extensive ablation study on the Knee X-ray dataset refined hyperparameters to maximize performance. The optimized model achieved high test accuracies of 98.09%, 97.12%, and 99.51% on FracAtlas, Knee, and Lung datasets, respectively. Further analyses, including noise robustness testing, k-fold cross-validation, and Grad-CAM visualization, demonstrated the model’s resilience, consistency, and explainability. The findings highlight the critical role of attention mechanisms in enhancing feature representation and generalization across heterogeneous medical imaging tasks. This work lays a foundation for reliable, interpretable AI systems that can support clinical decision-making, with future efforts focusing on expanding dataset diversity and real-world clinical validation to accelerate adoption. |
| Keywords | attention mechanism; X-ray image classification; transfer learning; explainable AI; medical image analysis |
| Article Publishing Charge (APC) Funding | Researcher |
| Contains Sensitive Content | Does not contain sensitive content |
| ANZSRC Field of Research 2020 | 420308. Health informatics and information systems |
| Byline Affiliations | Chittagong University of Engineering and Technology, Bangladesh |
| Daffodil International University, Bangladesh | |
| UniSQ College | |
| University of South Australia | |
| North Western University, Bangladesh | |
| Axis Colleges, India |
https://research.usq.edu.au/item/zzyvx/improving-medical-x-ray-imaging-diagnosis-with-attention-mechanisms-and-robust-transfer-learning-techniques
Download files
Published Version
| Improving_Medical_X-Ray_Imaging_Diagnosis_With_Attention_Mechanisms_and_Robust_Transfer_Learning_Techniques.pdf | ||
| License: CC BY 4.0 | ||
| File access level: Anyone | ||
15
total views4
total downloads0
views this month0
downloads this month