版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Department of Computer Science and Engineering Daffodil International University Dhaka Bangladesh Department of Computer Science and Engineering Mawlana Bhashani Science and Technology University Tangail Bangladesh
出 版 物:《Neural Computing and Applications》 (Neural Comput. Appl.)
年 卷 期:2025年第37卷第14期
页 面:8479-8507页
核心收录:
学科分类:08[工学] 0803[工学-光学工程] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:Not applicable
主 题:Magnetic resonance imaging
摘 要:Cancer remains a leading cause of mortality worldwide, with early detection and accurate diagnosis critical to improving patient outcomes. While computer-aided diagnosis systems powered by deep learning have shown considerable promise, their widespread clinical adoption faces significant challenges in maintaining consistent performance across diverse imaging modalities and datasets. This research addresses the critical challenge of developing robust, generalizable deep learning models by proposing a comprehensive framework utilizing seven diverse medical imaging datasets encompassing fundus photography, histopathology, endoscopy, and MRI, covering diseases such as ocular toxoplasmosis, endometrial cancer, colorectal cancer, gastrointestinal disease, breast cancer, brain tumor, and tympanic membrane conditions. Our methodology combines customized data augmentation strategies (photometric, geometric, and elastic transformations) with an optimized vision transformer with external attention (MViTX) architecture. The MViTX model demonstrated exceptional performance with test accuracies ranging from 94.1 to 99.1% across all datasets, achieving superior metrics in accuracy, precision, recall, F1-score, and AUC compared to state-of-the-art CNNs. The model s effectiveness was further validated through ablation studies and explainable AI techniques, while its practical utility was demonstrated through deployment as a user-friendly web application. Our research establishes the effectiveness of combining tailored data augmentation with attention-based transformer architectures for medical image analysis, representing a significant step toward enhancing healthcare professionals diagnostic capabilities and ultimately improving patient care outcomes. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.