Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 14;13(11):5813-5835.
doi: 10.1364/BOE.472176. eCollection 2022 Nov 1.

Deep attentive convolutional neural network for automatic grading of imbalanced diabetic retinopathy in retinal fundus images

Affiliations

Deep attentive convolutional neural network for automatic grading of imbalanced diabetic retinopathy in retinal fundus images

Feng Li et al. Biomed Opt Express. .

Abstract

Automated fine-grained diabetic retinopathy (DR) grading was of great significance for assisting ophthalmologists in monitoring DR and designing tailored treatments for patients. Nevertheless, it is a challenging task as a result of high intra-class variations, high inter-class similarities, small lesions, and imbalanced data distributions. The pivotal factor for the success in fine-grained DR grading is to discern more subtle associated lesion features, such as microaneurysms (MA), Hemorrhages (HM), soft exudates (SE), and hard exudates (HE). In this paper, we constructed a simple yet effective deep attentive convolutional neural network (DACNN) for DR grading and lesion discovery with only image-wise supervision. Designed as a top-down architecture, our model incorporated stochastic atrous spatial pyramid pooling (sASPP), global attention mechanism (GAM), category attention mechanism (CAM), and learnable connected module (LCM) to better extract lesion-related features and maximize the DR grading performance. To be concrete, we devised sASPP combining randomness with atrous spatial pyramid pooling (ASPP) to accommodate the various scales of the lesions and struggle against the co-adaptation of multiple atrous convolutions. Then, GAM was introduced to extract class-agnostic global attention feature details, whilst CAM was explored for seeking class-specific distinctive region-level lesion feature information and regarding each DR severity grade in an equal way, which tackled the problem of imbalance DR data distributions. Further, the LCM was designed to automatically and adaptively search the optimal connections among layers for better extracting detailed small lesion feature representations. The proposed approach obtained high accuracy of 88.0% and kappa score of 88.6% for multi-class DR grading task on the EyePACS dataset, respectively, while 98.5% AUC, 93.8% accuracy, 87.9% kappa, 90.7% recall, 94.6% precision, and 92.6% F1-score for referral and non-referral classification on the Messidor dataset. Extensive experimental results on three challenging benchmarks demonstrated that the proposed approach achieved competitive performance in DR grading and lesion discovery using retinal fundus images compared with existing cutting-edge methods, and had good generalization capacity for unseen DR datasets. These promising results highlighted its potential as an efficient and reliable tool to assist ophthalmologists in large-scale DR screening.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest related to this article.

Figures

Fig. 1.
Fig. 1.
The overall structure of the proposed DACNN.
Fig. 2.
Fig. 2.
The schematic illustration of the designed sASPP. (a) sASPP consisted of four atrous convolutions with dilation rates ranging from 6 to 24. The kernel size was 1, 3, 3, and 3 respectively. (b) sASPP in a certain state at the stage of training. Dilation rate and kernel size were the same as (a). bi denoted whether the feature map existed, where bi=0 was defined as discard state and vice versa. (c) sASPP at the stage of testing. All the feature maps generated by four atrous convolutions were reserved at the stage of testing in terms of their magnitudes scaled by the retainable probability during the training stage.
Fig. 3.
Fig. 3.
Description of the Global Attention Mechanism (GAM)
Fig. 4.
Fig. 4.
Illustration of Category Attention Mechanism (CAM)
Fig. 5.
Fig. 5.
Visualization results between baseline, sASPP, GAM, CAM, and LCM. The six rows denoted the original images, heatmaps of sASPP, GAM, CAM, and LCM, respectively.

Similar articles

References

    1. Antonetti D. A., Silva P. S., Stitt A. W., “Current understanding of the molecular and cellular pathology of diabetic retinopathy,” Nat. Rev. Endocrinol. 17(4), 195–206 (2021).10.1038/s41574-020-00451-4 - DOI - PMC - PubMed
    1. Rego S., Dutra-Medeiros M., Soares F., Monteiro-Soares M., “Screening for diabetic retinopathy using an automated diagnostic system based on deep learning: diagnostic accuracy assessment,” Ophthalmologica 244(3), 250–257 (2021).10.1159/000512638 - DOI - PubMed
    1. Farooq M. S., Arooj A., Alroobaea R., Baqasah A. M., Jabarulla M. Y., Singh D., Sardar R., “Untangling computer-aided diagnostic system for screening diabetic retinopathy based on deep learning techniques,” Sensors 22(19), C1 (2022).10.1109/JSEN.2022.3206996 - DOI - PMC - PubMed
    1. Ting D. S. W., Cheung C. Y. L., Lim G., Tan G. S. W., Quang N. D., Gan A., Hamzah H., Garcia-Franco R., Yeo I. Y. S., Lee S. Y., “Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes,” JAMA 318(22), 2211–2223 (2017).10.1001/jama.2017.18152 - DOI - PMC - PubMed
    1. Ipp E., Liljenquist D., Bode B., Shah V. N., Silverstein S., Regillo C. D., Lim J. I., Sadda S., Domalpally A., Gray G., Bhaskaranand M., Ramachandra C., Solanki K., “Pivotal evaluation of an artificial intelligence system for autonomous detection of referrable and vision-threatening diabetic retinopathy,” Jama Netw. Open. 4(11), e2134254 (2021).10.1001/jamanetworkopen.2021.34254 - DOI - PMC - PubMed

LinkOut - more resources

  NODES
Note 3
twitter 2