Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
- PMID: 31304372
- PMCID: PMC6550283
- DOI: 10.1038/s41746-019-0099-8
Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
Erratum in
-
Erratum: Author Correction: Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program.NPJ Digit Med. 2019 Jul 23;2:68. doi: 10.1038/s41746-019-0146-5. eCollection 2019. NPJ Digit Med. 2019. PMID: 31341955 Free PMC article.
Abstract
Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, p < 0.001), and a slightly lower specificity (0.96 vs. 0.98, p < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME (p < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively (p < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening.
Keywords: Developing world; Diabetes complications.
Conflict of interest statement
Competing interestsJ.K., R.S., K.W., B.J.L.C., G.S.C., L.P., S.P. and D.R.W. are Google employees and receive salary and stock as a part of the standard compensation package. O.K., J.J. and J.T. are consultants for Google.
Figures
Similar articles
-
Grader Variability and the Importance of Reference Standards for Evaluating Machine Learning Models for Diabetic Retinopathy.Ophthalmology. 2018 Aug;125(8):1264-1272. doi: 10.1016/j.ophtha.2018.01.034. Epub 2018 Mar 13. Ophthalmology. 2018. PMID: 29548646
-
Impact of Gold-Standard Label Errors on Evaluating Performance of Deep Learning Models in Diabetic Retinopathy Screening: Nationwide Real-World Validation Study.J Med Internet Res. 2024 Aug 14;26:e52506. doi: 10.2196/52506. J Med Internet Res. 2024. PMID: 39141915 Free PMC article.
-
Accuracy of Integrated Artificial Intelligence Grading Using Handheld Retinal Imaging in a Community Diabetic Eye Screening Program.Ophthalmol Sci. 2023 Dec 15;4(3):100457. doi: 10.1016/j.xops.2023.100457. eCollection 2024 May-Jun. Ophthalmol Sci. 2023. PMID: 38317871 Free PMC article.
-
Telemedicine for detecting diabetic retinopathy: a systematic review and meta-analysis.Br J Ophthalmol. 2015 Jun;99(6):823-31. doi: 10.1136/bjophthalmol-2014-305631. Epub 2015 Jan 6. Br J Ophthalmol. 2015. PMID: 25563767 Free PMC article. Review.
-
Association of Diabetic Macular Edema and Proliferative Diabetic Retinopathy With Cardiovascular Disease: A Systematic Review and Meta-analysis.JAMA Ophthalmol. 2017 Jun 1;135(6):586-593. doi: 10.1001/jamaophthalmol.2017.0988. JAMA Ophthalmol. 2017. PMID: 28472362 Free PMC article. Review.
Cited by
-
Trends and hotspots in the field of diabetic retinopathy imaging research from 2000-2023.Front Med (Lausanne). 2024 Oct 9;11:1481088. doi: 10.3389/fmed.2024.1481088. eCollection 2024. Front Med (Lausanne). 2024. PMID: 39444814 Free PMC article.
-
Big data to guide glaucoma treatment.Taiwan J Ophthalmol. 2023 Jul 28;14(3):333-339. doi: 10.4103/tjo.TJO-D-23-00068. eCollection 2024 Jul-Sep. Taiwan J Ophthalmol. 2023. PMID: 39430357 Free PMC article. Review.
-
Effectiveness of artificial intelligence for diabetic retinopathy screening in community in Binh Dinh Province, Vietnam.Taiwan J Ophthalmol. 2024 Sep 13;14(3):394-402. doi: 10.4103/tjo.TJO-D-23-00101. eCollection 2024 Jul-Sep. Taiwan J Ophthalmol. 2024. PMID: 39430352 Free PMC article.
-
Effects of artificial intelligence implementation on efficiency in medical imaging-a systematic literature review and meta-analysis.NPJ Digit Med. 2024 Sep 30;7(1):265. doi: 10.1038/s41746-024-01248-9. NPJ Digit Med. 2024. PMID: 39349815 Free PMC article. Review.
-
Artificial Intelligence-Based Screening System for Diabetic Retinopathy in Primary Care.Diagnostics (Basel). 2024 Sep 9;14(17):1992. doi: 10.3390/diagnostics14171992. Diagnostics (Basel). 2024. PMID: 39272776 Free PMC article.
References
-
- Jenchitr W, et al. The national survey of blindness low vision and visual impairment in thailand 2006–2007. Thai. J. Pub. Hlth. Ophthalmol. 2007;21:10.
LinkOut - more resources
Full Text Sources