Random forests ensemble classifier trained with data resampling strategy to improve cardiac arrhythmia diagnosis
- PMID: 21419401
- DOI: 10.1016/j.compbiomed.2011.03.001
Random forests ensemble classifier trained with data resampling strategy to improve cardiac arrhythmia diagnosis
Abstract
Supervised classification algorithms are commonly used in the designing of computer-aided diagnosis systems. In this study, we present a resampling strategy based Random Forests (RF) ensemble classifier to improve diagnosis of cardiac arrhythmia. Random forests is an ensemble classifier that consists of many decision trees and outputs the class that is the mode of the class's output by individual trees. In this way, an RF ensemble classifier performs better than a single tree from classification performance point of view. In general, multiclass datasets having unbalanced distribution of sample sizes are difficult to analyze in terms of class discrimination. Cardiac arrhythmia is such a dataset that has multiple classes with small sample sizes and it is therefore adequate to test our resampling based training strategy. The dataset contains 452 samples in fourteen types of arrhythmias and eleven of these classes have sample sizes less than 15. Our diagnosis strategy consists of two parts: (i) a correlation based feature selection algorithm is used to select relevant features from cardiac arrhythmia dataset. (ii) RF machine learning algorithm is used to evaluate the performance of selected features with and without simple random sampling to evaluate the efficiency of proposed training strategy. The resultant accuracy of the classifier is found to be 90.0% and this is a quite high diagnosis performance for cardiac arrhythmia. Furthermore, three case studies, i.e., thyroid, cardiotocography and audiology, are used to benchmark the effectiveness of the proposed method. The results of experiments demonstrated the efficiency of random sampling strategy in training RF ensemble classification algorithm.
Copyright © 2011 Elsevier Ltd. All rights reserved.
Similar articles
-
Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms.Comput Methods Programs Biomed. 2011 Dec;104(3):443-51. doi: 10.1016/j.cmpb.2011.03.018. Epub 2011 Apr 30. Comput Methods Programs Biomed. 2011. PMID: 21531475
-
Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal.Artif Intell Med. 2008 Sep;44(1):51-64. doi: 10.1016/j.artmed.2008.04.007. Epub 2008 Jun 27. Artif Intell Med. 2008. PMID: 18585905
-
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838. Proteins. 2008. PMID: 18186470
-
Class-imbalanced classifiers for high-dimensional data.Brief Bioinform. 2013 Jan;14(1):13-26. doi: 10.1093/bib/bbs006. Epub 2012 Mar 9. Brief Bioinform. 2013. PMID: 22408190 Review.
-
Reviewing ensemble classification methods in breast cancer.Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20. Comput Methods Programs Biomed. 2019. PMID: 31319964 Review.
Cited by
-
Predicting polypharmacy in half a million adults in the Iranian population: comparison of machine learning algorithms.BMC Med Inform Decis Mak. 2023 May 5;23(1):84. doi: 10.1186/s12911-023-02177-5. BMC Med Inform Decis Mak. 2023. PMID: 37147615 Free PMC article.
-
Dexamethasone stimulated gene expression in peripheral blood is a sensitive marker for glucocorticoid receptor resistance in depressed patients.Neuropsychopharmacology. 2012 May;37(6):1455-64. doi: 10.1038/npp.2011.331. Epub 2012 Jan 11. Neuropsychopharmacology. 2012. PMID: 22237309 Free PMC article.
-
Effective automated prediction of vertebral column pathologies based on logistic model tree with SMOTE preprocessing.J Med Syst. 2014 May;38(5):50. doi: 10.1007/s10916-014-0050-0. Epub 2014 Apr 22. J Med Syst. 2014. PMID: 24753003
-
Multiclassifier Systems for Predicting Neurological Outcome of Patients with Severe Trauma and Polytrauma in Intensive Care Units.J Med Syst. 2017 Sep;41(9):136. doi: 10.1007/s10916-017-0789-1. Epub 2017 Jul 28. J Med Syst. 2017. PMID: 28755271
-
Novel ensemble method for the prediction of response to fluvoxamine treatment of obsessive-compulsive disorder.Neuropsychiatr Dis Treat. 2018 Aug 10;14:2027-2038. doi: 10.2147/NDT.S173388. eCollection 2018. Neuropsychiatr Dis Treat. 2018. PMID: 30127613 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical