文章基本信息

标题：Machine Learning Model to Analyze Telemonitoring Dyphosia Factors of Parkinson’s Disease
本地全文：下载
作者：Mohimenol Islam Fahim ; Syful Islam ; Sumaiya Tun Noor 等
期刊名称：International Journal of Advanced Computer Science and Applications(IJACSA)
印刷版ISSN：2158-107X
电子版ISSN：2156-5570
出版年度：2021
卷号：12
期号：8
DOI：10.14569/IJACSA.2021.0120890
语种：English
出版社：Science and Information Society (SAI)
摘要：For many years, lots of people have been suffering from Parkinson’s disease all over the world, and some datasets are generated by recording important PD features for reliable decision-making diagnostics. But a dataset can contain correlated data points and outliers that can affect the dataset’s output. In this work, a framework is proposed where the performance of an original dataset is compared to the performance of its reduced version after removing correlated features and outliers. The dataset is collected from UCI Machine Learning Repository, and many machine learning (ML) classifiers are used to evaluate its performance in various categories. The same process is repeated on the reduced dataset, and some improvement in prediction accuracy is noticed. Among ANOVA F-test, RFE, MIFS, and CSFS methods, the Logistic Regression classifier along with RFE-based feature selection technique outperforms all other classifiers. We observed that our improved system demonstrates 82.94%accuracy, 82.74% ROC, 82.9% F-measure, along with 17.46%false positive rate and 17.05% false negative rate, which are better compared to the primary dataset prediction accuracy metric values. Therefore, we hope that this model can be beneficial for physicians to diagnose PD more explicitly.
关键词：Parkinson’s disease; correlation; outliers; machine learning; RFE-based analysis