期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2021
卷号:9
期号:7
页码:8223-8227
DOI:10.15680/IJIRCCE.2021.0907062
语种:English
出版社:S&S Publications
摘要:Diabetes causes a large number of deaths every year and a large number of people living with the disease do not realize their health condition early enough. Early prediction of diabetes is an important issue in Health Care Services (HCS). In this study, a model for early diagnosis and prediction of diabetes using the Pima Indians Diabetes dataset is proposed. Various techniques and algorithms are designed for application in extracting knowledge and information in the diagnosis and treatment of disease from medical databases. This proposed model comprises PCA (Principal Component Analysis), K-means and Logistic Regression algorithm. To enhance the K-means clustering algorithm, PCA will be used to reduce the dataset to a lower dimension. Logistic regression algorithm is used to classify data items into categories. The model is useful for automatically predicting diabetes using patient electronic health records data.