期刊名称:International Journal of Hybrid Information Technology
印刷版ISSN:1738-9968
出版年度:2016
卷号:9
期号:4
页码:59-68
DOI:10.14257/ijhit.2016.9.4.06
出版社:SERSC
摘要:In computer aided diagnosis (CAD) process, one of the most challenging problems is data sparsity, which leads to the diagnosis results are not reliable. This paper proposes a clustering collaborative filtering based algorithm to solve the problem of data sparsity. In this paper, we use k-means clustering algorithm to cluster the same type of patients, and then adopt collaborative filtering method to fill the missing data values for each cluster, in this way to reduce the complexity of similarity calculation of collaborative filtering. The proposed method makes full use of the information-sharing mechanism of "similar patient population" to predict and fill the missing values. A hepatitis dataset is used for evaluating the performance of the algorithm. Results indicate that the proposed algorithm has better performance for medical record data sparsity problem.
关键词:Collaborative Filtering; Data sparsity; Computer Aided Diagnosis