期刊名称:IJAIN (International Journal of Advances in Intelligent Informatics)
印刷版ISSN:2442-6571
电子版ISSN:2548-3161
出版年度:2021
卷号:7
期号:3
页码:304-317
DOI:10.26555/ijain.v7i3.696
语种:English
出版社:Universitas Ahmad Dahlan
摘要:K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing data with plausible values. One of the successes of KNN imputation is the ability to measure the missing data simulated from its nearest neighbors robustly. However, despite the favorable points, KNN still imposes undesirable circumstances. KNN suffers from high time complexity, choosing the right k, and different functions. Thus, this paper proposes a novel method for imputation of missing data, named KNNGOA, which optimized the KNN imputation technique based on the grasshopper optimization algorithm. Our GOA is designed to find the best value of k and optimize the imputed value from KNN that maximizes the imputation accuracy. Experimental evaluation for different types of datasets collected from UCI, with various rates of missing values ranging from 10%, 30%, and 50%. Our proposed algorithm has achieved promising results from the experiment conducted, which outperformed other methods, especially in terms of accuracy. .
关键词:Grasshopper;KNN;Imputation accuracy;GOA;Missing data