文章基本信息

标题：An Efficient Data Preprocessing Procedure for Support Vector Clustering
作者：Jeen-Shing Wang ; Jen-Chieh Chiang
期刊名称：Journal of Universal Computer Science
印刷版ISSN：0948-6968
出版年度：2009
卷号：15
期号：4
页码：705-721
出版社：Graz University of Technology and Know-Center
摘要：This paper presents an efficient data preprocessing procedure for the of support vector clustering (SVC) to reduce the size of a training dataset. Solving the optimization problem and labeling the data points with cluster labels are time-consuming in the SVC training procedure. This makes using SVC to process large datasets inefficient. We proposed a data preprocessing procedure to solve the problem. The procedure contains a shared nearest neighbor (SNN) algorithm, and utilizes the concept of unit vectors for eliminating insignificant data points from the dataset. Computer simulations have been conducted on artificial and benchmark datasets to demonstrate the effectiveness of the proposed method.