首页    期刊浏览 2026年01月03日 星期六
登录注册

文章基本信息

  • 标题:An algorithm for reducing the dimension and size of a sample for data exploration procedures
  • 本地全文:下载
  • 作者:Piotr Kulczycki ; Szymon Łukasik
  • 期刊名称:International Journal of Applied Mathematics and Computer Science
  • 电子版ISSN:2083-8492
  • 出版年度:2014
  • 卷号:24
  • 期号:1
  • DOI:10.2478/amcs-2014-0011
  • 出版社:De Gruyter Open
  • 摘要:The paper deals with the issue of reducing the dimension and size of a data set (random sample) for exploratory data analysis procedures. The concept of the algorithm investigated here is based on linear transformation to a space of a smaller dimension, while retaining as much as possible the same distances between particular elements. Elements of the transformation matrix are computed using the metaheuristics of parallel fast simulated annealing. Moreover, elimination of or a decrease in importance is performed on those data set elements which have undergone a significant change in location in relation to the others. The presented method can have universal application in a wide range of data exploration problems, offering flexible customization, possibility of use in a dynamic data environment, and comparable or better performance with regards to the principal component analysis. Its positive features were verified in detail for the domain’s fundamental tasks of clustering, classification and detection of atypical elements (outliers).
  • 关键词:dimension reduction; sample size reduction; linear transformation; simulated annealing; data mining
国家哲学社会科学文献中心版权所有