首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:Fast Sketch-based Recovery of Correlation Outliers
  • 作者:Graham Cormode ; Jacques Dark
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2018
  • 卷号:98
  • 页码:13:1-13:18
  • DOI:10.4230/LIPIcs.ICDT.2018.13
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:Many data sources can be interpreted as time-series, and a key problem is to identify which pairs out of a large collection of signals are highly correlated. We expect that there will be few, large, interesting correlations, while most signal pairs do not have any strong correlation. We abstract this as the problem of identifying the highly correlated pairs in a collection of n mostly pairwise uncorrelated random variables, where observations of the variables arrives as a stream. Dimensionality reduction can remove dependence on the number of observations, but further techniques are required to tame the quadratic (in n) cost of a search through all possible pairs. We develop a new algorithm for rapidly finding large correlations based on sketch techniques with an added twist: we quickly generate sketches of random combinations of signals, and use these in concert with ideas from coding theory to decode the identity of correlated pairs. We prove correctness and compare performance and effectiveness with the best LSH (locality sensitive hashing) based approach.
  • 关键词:correlation; sketching; streaming; dimensionality reduction
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有