首页    期刊浏览 2024年09月14日 星期六
登录注册

文章基本信息

  • 标题:A Framework for Estimating Stream Expression Cardinalities
  • 本地全文:下载
  • 作者:Anirban Dasgupta ; Kevin J. Lang ; Lee Rhodes
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2016
  • 卷号:48
  • 页码:6:1-6:17
  • DOI:10.4230/LIPIcs.ICDT.2016.6
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:Given m distributed data streams A_1,..., A_m, we consider the problem of estimating the number of unique identifiers in streams defined by set expressions over A_1,..., A_m. We identify a broad class of algorithms for solving this problem, and show that the estimators output by any algorithm in this class are perfectly unbiased and satisfy strong variance bounds. Our analysis unifies and generalizes a variety of earlier results in the literature. To demonstrate its generality, we describe several novel sampling algorithms in our class, and show that they achieve a novel tradeoff between accuracy, space usage, update speed, and applicability.
  • 关键词:sketching; data stream algorithms; mergeability; distinct elements; set operations
国家哲学社会科学文献中心版权所有