首页    期刊浏览 2024年07月19日 星期五
登录注册

文章基本信息

  • 标题:True Randomness from Big Data
  • 本地全文:下载
  • 作者:Periklis A. Papakonstantinou ; David P. Woodruff ; Guang Yang
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2016
  • 卷号:6
  • 期号:1
  • DOI:10.1038/srep33740
  • 语种:English
  • 出版社:Springer Nature
  • 摘要:Generating random bits is a difficult task, which is important for physical systems simulation, cryptography, and many applications that rely on high-quality random bits. Our contribution is to show how to generate provably random bits from uncertain events whose outcomes are routinely recorded in the form of massive data sets. These include scientific data sets, such as in astronomics, genomics, as well as data produced by individuals, such as internet search logs, sensor networks, and social network feeds. We view the generation of such data as the sampling process from a big source, which is a random variable of size at least a few gigabytes. Our view initiates the study of big sources in the randomness extraction literature. Previous approaches for big sources rely on statistical assumptions about the samples. We introduce a general method that provably extracts almost-uniform random bits from big sources and extensively validate it empirically on real data sets. The experimental findings indicate that our method is efficient enough to handle large enough sources, while previous extractor constructions are not efficient enough to be practical. Quality-wise, our method at least matches quantum randomness expanders and classical world empirical extractors as measured by standardized tests.
国家哲学社会科学文献中心版权所有