文章基本信息

标题：Racing to unleash the full potential of big data with the latest statistical and machine-learning techniques.
本地全文：下载
作者：Arun Kumar ; Feng Niu ; Christopher Ré 等
期刊名称：ACM Queue (Online): tomorrow's computing today
电子版ISSN：1542-7749
出版年度：2013
卷号：11
期号：1
语种：English
出版社：Association for Computing Machinery
摘要：Arun Kumar, Feng Niu, and Christopher Ré, Department of Computer Sciences, University of Wisconsin-Madison The rise of big data presents both big opportunities and big challenges in domains ranging from enterprises to sciences. The opportunities include better-informed business decisions, more efficient supply-chain management and resource allocation, more effective targeting of products and advertisements, better ways to "organize the world's information," faster turnaround of scientific discoveries, etc. The challenges are also tremendous. For one, more and more data comes in diverse forms: text, audio, video, OCR (optical character recognition), sensor data, etc. While existing data management systems predominantly assume that data has rigid, precise semantics, increasingly more data (albeit valuable) contains imprecision or inconsistency. For another, the proliferation of ever-evolving algorithms to gain insights from data (under names including machine learning, data mining, and statistical analysis) can often be daunting to a developer with a particular data set and specific goals: the developer not only has to keep up with the state of the art, but also must expend significant development effort in experimenting with different algorithms.