出版社:State Statistics Service of Ukraine, the National Academy of Statistics, Accounting and Audit (NASAA), the National Academy for Public Administration (NAPA) under the President of Ukraine
摘要:Розглянуто питання, пов’язані з потенційною можливістю використання в офіційній (державній) статистиці так званих “великих даних”. Висвітлено їх переваги, серед яких своєчасність, широке охоплення певних частин цільових сукупностей, скорочення витрат на їх отримання. Окреслено проблеми, які необхідно вирішувати при використанні “великих даних”. Наведено аргументи щодо наявності у прикладній та в офіційній статистиці прототипів інструментів, які за належного їх розвитку та адаптації дадуть можливість розв’язати основні з указаних проблем.↓Рассмотрены вопросы, связанные с потенциальной возможностью использования в официальной (государственной) статистике так называемых “больших данных”. Освещены их преимущества, среди которых своевременность, широкий охват определенных частей целевых совокупностей, сокращение расходов на их получение. Обозначены проблемы, которые необходимо решать при использовании "больших данных". Аргументировано наличие в прикладной и в официальной статистике прототипов инструментов, которые при надлежащем их развитии и адаптации позволят решить основные из указанных проблем.
其他摘要:Issues are discussed, related with potential use by official statistics of the so called “Big Data”, which refers to data extracted from websites, mobile phones, cash machines in retail sales networks, traffic surveillance cameras etc. These data are nicknamed as “big” mainly due to large scopes, not enabling for their processing by standard statistical tools but requiring special software and techniques. It is argued that “Big Data” have advantages such as timeliness, wide coverage of targeted population segments; their collection does not require special questionnaires or surveys, training or recruiting numerous paid personnel like supervisors or interviewers. When “Big Data” are used, accuracy requirements can be loosened, analysis of phenomena and processes can be made by quite simple procedures. As scopes of these data are increasing incessantly, often second by second, the only thing to do is to process them in a proper way, to analyze and use the output information. It is emphasized that use of “Big Data” is complicated due to the need to address problems like indeterminacy of the covered data sets; bias of estimates; accessibility of data, because they are mostly collected by private companies or belong to them; protection of private data, storage of large scopes of “Big Data” and their processing; statistical incorporation of numerous large data sets; risks of potential manipulation with data etc. Arguments are given that applied and official statistics have prototypes of tools capable to solve a major part of the above problems, once properly developed and adapted. They include methods for calibration of survey results, statistical aggregation of data, or model-based assessment of data. As regard “cloud” technologies for data storage and processing, their use can solve the problems of weak capacity of data carriers in statistical offices, and the problems of storage of private and confidential data. Results of studies conducted by leading statisticians of our days demonstrate that official statistics has no alternatives to use of “Bid Data”. The sooner this advanced field of statistics and information technologies comes in focus of the State Statistics Service, universities and research institutions, the easier new information sources and new statistical toolkit can be integrated in the official statistics within the forthcoming ten or fifteen years.
关键词:“Big Data”; information sources; statistical toolkit; o fficial statistics; in formation technologies.
其他关键词:“Big Data”; information sources; statistical toolkit; o fficial statistics; in formation technologies.