期刊名称:Journal of Computer Sciences and Applications
印刷版ISSN:2328-7268
电子版ISSN:2328-725X
出版年度:2015
卷号:3
期号:6
页码:177-180
DOI:10.12691/jcsa-3-6-13
出版社:Science and Education Publishing
摘要:With the rapid growth of emerging applications like social network, semantic web, sensor networks and LBS (Location Based Service) applications, a variety of data to be processed continues to witness a quick increase. Effective management and processing of large-scale data poses an interesting but critical challenge. Recently, big data has attracted a lot of attention from academia, industry as well as government. This paper introduces several big data processing techniques from system and application aspects. First, from the view of cloud data management and big data processing mechanisms, we present the key issues of big data processing, including definition of big data, big data management platform, big data service models, distributed file system, data storage, data virtualization platform and distributed applications. Following the Map Reduce parallel processing framework, we introduce some MapReduce optimization strategies reported in the literature. Finally, we discuss the open issues and challenges, and deeply explore the research directions in the future on big data processing in cloud computing environments.
关键词:big data; cloud computing; data management; distributed processing