首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:A Survey on Big Data privacy using Hadoop Architecture
  • 本地全文:下载
  • 作者:Priyank Jain ; Manasi Gyanchandani ; Nilay Khare
  • 期刊名称:International Journal of Computer Science and Network Security
  • 印刷版ISSN:1738-7906
  • 出版年度:2017
  • 卷号:17
  • 期号:2
  • 页码:148-155
  • 出版社:International Journal of Computer Science and Network Security
  • 摘要:Big Data is the term for any gathering of datasets so vast and complex that it gets to be distinctly troublesome to process using traditional data processing applications. The challenges include analysis, catch, curation, look, sharing, stockpiling, exchange, perception, and security infringement. Big data is a set of techniques and technologies that require new forms of integration to uncover huge concealed qualities from substantial datasets that are assorted, complex, and of a huge scale. Big data environment is used to acquire, organize and analyze the various types of data. Data that is so substantial in volume, so differing in assortment or moving with such speed is called big data. Analyzing Big Data is a challenging task as it involves large distributed file systems which should be fault tolerant, flexible and scalable. For such data-intensive applications, the Apache Hadoop Framework has recently attracted a lot of attention. This framework Adopted MapReduce, it is a programming model and a related execution for preparing and producing large data sets. The technologies used by big data application to handle the massive data are Hadoop, Map Reduce, Apache Hive, No SQL and HPCC. To begin with, we introduce the meaning of enormous information and discuss big data challenges. Hadoop is the core platform for structuring Big Data, and tackles the issue of making it helpful for examination purposes. Hadoop is an open source programming project that enables the distributed processing of large data sets across clusters of commodity servers. It is intended to scale up from a solitary server to a great many machines, with an extremely high degree of fault tolerance. This paper refer privacy and security aspects healthcare in big data. Next, we present Existing techniques of anonymization using MapReduce framework of big data privacy is also done as well.
  • 关键词:Big Data; Hadoop; HDFS; MapReduce; Hadoop Components; Hive; NoSQL; Hpcc
国家哲学社会科学文献中心版权所有