期刊名称:International Journal of Engineering and Computer Science
印刷版ISSN:2319-7242
出版年度:2015
卷号:4
期号:4
页码:11273-11278
出版社:IJECS
摘要:HBase is also called the Hadoop database since it is a NoSQL database that runs on top of Hadoop. It combines thescalability of Hadoop by running on the Hadoop Distributed File System (HDFS), with real-time data right of entry as a keyand value store and deep analytic capabilities of MapReduce. This article introduces HBase and describe how it organizes andmanages data and then demonstrates how to set up a local HBase environment and interact with data using the HBase shell. Itis a data model that is related to Google’s big table designed to provide quick random access to huge amounts of structureddata. It leverages the fault-tolerance provided by the Hadoop File System (HDFS). It is a part of the Hadoop ecosystem thatprovides random real-time read or write access to data in the Hadoop File System. One can store the data in HDFS eitherdirectly or during HBase. Data consumer reads and accesses the data in HDFS at randomly using HBase. It stands on top ofthe Hadoop File System and provides read and write access. HBase is key and value store specifically it is a reliable,Distributed, Multidimensional, Sorted map. HBase stores data in the form of a cells here cells are grouped by a row key intosomething that looks like a row and cells are stored individually the storage is sparse. HBase Performance testing on singlenode cluster step-up. but here query performance is slow under single node cluster set-up so this reason you have to introducethe multi-node cluster set-up and improvement of the query performance in HBase Cluster Environment
关键词:Hadoop; Mapreduce; hdfs; Hbase; NoSQL and Multi-node Cluster environment