首页    期刊浏览 2024年09月21日 星期六
登录注册

文章基本信息

  • 标题:Inverted Indexing In Big Data Using Hadoop Multiple Node Cluster
  • 本地全文:下载
  • 作者:Kaushik Velusamy ; Deepthi Venkitaramanan ; Nivetha Vijayaraju
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2013
  • 卷号:4
  • 期号:11
  • DOI:10.14569/IJACSA.2013.041122
  • 出版社:Science and Information Society (SAI)
  • 摘要:Inverted Indexing is an efficient, standard data structure, most suited for search operation over an exhaustive set of data. The huge set of data is mostly unstructured and does not fit into traditional database categories. Large scale processing of such data needs a distributed framework such as Hadoop where computational resources could easily be shared and accessed. An implementation of a search engine in Hadoop over millions of Wikipedia documents using an inverted index data structure would be carried out for making search operation more accomplished. Inverted index data structure is used for mapping a word in a file or set of files to their corresponding locations. A hash table is used in this data structure which stores each word as index and their corresponding locations as its values thereby providing easy lookup and retrieval of data making it suitable for search operations.
  • 关键词:thesai; IJACSA; thesai.org; journal; IJACSA papers; Hadoop; Big data; inverted indexing; data structure
国家哲学社会科学文献中心版权所有