首页    期刊浏览 2024年07月09日 星期二
登录注册

文章基本信息

  • 标题:A review on compressed pattern matching
  • 本地全文:下载
  • 作者:Surya Prakash Mishra ; Surya Prakash Mishra ; Col. Gurmit Singh
  • 期刊名称:Perspectives in Science
  • 印刷版ISSN:2213-0209
  • 电子版ISSN:2213-0209
  • 出版年度:2016
  • 卷号:8
  • 页码:727-729
  • DOI:10.1016/j.pisc.2016.06.071
  • 语种:English
  • 出版社:Elsevier
  • 摘要:Summary Compressed pattern matching (CPM) refers to the task of locating all the occurrences of a pattern (or set of patterns) inside the body of compressed text. In this type of matching, pattern may or may not be compressed. CPM is very useful in handling large volume of data especially over the network. It has many applications in computational biology, where it is useful in finding similar trends in DNA sequences; intrusion detection over the networks, big data analytics etc. Various solutions have been provided by researchers where pattern is matched directly over the uncompressed text. Such solution requires lot of space and consumes lot of time when handling the big data. Various researchers have proposed the efficient solutions for compression but very few exist for pattern matching over the compressed text. Considering the future trend where data size is increasing exponentially day-by-day, CPM has become a desirable task. This paper presents a critical review on the recent techniques on the compressed pattern matching. The covered techniques includes: Word based Huffman codes, Word Based Tagged Codes; Wavelet Tree Based Indexing. We have presented a comparative analysis of all the techniques mentioned above and highlighted their advantages and disadvantages.
  • 关键词:Pattern matching; Compressed pattern matching; Wavelet tree; Word based tagged code and big data;
国家哲学社会科学文献中心版权所有