首页    期刊浏览 2025年08月09日 星期六
登录注册

文章基本信息

  • 标题:Using Inverted Files to Compress Text
  • 本地全文:下载
  • 作者:Ristov, Strahil
  • 期刊名称:Journal of Computing and Information Technology
  • 印刷版ISSN:1330-1136
  • 电子版ISSN:1846-3908
  • 出版年度:2002
  • 卷号:10
  • 期号:3
  • 页码:157-161
  • 出版社:SRCE - Sveučilišni računski centar
  • 摘要:This is the first report on a new approach to text compression. It consists of representing the text file with compressed inverted file index in conjunction with very compact lexicon, where lexicon includes every word in the text. The index is compressed using standard index compression techniques, and lexicon is compressed by original dictionary compression method that gives better compression results than existing procedures. Compression procedure is complex, but decompression time is linear with the file size, although it requires two passes and hence can not be performed online. First experiments show that this method, when refined, can be competitive for larger texts that only need to be decompressed in the real time.
国家哲学社会科学文献中心版权所有