首页    期刊浏览 2025年07月27日 星期日
登录注册

文章基本信息

  • 标题:Enhancement of Accuracy of ANN Data Mining Algorithm for Protein Classification Based on Architecture Optimization
  • 本地全文:下载
  • 作者:Nandika Salwan ; Jasmine Kaur
  • 期刊名称:International Journal of Computer Science & Technology
  • 印刷版ISSN:2229-4333
  • 电子版ISSN:0976-8491
  • 出版年度:2014
  • 卷号:5
  • 期号:1
  • 页码:123-125
  • 语种:English
  • 出版社:Ayushmaan Technologies
  • 摘要:In order to cope with the vast diversity of book content and typefaces, it is important for OCR systems to leverage the strong consistency within a book but adapt to variations across books. In this work, we describe a system that combines two parallel correction paths using document-specific image and language models. Each model adapts to shapes and vocabularies within a book to identify inconsistencies as correction hypotheses, but relies on the other for effective cross-validation. Using the open source Tesseract engine as baseline, results on a large dataset of scanned books demonstrate that word error rates can be reduced by 25% using this approach.
  • 关键词:Proteins;Data Mining;Artificial Neural Network
国家哲学社会科学文献中心版权所有