首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Mining Revision Log of Language Learning SNS for Automated Japanese Error Correction
  • 本地全文:下载
  • 作者:Tomoya Mizumoto ; Mamoru Komachi ; Masaaki Nagata
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2013
  • 卷号:28
  • 期号:5
  • 页码:420-432
  • DOI:10.1527/tjsai.28.420
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:Recently, natural language processing research has begun to pay attention to second language learning. However, it is not easy to acquire a large-scale learners' corpus, which is important for a research for second language learning by natural language processing. We present an attempt to extract a large-scale Japanese learners' corpus from the revision log of a language learning social network service.This corpus is easy to obtain in large-scale, covers a wide variety of topics and styles, and can be a great source of knowledge for both language learners and instructors. We also demonstrate that the extracted learners' corpus of Japanese as a second language can be used as training data for learners' error correction using a statistical machine translation approach.We evaluate different granularities of tokenization to alleviate the problem of word segmentation errors caused by erroneous input from language learners.We propose a character-based SMT approach to alleviate the problem of erroneous input from language learners.Experimental results show that the character-based model outperforms the word-based model when corpus size is small and test data is written by the learners whose L1 is English.
  • 关键词:Japanese error correction ; mining revision log ; language learning SNS ; second language learning
国家哲学社会科学文献中心版权所有