文章基本信息

标题：On SemEval-2010 Japanese WSD Task
本地全文：下载
作者：Manabu Okumura ; Kiyoaki Shirai ; Kanako Komiya 等
期刊名称：Information and Media Technologies
电子版ISSN：1881-0896
出版年度：2011
卷号：6
期号：3
页码：730-744
DOI：10.11185/imt.6.730
出版社：Information and Media Technologies Editorial Board
摘要：An overview of the SemEval-2 Japanese WSD task is presented. The new characteristics of our task are (1) the task will use the first balanced Japanese sense-tagged corpus, and (2) the task will take into account not only the instances that have a sense in the given set but also the instances that have a sense that cannot be found in the set. It is a lexical sample task, and word senses are defined according to a Japanese dictionary, the Iwanami Kokugo Jiten. This dictionary and a training corpus were distributed to participants. The number of target words was 50, with 22 nouns, 23 verbs, and 5 adjectives. Fifty instances of each target word were provided, consisting of a total of 2,500 instances for the evaluation. Nine systems from four organizations participated in the task.
关键词：Word Sense Disambiguation;Balanced Corpus;Text Genre;New Word Sense