文章基本信息

标题：The DOP Estimation Method Is Biased and Inconsistent
本地全文：下载
作者：Mark Johnson
期刊名称：Computational Linguistics
印刷版ISSN：0891-2017
电子版ISSN：1530-9312
出版年度：2002
卷号：28
期号：1
页码：71-76
DOI：10.1162/089120102317341783
语种：English
出版社：MIT Press
摘要：A data-oriented parsing or DOP model for statistical parsing associates fragments of linguistic representations with numerical weights, where these weights are estimated by normalizing the empirical frequency of each fragment in a training corpus (see Bod [1998] and references cited therein). This note observes that this estimation method is biased and inconsistent that is, the estimated distribution does not in general converge on the true distribution as the size of the training corpus increases.