期刊名称:DESIDOC Journal of Library & Information Technology
电子版ISSN:0976-4658
出版年度:2007
卷号:27
期号:4
页码:51-58
DOI:10.14429/djlit.27.4.187
语种:English
出版社:DESIDOC, Ministry of Defence, India
摘要:Zipf’s law has attracted infometricians time and again. There have been many studies, which have explored the application of Zipf’s law to various areas. However, there are a few parameters, which largely affect a study. These parameters are the power law embedded in Zipf’s law, the ranking method, the type of text taken for the study and the behaviour of extreme regions in the Zipf’s curve. This paper tries to address all these points by taking a random text in English language from computer science literature. The selected text is called random because of its highly specific nature of technical words. The paper studies the properties of this text and compares the product of rank and frequency for three ranking procedures. It also analyses the performance of data in the extreme regions of the Zipf’s curve. It is observed that ranking procedure and type of text have definite bearings on the performance of Zipf’s curve.http://dx.doi.org/10.14429/djlit.27.4.187
关键词:Zipf’s law; zipf’s curve; infometrics; power law ; computer science