文章基本信息

标题：Development and Validation of a Corpus of Written Parliamentary Questions in the Hellenic Parliament
本地全文：下载
作者：Fotios Fitsilis ; George Mikros
期刊名称：Journal of Open Humanities Data
电子版ISSN：2059-481X
出版年度：2021
卷号：7
DOI：10.5334/johd.45
语种：English
出版社：Ubiquity Press
摘要：This paper presents the development of the first parliamentary corpus of written questions in the Hellenic Parliament. Moreover, we discuss a well-defined end-to-end process that has been streamlined and optimised to produce high-quality open text data based on parliamentary documents. Based on the above methodology, a representative sample of 2,000 questions from four parliamentary periods in the Hellenic Parliament has been extracted, validated, and placed into an open data repository. Furthermore, open data production is analysed, and several degrees of freedom in its application in alternative data sets are proposed and discussed. Consequently, the authors argue that this method constitutes a transferable and scalable practice that can be used by other representative institutions for the creation and subsequent study of their open data.