首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Retrieving, classifying and analysing narrative commentary in unstructured (glossy) annual reports published as PDF files
  • 本地全文:下载
  • 作者:Mahmoud El-Haj ; Paulo Alves ; Paul Rayson
  • 期刊名称:Accounting and Business Research
  • 印刷版ISSN:0001-4788
  • 电子版ISSN:2159-4260
  • 出版年度:2020
  • 卷号:50
  • 期号:1
  • 页码:6-34
  • DOI:10.1080/00014788.2019.1609346
  • 语种:English
  • 出版社:Taylor & Francis Group
  • 摘要:We provide a methodological contribution by developing, describing and evaluating a methodfor automatically retrieving and analysing text from digital PDF annual report files publishedby firms listed on the London Stock Exchange (LSE). The retrieval method retains informationon document structure, enabling clear delineation between narrative and financial statementcomponents of reports, and between individual sections within the narratives component.Retrieval accuracy exceeds 95% for manual validations using a random sample of 586reports. Large-sample statistical validations using a comprehensive sample of reportspublished by non-financial LSE firms confirm that report length, narrative tone and (to alesser degree) readability vary predictably with economic and regulatory factors. Wedemonstrate how the method is adaptable to non-English language documents and differentregulatory regimes using a case study of Portuguese reports. We use the procedure toconstruct new research resources including corpora for commonly occurring annual reportsections and a dataset of text properties for over 26,000 U.K. annual reports.
  • 关键词:Annual reports;textual analysis;unstructured documents;narrative reporting
国家哲学社会科学文献中心版权所有