首页    期刊浏览 2024年07月06日 星期六
登录注册

文章基本信息

  • 标题:The Metadata Coverage Index (MCI): A standardized metric for quantifying database metadata richness
  • 本地全文:下载
  • 作者:Konstantinos Liolios ; Lynn Schriml ; Lynette Hirschman
  • 期刊名称:Environmental Microbiome
  • 印刷版ISSN:2524-6372
  • 出版年度:2012
  • 卷号:6
  • 期号:3
  • 页码:444-453
  • DOI:10.4056/sigs.2675953
  • 摘要:Variability in the extent of the descriptions of data (‘metadata’) held in public repositories forces users to assess the quality of records individually, which rapidly becomes impractical. The scoring of records on the richness of their description provides a simple, objective proxy measure for quality that enables filtering that supports downstream analysis. Pivotally, such descriptions should spur on improvements. Here, we introduce such a measure - the ‘Metadata Coverage Index’ (MCI): the percentage of available fields actually filled in a record or description. MCI scores can be calculated across a database, for individual records or for their component parts (e.g., fields of interest). There are many potential uses for this simple metric: for example; to filter, rank or search for records; to assess the metadata availability of an ad hoc collection; to determine the frequency with which fields in a particular record type are filled, especially with respect to standards compliance; to assess the utility of specific tools and resources, and of data capture practice more generally; to prioritize records for further curation; to serve as performance metrics of funded projects; or to quantify the value added by curation. Here we demonstrate the utility of MCI scores using metadata from the Genomes Online Database (GOLD), including records compliant with the ‘Minimum Information about a Genome Sequence’ (MIGS) standard developed by the Genomic Standards Consortium. We discuss challenges and address the further application of MCI scores; to show improvements in annotation quality over time, to inform the work of standards bodies and repository providers on the usability and popularity of their products, and to assess and credit the work of curators. Such an index provides a step towards putting metadata capture practices and in the future, standards compliance, into a quantitative and objective framework.
  • 关键词:Individual Record;Human Microbiome Project;NCBI Taxonomy;Genomic Standard Consortium;Genome Online Database
国家哲学社会科学文献中心版权所有