期刊名称:Libellarium : Journal for the history of writing, books, and memory institutions
印刷版ISSN:1846-8527
电子版ISSN:1846-9213
出版年度:2017
卷号:9
期号:2
页码:0-0
语种:English
出版社:Department of Library and Information Science
摘要:This paper presents a method to facilitate decision making for the preservation of digital content in libraries and archives using institutional risk profiles that highlight endangered files formats (in danger of becoming inaccessible or unusable). The primary contribution of this work is the combined use of both machine-mined data and human-expert input to select and configure institution-specific preservation risk profiles. The machine-mined data used the developed File Format Metadata Aggregator (FFMA), and the crowdsourced expert input was collected via two surveys of digital preservation practitioners. A by-product of this endeavor is the ability to visualize risk factors for analysis. The underlying decision support system used the Cosine Similarity algorithm to provide recommendations for matching risk profiles to selected institutional risk settings. This method improves the interpretability of risk factor values and the quality of a digital preservation process. The aggregated information about the risk factors is presented as a multidimensional vector that shows a particular analysis focus and its resulting impact on selected file formats. Sample risk profile calculations and the visualization of risk factor dimensions are shared in the evaluation section.
关键词:digital preservation; file format; institutional risk profiles; decision support system; information aggregation