{"title":"一个便于数字保存文件格式选择的决策支持系统","authors":"Roman Graf, H. Ryan, Tibaut Houzanme, S. Gordea","doi":"10.15291/LIBELLARIUM.V9I2.274","DOIUrl":null,"url":null,"abstract":"This paper presents a method to facilitate decision making for the preservation of digital content in libraries and archives using institutional risk profiles that highlight endangered files formats (in danger of becoming inaccessible or unusable). The primary contribution of this work is the combined use of both machine-mined data and human-expert input to select and configure institution-specific preservation risk profiles. The machine-mined data used the developed File Format Metadata Aggregator (FFMA), and the crowdsourced expert input was collected via two surveys of digital preservation practitioners. A by-product of this endeavor is the ability to visualize risk factors for analysis. The underlying decision support system used the Cosine Similarity algorithm to provide recommendations for matching risk profiles to selected institutional risk settings. This method improves the interpretability of risk factor values and the quality of a digital preservation process. The aggregated information about the risk factors is presented as a multidimensional vector that shows a particular analysis focus and its resulting impact on selected file formats. Sample risk profile calculations and the visualization of risk factor dimensions are shared in the evaluation section.","PeriodicalId":30549,"journal":{"name":"Libellarium Journal for the Research of Writing Books and Cultural Heritage Institutions","volume":" ","pages":"0-0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Decision Support System to Facilitate File Format Selection for Digital Preservation\",\"authors\":\"Roman Graf, H. Ryan, Tibaut Houzanme, S. Gordea\",\"doi\":\"10.15291/LIBELLARIUM.V9I2.274\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a method to facilitate decision making for the preservation of digital content in libraries and archives using institutional risk profiles that highlight endangered files formats (in danger of becoming inaccessible or unusable). The primary contribution of this work is the combined use of both machine-mined data and human-expert input to select and configure institution-specific preservation risk profiles. The machine-mined data used the developed File Format Metadata Aggregator (FFMA), and the crowdsourced expert input was collected via two surveys of digital preservation practitioners. A by-product of this endeavor is the ability to visualize risk factors for analysis. The underlying decision support system used the Cosine Similarity algorithm to provide recommendations for matching risk profiles to selected institutional risk settings. This method improves the interpretability of risk factor values and the quality of a digital preservation process. The aggregated information about the risk factors is presented as a multidimensional vector that shows a particular analysis focus and its resulting impact on selected file formats. Sample risk profile calculations and the visualization of risk factor dimensions are shared in the evaluation section.\",\"PeriodicalId\":30549,\"journal\":{\"name\":\"Libellarium Journal for the Research of Writing Books and Cultural Heritage Institutions\",\"volume\":\" \",\"pages\":\"0-0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Libellarium Journal for the Research of Writing Books and Cultural Heritage Institutions\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15291/LIBELLARIUM.V9I2.274\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Libellarium Journal for the Research of Writing Books and Cultural Heritage Institutions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15291/LIBELLARIUM.V9I2.274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Decision Support System to Facilitate File Format Selection for Digital Preservation
This paper presents a method to facilitate decision making for the preservation of digital content in libraries and archives using institutional risk profiles that highlight endangered files formats (in danger of becoming inaccessible or unusable). The primary contribution of this work is the combined use of both machine-mined data and human-expert input to select and configure institution-specific preservation risk profiles. The machine-mined data used the developed File Format Metadata Aggregator (FFMA), and the crowdsourced expert input was collected via two surveys of digital preservation practitioners. A by-product of this endeavor is the ability to visualize risk factors for analysis. The underlying decision support system used the Cosine Similarity algorithm to provide recommendations for matching risk profiles to selected institutional risk settings. This method improves the interpretability of risk factor values and the quality of a digital preservation process. The aggregated information about the risk factors is presented as a multidimensional vector that shows a particular analysis focus and its resulting impact on selected file formats. Sample risk profile calculations and the visualization of risk factor dimensions are shared in the evaluation section.