技术辅助敏感性评价中敏感性分类有效性如何影响审稿人

Proceedings of the 2019 Conference on Human Information Interaction and Retrieval Pub Date : 2019-03-08 DOI:10.1145/3295750.3298962

G. Mcdonald, C. Macdonald, I. Ounis

{"title":"技术辅助敏感性评价中敏感性分类有效性如何影响审稿人","authors":"G. Mcdonald, C. Macdonald, I. Ounis","doi":"10.1145/3295750.3298962","DOIUrl":null,"url":null,"abstract":"All government documents that are released to the public must first be manually reviewed to identify and protect any sensitive information, e.g. confidential information. However, the unassisted manual sensitivity review of born-digital documents is not practical due to, for example, the volume of documents that are created. Previous work has shown that sensitivity classification can be effective for predicting if a document contains sensitive information. However, since all of the released documents must be manually reviewed, it is important to know if sensitivity classification can assist sensitivity reviewers in making their sensitivity judgements. Hence, in this paper, we conduct a digital sensitivity review user study, to investigate if the accuracy of sensitivity classification effects the number of documents that a reviewer correctly judges to be sensitive or not (reviewer accuracy) and the time that it takes to sensitivity review a document (reviewing speed). Our results show that providing reviewers with sensitivity classification predictions, from a classifier that achieves 0.7 Balanced Accuracy, results in a 38% increase in mean reviewer accuracy and an increase of 72% in mean reviewing speeds, compared to when reviewers are not provided with predictions. Overall, our findings demonstrate that sensitivity classification is a viable technology for assisting with the sensitivity review of born-digital government documents.","PeriodicalId":187771,"journal":{"name":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","volume":"274 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"How Sensitivity Classification Effectiveness Impacts Reviewers in Technology-Assisted Sensitivity Review\",\"authors\":\"G. Mcdonald, C. Macdonald, I. Ounis\",\"doi\":\"10.1145/3295750.3298962\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"All government documents that are released to the public must first be manually reviewed to identify and protect any sensitive information, e.g. confidential information. However, the unassisted manual sensitivity review of born-digital documents is not practical due to, for example, the volume of documents that are created. Previous work has shown that sensitivity classification can be effective for predicting if a document contains sensitive information. However, since all of the released documents must be manually reviewed, it is important to know if sensitivity classification can assist sensitivity reviewers in making their sensitivity judgements. Hence, in this paper, we conduct a digital sensitivity review user study, to investigate if the accuracy of sensitivity classification effects the number of documents that a reviewer correctly judges to be sensitive or not (reviewer accuracy) and the time that it takes to sensitivity review a document (reviewing speed). Our results show that providing reviewers with sensitivity classification predictions, from a classifier that achieves 0.7 Balanced Accuracy, results in a 38% increase in mean reviewer accuracy and an increase of 72% in mean reviewing speeds, compared to when reviewers are not provided with predictions. Overall, our findings demonstrate that sensitivity classification is a viable technology for assisting with the sensitivity review of born-digital government documents.\",\"PeriodicalId\":187771,\"journal\":{\"name\":\"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval\",\"volume\":\"274 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3295750.3298962\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3295750.3298962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

所有向公众发布的政府文件都必须首先经过人工审查，以识别和保护任何敏感信息，例如机密信息。然而，由于所创建的文档的数量等原因，对出生的数字文档进行无辅助的手动灵敏度审查是不切实际的。先前的工作表明，敏感性分类可以有效地预测文档是否包含敏感信息。然而，由于所有发布的文件都必须人工审核，因此了解敏感性分类是否可以帮助敏感性审核员做出敏感性判断是很重要的。因此，在本文中，我们进行了一项数字敏感性审查用户研究，研究敏感性分类的准确性是否影响审稿人正确判断为敏感或不敏感的文件数量(审稿人准确性)和敏感审查文件所需的时间(审查速度)。我们的研究结果表明，与没有提供预测的审稿人相比，为审稿人提供灵敏度分类预测，从达到0.7平衡精度的分类器中，平均审稿人准确率提高了38%，平均审稿速度提高了72%。总体而言，我们的研究结果表明，敏感性分类是一种可行的技术，有助于对出生的数字政府文件进行敏感性审查。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

How Sensitivity Classification Effectiveness Impacts Reviewers in Technology-Assisted Sensitivity Review

All government documents that are released to the public must first be manually reviewed to identify and protect any sensitive information, e.g. confidential information. However, the unassisted manual sensitivity review of born-digital documents is not practical due to, for example, the volume of documents that are created. Previous work has shown that sensitivity classification can be effective for predicting if a document contains sensitive information. However, since all of the released documents must be manually reviewed, it is important to know if sensitivity classification can assist sensitivity reviewers in making their sensitivity judgements. Hence, in this paper, we conduct a digital sensitivity review user study, to investigate if the accuracy of sensitivity classification effects the number of documents that a reviewer correctly judges to be sensitive or not (reviewer accuracy) and the time that it takes to sensitivity review a document (reviewing speed). Our results show that providing reviewers with sensitivity classification predictions, from a classifier that achieves 0.7 Balanced Accuracy, results in a 38% increase in mean reviewer accuracy and an increase of 72% in mean reviewing speeds, compared to when reviewers are not provided with predictions. Overall, our findings demonstrate that sensitivity classification is a viable technology for assisting with the sensitivity review of born-digital government documents.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2019 Conference on Human Information Interaction and Retrieval

自引率

0.00%

发文量

期刊最新文献

Assisting Health Consumers While Searching the Web through Medical Annotations Do Metrics Matter? Understanding Context for Tasks and Activities NotifyMeHere: Intelligent Notification Delivery in Multi-Device Environments Computational Surprise, Perceptual Surprise, and Personal Background in Text Understanding