{"title":"A Fact-checking Assistant System for Textual Documents*","authors":"Tomoya Furuta, Yumiko Suzuki","doi":"10.1109/MIPR51284.2021.00046","DOIUrl":null,"url":null,"abstract":"This paper proposes a system for identifying which parts of textual documents the editors should do fact-checking. Using our system, we can reduce editors’ time and efforts by identifying descriptions that need fact-checking. To accomplish this purpose, we construct a machine-learning-based classifier of sentences, which classifies a part of documents into four classes: according to the necessity of fact-checking. We assume that there are typical descriptions which contain misinformation. Therefore, if we collect the documents and their revised documents, and labels whether their revisions are corrections or not, we can construct the classifier by learning the dataset. To construct this classifier, we build a dataset that includes a set of sentences which are revised more than once, from Wikipedia edit history. The labels indicate the degree of sentence corrections by editors. We develop a Web-based system for demonstrating our proposed approach. When we input texts, the system predicts which parts of the texts the editors should re-confirm the facts.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR51284.2021.00046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper proposes a system for identifying which parts of textual documents the editors should do fact-checking. Using our system, we can reduce editors’ time and efforts by identifying descriptions that need fact-checking. To accomplish this purpose, we construct a machine-learning-based classifier of sentences, which classifies a part of documents into four classes: according to the necessity of fact-checking. We assume that there are typical descriptions which contain misinformation. Therefore, if we collect the documents and their revised documents, and labels whether their revisions are corrections or not, we can construct the classifier by learning the dataset. To construct this classifier, we build a dataset that includes a set of sentences which are revised more than once, from Wikipedia edit history. The labels indicate the degree of sentence corrections by editors. We develop a Web-based system for demonstrating our proposed approach. When we input texts, the system predicts which parts of the texts the editors should re-confirm the facts.