{"title":"基于规则的胸部x光报告问答数据集开发方法","authors":"Jie Wang, Hairong Lv, R. Jiang, Zhen Xie","doi":"10.1109/CBMS.2019.00016","DOIUrl":null,"url":null,"abstract":"Available and objective clinical documents are important for research of assistant diagnosis, development of algorithms, and education. To facilitate the readability and variability of clinical documents, this paper presents a rule-based approach to develop a question-answer dataset for chest X-rays from a public collection of radiology examinations, including both images and radiologist narrative reports. Our method simplified the complicated reports via hand-selected keywords, generated more than 63 thousand question-answer pairs via hand-written patterns, and augmented the question-answer dataset to more than 130 thousand pairs via rule-based question answering. To the best of our knowledge, this is the first generated question-answer dataset for chest X-rays by rule-based method. The dataset is promising for future researches and applications such as visual question answering, computer-aided diagnosis and so on.","PeriodicalId":311634,"journal":{"name":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Rule-Based Method to Develop Question-Answer Dataset from Chest X-Ray Reports\",\"authors\":\"Jie Wang, Hairong Lv, R. Jiang, Zhen Xie\",\"doi\":\"10.1109/CBMS.2019.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Available and objective clinical documents are important for research of assistant diagnosis, development of algorithms, and education. To facilitate the readability and variability of clinical documents, this paper presents a rule-based approach to develop a question-answer dataset for chest X-rays from a public collection of radiology examinations, including both images and radiologist narrative reports. Our method simplified the complicated reports via hand-selected keywords, generated more than 63 thousand question-answer pairs via hand-written patterns, and augmented the question-answer dataset to more than 130 thousand pairs via rule-based question answering. To the best of our knowledge, this is the first generated question-answer dataset for chest X-rays by rule-based method. The dataset is promising for future researches and applications such as visual question answering, computer-aided diagnosis and so on.\",\"PeriodicalId\":311634,\"journal\":{\"name\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2019.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2019.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Rule-Based Method to Develop Question-Answer Dataset from Chest X-Ray Reports
Available and objective clinical documents are important for research of assistant diagnosis, development of algorithms, and education. To facilitate the readability and variability of clinical documents, this paper presents a rule-based approach to develop a question-answer dataset for chest X-rays from a public collection of radiology examinations, including both images and radiologist narrative reports. Our method simplified the complicated reports via hand-selected keywords, generated more than 63 thousand question-answer pairs via hand-written patterns, and augmented the question-answer dataset to more than 130 thousand pairs via rule-based question answering. To the best of our knowledge, this is the first generated question-answer dataset for chest X-rays by rule-based method. The dataset is promising for future researches and applications such as visual question answering, computer-aided diagnosis and so on.