{"title":"Inclusion-Exclusion Knowledge Filtering Approach for Conversation-Based Preliminary Diagnosis","authors":"Binghong Chen, Jenhui Chen","doi":"10.1109/ECBIOS57802.2023.10218646","DOIUrl":null,"url":null,"abstract":"Using natural language processing (NLP) techniques, we conducted a preliminary diagnosis of the disease from the patient syndrome description. Because patients are not medical professionals, they cannot accurately describe all symptoms. To solve this issue, we build a medical knowledge graph (KG) by constructing symptom-disease relation triples for pre-processing the patient syndrome description. According to the medical KG, the descriptions were reconstructed into KG embedding representation. To avoid the knowledge noise issue, we investigate an inclusion-exclusion knowledge filtering approach (IKFA) for symptom-to-disease triples to load them to a pretrained language model (PLM), i.e., bidirectional encoder representations from Transformers (BERT). To train the IKFA, we built a medical diagnosis question-answer dataset (MDQA dataset), which contains large-scale and high-quality questions (patient symptom description) and answers (diagnosis) (Q&A) corpus with 1.63 million entries in the size of 213 MB. The KG was built based on 8,731 diseases with detailed syndrome descriptions in the size of 1.98 MB. The experimental results showed that the IKFA preliminarily diagnosed 8,731 different diseases based on the patient's initial symptom description with an accuracy of 0.9894.","PeriodicalId":334600,"journal":{"name":"2023 IEEE 5th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS)","volume":"268 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 5th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECBIOS57802.2023.10218646","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Using natural language processing (NLP) techniques, we conducted a preliminary diagnosis of the disease from the patient syndrome description. Because patients are not medical professionals, they cannot accurately describe all symptoms. To solve this issue, we build a medical knowledge graph (KG) by constructing symptom-disease relation triples for pre-processing the patient syndrome description. According to the medical KG, the descriptions were reconstructed into KG embedding representation. To avoid the knowledge noise issue, we investigate an inclusion-exclusion knowledge filtering approach (IKFA) for symptom-to-disease triples to load them to a pretrained language model (PLM), i.e., bidirectional encoder representations from Transformers (BERT). To train the IKFA, we built a medical diagnosis question-answer dataset (MDQA dataset), which contains large-scale and high-quality questions (patient symptom description) and answers (diagnosis) (Q&A) corpus with 1.63 million entries in the size of 213 MB. The KG was built based on 8,731 diseases with detailed syndrome descriptions in the size of 1.98 MB. The experimental results showed that the IKFA preliminarily diagnosed 8,731 different diseases based on the patient's initial symptom description with an accuracy of 0.9894.