{"title":"结合深度学习和模糊逻辑从临床记录中预测罕见的ICD-10代码","authors":"T. Chomutare, A. Budrionis, H. Dalianis","doi":"10.1109/ICDH55609.2022.00033","DOIUrl":null,"url":null,"abstract":"Computer assisted coding (CAC) of clinical text into standardized classifications such as ICD-10 is an important challenge. For frequently used ICD-10 codes, deep learning approaches have been quite successful. For rare codes, however, the problem is still outstanding. To improve performance for rare codes, a pipeline is proposed that takes advantage of the ICD-10 code hierarchy to combine semantic capabilities of deep learning and the flexibility of fuzzy logic. The data used are discharge summaries in Swedish in the medical speciality of gastrointestinal diseases. Using our pipeline, fuzzy matching computation time is reduced and accuracy of the top 10 hits of the rare codes is also improved. While the method is promising, further work is required before the pipeline can be part of a usable prototype. Code repository: https://github.com/icd-coding/zeroshot.","PeriodicalId":120923,"journal":{"name":"2022 IEEE International Conference on Digital Health (ICDH)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Combining deep learning and fuzzy logic to predict rare ICD-10 codes from clinical notes\",\"authors\":\"T. Chomutare, A. Budrionis, H. Dalianis\",\"doi\":\"10.1109/ICDH55609.2022.00033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer assisted coding (CAC) of clinical text into standardized classifications such as ICD-10 is an important challenge. For frequently used ICD-10 codes, deep learning approaches have been quite successful. For rare codes, however, the problem is still outstanding. To improve performance for rare codes, a pipeline is proposed that takes advantage of the ICD-10 code hierarchy to combine semantic capabilities of deep learning and the flexibility of fuzzy logic. The data used are discharge summaries in Swedish in the medical speciality of gastrointestinal diseases. Using our pipeline, fuzzy matching computation time is reduced and accuracy of the top 10 hits of the rare codes is also improved. While the method is promising, further work is required before the pipeline can be part of a usable prototype. Code repository: https://github.com/icd-coding/zeroshot.\",\"PeriodicalId\":120923,\"journal\":{\"name\":\"2022 IEEE International Conference on Digital Health (ICDH)\",\"volume\":\"57 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Digital Health (ICDH)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDH55609.2022.00033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Digital Health (ICDH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDH55609.2022.00033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Combining deep learning and fuzzy logic to predict rare ICD-10 codes from clinical notes
Computer assisted coding (CAC) of clinical text into standardized classifications such as ICD-10 is an important challenge. For frequently used ICD-10 codes, deep learning approaches have been quite successful. For rare codes, however, the problem is still outstanding. To improve performance for rare codes, a pipeline is proposed that takes advantage of the ICD-10 code hierarchy to combine semantic capabilities of deep learning and the flexibility of fuzzy logic. The data used are discharge summaries in Swedish in the medical speciality of gastrointestinal diseases. Using our pipeline, fuzzy matching computation time is reduced and accuracy of the top 10 hits of the rare codes is also improved. While the method is promising, further work is required before the pipeline can be part of a usable prototype. Code repository: https://github.com/icd-coding/zeroshot.