{"title":"一种改进的基于词向量的病案症状提取方法","authors":"Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li","doi":"10.1109/ITME53901.2021.00082","DOIUrl":null,"url":null,"abstract":"Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"17 1","pages":"379-384"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis\",\"authors\":\"Zhongmin Liu, Zhiming Luo, Jiajun Xu, Shaozi Li\",\"doi\":\"10.1109/ITME53901.2021.00082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.\",\"PeriodicalId\":6774,\"journal\":{\"name\":\"2021 11th International Conference on Information Technology in Medicine and Education (ITME)\",\"volume\":\"17 1\",\"pages\":\"379-384\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 11th International Conference on Information Technology in Medicine and Education (ITME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITME53901.2021.00082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved Word Vector-Based Symptom Extraction Method for Traditional Chinese Medical Record Analysis
Extracting and standardizing symptoms from traditional Chinese medical records plays an important role in intelligent diagnosis. Recently, abundant word vector models have been developed and used in natural language processing tasks due to their powerful performance. However, simply using a word vector model as core to analysis text is hard to satisfy both time and precision requirements. To improve this situation, we introduce an improved word vector-based symptom extraction method for traditional Chinese medicine which can extract and standardize symptoms in original medical texts written in Chinese. We design this method into three parts, Word Segmentation, Word Vector Generation, and Term Substitution. Experimental results on our dataset show that our method has a good effect in extracting medical symptoms and discarding redundant words. Compared to other baseline models of word vector representation, our method performs well in general performance of efficiency and accuracy.