{"title":"基于多特征融合与GCN的中医辨证研究","authors":"Boting Liu, Weili Guan, Zhijie Fang","doi":"10.1117/12.2682399","DOIUrl":null,"url":null,"abstract":"Syndrome differentiation (SD) is a basic task in TCM (Traditional Chinese Medicine) diagnosis and treatment. TCM syndrome differentiation is very complex and time-consuming. Meanwhile, the accuracy of the results depends on the experience of TCM practitioners. To help TCM practitioners differentiate syndrome more quickly, we propose a syndrome differentiation method of deep learning based on multi-feature fusion. We extracted char, word and POS (Part of Speech) from TCM diagnosis and treatment records. The vector representation of char feature is obtained by ZY-BERT (Zhong Yi BERT), ZY-BERT was pre-trained on large datasets of TCM-SD (TCM Syndrome Differentiation). The vector representation of word and POS is obtained by Word2vec (Word to vector). We construct text graphs of char, word and POS according to context. GCN (Graph Convolutional Networks) is used to extract spatial structure information between multiple features to achieve multi-feature fusion. The experiment was carried out on TCM-SD. The experimental results showed that the accuracy of the proposed method was 81.52%, which was better than the comparison method. This method is helpful in the development of TCM modernization.","PeriodicalId":440430,"journal":{"name":"International Conference on Electronic Technology and Information Science","volume":"6 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on TCM syndrome differentiation based on multi-feature fusion and GCN\",\"authors\":\"Boting Liu, Weili Guan, Zhijie Fang\",\"doi\":\"10.1117/12.2682399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Syndrome differentiation (SD) is a basic task in TCM (Traditional Chinese Medicine) diagnosis and treatment. TCM syndrome differentiation is very complex and time-consuming. Meanwhile, the accuracy of the results depends on the experience of TCM practitioners. To help TCM practitioners differentiate syndrome more quickly, we propose a syndrome differentiation method of deep learning based on multi-feature fusion. We extracted char, word and POS (Part of Speech) from TCM diagnosis and treatment records. The vector representation of char feature is obtained by ZY-BERT (Zhong Yi BERT), ZY-BERT was pre-trained on large datasets of TCM-SD (TCM Syndrome Differentiation). The vector representation of word and POS is obtained by Word2vec (Word to vector). We construct text graphs of char, word and POS according to context. GCN (Graph Convolutional Networks) is used to extract spatial structure information between multiple features to achieve multi-feature fusion. The experiment was carried out on TCM-SD. The experimental results showed that the accuracy of the proposed method was 81.52%, which was better than the comparison method. This method is helpful in the development of TCM modernization.\",\"PeriodicalId\":440430,\"journal\":{\"name\":\"International Conference on Electronic Technology and Information Science\",\"volume\":\"6 3\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Electronic Technology and Information Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2682399\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Technology and Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
辨证论治是中医诊治的一项基本任务。中医辨证过程复杂,耗时长。同时,结果的准确性取决于中医从业人员的经验。为了帮助中医更快地辨证,我们提出了一种基于多特征融合的深度学习辨证方法。从中医诊疗记录中提取字符、单词和词类。通过ZY-BERT (Zhong Yi BERT)获得字符特征的向量表示,ZY-BERT在TCM- sd (TCM Syndrome Differentiation)大数据集上进行预训练。word和POS的向量表示由Word2vec (word to vector)获得。我们根据上下文构造char、word和POS的文本图。采用GCN(图卷积网络)提取多个特征之间的空间结构信息,实现多特征融合。实验采用中药- sd进行。实验结果表明,该方法的准确率为81.52%,优于对比方法。这种方法有助于中医现代化的发展。
Research on TCM syndrome differentiation based on multi-feature fusion and GCN
Syndrome differentiation (SD) is a basic task in TCM (Traditional Chinese Medicine) diagnosis and treatment. TCM syndrome differentiation is very complex and time-consuming. Meanwhile, the accuracy of the results depends on the experience of TCM practitioners. To help TCM practitioners differentiate syndrome more quickly, we propose a syndrome differentiation method of deep learning based on multi-feature fusion. We extracted char, word and POS (Part of Speech) from TCM diagnosis and treatment records. The vector representation of char feature is obtained by ZY-BERT (Zhong Yi BERT), ZY-BERT was pre-trained on large datasets of TCM-SD (TCM Syndrome Differentiation). The vector representation of word and POS is obtained by Word2vec (Word to vector). We construct text graphs of char, word and POS according to context. GCN (Graph Convolutional Networks) is used to extract spatial structure information between multiple features to achieve multi-feature fusion. The experiment was carried out on TCM-SD. The experimental results showed that the accuracy of the proposed method was 81.52%, which was better than the comparison method. This method is helpful in the development of TCM modernization.