Yuqi Yang, Guangzhi Zhang, R. Bie, Sungjoong Kim, Dongil Shin
{"title":"跨语言医学术语比对关键技术","authors":"Yuqi Yang, Guangzhi Zhang, R. Bie, Sungjoong Kim, Dongil Shin","doi":"10.1109/IIKI.2016.26","DOIUrl":null,"url":null,"abstract":"Health is closely related to everyone. Integrating different medical data sets will bring tremendous value for human. Basing on Chinese and English disease medical term, we use text mining technique in terms of two dimensions of the disease from the name and text description of the semantic clustering to achieve initial alignment disease terminology. First, we translate the Chinese data set through the API translation. Then we assign weights for each feature item to obtain feature vector for each disease node disease. Finally, we calculate the similarity of diseases and K-means clustering. We conduct experiments to evaluate the method on real-world and authoritative dataset, and the results prove that it has better rationality and superiority. The method can be extended to the initial alignment of multilingual texts with the same concept after improving.","PeriodicalId":371106,"journal":{"name":"2016 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Key Techniques of Cross-Language Medical Term Alignment\",\"authors\":\"Yuqi Yang, Guangzhi Zhang, R. Bie, Sungjoong Kim, Dongil Shin\",\"doi\":\"10.1109/IIKI.2016.26\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Health is closely related to everyone. Integrating different medical data sets will bring tremendous value for human. Basing on Chinese and English disease medical term, we use text mining technique in terms of two dimensions of the disease from the name and text description of the semantic clustering to achieve initial alignment disease terminology. First, we translate the Chinese data set through the API translation. Then we assign weights for each feature item to obtain feature vector for each disease node disease. Finally, we calculate the similarity of diseases and K-means clustering. We conduct experiments to evaluate the method on real-world and authoritative dataset, and the results prove that it has better rationality and superiority. The method can be extended to the initial alignment of multilingual texts with the same concept after improving.\",\"PeriodicalId\":371106,\"journal\":{\"name\":\"2016 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI)\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIKI.2016.26\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIKI.2016.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Key Techniques of Cross-Language Medical Term Alignment
Health is closely related to everyone. Integrating different medical data sets will bring tremendous value for human. Basing on Chinese and English disease medical term, we use text mining technique in terms of two dimensions of the disease from the name and text description of the semantic clustering to achieve initial alignment disease terminology. First, we translate the Chinese data set through the API translation. Then we assign weights for each feature item to obtain feature vector for each disease node disease. Finally, we calculate the similarity of diseases and K-means clustering. We conduct experiments to evaluate the method on real-world and authoritative dataset, and the results prove that it has better rationality and superiority. The method can be extended to the initial alignment of multilingual texts with the same concept after improving.