{"title":"基于知识的生物医学词义消歧:评价及其在临床文献分类中的应用","authors":"Vijay Garla, C. Brandt","doi":"10.1109/HISB.2012.12","DOIUrl":null,"url":null,"abstract":"Motivation: Word Sense Disambiguation (WSD) methods automatically assign an unambiguous concept to an ambiguous term based on context, and are important to many text processing tasks. In this study, we developed and evaluated a knowledge-based WSD method that uses semantic similarity measures derived from the Unified Medical Language System (UMLS), and we evaluated the contribution of WSD to clinical text classification. Results: We evaluated our system on biomedical WSD datasets; our system compares favorably to other knowledge-based methods. We evaluated the contribution of our WSD system to clinical document classification on the 2007 Computational Medicine Challenge corpus. Machine learning classifiers trained on disambiguated concepts significantly outperformed those trained using all concepts. Availability: We integrated our WSD system with MetaMap and cTAKES, two popular biomedical natural language processing systems. We released all code required to reproduce our results and all tools developed as part of this study as open source, available under http://code.google.com/p/ytex.","PeriodicalId":375089,"journal":{"name":"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":"{\"title\":\"Knowledge-Based Biomedical Word Sense Disambiguation: An Evaluation and Application to Clinical Document Classification\",\"authors\":\"Vijay Garla, C. Brandt\",\"doi\":\"10.1109/HISB.2012.12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Motivation: Word Sense Disambiguation (WSD) methods automatically assign an unambiguous concept to an ambiguous term based on context, and are important to many text processing tasks. In this study, we developed and evaluated a knowledge-based WSD method that uses semantic similarity measures derived from the Unified Medical Language System (UMLS), and we evaluated the contribution of WSD to clinical text classification. Results: We evaluated our system on biomedical WSD datasets; our system compares favorably to other knowledge-based methods. We evaluated the contribution of our WSD system to clinical document classification on the 2007 Computational Medicine Challenge corpus. Machine learning classifiers trained on disambiguated concepts significantly outperformed those trained using all concepts. Availability: We integrated our WSD system with MetaMap and cTAKES, two popular biomedical natural language processing systems. We released all code required to reproduce our results and all tools developed as part of this study as open source, available under http://code.google.com/p/ytex.\",\"PeriodicalId\":375089,\"journal\":{\"name\":\"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"44\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HISB.2012.12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HISB.2012.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Knowledge-Based Biomedical Word Sense Disambiguation: An Evaluation and Application to Clinical Document Classification
Motivation: Word Sense Disambiguation (WSD) methods automatically assign an unambiguous concept to an ambiguous term based on context, and are important to many text processing tasks. In this study, we developed and evaluated a knowledge-based WSD method that uses semantic similarity measures derived from the Unified Medical Language System (UMLS), and we evaluated the contribution of WSD to clinical text classification. Results: We evaluated our system on biomedical WSD datasets; our system compares favorably to other knowledge-based methods. We evaluated the contribution of our WSD system to clinical document classification on the 2007 Computational Medicine Challenge corpus. Machine learning classifiers trained on disambiguated concepts significantly outperformed those trained using all concepts. Availability: We integrated our WSD system with MetaMap and cTAKES, two popular biomedical natural language processing systems. We released all code required to reproduce our results and all tools developed as part of this study as open source, available under http://code.google.com/p/ytex.