Daniel Sánchez-Cisneros, Paloma Martínez, Isabel Segura-Bedmar
{"title":"结合字典和本体论在生物医学文本中的药物名称识别","authors":"Daniel Sánchez-Cisneros, Paloma Martínez, Isabel Segura-Bedmar","doi":"10.1145/2512089.2512100","DOIUrl":null,"url":null,"abstract":"Two approaches have been commonly used for recognizing Drug Name Entities in biomedical texts: machine learning-based and domain specific resources-based approaches. In this work we focus on the second one by combining (1) a dictionary-based approach that collects terms from different pharmacological data sources such as DrugBank, MeSH, RxNorm and ATC index; and (2) an ontology-based approach that maps each text unit of a source text into one or more domain-specific concepts, providing rich semantic knowledge of domain name entities using Metamap and Mgrep analyzer. The aim is to take advantage of the best of each resource used. The combined system obtains an F1 measure of 0, 667 over exact matching span evaluation.","PeriodicalId":143937,"journal":{"name":"Data and Text Mining in Bioinformatics","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Combining dictionaries and ontologies for drug name recognition in biomedical texts\",\"authors\":\"Daniel Sánchez-Cisneros, Paloma Martínez, Isabel Segura-Bedmar\",\"doi\":\"10.1145/2512089.2512100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Two approaches have been commonly used for recognizing Drug Name Entities in biomedical texts: machine learning-based and domain specific resources-based approaches. In this work we focus on the second one by combining (1) a dictionary-based approach that collects terms from different pharmacological data sources such as DrugBank, MeSH, RxNorm and ATC index; and (2) an ontology-based approach that maps each text unit of a source text into one or more domain-specific concepts, providing rich semantic knowledge of domain name entities using Metamap and Mgrep analyzer. The aim is to take advantage of the best of each resource used. The combined system obtains an F1 measure of 0, 667 over exact matching span evaluation.\",\"PeriodicalId\":143937,\"journal\":{\"name\":\"Data and Text Mining in Bioinformatics\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data and Text Mining in Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2512089.2512100\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data and Text Mining in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2512089.2512100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Combining dictionaries and ontologies for drug name recognition in biomedical texts
Two approaches have been commonly used for recognizing Drug Name Entities in biomedical texts: machine learning-based and domain specific resources-based approaches. In this work we focus on the second one by combining (1) a dictionary-based approach that collects terms from different pharmacological data sources such as DrugBank, MeSH, RxNorm and ATC index; and (2) an ontology-based approach that maps each text unit of a source text into one or more domain-specific concepts, providing rich semantic knowledge of domain name entities using Metamap and Mgrep analyzer. The aim is to take advantage of the best of each resource used. The combined system obtains an F1 measure of 0, 667 over exact matching span evaluation.