Arantza Casillas, Koldo Gojenola, Alicia Pérez, M. Oronoz
{"title":"有效提取药物过敏反应的临床文本挖掘","authors":"Arantza Casillas, Koldo Gojenola, Alicia Pérez, M. Oronoz","doi":"10.1109/BIBM.2016.7822651","DOIUrl":null,"url":null,"abstract":"This work focuses on the extraction of allergic drug reactions in electronic health records. The goal is to annotate a sub-class of cause-effect events, those in which drugs are causing allergies. Little work has carried out in this field, seldom for Spanish clinical text mining, which is, indeed, the aim of this work. We present two approaches: a rule-based method and another one based on machine learning. Both approaches incorporate semantic knowledge derived from FreeLing-Med, a software explicitly developed to parse texts in the medical domain. Having recognised the medical entities for a given record, the challenge stands on triggering the underlying allergies. To this end, the knowledge is expressed as a set of semantic, syntactic and structural features. Our best system, based on machine learning, obtained a precision of 0.90 with a recall of 0.87, outperforming a rule-based approach.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Clinical text mining for efficient extraction of drug-allergy reactions\",\"authors\":\"Arantza Casillas, Koldo Gojenola, Alicia Pérez, M. Oronoz\",\"doi\":\"10.1109/BIBM.2016.7822651\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work focuses on the extraction of allergic drug reactions in electronic health records. The goal is to annotate a sub-class of cause-effect events, those in which drugs are causing allergies. Little work has carried out in this field, seldom for Spanish clinical text mining, which is, indeed, the aim of this work. We present two approaches: a rule-based method and another one based on machine learning. Both approaches incorporate semantic knowledge derived from FreeLing-Med, a software explicitly developed to parse texts in the medical domain. Having recognised the medical entities for a given record, the challenge stands on triggering the underlying allergies. To this end, the knowledge is expressed as a set of semantic, syntactic and structural features. Our best system, based on machine learning, obtained a precision of 0.90 with a recall of 0.87, outperforming a rule-based approach.\",\"PeriodicalId\":345384,\"journal\":{\"name\":\"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM.2016.7822651\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2016.7822651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Clinical text mining for efficient extraction of drug-allergy reactions
This work focuses on the extraction of allergic drug reactions in electronic health records. The goal is to annotate a sub-class of cause-effect events, those in which drugs are causing allergies. Little work has carried out in this field, seldom for Spanish clinical text mining, which is, indeed, the aim of this work. We present two approaches: a rule-based method and another one based on machine learning. Both approaches incorporate semantic knowledge derived from FreeLing-Med, a software explicitly developed to parse texts in the medical domain. Having recognised the medical entities for a given record, the challenge stands on triggering the underlying allergies. To this end, the knowledge is expressed as a set of semantic, syntactic and structural features. Our best system, based on machine learning, obtained a precision of 0.90 with a recall of 0.87, outperforming a rule-based approach.