{"title":"保加利亚语患者记录自动信息提取","authors":"G. Angelova","doi":"10.1145/2516775.2516777","DOIUrl":null,"url":null,"abstract":"Natural Language Processing (NLP) has been viewed as a promising technology in medical informatics since decades. Despite the gradually improving quality of automatic text analysis, however, clinical NLP systems are still rarely used outside the research Labs due to the following reasons: (i) their development is very expensive so most of them are prototypes or proof-of-concept demonstrators, (ii) real exploitation of NLP modules would require constant support of the underlying linguistic resources and tuning the systems to new text types; (iii) the technology has potentially high accuracy but some results might be erroneous and misleading [1]. On the other hand, the quick adoption of Electronic Health Records worldwide implies constant growth of electronic narratives discussing patient-related information. According to the established medical practices, the most important findings about the patients are still kept as free texts in various documents and languages. In this way the so called Information Extraction (IE) becomes the dominating language technology that is currently applied to biomedical texts. The main idea is to extract automatically important entities, with accuracy as high as possible, and to operate on these entities skipping the remaining text fragments. IE is based on shallow analysis only but it is expected that even the progress in partial text understanding would enable radical improvements in clinical decision support, biomedical research and healthcare in general.","PeriodicalId":316788,"journal":{"name":"International Conference on Computer Systems and Technologies","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic information extraction from patient records in Bulgarian language\",\"authors\":\"G. Angelova\",\"doi\":\"10.1145/2516775.2516777\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Natural Language Processing (NLP) has been viewed as a promising technology in medical informatics since decades. Despite the gradually improving quality of automatic text analysis, however, clinical NLP systems are still rarely used outside the research Labs due to the following reasons: (i) their development is very expensive so most of them are prototypes or proof-of-concept demonstrators, (ii) real exploitation of NLP modules would require constant support of the underlying linguistic resources and tuning the systems to new text types; (iii) the technology has potentially high accuracy but some results might be erroneous and misleading [1]. On the other hand, the quick adoption of Electronic Health Records worldwide implies constant growth of electronic narratives discussing patient-related information. According to the established medical practices, the most important findings about the patients are still kept as free texts in various documents and languages. In this way the so called Information Extraction (IE) becomes the dominating language technology that is currently applied to biomedical texts. The main idea is to extract automatically important entities, with accuracy as high as possible, and to operate on these entities skipping the remaining text fragments. IE is based on shallow analysis only but it is expected that even the progress in partial text understanding would enable radical improvements in clinical decision support, biomedical research and healthcare in general.\",\"PeriodicalId\":316788,\"journal\":{\"name\":\"International Conference on Computer Systems and Technologies\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Computer Systems and Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2516775.2516777\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Computer Systems and Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2516775.2516777","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic information extraction from patient records in Bulgarian language
Natural Language Processing (NLP) has been viewed as a promising technology in medical informatics since decades. Despite the gradually improving quality of automatic text analysis, however, clinical NLP systems are still rarely used outside the research Labs due to the following reasons: (i) their development is very expensive so most of them are prototypes or proof-of-concept demonstrators, (ii) real exploitation of NLP modules would require constant support of the underlying linguistic resources and tuning the systems to new text types; (iii) the technology has potentially high accuracy but some results might be erroneous and misleading [1]. On the other hand, the quick adoption of Electronic Health Records worldwide implies constant growth of electronic narratives discussing patient-related information. According to the established medical practices, the most important findings about the patients are still kept as free texts in various documents and languages. In this way the so called Information Extraction (IE) becomes the dominating language technology that is currently applied to biomedical texts. The main idea is to extract automatically important entities, with accuracy as high as possible, and to operate on these entities skipping the remaining text fragments. IE is based on shallow analysis only but it is expected that even the progress in partial text understanding would enable radical improvements in clinical decision support, biomedical research and healthcare in general.