Maximum entropy modeling for mining patient medication status from free text.

Proceedings. AMIA Symposium Pub Date : 2002-01-01
Serguei V Pakhomov, Alexander Ruggieri, Christopher G Chute
{"title":"Maximum entropy modeling for mining patient medication status from free text.","authors":"Serguei V Pakhomov,&nbsp;Alexander Ruggieri,&nbsp;Christopher G Chute","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Using a classification scheme of patient medication status we sought to recognize and categorize medications mentioned in the unrestricted text of clinical documents generated in clinical practice. The categories refer to the patient's status with respect to the medication such as discontinuation, start or initiation, and continuation of a given medication. This categorization is performed with a machine learning technique, Maximum Entropy (ME), that is well suited to incorporating heterogeneous sources of information necessary for classifying patient's medication status. We use hand labeled training data to generate ME models and test 5 different training feature sets. Our results show that the most optimal feature set includes a combination of the following: two words preceding and following the mention of the drug, the subject of the sentence in which the drug mention occurs, the 2 words following the subject, and a binary feature vector of lexicalized semantic cues indicative of medication status or its change. The average predictive power of a model trained on these features is approximately 89%.</p>","PeriodicalId":79712,"journal":{"name":"Proceedings. AMIA Symposium","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2002-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2244576/pdf/procamiasymp00001-0628.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. AMIA Symposium","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Using a classification scheme of patient medication status we sought to recognize and categorize medications mentioned in the unrestricted text of clinical documents generated in clinical practice. The categories refer to the patient's status with respect to the medication such as discontinuation, start or initiation, and continuation of a given medication. This categorization is performed with a machine learning technique, Maximum Entropy (ME), that is well suited to incorporating heterogeneous sources of information necessary for classifying patient's medication status. We use hand labeled training data to generate ME models and test 5 different training feature sets. Our results show that the most optimal feature set includes a combination of the following: two words preceding and following the mention of the drug, the subject of the sentence in which the drug mention occurs, the 2 words following the subject, and a binary feature vector of lexicalized semantic cues indicative of medication status or its change. The average predictive power of a model trained on these features is approximately 89%.

分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从自由文本中挖掘患者用药状态的最大熵模型。
使用患者用药状态的分类方案,我们试图识别和分类在临床实践中产生的临床文件的不受限制的文本中提到的药物。这些类别是指患者与药物有关的状态,例如停药,开始或开始,以及继续服用给定药物。这种分类是通过机器学习技术进行的,最大熵(ME),它非常适合合并对患者药物状态进行分类所需的异构信息源。我们使用手工标记的训练数据来生成ME模型,并测试了5个不同的训练特征集。我们的研究结果表明,最优特征集包括以下组合:提到药物的前后两个词,提到药物的句子的主语,主语后面的两个词,以及指示药物状态或其变化的词汇化语义线索的二进制特征向量。在这些特征上训练的模型的平均预测能力约为89%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Electronic Patient Record Medical informatics as a market for IS/IT Perceived Information Needs and Communication Difficulties of Inpatient Physicians and Nurses Disambiguation Data: Extracting Information from Anonymized Sources The Operating Room Charge Nurse: Coordinator and Communicator
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1