An automated algorithm using free-text clinical notes to improve identification of transgender people.

IF 2.5 4区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Informatics for Health & Social Care Pub Date : 2021-03-02 Epub Date: 2020-11-17 DOI:10.1080/17538157.2020.1828890
Fagen Xie, Darios Getahun, Virginia P Quinn, Theresa M Im, Richard Contreras, Michael J Silverberg, Tisha C Baird, Rebecca Nash, Lee Cromwell, Douglas Roblin, Trenton Hoffman, Michael Goodman
{"title":"An automated algorithm using free-text clinical notes to improve identification of transgender people.","authors":"Fagen Xie,&nbsp;Darios Getahun,&nbsp;Virginia P Quinn,&nbsp;Theresa M Im,&nbsp;Richard Contreras,&nbsp;Michael J Silverberg,&nbsp;Tisha C Baird,&nbsp;Rebecca Nash,&nbsp;Lee Cromwell,&nbsp;Douglas Roblin,&nbsp;Trenton Hoffman,&nbsp;Michael Goodman","doi":"10.1080/17538157.2020.1828890","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate identification of transgender persons is a critical first step in conducting transgender health studies. To develop an automated algorithm for identifying transgender individuals from electronic medical records (EMR) using free-text clinical notes. The development and validation of the algorithm was based on data from an integrated healthcare system that served as a participating site in the multicenter Study of Transition Outcomes and Gender. The training and test datasets each contained a total of 300 individuals identified between 2006 and 2014. Both datasets underwent a full medical record review by experienced research abstractors. The validated algorithm was then implemented to identify transgender individuals in the EMR using all clinical notes of patients that received care between January 1, 2015 and June 30, 2018. Validation of the algorithm against the full chart review demonstrated a high degree of accuracy with 97% sensitivity, 95% specificity, 94% positive predictive value, and 97% negative predictive value. The algorithm classified 7,409 individuals (3.5%) as \"Definitely transgender\" and 679 individuals (0.3%) as \"Probably transgender\" out of 212,138 candidates with a total of 378,641 clinical notes. The computerized NLP algorithm can support essential efforts to improve the health of transgender people.</p>","PeriodicalId":54984,"journal":{"name":"Informatics for Health & Social Care","volume":"46 1","pages":"18-28"},"PeriodicalIF":2.5000,"publicationDate":"2021-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/17538157.2020.1828890","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics for Health & Social Care","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17538157.2020.1828890","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/11/17 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 6

Abstract

Accurate identification of transgender persons is a critical first step in conducting transgender health studies. To develop an automated algorithm for identifying transgender individuals from electronic medical records (EMR) using free-text clinical notes. The development and validation of the algorithm was based on data from an integrated healthcare system that served as a participating site in the multicenter Study of Transition Outcomes and Gender. The training and test datasets each contained a total of 300 individuals identified between 2006 and 2014. Both datasets underwent a full medical record review by experienced research abstractors. The validated algorithm was then implemented to identify transgender individuals in the EMR using all clinical notes of patients that received care between January 1, 2015 and June 30, 2018. Validation of the algorithm against the full chart review demonstrated a high degree of accuracy with 97% sensitivity, 95% specificity, 94% positive predictive value, and 97% negative predictive value. The algorithm classified 7,409 individuals (3.5%) as "Definitely transgender" and 679 individuals (0.3%) as "Probably transgender" out of 212,138 candidates with a total of 378,641 clinical notes. The computerized NLP algorithm can support essential efforts to improve the health of transgender people.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一个使用自由文本临床记录的自动算法,以提高对变性人的识别。
准确识别跨性别者是开展跨性别健康研究的关键第一步。开发一种自动算法,用于使用自由文本临床记录从电子医疗记录(EMR)中识别跨性别者。该算法的开发和验证基于一个综合医疗保健系统的数据,该系统作为多中心过渡结果和性别研究的参与站点。训练和测试数据集各包含2006年至2014年间确定的300个个体。两个数据集都由经验丰富的研究摘要人员进行了完整的医疗记录审查。然后实施经过验证的算法,使用2015年1月1日至2018年6月30日期间接受治疗的患者的所有临床记录,在EMR中识别跨性别者。对整个图表的验证表明,该算法具有很高的准确性,灵敏度为97%,特异性为95%,阳性预测值为94%,阴性预测值为97%。该算法将212138名候选人中的7409人(3.5%)分类为“绝对跨性别者”,679人(0.3%)分类为“可能跨性别者”,总共有378641份临床记录。计算机化的NLP算法可以支持改善变性人健康的必要努力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.10
自引率
4.20%
发文量
21
审稿时长
>12 weeks
期刊介绍: Informatics for Health & Social Care promotes evidence-based informatics as applied to the domain of health and social care. It showcases informatics research and practice within the many and diverse contexts of care; it takes personal information, both its direct and indirect use, as its central focus. The scope of the Journal is broad, encompassing both the properties of care information and the life-cycle of associated information systems. Consideration of the properties of care information will necessarily include the data itself, its representation, structure, and associated processes, as well as the context of its use, highlighting the related communication, computational, cognitive, social and ethical aspects. Consideration of the life-cycle of care information systems includes full range from requirements, specifications, theoretical models and conceptual design through to sustainable implementations, and the valuation of impacts. Empirical evidence experiences related to implementation are particularly welcome. Informatics in Health & Social Care seeks to consolidate and add to the core knowledge within the disciplines of Health and Social Care Informatics. The Journal therefore welcomes scientific papers, case studies and literature reviews. Examples of novel approaches are particularly welcome. Articles might, for example, show how care data is collected and transformed into useful and usable information, how informatics research is translated into practice, how specific results can be generalised, or perhaps provide case studies that facilitate learning from experience.
期刊最新文献
Development and validation of the infodemic scale. Personalized medicine meets artificial intelligence: beyond “hype”, towards the metaverse Technological acceptance and features needed in mobile health apps development for people living with dementia and their caregivers in Indonesia Alzheimer’s in the modern age: Ethical challenges in the use of digital monitoring to identify cognitive changes Self-care intervention using mobile apps for sexual and reproductive health in the WHO Eastern Mediterranean Region.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1