Leveraging free-text diagnoses to identify patients with diabetes mellitus, obesity or dyslipidaemia - a cross-sectional study in a large Swiss primary care database.

IF 1.9 4区 医学 Q2 MEDICINE, GENERAL & INTERNAL Swiss medical weekly Pub Date : 2025-02-13 DOI:10.57187/s.3360
Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun
{"title":"Leveraging free-text diagnoses to identify patients with diabetes mellitus, obesity or dyslipidaemia - a cross-sectional study in a large Swiss primary care database.","authors":"Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun","doi":"10.57187/s.3360","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.</p><p><strong>Objective: </strong>This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.</p><p><strong>Methods: </strong>This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.</p><p><strong>Results: </strong>The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).</p><p><strong>Conclusion: </strong>Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.</p>","PeriodicalId":22111,"journal":{"name":"Swiss medical weekly","volume":"155 ","pages":"3360"},"PeriodicalIF":1.9000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swiss medical weekly","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.57187/s.3360","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.

Objective: This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.

Methods: This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.

Results: The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).

Conclusion: Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用自由文本诊断识别糖尿病、肥胖症或血脂异常患者--一项在瑞士大型初级保健数据库中进行的横断面研究。
背景:电子病历(EMRs)在全科实践中提供了多种方法来识别具有特定诊断的患者。虽然有几项研究侧重于通过结构化电子病历组件进行病例识别,但在一般实践中,诊断经常被记录为非结构化的自由文本条目,这使得它们在研究中的使用具有挑战性。此外,即使在结构化电子病历数据中存在潜在疾病的证据,诊断也可能没有记录。目的:本研究旨在量化自由文本诊断在多大程度上有助于识别糖尿病、肥胖和血脂异常(目标疾病)的额外病例,并评估完全依赖自由文本条目时遗漏的病例。方法:这项横断面研究利用了截至2019年瑞士10个全科诊所6000名患者的所有咨询的电子病历数据。在诊断的自由文本输入字段中记录的诊断是针对目标疾病手动编码的。病例定义为具有相应的编码自由文本诊断或满足结构化电子病历组件(药物数据或临床和实验室参数)中预定义标准的患者。对于每种目标疾病,患病率与仅通过自由文本诊断确定的病例比例以及单独使用自由文本诊断时遗漏的比例一起计算。结果:糖尿病患病率为8.8%,肥胖患病率为16.2%,血脂异常患病率为38.9%。很少有病例完全依赖于自由文本诊断进行识别,但当完全依赖于自由文本诊断时,有相当大比例的病例被遗漏,特别是肥胖(19.5%完全被识别;50.7%未检出)和血脂异常(8.7%完全确诊;错过了53.3%)。结论:自由文本诊断在糖尿病、肥胖或血脂异常的病例识别中应用有限,提示手工编码自由文本诊断可能并不总是合理的。不建议仅仅依靠自由文本诊断来确定病例,因为相当大比例的病例可能仍未被发现,导致患病率估计有偏差。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Swiss medical weekly
Swiss medical weekly 医学-医学:内科
CiteScore
5.00
自引率
0.00%
发文量
0
审稿时长
3-8 weeks
期刊介绍: The Swiss Medical Weekly accepts for consideration original and review articles from all fields of medicine. The quality of SMW publications is guaranteed by a consistent policy of rigorous single-blind peer review. All editorial decisions are made by research-active academics.
期刊最新文献
Supplementum 295: Abstracts of the 48th IHF World Hospital Congress (Geneva, November 10-13, 2025). Effectiveness of community mental health nursing on psychiatric hospitalisations in Switzerland: a mirror-image analysis. Passive RSV immunisation using nirsevimab in neonatal care: a structured multidisciplinary approach and immunisation data from a Swiss tertiary centre. Limitation of therapeutic efforts in internal medicine: a retrospective analysis of evolution and determinants (2013-2023) in a Swiss university hospital. Monitoring of antimicrobial stewardship implementation in Swiss hospitals: results from a national survey.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1