首页 > 最新文献

Methods of Information in Medicine最新文献

英文 中文
A Semi-Automated Term Harmonization Pipeline Applied to Pulmonary Arterial Hypertension Clinical Trials. 半自动化术语协调管道在肺动脉高压临床试验中的应用。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-05-01 DOI: 10.1055/s-0041-1739361
Ryan J Urbanowicz, John H Holmes, Dina Appleby, Vanamala Narasimhan, Stephen Durborow, Nadine Al-Naamani, Melissa Fernando, Steven M Kawut

Objective: Data harmonization is essential to integrate individual participant data from multiple sites, time periods, and trials for meta-analysis. The process of mapping terms and phrases to an ontology is complicated by typographic errors, abbreviations, truncation, and plurality. We sought to harmonize medical history (MH) and adverse events (AE) term records across 21 randomized clinical trials in pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension.

Methods: We developed and applied a semi-automated harmonization pipeline for use with domain-expert annotators to resolve ambiguous term mappings using exact and fuzzy matching. We summarized MH and AE term mapping success, including map quality measures, and imputation of a generalizing term hierarchy as defined by the applied Medical Dictionary for Regulatory Activities (MedDRA) ontology standard.

Results: Over 99.6% of both MH (N = 37,105) and AE (N = 58,170) records were successfully mapped to MedDRA low-level terms. Automated exact matching accounted for 74.9% of MH and 85.5% of AE mappings. Term recommendations from fuzzy matching in the pipeline facilitated annotator mapping of the remaining 24.9% of MH and 13.8% of AE records. Imputation of the generalized MedDRA term hierarchy was unambiguous in 85.2% of high-level terms, 99.4% of high-level group terms, and 99.5% of system organ class in MH, and 75% of high-level terms, 98.3% of high-level group terms, and 98.4% of system organ class in AE.

Conclusion: This pipeline dramatically reduced the burden of manual annotation for MH and AE term harmonization and could be adapted to other data integration efforts.

目的:数据协调对于整合来自多个地点、时间段和试验的个体参与者数据进行meta分析至关重要。将术语和短语映射到本体的过程由于排版错误、缩写、截断和复数而变得复杂。我们试图协调21项随机临床试验中肺动脉高压和慢性血栓栓塞性肺动脉高压的病史(MH)和不良事件(AE)记录。方法:我们开发并应用了一个半自动的协调管道,用于与领域专家注释器一起使用精确和模糊匹配来解决模棱两可的术语映射。我们总结了MH和AE术语映射的成功,包括地图质量测量,以及根据应用医学词典监管活动(MedDRA)本体标准定义的广义术语层次的插入。结果:99.6%以上的MH (N = 37105)和AE (N = 58170)记录均成功映射到MedDRA低水平项。自动精确匹配占MH的74.9%和AE映射的85.5%。管道中模糊匹配的术语推荐有助于注释者对剩余24.9%的MH和13.8%的AE记录进行映射。广义MedDRA术语层次在MH中85.2%的高级术语、99.4%的高级组术语和99.5%的系统器官类别中是明确的,在AE中75%的高级术语、98.3%的高级组术语和98.4%的系统器官类别中是明确的。结论:该管道极大地减轻了手工标注MH和AE术语协调的负担,可以适应其他数据集成工作。
{"title":"A Semi-Automated Term Harmonization Pipeline Applied to Pulmonary Arterial Hypertension Clinical Trials.","authors":"Ryan J Urbanowicz,&nbsp;John H Holmes,&nbsp;Dina Appleby,&nbsp;Vanamala Narasimhan,&nbsp;Stephen Durborow,&nbsp;Nadine Al-Naamani,&nbsp;Melissa Fernando,&nbsp;Steven M Kawut","doi":"10.1055/s-0041-1739361","DOIUrl":"https://doi.org/10.1055/s-0041-1739361","url":null,"abstract":"<p><strong>Objective: </strong>Data harmonization is essential to integrate individual participant data from multiple sites, time periods, and trials for meta-analysis. The process of mapping terms and phrases to an ontology is complicated by typographic errors, abbreviations, truncation, and plurality. We sought to harmonize medical history (MH) and adverse events (AE) term records across 21 randomized clinical trials in pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension.</p><p><strong>Methods: </strong>We developed and applied a semi-automated harmonization pipeline for use with domain-expert annotators to resolve ambiguous term mappings using exact and fuzzy matching. We summarized MH and AE term mapping success, including map quality measures, and imputation of a generalizing term hierarchy as defined by the applied Medical Dictionary for Regulatory Activities (MedDRA) ontology standard.</p><p><strong>Results: </strong>Over 99.6% of both MH (<i>N</i> = 37,105) and AE (<i>N</i> = 58,170) records were successfully mapped to MedDRA low-level terms. Automated exact matching accounted for 74.9% of MH and 85.5% of AE mappings. Term recommendations from fuzzy matching in the pipeline facilitated annotator mapping of the remaining 24.9% of MH and 13.8% of AE records. Imputation of the generalized MedDRA term hierarchy was unambiguous in 85.2% of high-level terms, 99.4% of high-level group terms, and 99.5% of system organ class in MH, and 75% of high-level terms, 98.3% of high-level group terms, and 98.4% of system organ class in AE.</p><p><strong>Conclusion: </strong>This pipeline dramatically reduced the burden of manual annotation for MH and AE term harmonization and could be adapted to other data integration efforts.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 1-02","pages":"3-10"},"PeriodicalIF":1.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9978994/pdf/nihms-1873072.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10276427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Security and Privacy in Distributed Health Care Environments. 分布式医疗保健环境中的安全和隐私。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-05-01 DOI: 10.1055/a-1768-2966
Stephen V Flowerday, Christos Xenakis
N.A.
{"title":"Security and Privacy in Distributed Health Care Environments.","authors":"Stephen V Flowerday,&nbsp;Christos Xenakis","doi":"10.1055/a-1768-2966","DOIUrl":"https://doi.org/10.1055/a-1768-2966","url":null,"abstract":"N.A.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 1-02","pages":"1-2"},"PeriodicalIF":1.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9608470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Automated Identification of Immunocompromised Status in Critically Ill Children. 危重儿童免疫受损状态的自动识别。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-04-05 DOI: 10.1055/a-1817-7208
Swaminathan Kandaswamy, Evan W. Orenstein, Elizabeth Quincer, A. Fernandez, Mark D. Gonzalez, LY Lu, R. Kamaleswaran, I. Banerjee, P. Jaggi
BACKGROUNDEasy identification of immunocompromised hosts (ICH) would allow for stratification of culture results based on host type.METHODSWe utilized antimicrobial stewardship (ASP) team notes written during handshake stewardship rounds in the pediatric intensive care unit as the gold standard for host status; clinical notes from the primary team, medication orders during the encounter, problem list and billing diagnoses documented prior to the ASP documentation were extracted to develop models that predict host status. We calculated performance for three models based on diagnoses/medications, with and without natural language processing from clinical notes. The susceptibility of pathogens causing bacteremia to commonly used empiric antibiotic regimens was then stratified by host status.RESULTSWe identified 844 antimicrobial episodes from 666 unique patients; 160 (18.9%) were identified as an ICH. We randomly selected 675 initiations (80%) for model training and 169 initiations (20%) for testing. A rule-based model using diagnoses and medications alone yielded sensitivity of 0.87 (08.6-0.88), specificity of 0.93 (0.92-0.93), and positive predictive value (PPV) of 0.74 (0.73-0.75). Adding clinical notes into XGBoost model led to improved specificity of 0.98 (0.98 - 0.98) and PPV of 0.9 (0.88 - 0.91), but with decreased sensitivity 0.77 (0.76 - 0.79). There were 77 bacteremia episodes during the study period identified and a host specific visualization was created.CONCLUSIONSAn EHR phenotype based on notes, diagnoses and medications identifies ICH in the PICU with high specificity.
背景免疫功能低下宿主(ICH)的简单鉴定将允许根据宿主类型对培养结果进行分层。方法我们利用儿科重症监护室握手管理期间写的抗菌管理(ASP)团队笔记作为宿主状态的金标准;提取初级团队的临床笔记、就诊期间的医嘱、ASP文档之前记录的问题列表和账单诊断,以开发预测宿主状态的模型。我们根据诊断/药物计算了三个模型的性能,包括是否从临床笔记中进行自然语言处理。然后根据宿主状态对引起菌血症的病原体对常用经验性抗生素方案的易感性进行分层。结果我们从666名独特的患者中鉴定出844例抗微生物事件;160例(18.9%)被鉴定为脑出血。我们随机选择675个初始(80%)进行模型训练,169个初始(20%)进行测试。仅使用诊断和药物的基于规则的模型的敏感性为0.87(08.6-0.88),特异性为0.93(0.92-0.93),阳性预测值(PPV)为0.74(0.73-0.75)。在XGBoost模型中添加临床注释可提高特异性0.98(0.98-0.98)和PPV 0.9(0.88-0.91),但敏感性降低0.77(0.76-0.79)。在研究期间发现了77次菌血症发作,并创建了宿主特异性可视化。结论基于注释、诊断和药物的EHR表型以高特异性鉴定PICU中的ICH。
{"title":"Automated Identification of Immunocompromised Status in Critically Ill Children.","authors":"Swaminathan Kandaswamy, Evan W. Orenstein, Elizabeth Quincer, A. Fernandez, Mark D. Gonzalez, LY Lu, R. Kamaleswaran, I. Banerjee, P. Jaggi","doi":"10.1055/a-1817-7208","DOIUrl":"https://doi.org/10.1055/a-1817-7208","url":null,"abstract":"BACKGROUND\u0000Easy identification of immunocompromised hosts (ICH) would allow for stratification of culture results based on host type.\u0000\u0000\u0000METHODS\u0000We utilized antimicrobial stewardship (ASP) team notes written during handshake stewardship rounds in the pediatric intensive care unit as the gold standard for host status; clinical notes from the primary team, medication orders during the encounter, problem list and billing diagnoses documented prior to the ASP documentation were extracted to develop models that predict host status. We calculated performance for three models based on diagnoses/medications, with and without natural language processing from clinical notes. The susceptibility of pathogens causing bacteremia to commonly used empiric antibiotic regimens was then stratified by host status.\u0000\u0000\u0000RESULTS\u0000We identified 844 antimicrobial episodes from 666 unique patients; 160 (18.9%) were identified as an ICH. We randomly selected 675 initiations (80%) for model training and 169 initiations (20%) for testing. A rule-based model using diagnoses and medications alone yielded sensitivity of 0.87 (08.6-0.88), specificity of 0.93 (0.92-0.93), and positive predictive value (PPV) of 0.74 (0.73-0.75). Adding clinical notes into XGBoost model led to improved specificity of 0.98 (0.98 - 0.98) and PPV of 0.9 (0.88 - 0.91), but with decreased sensitivity 0.77 (0.76 - 0.79). There were 77 bacteremia episodes during the study period identified and a host specific visualization was created.\u0000\u0000\u0000CONCLUSIONS\u0000An EHR phenotype based on notes, diagnoses and medications identifies ICH in the PICU with high specificity.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"1 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43013005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A methodological approach to validate pneumonia encounters from radiology reports using Natural Language Processing (NLP). 使用自然语言处理(NLP)验证放射学报告中肺炎遭遇的方法学方法。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-04-05 DOI: 10.1055/a-1817-7008
A. Panny, H. Hegde, I. Glurich, F. Scannapieco, J. Vedre, J. Vanwormer, J. Miecznikowski, A. Acharya
INTRODUCTIONPneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format.OBJECTIVEThe study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format.METHODSA pneumonia-specific Natural Language Processing (NLP) pipeline was strategically developed applying cTAKES to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive", "negative" or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes.RESULTSA total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest x-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as 'Pneumonia-positive', 19% as (15401/81,707) as 'Pneumonia-negative' and 48% (39,209/81,707) as ''episode classification pending further manual review'. NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%).CONCLUSIONThe pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.
肺炎是由在肺部建立感染过程的微生物引起的。肺炎诊断的金标准是放射科医生在放射学记录中记录的肺炎相关特征,这些特征以非结构化格式在电子健康记录中捕获。研究目的是开发一种方法学方法来评估肺炎诊断的有效性,该方法基于确定放射学报告中关键放射学特征的存在或缺失,随后将诊断决策呈现为结构化格式。方法在开发肺炎特征特异性词典后,战略性地开发了一个肺炎特异性自然语言处理(NLP)管道,应用ctake来验证肺炎诊断。通过NLP管道对国际疾病分类(ICD)代码识别的符合研究条件的受试者的放射学报告进行解析。制定了分类规则,根据支持或反驳诊断代码的标记概念,将每次肺炎发作分为三类:“阳性”、“阴性”或“未分类:需要人工审查”。结果回顾性检索65,904例确诊的91,998次肺炎发作。大约89%(81707 / 91998)的肺炎发作记录在225893份胸片报告中。NLP将33%(26,800/81,707)的肺炎事件分类为“肺炎阳性”,19%(15401/81,707)为“肺炎阴性”,48%(39,209/81,707)为“有待进一步人工审查的事件分类”。NLP管道性能指标包括准确性(76.3%)、灵敏度(88%)和特异性(75%)。结论与迄今为止开发的其他肺炎特异性NLP系统相比,该肺炎特异性NLP管道具有良好的性能。
{"title":"A methodological approach to validate pneumonia encounters from radiology reports using Natural Language Processing (NLP).","authors":"A. Panny, H. Hegde, I. Glurich, F. Scannapieco, J. Vedre, J. Vanwormer, J. Miecznikowski, A. Acharya","doi":"10.1055/a-1817-7008","DOIUrl":"https://doi.org/10.1055/a-1817-7008","url":null,"abstract":"INTRODUCTION\u0000Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format.\u0000\u0000\u0000OBJECTIVE\u0000The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format.\u0000\u0000\u0000METHODS\u0000A pneumonia-specific Natural Language Processing (NLP) pipeline was strategically developed applying cTAKES to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: \"positive\", \"negative\" or \"not classified: requires manual review\" based on tagged concepts that support or refute diagnostic codes.\u0000\u0000\u0000RESULTS\u0000A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest x-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as 'Pneumonia-positive', 19% as (15401/81,707) as 'Pneumonia-negative' and 48% (39,209/81,707) as ''episode classification pending further manual review'. NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%).\u0000\u0000\u0000CONCLUSION\u0000The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43521863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Identifying Pneumonia Sub-types from Electronic Health Records Using Rule-based Algorithms. 使用基于规则的算法从电子健康记录中识别肺炎亚型。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-03-17 DOI: 10.1055/a-1801-2718
H. Hegde, I. Glurich, A. Panny, J. Vedre, J. Vanwormer, R. Berg, F. Scannapieco, J. Miecznikowski, A. Acharya
BACKGROUNDInternational Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating challenges for epidemiological evaluations, where pneumonia is standardly subtyped by settings, exposures and time of emergence. Pneumonia subtype classification requires data available in electronic health records (EHR), frequently in non-structured formats including radiological interpretation or clinical notes that complicate electronic classification.OBJECTIVEThe current study undertook development of a rule-based pneumonia subtyping algorithm for stratifying pneumonia by the setting in which it emerged using information documented in the EHR.METHODSPneumonia subtype classification was developed by interrogating patient information within the EHR of a large private Health System. ICD coding was mined in the EHR applying requirements for 'rule of two' pneumonia-related codes or one ICD code and radiologically-confirmed pneumonia validated by natural language processing and/or documented antibiotic prescriptions. A rule-based algorithm flow chart was created to support sub-classification based on features including symptomatic patient point of entry into the healthcare system timing of pneumonia emergence and identification of clinical, laboratory or medication orders that informed definition of the pneumonia sub-classification algorithm.RESULTSData from 65,904 study-eligible patients with 91,998 episodes of pneumonia diagnoses documented by 380,509 encounters were analyzed, while 8,611 episodes were excluded following NLP classification of pneumonia status as 'negative' or 'unknown'. Subtyping of 83,387 episodes identified: community acquired (54.5%), hospital-acquired (20%), aspiration-related (10.7%), healthcare-acquired (5%), ventilator-associated (0.4%) cases, and 9.4% were not classifiable by the algorithm.CONCLUSIONStudy outcome indicated capacity to achieve electronic pneumonia subtype classification based on interrogation of big data available in the EHR. Examination of portability of the algorithm to achieve rule-based pneumonia classification in other health systems remains to be explored.
背景国际疾病分类(ICD)对肺炎分类的编码是基于病因或通用肺炎编码的使用,这给流行病学评估带来了挑战,因为肺炎通常按环境、暴露和出现时间进行分型。肺炎亚型分类需要电子健康记录(EHR)中的可用数据,通常是非结构化格式,包括使电子分类复杂化的放射学解释或临床记录。目的本研究开发了一种基于规则的肺炎亚型算法,用于根据EHR中记录的信息对肺炎进行分层。方法肺炎亚型分类是通过询问大型私人卫生系统EHR中的患者信息而开发的。ICD编码在EHR中进行了挖掘,应用了“二规则”肺炎相关代码或一个ICD代码的要求,以及通过自然语言处理和/或记录的抗生素处方验证的放射学证实的肺炎。创建了一个基于规则的算法流程图,以支持基于以下特征的子分类,这些特征包括有症状的患者进入医疗保健系统的时间、肺炎出现的时间以及为肺炎子分类算法的定义提供信息的临床、实验室或药物订单的识别。结果分析了65904名符合研究条件的患者的数据,这些患者在380509次接触中记录了91998次肺炎发作,而在NLP将肺炎状态分类为“阴性”或“未知”后,8611次发作被排除在外。确定的83387例发作的亚型:社区获得性(54.5%)、医院获得性(20%)、抽吸相关(10.7%)、医疗保健获得性(5%)、呼吸机相关(0.4%)和9.4%无法通过算法进行分类。结论研究结果表明,有能力根据EHR中可用的大数据实现电子肺炎亚型分类。在其他卫生系统中检查算法的可移植性以实现基于规则的肺炎分类仍有待探索。
{"title":"Identifying Pneumonia Sub-types from Electronic Health Records Using Rule-based Algorithms.","authors":"H. Hegde, I. Glurich, A. Panny, J. Vedre, J. Vanwormer, R. Berg, F. Scannapieco, J. Miecznikowski, A. Acharya","doi":"10.1055/a-1801-2718","DOIUrl":"https://doi.org/10.1055/a-1801-2718","url":null,"abstract":"BACKGROUND\u0000International Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating challenges for epidemiological evaluations, where pneumonia is standardly subtyped by settings, exposures and time of emergence. Pneumonia subtype classification requires data available in electronic health records (EHR), frequently in non-structured formats including radiological interpretation or clinical notes that complicate electronic classification.\u0000\u0000\u0000OBJECTIVE\u0000The current study undertook development of a rule-based pneumonia subtyping algorithm for stratifying pneumonia by the setting in which it emerged using information documented in the EHR.\u0000\u0000\u0000METHODS\u0000Pneumonia subtype classification was developed by interrogating patient information within the EHR of a large private Health System. ICD coding was mined in the EHR applying requirements for 'rule of two' pneumonia-related codes or one ICD code and radiologically-confirmed pneumonia validated by natural language processing and/or documented antibiotic prescriptions. A rule-based algorithm flow chart was created to support sub-classification based on features including symptomatic patient point of entry into the healthcare system timing of pneumonia emergence and identification of clinical, laboratory or medication orders that informed definition of the pneumonia sub-classification algorithm.\u0000\u0000\u0000RESULTS\u0000Data from 65,904 study-eligible patients with 91,998 episodes of pneumonia diagnoses documented by 380,509 encounters were analyzed, while 8,611 episodes were excluded following NLP classification of pneumonia status as 'negative' or 'unknown'. Subtyping of 83,387 episodes identified: community acquired (54.5%), hospital-acquired (20%), aspiration-related (10.7%), healthcare-acquired (5%), ventilator-associated (0.4%) cases, and 9.4% were not classifiable by the algorithm.\u0000\u0000\u0000CONCLUSION\u0000Study outcome indicated capacity to achieve electronic pneumonia subtype classification based on interrogation of big data available in the EHR. Examination of portability of the algorithm to achieve rule-based pneumonia classification in other health systems remains to be explored.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44828042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Towards the Representation of Network Assets in Health Care Environments Using Ontologies. 面向医疗保健环境中使用本体的网络资产表示。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-12-01 DOI: 10.1055/s-0041-1735621
Lucía Prieto Santamaría, David Fernández Lobón, Antonio Jesús Díaz-Honrubia, Ernestina Menasalvas Ruiz, Sokratis Nifakos, Alejandro Rodríguez-González

Objectives: The aim of the study is to design an ontology model for the representation of assets and its features in distributed health care environments. Allow the interchange of information about these assets through the use of specific vocabularies based on the use of ontologies.

Methods: Ontologies are a formal way to represent knowledge by means of triples composed of a subject, a predicate, and an object. Given the sensitivity of network assets in health care institutions, this work by using an ontology-based representation of information complies with the FAIR principles. Federated queries to the ontology systems, allow users to obtain data from multiple sources (i.e., several hospitals belonging to the same public body). Therefore, this representation makes it possible for network administrators in health care institutions to have a clear understanding of possible threats that may emerge in the network.

Results: As a result of this work, the "Software Defined Networking Description Language-CUREX Asset Discovery Tool Ontology" (SDNDL-CAO) has been developed. This ontology uses the main concepts in network assets to represent the knowledge extracted from the distributed health care environments: interface, device, port, service, etc. CONCLUSION:  The developed SDNDL-CAO ontology allows to represent the aforementioned knowledge about the distributed health care environments. Network administrators of these institutions will benefit as they will be able to monitor emerging threats in real-time, something critical when managing personal medical information.

目的:研究的目的是为分布式医疗环境中资产及其特征的表示设计一个本体模型。通过使用基于本体的特定词汇表,允许交换关于这些资产的信息。方法:本体论是一种通过由主语、谓语和宾语组成的三元组来表示知识的形式化方法。考虑到医疗机构网络资产的敏感性,使用基于本体的信息表示的这项工作符合FAIR原则。对本体系统的联邦查询允许用户从多个来源(即,属于同一公共机构的几家医院)获取数据。因此,这种表示使医疗保健机构的网络管理员能够清楚地了解网络中可能出现的威胁。结果:通过本工作,开发了“软件定义网络描述语言- curex资产发现工具本体”(SDNDL-CAO)。该本体使用网络资产中的主要概念来表示从分布式医疗环境中提取的知识:接口、设备、端口、服务等。结论:开发的SDNDL-CAO本体可以表示上述关于分布式医疗环境的知识。这些机构的网络管理员将从中受益,因为他们将能够实时监控新出现的威胁,这在管理个人医疗信息时至关重要。
{"title":"Towards the Representation of Network Assets in Health Care Environments Using Ontologies.","authors":"Lucía Prieto Santamaría,&nbsp;David Fernández Lobón,&nbsp;Antonio Jesús Díaz-Honrubia,&nbsp;Ernestina Menasalvas Ruiz,&nbsp;Sokratis Nifakos,&nbsp;Alejandro Rodríguez-González","doi":"10.1055/s-0041-1735621","DOIUrl":"https://doi.org/10.1055/s-0041-1735621","url":null,"abstract":"<p><strong>Objectives: </strong>The aim of the study is to design an ontology model for the representation of assets and its features in distributed health care environments. Allow the interchange of information about these assets through the use of specific vocabularies based on the use of ontologies.</p><p><strong>Methods: </strong>Ontologies are a formal way to represent knowledge by means of triples composed of a subject, a predicate, and an object. Given the sensitivity of network assets in health care institutions, this work by using an ontology-based representation of information complies with the FAIR principles. Federated queries to the ontology systems, allow users to obtain data from multiple sources (i.e., several hospitals belonging to the same public body). Therefore, this representation makes it possible for network administrators in health care institutions to have a clear understanding of possible threats that may emerge in the network.</p><p><strong>Results: </strong>As a result of this work, the \"Software Defined Networking Description Language-CUREX Asset Discovery Tool Ontology\" (SDNDL-CAO) has been developed. This ontology uses the main concepts in network assets to represent the knowledge extracted from the distributed health care environments: interface, device, port, service, etc. CONCLUSION:  The developed SDNDL-CAO ontology allows to represent the aforementioned knowledge about the distributed health care environments. Network administrators of these institutions will benefit as they will be able to monitor emerging threats in real-time, something critical when managing personal medical information.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 S 02","pages":"e89-e102"},"PeriodicalIF":1.7,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/9b/d5/10-1055-s-0041-1735621.PMC8714298.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9254222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Ontology Engineering for Gastric Dystemperament in Persian Medicine. 波斯医学胃异常的本体工程。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-12-01 Epub Date: 2021-08-26 DOI: 10.1055/s-0041-1735168
Hassan Shojaee-Mend, Haleh Ayatollahi, Azam Abdolahadi

Objective: Developing an ontology can help collecting and sharing information in traditional medicine including Persian medicine in a well-defined format. The present study aimed to develop an ontology for gastric dystemperament in the Persian medicine.

Methods: This was a mixed-methods study conducted in 2019. The first stage was related to providing an ontology requirements specification document. In the second stage, important terms, concepts, and their relationships were identified via literature review and expert panels. Then, the results derived from the second stage were refined and validated using the Delphi method in three rounds. Finally, in the fourth stage, the ontology was evaluated in terms of consistency and coherence.

Results: In this study, 241 concepts related to different types of gastric dystemperament, diagnostic criteria, and treatments in the Persian medicine were identified through literature review and expert panels, and 12 new concepts were suggested during the Delphi study. In total, after performing three rounds of the Delphi study, 233 concepts were identified. Finally, an ontology was developed with 71 classes, and the results of the evaluation study revealed that the ontology was consistent and coherent.

Conclusion: In this study, an ontology was created for gastric dystemperament in the Persian medicine. This ontology can be used for designing future systems, such as case-based reasoning and expert systems. Moreover, the use of other evaluation methods is suggested to construct a more complete and precise ontology.

目的:建立传统医学(包括波斯医学)的本体论,有助于以定义良好的格式收集和共享信息。本研究旨在建立波斯医学胃病本体论。方法:这是一项于2019年进行的混合方法研究。第一个阶段与提供本体需求规范文档有关。在第二阶段,通过文献回顾和专家小组来确定重要的术语、概念及其关系。然后,使用德尔菲法对第二阶段得出的结果进行三轮细化和验证。最后,在第四阶段,对本体进行一致性和相干性评价。结果:本研究通过文献回顾和专家评议,确定了241个与波斯医学中不同类型胃不适、诊断标准和治疗相关的概念,并通过德尔菲研究提出了12个新概念。在进行了三轮德尔菲研究之后,总共确定了233个概念。最后,建立了包含71个类的本体,评价研究结果表明,本体具有一致性和连贯性。结论:本研究建立了波斯医学胃病本体论。该本体可用于设计未来的系统,如基于案例的推理和专家系统。此外,建议使用其他评价方法来构建更完整、更精确的本体。
{"title":"Ontology Engineering for Gastric Dystemperament in Persian Medicine.","authors":"Hassan Shojaee-Mend,&nbsp;Haleh Ayatollahi,&nbsp;Azam Abdolahadi","doi":"10.1055/s-0041-1735168","DOIUrl":"https://doi.org/10.1055/s-0041-1735168","url":null,"abstract":"<p><strong>Objective: </strong>Developing an ontology can help collecting and sharing information in traditional medicine including Persian medicine in a well-defined format. The present study aimed to develop an ontology for gastric dystemperament in the Persian medicine.</p><p><strong>Methods: </strong>This was a mixed-methods study conducted in 2019. The first stage was related to providing an ontology requirements specification document. In the second stage, important terms, concepts, and their relationships were identified via literature review and expert panels. Then, the results derived from the second stage were refined and validated using the Delphi method in three rounds. Finally, in the fourth stage, the ontology was evaluated in terms of consistency and coherence.</p><p><strong>Results: </strong>In this study, 241 concepts related to different types of gastric dystemperament, diagnostic criteria, and treatments in the Persian medicine were identified through literature review and expert panels, and 12 new concepts were suggested during the Delphi study. In total, after performing three rounds of the Delphi study, 233 concepts were identified. Finally, an ontology was developed with 71 classes, and the results of the evaluation study revealed that the ontology was consistent and coherent.</p><p><strong>Conclusion: </strong>In this study, an ontology was created for gastric dystemperament in the Persian medicine. This ontology can be used for designing future systems, such as case-based reasoning and expert systems. Moreover, the use of other evaluation methods is suggested to construct a more complete and precise ontology.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 5-06","pages":"162-170"},"PeriodicalIF":1.7,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39371833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Status of AI-Enabled Clinical Decision Support Systems Implementations in China. 中国人工智能临床决策支持系统实施现状
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-12-01 Epub Date: 2021-10-25 DOI: 10.1055/s-0041-1736461
Mengting Ji, Xiaoyun Chen, Georgi Z Genchev, Mingyue Wei, Guangjun Yu

Background: AI-enabled Clinical Decision Support Systems (AI + CDSSs) were heralded to contribute greatly to the advancement of health care services. There is an increased availability of monetary funds and technical expertise invested in projects and proposals targeting the building and implementation of such systems. Therefore, understanding the actual system implementation status in clinical practice is imperative.

Objectives: The aim of the study is to understand (1) the current situation of AI + CDSSs clinical implementations in Chinese hospitals and (2) concerns regarding AI + CDSSs current and future implementations.

Methods: We investigated 160 tertiary hospitals from six provinces and province-level cities. Descriptive analysis, two-sided Fisher exact test, and Mann-Whitney U-test were utilized for analysis.

Results: Thirty-eight of the surveyed hospitals (23.75%) had implemented AI + CDSSs. There were statistical differences on grade, scales, and medical volume between the two groups of hospitals (implemented vs. not-implemented AI + CDSSs, p <0.05). On the 5-point Likert scale, 81.58% (31/38) of respondents rated their overall satisfaction with the systems as "just neutral" to "satisfied." The three most common concerns were system functions improvement and integration into the clinical process, data quality and availability, and methodological bias.

Conclusion: While AI + CDSSs were not yet widespread in Chinese clinical settings, professionals recognize the potential benefits and challenges regarding in-hospital AI + CDSSs.

背景:人工智能支持的临床决策支持系统(AI + cdss)被认为对卫生保健服务的进步做出了巨大贡献。为建立和实施这些系统而投资的项目和建议的货币资金和技术专门知识的可用性有所增加。因此,了解系统在临床实践中的实际实施情况势在必行。目的:本研究的目的是了解(1)中国医院人工智能+ cdss临床实施的现状(2)对人工智能+ cdss目前和未来实施的关注。方法:对全国6个省、市的160家三级医院进行调查。采用描述性分析、双侧Fisher精确检验和Mann-Whitney u检验进行分析。结果:38家医院(23.75%)实施了人工智能+ cdss。两组医院在级别、规模和医疗量方面存在统计学差异(实施与未实施AI + cdss)。结论:虽然AI + cdss在中国临床环境中尚未普及,但专业人员认识到院内AI + cdss的潜在益处和挑战。
{"title":"Status of AI-Enabled Clinical Decision Support Systems Implementations in China.","authors":"Mengting Ji,&nbsp;Xiaoyun Chen,&nbsp;Georgi Z Genchev,&nbsp;Mingyue Wei,&nbsp;Guangjun Yu","doi":"10.1055/s-0041-1736461","DOIUrl":"https://doi.org/10.1055/s-0041-1736461","url":null,"abstract":"<p><strong>Background: </strong>AI-enabled Clinical Decision Support Systems (AI + CDSSs) were heralded to contribute greatly to the advancement of health care services. There is an increased availability of monetary funds and technical expertise invested in projects and proposals targeting the building and implementation of such systems. Therefore, understanding the actual system implementation status in clinical practice is imperative.</p><p><strong>Objectives: </strong>The aim of the study is to understand (1) the current situation of AI + CDSSs clinical implementations in Chinese hospitals and (2) concerns regarding AI + CDSSs current and future implementations.</p><p><strong>Methods: </strong>We investigated 160 tertiary hospitals from six provinces and province-level cities. Descriptive analysis, two-sided Fisher exact test, and Mann-Whitney <i>U</i>-test were utilized for analysis.</p><p><strong>Results: </strong>Thirty-eight of the surveyed hospitals (23.75%) had implemented AI + CDSSs. There were statistical differences on grade, scales, and medical volume between the two groups of hospitals (implemented vs. not-implemented AI + CDSSs, <i>p</i> <0.05). On the 5-point Likert scale, 81.58% (31/38) of respondents rated their overall satisfaction with the systems as \"just neutral\" to \"satisfied.\" The three most common concerns were system functions improvement and integration into the clinical process, data quality and availability, and methodological bias.</p><p><strong>Conclusion: </strong>While AI + CDSSs were not yet widespread in Chinese clinical settings, professionals recognize the potential benefits and challenges regarding in-hospital AI + CDSSs.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 5-06","pages":"123-132"},"PeriodicalIF":1.7,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39569421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Acceptance of Interruptive Medication Alerts in an Electronic Decision Support System Differs between Different Alert Types. 在电子决策支持系统中,不同警报类型对中断药物警报的接受程度不同。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-12-01 Epub Date: 2021-08-27 DOI: 10.1055/s-0041-1735169
Janina A Bittmann, Elisabeth K Rein, Michael Metzner, Walter E Haefeli, Hanna M Seidling

Background: Through targeted medication alerts, clinical decision support systems (CDSS) help users to identify medication errors such as disregarded drug-drug interactions (DDIs). Override rates of such alerts are high; however, they can be mitigated by alert tailoring or workflow-interrupting display of severe alerts that need active user acceptance or overriding. Yet, the extent to which the displayed alert interferes with the prescribers' workflow showed inconclusive impact on alert acceptance.

Objectives: We aimed to assess whether and how often prescriptions were changed as a potential result of interruptive alerts on different (contraindicated) prescription constellations with particularly high risks for adverse drug events (ADEs).

Methods: We retrospectively collected data of all interruptive alerts issued between March 2016 and August 2020 in the local CDSS (AiDKlinik) at Heidelberg University Hospital. The alert battery consisted of 31 distinct alerts for contraindicated DDI with simvastatin, potentially inappropriate medication for patients > 65 years (PIM, N = 14 drugs and 36 drug combinations), and contraindicated drugs in hyperkalemia (N = 5) that could be accepted or overridden giving a reason in free-text form.

Results: In 935 prescribing sessions of 500 274 total sessions, at least one interruptive alert was fired. Of all interruptive alerts, about half of the sessions were evaluable whereof in total 57.5% (269 of 468 sessions) were accepted while 42.5% were overridden. The acceptance rate of interruptive alerts differed significantly depending on the alert type (p <0.0001), reaching 85.7% for DDI alerts (N = 185), 65.3% for contraindicated drugs in hyperkalemia (N = 98), and 25.1% for PIM alerts (N = 185).

Conclusion: A total of 57.5% of the interruptive medication alerts with particularly high risks for ADE in our setting were accepted while the acceptance rate differed according to the alert type with contraindicated simvastatin DDI alerts being accepted most frequently.

背景:通过有针对性的药物警报,临床决策支持系统(CDSS)帮助用户识别药物错误,如忽视药物-药物相互作用(ddi)。这类警报的覆盖率很高;但是,可以通过警报裁剪或显示需要主动用户接受或覆盖的严重警报来缓解这些问题。然而,显示的警报干扰处方者工作流程的程度对警报接受的影响尚无定论。目的:我们旨在评估不同(禁忌症)处方星座的药物不良事件(ADEs)风险特别高的潜在中断警报是否以及多久改变一次处方。方法:回顾性收集海德堡大学医院当地CDSS (AiDKlinik)在2016年3月至2020年8月期间发布的所有中断警报的数据。警报组由31个不同的警报组成,包括辛伐他汀禁忌症DDI, > 65岁患者可能不适当的药物(PIM, N = 14种药物和36种药物组合),以及高钾血症禁忌症药物(N = 5),可以接受或撤销,并以自由文本形式给出理由。结果:在500 274次处方的935次处方中,至少触发了一次中断警报。在所有中断警报中,大约一半的会话是可评估的,其中总共57.5%(468个会话中的269个)被接受,而42.5%被覆盖。不同警报类型的中断警报的接受率差异显著(p N = 185),高钾血症禁忌症药物的接受率为65.3% (N = 98), PIM警报的接受率为25.1% (N = 185)。结论:本组ADE高危中断用药警报的通过率为57.5%,不同警报类型的通过率不同,以辛伐他汀类DDI禁忌症警报的通过率最高。
{"title":"The Acceptance of Interruptive Medication Alerts in an Electronic Decision Support System Differs between Different Alert Types.","authors":"Janina A Bittmann,&nbsp;Elisabeth K Rein,&nbsp;Michael Metzner,&nbsp;Walter E Haefeli,&nbsp;Hanna M Seidling","doi":"10.1055/s-0041-1735169","DOIUrl":"https://doi.org/10.1055/s-0041-1735169","url":null,"abstract":"<p><strong>Background: </strong>Through targeted medication alerts, clinical decision support systems (CDSS) help users to identify medication errors such as disregarded drug-drug interactions (DDIs). Override rates of such alerts are high; however, they can be mitigated by alert tailoring or workflow-interrupting display of severe alerts that need active user acceptance or overriding. Yet, the extent to which the displayed alert interferes with the prescribers' workflow showed inconclusive impact on alert acceptance.</p><p><strong>Objectives: </strong>We aimed to assess whether and how often prescriptions were changed as a potential result of interruptive alerts on different (contraindicated) prescription constellations with particularly high risks for adverse drug events (ADEs).</p><p><strong>Methods: </strong>We retrospectively collected data of all interruptive alerts issued between March 2016 and August 2020 in the local CDSS (AiD<i>Klinik</i>) at Heidelberg University Hospital. The alert battery consisted of 31 distinct alerts for contraindicated DDI with simvastatin, potentially inappropriate medication for patients > 65 years (PIM, <i>N</i> = 14 drugs and 36 drug combinations), and contraindicated drugs in hyperkalemia (<i>N</i> = 5) that could be accepted or overridden giving a reason in free-text form.</p><p><strong>Results: </strong>In 935 prescribing sessions of 500 274 total sessions, at least one interruptive alert was fired. Of all interruptive alerts, about half of the sessions were evaluable whereof in total 57.5% (269 of 468 sessions) were accepted while 42.5% were overridden. The acceptance rate of interruptive alerts differed significantly depending on the alert type (<i>p</i> <0.0001), reaching 85.7% for DDI alerts (<i>N</i> = 185), 65.3% for contraindicated drugs in hyperkalemia (<i>N</i> = 98), and 25.1% for PIM alerts (<i>N</i> = 185).</p><p><strong>Conclusion: </strong>A total of 57.5% of the interruptive medication alerts with particularly high risks for ADE in our setting were accepted while the acceptance rate differed according to the alert type with contraindicated simvastatin DDI alerts being accepted most frequently.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 5-06","pages":"180-184"},"PeriodicalIF":1.7,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39358833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluation Metrics for Health Chatbots: A Delphi Study. 健康聊天机器人的评估指标:德尔菲研究。
IF 1.7 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-12-01 Epub Date: 2021-10-31 DOI: 10.1055/s-0041-1736664
Kerstin Denecke, Alaa Abd-Alrazaq, Mowafa Househ, Jim Warren

Background: In recent years, an increasing number of health chatbots has been published in app stores and described in research literature. Given the sensitive data they are processing and the care settings for which they are developed, evaluation is essential to avoid harm to users. However, evaluations of those systems are reported inconsistently and without using a standardized set of evaluation metrics. Missing standards in health chatbot evaluation prevent comparisons of systems, and this may hamper acceptability since their reliability is unclear.

Objectives: The objective of this paper is to make an important step toward developing a health-specific chatbot evaluation framework by finding consensus on relevant metrics.

Methods: We used an adapted Delphi study design to verify and select potential metrics that we retrieved initially from a scoping review. We invited researchers, health professionals, and health informaticians to score each metric for inclusion in the final evaluation framework, over three survey rounds. We distinguished metrics scored relevant with high, moderate, and low consensus. The initial set of metrics comprised 26 metrics (categorized as global metrics, metrics related to response generation, response understanding and aesthetics).

Results: Twenty-eight experts joined the first round and 22 (75%) persisted to the third round. Twenty-four metrics achieved high consensus and three metrics achieved moderate consensus. The core set for our framework comprises mainly global metrics (e.g., ease of use, security content accuracy), metrics related to response generation (e.g., appropriateness of responses), and related to response understanding. Metrics on aesthetics (font type and size, color) are less well agreed upon-only moderate or low consensus was achieved for those metrics.

Conclusion: The results indicate that experts largely agree on metrics and that the consensus set is broad. This implies that health chatbot evaluation must be multifaceted to ensure acceptability.

背景:近年来,越来越多的健康聊天机器人在应用商店中发布,并在研究文献中进行描述。鉴于它们正在处理的敏感数据和开发它们的护理环境,评估对于避免对用户造成伤害至关重要。然而,这些系统的评估报告不一致,而且没有使用一套标准化的评估量度。缺乏健康聊天机器人评估的标准阻碍了系统的比较,这可能会妨碍可接受性,因为它们的可靠性尚不清楚。目的:本文的目的是通过在相关指标上达成共识,朝着开发特定于健康的聊天机器人评估框架迈出重要的一步。方法:我们采用适应性德尔菲研究设计来验证和选择我们最初从范围审查中检索到的潜在指标。我们邀请了研究人员、卫生专业人员和卫生信息学家对每个指标进行评分,以便在三轮调查中纳入最终评估框架。我们区分了与高、中、低共识相关的指标得分。最初的度量标准包括26个度量标准(分类为全局度量标准、与响应生成、响应理解和美学相关的度量标准)。结果:28名专家参加第一轮,22名(75%)专家坚持到第三轮。24个指标达到高度一致,3个指标达到中等一致。我们框架的核心集主要包括全局度量(例如,易用性、安全性内容准确性)、与响应生成相关的度量(例如,响应的适当性)以及与响应理解相关的度量。关于美学的指标(字体类型和大小,颜色)还没有达成一致,只有中等或较低的共识。结论:结果表明,专家在很大程度上同意的指标,共识集是广泛的。这意味着健康聊天机器人的评估必须是多方面的,以确保可接受性。
{"title":"Evaluation Metrics for Health Chatbots: A Delphi Study.","authors":"Kerstin Denecke,&nbsp;Alaa Abd-Alrazaq,&nbsp;Mowafa Househ,&nbsp;Jim Warren","doi":"10.1055/s-0041-1736664","DOIUrl":"https://doi.org/10.1055/s-0041-1736664","url":null,"abstract":"<p><strong>Background: </strong>In recent years, an increasing number of health chatbots has been published in app stores and described in research literature. Given the sensitive data they are processing and the care settings for which they are developed, evaluation is essential to avoid harm to users. However, evaluations of those systems are reported inconsistently and without using a standardized set of evaluation metrics. Missing standards in health chatbot evaluation prevent comparisons of systems, and this may hamper acceptability since their reliability is unclear.</p><p><strong>Objectives: </strong>The objective of this paper is to make an important step toward developing a health-specific chatbot evaluation framework by finding consensus on relevant metrics.</p><p><strong>Methods: </strong>We used an adapted Delphi study design to verify and select potential metrics that we retrieved initially from a scoping review. We invited researchers, health professionals, and health informaticians to score each metric for inclusion in the final evaluation framework, over three survey rounds. We distinguished metrics scored relevant with high, moderate, and low consensus. The initial set of metrics comprised 26 metrics (categorized as global metrics, metrics related to response generation, response understanding and aesthetics).</p><p><strong>Results: </strong>Twenty-eight experts joined the first round and 22 (75%) persisted to the third round. Twenty-four metrics achieved high consensus and three metrics achieved moderate consensus. The core set for our framework comprises mainly global metrics (e.g., ease of use, security content accuracy), metrics related to response generation (e.g., appropriateness of responses), and related to response understanding. Metrics on aesthetics (font type and size, color) are less well agreed upon-only moderate or low consensus was achieved for those metrics.</p><p><strong>Conclusion: </strong>The results indicate that experts largely agree on metrics and that the consensus set is broad. This implies that health chatbot evaluation must be multifaceted to ensure acceptability.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 5-06","pages":"171-179"},"PeriodicalIF":1.7,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39577534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Methods of Information in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1