Journal of healthcare informatics research最新文献_第3页

Individualized Modeling to Distinguish Between High and Low Arousal States Using Physiological Data. 使用生理数据区分高唤醒状态和低唤醒状态的个性化建模。

IF 3.7 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2020-01-22 eCollection Date: 2020-03-01 DOI: 10.1007/s41666-019-00064-1

Ame Osotsi, Zita Oravecz, Qunhua Li, Joshua Smyth, Timothy R Brick

With wearable, relatively unobtrusive health monitors and smartphone sensors, it is increasingly easy to collect continuously streaming physiological data in a passive mode without placing much burden on participants. At the same time, smartphones provide the ability to survey participants to provide "ground-truth" reporting on psychological states, although this comes at an increased cost in participant burden. In this paper, we examined how analytical approaches from the field of machine learning could allow us to distill the collected physiological data into actionable decision rules about each individual's psychological state, with the eventual goal of identifying important psychological states (e.g., risk moments) without the need for ongoing burdensome active assessment (e.g., self-report). As a first step towards this goal, we compared two methods: (1) a k-nearest neighbor classifier that uses dynamic time warping distance, and (2) a random forests classifier to predict low and high states of affective arousal states based on features extracted using the tsfresh python package. Then, we compared random-forest-based predictive models tailored for the individual with individual-general models. Results showed that the individual-specific model outperformed the general one. Our results support the feasibility of using passively collected wearable data to predict psychological states, suggesting that by relying on both types of data, the active collection can be reduced or eliminated.

有了可穿戴的、相对不显眼的健康监测器和智能手机传感器，在被动模式下收集连续不断的生理数据变得越来越容易，而不会给参与者带来太多负担。与此同时，智能手机提供了调查参与者的能力，以提供关于心理状态的“基本事实”报告，尽管这增加了参与者负担的成本。在本文中，我们研究了机器学习领域的分析方法如何使我们能够将收集到的生理数据提炼成关于每个人心理状态的可操作决策规则，最终目标是识别重要的心理状态（例如，风险时刻），而不需要持续繁重的主动评估（例如，自我报告）。作为实现这一目标的第一步，我们比较了两种方法：(1)使用动态时间规整距离的k近邻分类器和(2)基于使用tsfresh python包提取的特征预测情感唤醒状态的低状态和高状态的随机森林分类器。然后，我们比较了为个体量身定制的基于随机森林的预测模型和个体-一般模型。结果表明，个体特异性模型优于一般模型。我们的研究结果支持使用被动收集的可穿戴数据来预测心理状态的可行性，这表明通过依赖这两种类型的数据，可以减少或消除主动收集。

{"title":"Individualized Modeling to Distinguish Between High and Low Arousal States Using Physiological Data.","authors":"Ame Osotsi, Zita Oravecz, Qunhua Li, Joshua Smyth, Timothy R Brick","doi":"10.1007/s41666-019-00064-1","DOIUrl":"https://doi.org/10.1007/s41666-019-00064-1","url":null,"abstract":"With wearable, relatively unobtrusive health monitors and smartphone sensors, it is increasingly easy to collect continuously streaming physiological data in a passive mode without placing much burden on participants. At the same time, smartphones provide the ability to survey participants to provide \"ground-truth\" reporting on psychological states, although this comes at an increased cost in participant burden. In this paper, we examined how analytical approaches from the field of machine learning could allow us to distill the collected physiological data into actionable decision rules about each individual's psychological state, with the eventual goal of identifying important psychological states (e.g., risk moments) without the need for ongoing burdensome active assessment (e.g., self-report). As a first step towards this goal, we compared two methods: (1) a k-nearest neighbor classifier that uses dynamic time warping distance, and (2) a random forests classifier to predict low and high states of affective arousal states based on features extracted using the tsfresh python package. Then, we compared random-forest-based predictive models tailored for the individual with individual-general models. Results showed that the individual-specific model outperformed the general one. Our results support the feasibility of using passively collected wearable data to predict psychological states, suggesting that by relying on both types of data, the active collection can be reduced or eliminated.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"4 1","pages":"91-109"},"PeriodicalIF":3.7,"publicationDate":"2020-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982753/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144755592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Detection of Surgical Site Infection Utilizing Automated Feature Generation in Clinical Notes. 利用临床记录中的自动特征生成检测手术部位感染。

IF 3.7 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2019-09-01 Epub Date: 2018-11-06 DOI: 10.1007/s41666-018-0042-9

Feichen Shen, David W Larson, James M Naessens, Elizabeth B Habermann, Hongfang Liu, Sunghwan Sohn

Postsurgical complications (PSCs) are known as a deviation from the normal postsurgical course and categorized by severity and treatment requirements. Surgical site infection (SSI) is one of major PSCs and the most common healthcare-associated infection, resulting in increased length of hospital stay and cost. In this work, we proposed an automated way to generate keyword features using sublanguage analysis with heuristics to detect SSI from cohort in clinical notes and evaluated these keywords with medical experts. To further valid our approach, we also applied different machine learning algorithms on cohort using automatically generated keywords. The results showed that our approach was able to identify SSI keywords from clinical narratives and can be used as a foundation to develop an information extraction system or support search-based natural language processing (NLP) approaches by augmenting search queries.

术后并发症（PSCs）被认为是偏离正常的术后过程，并根据严重程度和治疗要求进行分类。手术部位感染（SSI）是主要的PSCs之一，也是最常见的卫生保健相关感染，导致住院时间和费用增加。在这项工作中，我们提出了一种自动生成关键字特征的方法，使用启发式子语言分析来检测临床记录中的队列SSI，并与医学专家一起评估这些关键字。为了进一步验证我们的方法，我们还使用自动生成的关键字在队列上应用了不同的机器学习算法。结果表明，我们的方法能够从临床叙述中识别出SSI关键字，并可作为开发信息提取系统或通过增加搜索查询来支持基于搜索的自然语言处理（NLP）方法的基础。

引用次数: 0

Emotional Awareness and Decision-Making in the Context of Computer-Mediated Psychotherapy. 计算机辅助心理疗法背景下的情感意识和决策制定。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2019-03-21 eCollection Date: 2019-09-01 DOI: 10.1007/s41666-019-00050-7

Ebrahim Oshni Alvandi, George Van Doorn, Mark Symmons

Emotional awareness has been previously investigated among clinicians. In this work, we bring to the fore of research the interest to uncover emotional awareness of clinicians during the tele-mental health session. The study reported here aimed at determining whether clinicians process their own emotions, as well as those of the client, in a computer-mediated context. Also, clinicians' decision-making process was assessed because such action appears to be related to the way they feel and recognise how those emotions may change their thinking and impact their interaction with clients. We estimated that such ability in clinicians' would be contrasted when the psychotherapy-session level is conducted via various technologies. Participant of the study were presented by stimuli in different modes of delivery (e.g. text, audio, and video). The experiment indicates that the ability to manage, perceive, and utilise emotions was as being satisfactory during all modes of delivery. In essence, the findings contribute to the field of remote therapy suggesting emotional awareness as a key cognitive factor in diagnosis.

以前曾对临床医生的情感意识进行过调查。在这项工作中，我们将揭示临床医生在远程心理健康会话中的情感意识作为研究重点。本文所报告的研究旨在确定临床医生是否在以计算机为媒介的环境中处理自己和客户的情绪。此外，我们还对临床医生的决策过程进行了评估，因为这种行为似乎与他们的感受有关，并能识别这些情绪会如何改变他们的思维并影响他们与客户的互动。我们估计，当通过各种技术进行心理治疗时，临床医生的这种能力将形成鲜明对比。这项研究的参与者受到了不同传播方式（如文字、音频和视频）的刺激。实验结果表明，在所有传递模式下，参与者管理、感知和利用情绪的能力都令人满意。从本质上讲，这些发现有助于远程治疗领域，表明情绪意识是诊断中的一个关键认知因素。

{"title":"Emotional Awareness and Decision-Making in the Context of Computer-Mediated Psychotherapy.","authors":"Ebrahim Oshni Alvandi, George Van Doorn, Mark Symmons","doi":"10.1007/s41666-019-00050-7","DOIUrl":"https://doi.org/10.1007/s41666-019-00050-7","url":null,"abstract":"Emotional awareness has been previously investigated among clinicians. In this work, we bring to the fore of research the interest to uncover emotional awareness of clinicians during the tele-mental health session. The study reported here aimed at determining whether clinicians process their own emotions, as well as those of the client, in a computer-mediated context. Also, clinicians' decision-making process was assessed because such action appears to be related to the way they feel and recognise how those emotions may change their thinking and impact their interaction with clients. We estimated that such ability in clinicians' would be contrasted when the psychotherapy-session level is conducted via various technologies. Participant of the study were presented by stimuli in different modes of delivery (e.g. text, audio, and video). The experiment indicates that the ability to manage, perceive, and utilise emotions was as being satisfactory during all modes of delivery. In essence, the findings contribute to the field of remote therapy suggesting emotional awareness as a key cognitive factor in diagnosis.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"3 3","pages":"345-370"},"PeriodicalIF":5.4,"publicationDate":"2019-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982805/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Discovering Oculometric Patterns to Detect Cognitive Performance Changes in Healthy Youth Football Athletes. 发现视力模式，检测健康青少年足球运动员的认知能力变化。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2019-02-08 eCollection Date: 2019-12-01 DOI: 10.1007/s41666-019-00045-4

Gaurav N Pradhan, Jamie M Bogle, Michael J Cevette, Jan Stepanek

In this paper, we focus on the application of oculometric patterns extracted from raw eye movements during a mental workload task to assess changes in cognitive performance in healthy youth athletes over the course of a typical sport season. Oculometric features pertaining to fixations and saccades were measured on 116 athletes in pre- and post-season testing. Participants were between 7 and 14 years of age at pre-season testing. Due to varied developmental rates, there were large interindividual performance differences during a mental workload task consisting of reading numbers. Based on different reading speeds, we classified three profiles (slow, moderate, and fast) and established their corresponding baselines for oculometric data. Within each profile, we describe changes in oculomotor function based on changes in cognitive performance during the season. To visualize these changes in multidimensional oculometric data, we also present a multidimensional visualization tool named DiViTo (diagnostic visualization tool). These experimental, computational informatics and visualization methodologies may serve to utilize oculometric information to detect changes in cognitive performance due to mild or severe cognitive impairment such as concussion/mild traumatic brain injury, as well as possibly other disorders such as attention deficit hyperactivity disorders, learning/reading disabilities, impairment of alertness, and neurocognitive function.

在本文中，我们重点研究了在一项脑力劳动任务中从原始眼球运动中提取的眼球测量模式的应用，以评估健康青少年运动员在一个典型运动赛季中认知能力的变化。在赛季前和赛季后的测试中，对 116 名运动员的定点和眼球移动进行了眼球测量。参加季前测试的运动员年龄在 7 至 14 岁之间。由于发育速度不同，在完成阅读数字的脑力劳动任务时，个体间的表现差异很大。根据不同的阅读速度，我们将其分为三类（慢速、中速和快速），并建立了相应的视力数据基线。在每种情况下，我们都会根据季节中认知能力的变化来描述眼球运动功能的变化。为了将这些多维眼球测量数据的变化可视化，我们还推出了一个名为 DiViTo（诊断可视化工具）的多维可视化工具。这些实验、计算信息学和可视化方法可用于利用眼球测量信息检测轻度或严重认知障碍（如脑震荡/轻度脑外伤）以及其他可能的疾病（如注意力缺陷多动障碍、学习/阅读障碍、警觉性受损和神经认知功能障碍）导致的认知表现变化。

{"title":"Discovering Oculometric Patterns to Detect Cognitive Performance Changes in Healthy Youth Football Athletes.","authors":"Gaurav N Pradhan, Jamie M Bogle, Michael J Cevette, Jan Stepanek","doi":"10.1007/s41666-019-00045-4","DOIUrl":"https://doi.org/10.1007/s41666-019-00045-4","url":null,"abstract":"In this paper, we focus on the application of oculometric patterns extracted from raw eye movements during a mental workload task to assess changes in cognitive performance in healthy youth athletes over the course of a typical sport season. Oculometric features pertaining to fixations and saccades were measured on 116 athletes in pre- and post-season testing. Participants were between 7 and 14 years of age at pre-season testing. Due to varied developmental rates, there were large interindividual performance differences during a mental workload task consisting of reading numbers. Based on different reading speeds, we classified three profiles (slow, moderate, and fast) and established their corresponding baselines for oculometric data. Within each profile, we describe changes in oculomotor function based on changes in cognitive performance during the season. To visualize these changes in multidimensional oculometric data, we also present a multidimensional visualization tool named DiViTo (diagnostic visualization tool). These experimental, computational informatics and visualization methodologies may serve to utilize oculometric information to detect changes in cognitive performance due to mild or severe cognitive impairment such as concussion/mild traumatic brain injury, as well as possibly other disorders such as attention deficit hyperactivity disorders, learning/reading disabilities, impairment of alertness, and neurocognitive function.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"3 4","pages":"371-392"},"PeriodicalIF":5.4,"publicationDate":"2019-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982780/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Salience of Medical Concepts of Inside Clinical Texts and Outside Medical Records for Referred Cardiovascular Patients. 转诊心血管病人的临床文献和外部医疗记录中医疗概念的显著性。

Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2019-01-28 eCollection Date: 2019-06-01 DOI: 10.1007/s41666-019-00044-5

Sungrim Moon, Sijia Liu, David Chen, Yanshan Wang, Douglas L Wood, Rajeev Chaudhry, Hongfang Liu, Paul Kingsbury

Outside medical records (OMRs) accompanying referred patients are frequently sent as faxes from external healthcare providers. Accessing useful and relevant information from these OMRs in a timely manner is a challenging task due to a combination of the presence of machine-illegible information and the limited system interoperability inherent in healthcare. Little research has been done on investigating information in OMRs. This paper evaluated overlapping and non-overlapping medical concepts captured from digitally faxed OMRs for patients transferring to the Department of Cardiovascular Medicine and from clinical consultant notes generated at the Mayo Clinic. We used optical character recognition (OCR) techniques to make faxed OMRs machine-readable and used natural language processing (NLP) techniques to capture clinical concepts from both machine-readable OMRs and Mayo clinical notes. We measured the level of overlap in medical concepts between OMRs and Mayo clinical narratives in the quantitative approaches and assessed the salience of concepts specific to Cardiovascular Medicine by calculating the ratio of those mentioned concepts relative to an independent clinical corpus. Among the concepts collected from the OMRs, 11.19% of those were also present in the Mayo clinical narratives that were generated within the 3 months after their initial encounter at the Mayo Clinic. For those common concepts, 73.97% were identified in initial consultant notes (ICNs) and 26.03% were captured over subsequent follow-up consultant notes (FCNs). These findings implied that information collected from the OMRs is potentially informative for patient care, but some valuable information (additionally identified in FCNs) collected from the OMRs is not fully used in an earlier stage of the care process. The concepts collected from the ICNs have the highest salience to Cardiovascular Medicine (0.112) compared to concepts in OMRs and concepts in FCNs. Additionally, unique concepts captured in ICNs (unseen in OMRs or FCNs) carried the most salient information (0.094), which demonstrated that ICNs provided the most informative concepts for the care of transferred patients.

转诊病人的外部医疗记录（OMR）经常以传真形式从外部医疗机构发送过来。由于存在机器无法识别的信息和医疗保健系统固有的有限互操作性，及时从这些外部医疗记录中获取有用的相关信息是一项具有挑战性的任务。目前，有关 OMR 中信息调查的研究还很少。本文评估了从转到心血管内科的患者的数字传真 OMR 和梅奥诊所生成的临床顾问笔记中获取的重叠和非重叠医疗概念。我们使用光学字符识别 (OCR) 技术使传真的 OMR 具有机器可读性，并使用自然语言处理 (NLP) 技术从机器可读的 OMR 和梅奥临床笔记中捕获临床概念。我们在定量方法中测量了 OMR 和梅奥临床叙述之间医学概念的重叠程度，并通过计算相对于独立临床语料库的被提及概念的比例来评估心血管内科特有概念的显著性。在从 OMR 中收集到的概念中，有 11.19% 也出现在梅奥临床叙述中，这些叙述是在梅奥诊所初次就诊后 3 个月内产生的。在这些常见的概念中，73.97% 是在首次咨询记录（ICN）中发现的，26.03% 是在随后的随访咨询记录（FCN）中发现的。这些研究结果表明，从手术记录中收集到的信息对病人护理具有潜在的参考价值，但从手术记录中收集到的一些有价值的信息（另外在 FCN 中也有发现）并没有在护理过程的早期阶段得到充分利用。与 OMR 和 FCN 中的概念相比，ICN 中收集的概念对心血管内科的显著性最高（0.112）。此外，在 ICN 中捕捉到的独特概念（在 OMR 或 FCN 中未出现过）具有最显著的信息（0.094），这表明 ICN 为转院病人的护理提供了最有价值的概念。

{"title":"Salience of Medical Concepts of Inside Clinical Texts and Outside Medical Records for Referred Cardiovascular Patients.","authors":"Sungrim Moon, Sijia Liu, David Chen, Yanshan Wang, Douglas L Wood, Rajeev Chaudhry, Hongfang Liu, Paul Kingsbury","doi":"10.1007/s41666-019-00044-5","DOIUrl":"https://doi.org/10.1007/s41666-019-00044-5","url":null,"abstract":"Outside medical records (OMRs) accompanying referred patients are frequently sent as faxes from external healthcare providers. Accessing useful and relevant information from these OMRs in a timely manner is a challenging task due to a combination of the presence of machine-illegible information and the limited system interoperability inherent in healthcare. Little research has been done on investigating information in OMRs. This paper evaluated overlapping and non-overlapping medical concepts captured from digitally faxed OMRs for patients transferring to the Department of Cardiovascular Medicine and from clinical consultant notes generated at the Mayo Clinic. We used optical character recognition (OCR) techniques to make faxed OMRs machine-readable and used natural language processing (NLP) techniques to capture clinical concepts from both machine-readable OMRs and Mayo clinical notes. We measured the level of overlap in medical concepts between OMRs and Mayo clinical narratives in the quantitative approaches and assessed the salience of concepts specific to Cardiovascular Medicine by calculating the ratio of those mentioned concepts relative to an independent clinical corpus. Among the concepts collected from the OMRs, 11.19% of those were also present in the Mayo clinical narratives that were generated within the 3 months after their initial encounter at the Mayo Clinic. For those common concepts, 73.97% were identified in initial consultant notes (ICNs) and 26.03% were captured over subsequent follow-up consultant notes (FCNs). These findings implied that information collected from the OMRs is potentially informative for patient care, but some valuable information (additionally identified in FCNs) collected from the OMRs is not fully used in an earlier stage of the care process. The concepts collected from the ICNs have the highest salience to Cardiovascular Medicine (0.112) compared to concepts in OMRs and concepts in FCNs. Additionally, unique concepts captured in ICNs (unseen in OMRs or FCNs) carried the most salient information (0.094), which demonstrated that ICNs provided the most informative concepts for the care of transferred patients.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"3 2","pages":"200-219"},"PeriodicalIF":0.0,"publicationDate":"2019-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982748/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140913664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

What Happened to Me while I Was in the Hospital? Challenges and Opportunities for Generating Patient-Friendly Hospitalization Summaries. 我在医院的时候发生了什么？生成病人友好型住院摘要的挑战和机遇。

IF 3.7 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2018-10-12 eCollection Date: 2019-03-01 DOI: 10.1007/s41666-018-0036-7

Sabita Acharya, Andrew D Boyd, Richard Cameron, Karen Dunn Lopez, Pamela Martyn-Nemeth, Carolyn Dickens, Amer Ardati, Jose D Flores, Matt Baumann, Betty Welland, Barbara Di Eugenio

Comprehending medical information is a challenging task, especially for people who have not received formal medical education. When patients are discharged from the hospital, they are provided with lengthy medical documents that contain intricate terminologies. Studies have shown that if people do not understand the content of their health documents, they will neither look for new information regarding their illness nor will they take actions to prevent or recover from their health issue. In this article, we highlight the need for generating personalized hospital-stay summaries and several research challenges associated with this task. The proposed directions are directly informed by our ongoing work in generating concise and comprehensible hospitalization summaries that are tailored to suit the patient's understanding of medical terminologies and level of engagement in improving their own health. Our preliminary evaluation shows that our summaries effectively present required medical concepts.

理解医学信息是一项具有挑战性的任务，特别是对于没有接受过正规医学教育的人。当病人出院时，他们会得到冗长的医疗文件，其中包含复杂的术语。研究表明，如果人们不了解他们的健康文件的内容，他们既不会寻找有关他们疾病的新信息，也不会采取行动来预防或从他们的健康问题中恢复过来。在本文中，我们强调了生成个性化住院摘要的必要性以及与此任务相关的几个研究挑战。我们正在进行的工作直接通知了建议的方向，即生成简明易懂的住院摘要，这些摘要是根据患者对医学术语的理解和改善自身健康的参与程度量身定制的。我们的初步评估表明，我们的摘要有效地呈现了所需的医学概念。

{"title":"What Happened to Me while I Was in the Hospital? Challenges and Opportunities for Generating Patient-Friendly Hospitalization Summaries.","authors":"Sabita Acharya, Andrew D Boyd, Richard Cameron, Karen Dunn Lopez, Pamela Martyn-Nemeth, Carolyn Dickens, Amer Ardati, Jose D Flores, Matt Baumann, Betty Welland, Barbara Di Eugenio","doi":"10.1007/s41666-018-0036-7","DOIUrl":"https://doi.org/10.1007/s41666-018-0036-7","url":null,"abstract":"Comprehending medical information is a challenging task, especially for people who have not received formal medical education. When patients are discharged from the hospital, they are provided with lengthy medical documents that contain intricate terminologies. Studies have shown that if people do not understand the content of their health documents, they will neither look for new information regarding their illness nor will they take actions to prevent or recover from their health issue. In this article, we highlight the need for generating personalized hospital-stay summaries and several research challenges associated with this task. The proposed directions are directly informed by our ongoing work in generating concise and comprehensible hospitalization summaries that are tailored to suit the patient's understanding of medical terminologies and level of engagement in improving their own health. Our preliminary evaluation shows that our summaries effectively present required medical concepts.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"3 1","pages":"107-123"},"PeriodicalIF":3.7,"publicationDate":"2018-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982693/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145093425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Individual Mobility and Uncertain Geographic Context: Real-time Versus Neighborhood Approximated Exposure to Retail Tobacco Outlets Across the US. 个人流动性和不确定的地理环境：美国各地烟草零售店的实时暴露与邻近地区的近似暴露。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2018-10-10 eCollection Date: 2019-03-01 DOI: 10.1007/s41666-018-0035-8

Thomas R Kirchner, Hong Gao, Daniel J Lewis, Andrew Anesetti-Rothermel, Heather A Carlos, Brian House

There is growing interest in the way exposure to neighborhood risk and protective factors affects the health of residents. Although multiple approaches have been reported, empirical methods for contrasting the spatial uncertainty of exposure estimates are not well established. The objective of this paper was to contrast real-time versus neighborhood approximated exposure to the landscape of tobacco outlets across the contiguous US. A nationwide density surface of tobacco retail outlet locations was generated using kernel density estimation (KDE). This surface was linked to participants' (N _p = 363) inferred residential location, as well as to their real-time geographic locations, recorded every 10 min over 180 days. Real-time exposure was estimated as the hourly product of radius of gyration and average tobacco outlet density (N _hour = 304, 164 h). Ordinal logit modeling was used to assess the distribution of real-time exposure estimates as a function of each participant's residential exposure. Overall, 61.3% of real-time, hourly exposures were of relatively low intensity, and after controlling for temporal and seasonal variation, 72.8% of the variance among these low-level exposures was accounted for by residence in one of the two lowest residential exposure quintiles. Most moderate to high intensity exposures (38.7% of all real-time, hourly exposures) were no more likely to have been contributed by subjects from any single residential exposure cluster than another. Altogether, 55.2% of the variance in real-time exposures was not explained by participants' residential exposure cluster. Calculating hourly exposure estimates made it possible to directly contrast real-time observations with static residential exposure estimates. Results document the substantial degree that real-time exposures can be misclassified by residential approximations, especially in residential areas characterized by moderate to high retail density levels.

人们越来越关注暴露于邻里风险和保护因素对居民健康的影响。虽然已有多种方法的报道，但对比暴露估计值空间不确定性的经验方法还不成熟。本文旨在对比美国毗邻地区烟草零售点景观的实时暴露与邻近地区近似暴露。利用核密度估计（KDE）生成了烟草零售点位置的全国密度面。该表面与参与者（N p = 363）推断的居住地以及他们的实时地理位置（在 180 天内每 10 分钟记录一次）相关联。实时暴露量根据每小时回旋半径与平均烟草销售点密度的乘积估算（N 小时 = 304，164 小时）。采用正态对数模型来评估实时暴露估计值与每位参与者居住地暴露量的分布关系。总体而言，61.3%的每小时实时暴露强度相对较低，在控制了时间和季节变化后，72.8%的低水平暴露差异是由居住在两个最低居住暴露五分位数之一的居民所造成的。大多数中高强度的暴露（占所有实时、每小时暴露的 38.7%）并不比任何一个住宅暴露群组的受试者更有可能造成这些暴露。总之，55.2% 的实时暴露变异不是由参与者的居住暴露群组解释的。通过计算每小时的暴露估计值，可以直接将实时观测结果与静态居民暴露估计值进行对比。结果表明，实时暴露量在很大程度上会被住宅近似值误分类，尤其是在零售密度处于中高水平的住宅区。

{"title":"Individual Mobility and Uncertain Geographic Context: Real-time Versus Neighborhood Approximated Exposure to Retail Tobacco Outlets Across the US.","authors":"Thomas R Kirchner, Hong Gao, Daniel J Lewis, Andrew Anesetti-Rothermel, Heather A Carlos, Brian House","doi":"10.1007/s41666-018-0035-8","DOIUrl":"https://doi.org/10.1007/s41666-018-0035-8","url":null,"abstract":"There is growing interest in the way exposure to neighborhood risk and protective factors affects the health of residents. Although multiple approaches have been reported, empirical methods for contrasting the spatial uncertainty of exposure estimates are not well established. The objective of this paper was to contrast real-time versus neighborhood approximated exposure to the landscape of tobacco outlets across the contiguous US. A nationwide density surface of tobacco retail outlet locations was generated using kernel density estimation (KDE). This surface was linked to participants' (N p = 363) inferred residential location, as well as to their real-time geographic locations, recorded every 10 min over 180 days. Real-time exposure was estimated as the hourly product of radius of gyration and average tobacco outlet density (N hour = 304, 164 h). Ordinal logit modeling was used to assess the distribution of real-time exposure estimates as a function of each participant's residential exposure. Overall, 61.3% of real-time, hourly exposures were of relatively low intensity, and after controlling for temporal and seasonal variation, 72.8% of the variance among these low-level exposures was accounted for by residence in one of the two lowest residential exposure quintiles. Most moderate to high intensity exposures (38.7% of all real-time, hourly exposures) were no more likely to have been contributed by subjects from any single residential exposure cluster than another. Altogether, 55.2% of the variance in real-time exposures was not explained by participants' residential exposure cluster. Calculating hourly exposure estimates made it possible to directly contrast real-time observations with static residential exposure estimates. Results document the substantial degree that real-time exposures can be misclassified by residential approximations, especially in residential areas characterized by moderate to high retail density levels.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"3 1","pages":"70-85"},"PeriodicalIF":5.4,"publicationDate":"2018-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982724/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bi-submodular Optimization (BSMO) for Detecting Drug-Drug Interactions (DDIs) from On-line Health Forums. 从在线健康论坛检测药物相互作用（DDI）的双次模块优化（BSMO）。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2018-08-30 eCollection Date: 2019-03-01 DOI: 10.1007/s41666-018-0032-y

Yan Hu, Rui Wang, Feng Chen

Online health discussion forums as information exchange repository are used by different patient groups for sharing experience and seeking advice. Their accessibility is tremendously expanded in the last decade with the rapid growth of mobile internet. Among many popular topics, "drug-drug interactions" (DDIs) forum embeds a large number of DDIs hazards patient experienced however not published. In this paper, we intend to uncover the potential DDIs from the online forums and formulate the task as a sub-graph detection problem, such that co-mentioned drugs and symptoms are modeled as vertices, along with the occurrences are modeled as weighted edges. Therefore, a connected sub-graph consisting of both symptoms and drug vertices reveals DDIs occurrence. We then propose a novel bi-submodular function to characterize the likelihood of DDI occurrence within a connected sub-graph and apply an approximated algorithm to resolve the bi-submodular optimization (BSMO). The complexity of the algorithm is nearly linear. Our extensive experiments demonstrate the effectiveness and efficiency of the proposed approach.

在线健康论坛作为信息交流库，被不同的患者群体用来分享经验和寻求建议。近十年来，随着移动互联网的迅猛发展，这些论坛的可访问性大大增加。在众多热门话题中，"药物相互作用"（DDIs）论坛包含了大量患者经历过但未公布的 DDIs 危害。在本文中，我们打算从在线论坛中发现潜在的 DDIs，并将这一任务表述为一个子图检测问题，即共同提及的药物和症状被建模为顶点，同时出现的情况被建模为加权边。因此，由症状和药物顶点组成的连通子图揭示了 DDIs 的发生。然后，我们提出了一种新的双子模块化函数来描述连通子图中出现 DDI 的可能性，并应用近似算法来解决双子模块化优化问题（BSMO）。该算法的复杂度接近线性。我们的大量实验证明了所提方法的有效性和效率。

{"title":"Bi-submodular Optimization (BSMO) for Detecting Drug-Drug Interactions (DDIs) from On-line Health Forums.","authors":"Yan Hu, Rui Wang, Feng Chen","doi":"10.1007/s41666-018-0032-y","DOIUrl":"https://doi.org/10.1007/s41666-018-0032-y","url":null,"abstract":"Online health discussion forums as information exchange repository are used by different patient groups for sharing experience and seeking advice. Their accessibility is tremendously expanded in the last decade with the rapid growth of mobile internet. Among many popular topics, \"drug-drug interactions\" (DDIs) forum embeds a large number of DDIs hazards patient experienced however not published. In this paper, we intend to uncover the potential DDIs from the online forums and formulate the task as a sub-graph detection problem, such that co-mentioned drugs and symptoms are modeled as vertices, along with the occurrences are modeled as weighted edges. Therefore, a connected sub-graph consisting of both symptoms and drug vertices reveals DDIs occurrence. We then propose a novel bi-submodular function to characterize the likelihood of DDI occurrence within a connected sub-graph and apply an approximated algorithm to resolve the bi-submodular optimization (BSMO). The complexity of the algorithm is nearly linear. Our extensive experiments demonstrate the effectiveness and efficiency of the proposed approach.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"3 1","pages":"19-42"},"PeriodicalIF":5.4,"publicationDate":"2018-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982730/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The State of Data in Healthcare: Path Towards Standardization. 医疗保健数据现状：走向标准化之路。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2018-05-22 eCollection Date: 2018-09-01 DOI: 10.1007/s41666-018-0019-8

Keith Feldman, Reid A Johnson, Nitesh V Chawla

Coupled with the rise of data science and machine learning, the increasing availability of digitized health and wellness data has provided an exciting opportunity for complex analyses of problems throughout the healthcare domain. Whereas many early works focused on a particular aspect of patient care, often drawing on data from a specific clinical or administrative source, it has become clear such a single-source approach is insufficient to capture the complexity of the human condition. Instead, adequately modeling health and wellness problems requires the ability to draw upon data spanning multiple facets of an individual's biology, their care, and the social aspects of their life. Although such an awareness has greatly expanded the breadth of health and wellness data collected, the diverse array of data sources and intended uses often leave researchers and practitioners with a scattered and fragmented view of any particular patient. As a result, there exists a clear need to catalogue and organize the range of healthcare data available for analysis. This work represents an effort at developing such an organization, presenting a patient-centric framework deemed the Healthcare Data Spectrum (HDS). Comprised of six layers, the HDS begins with the innermost micro-level omics and macro-level demographic data that directly characterize a patient, and extends at its outermost to aggregate population-level data derived from attributes of care for each individual patient. For each level of the HDS, this manuscript will examine the specific types of constituent data, provide examples of how the data aid in a broad set of research problems, and identify the primary terminology and standards used to describe the data.

随着数据科学和机器学习的兴起，越来越多的数字化健康和保健数据为对整个医疗保健领域的问题进行复杂分析提供了令人兴奋的机会。早期的许多研究都侧重于患者护理的某个方面，通常利用特定临床或行政来源的数据，但这种单一来源的方法显然不足以捕捉人类状况的复杂性。相反，要对健康和保健问题进行充分建模，就必须能够利用涵盖个人生物学、护理和社会生活等多个方面的数据。虽然这种意识极大地扩展了健康和保健数据收集的广度，但数据来源和预期用途的多样性往往使研究人员和从业人员对任何特定病人的了解都是分散和零碎的。因此，显然有必要对可用于分析的各种医疗保健数据进行编目和组织。这项工作体现了开发此类组织的努力，提出了一个以患者为中心的框架，即医疗保健数据频谱（HDS）。HDS 由六个层次组成，从直接描述患者特征的最内层微观层面的全息数据和宏观层面的人口统计数据开始，最外层扩展到从每个患者的护理属性中得出的总体人口层面的数据。对于 HDS 的每个层次，本手稿将研究组成数据的具体类型，举例说明数据如何帮助解决一系列广泛的研究问题，并确定用于描述数据的主要术语和标准。

{"title":"The State of Data in Healthcare: Path Towards Standardization.","authors":"Keith Feldman, Reid A Johnson, Nitesh V Chawla","doi":"10.1007/s41666-018-0019-8","DOIUrl":"https://doi.org/10.1007/s41666-018-0019-8","url":null,"abstract":"Coupled with the rise of data science and machine learning, the increasing availability of digitized health and wellness data has provided an exciting opportunity for complex analyses of problems throughout the healthcare domain. Whereas many early works focused on a particular aspect of patient care, often drawing on data from a specific clinical or administrative source, it has become clear such a single-source approach is insufficient to capture the complexity of the human condition. Instead, adequately modeling health and wellness problems requires the ability to draw upon data spanning multiple facets of an individual's biology, their care, and the social aspects of their life. Although such an awareness has greatly expanded the breadth of health and wellness data collected, the diverse array of data sources and intended uses often leave researchers and practitioners with a scattered and fragmented view of any particular patient. As a result, there exists a clear need to catalogue and organize the range of healthcare data available for analysis. This work represents an effort at developing such an organization, presenting a patient-centric framework deemed the Healthcare Data Spectrum (HDS). Comprised of six layers, the HDS begins with the innermost micro-level omics and macro-level demographic data that directly characterize a patient, and extends at its outermost to aggregate population-level data derived from attributes of care for each individual patient. For each level of the HDS, this manuscript will examine the specific types of constituent data, provide examples of how the data aid in a broad set of research problems, and identify the primary terminology and standards used to describe the data.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"2 3","pages":"248-271"},"PeriodicalIF":5.4,"publicationDate":"2018-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982788/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

EpiK: A Knowledge Base for Epidemiological Modeling and Analytics of Infectious Diseases. EpiK：传染病流行病学建模和分析知识库。

IF 5.4 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of healthcare informatics research

Pub Date : 2017-11-06 eCollection Date: 2017-12-01 DOI: 10.1007/s41666-017-0010-9

S M Shamimul Hasan, Edward A Fox, Keith Bisset, Madhav V Marathe

Computational epidemiology seeks to develop computational methods to study the distribution and determinants of health-related states or events (including disease), and the application of this study to the control of diseases and other health problems. Recent advances in computing and data sciences have led to the development of innovative modeling environments to support this important goal. The datasets used to drive the dynamic models as well as the data produced by these models presents unique challenges owing to their size, heterogeneity and diversity. These datasets form the basis of effective and easy to use decision support and analytical environments. As a result, it is important to develop scalable data management systems to store, manage and integrate these datasets. In this paper, we develop EpiK-a knowledge base that facilitates the development of decision support and analytical environments to support epidemic science. An important goal is to develop a framework that links the input as well as output datasets to facilitate effective spatio-temporal and social reasoning that is critical in planning and intervention analysis before and during an epidemic. The data management framework links modeling workflow data and its metadata using a controlled vocabulary. The metadata captures information about storage, the mapping between the linked model and the physical layout, and relationships to support services. EpiK is designed to support agent-based modeling and analytics frameworks-aggregate models can be seen as special cases and are thus supported. We use semantic web technologies to create a representation of the datasets that encapsulates both the location and the schema heterogeneity. The choice of RDF as a representation language is motivated by the diversity and growth of the datasets that need to be integrated. A query bank is developed-the queries capture a broad range of questions that can be posed and answered during a typical case study pertaining to disease outbreaks. The queries are constructed using SPARQL Protocol and RDF Query Language (SPARQL) over the EpiK. EpiK can hide schema and location heterogeneity while efficiently supporting queries that span the computational epidemiology modeling pipeline: from model construction to simulation output. We show that the performance of benchmark queries varies significantly with respect to the choice of hardware underlying the database and resource description framework (RDF) engine.

计算流行病学旨在发展计算方法来研究与健康有关的状态或事件（包括疾病）的分布和决定因素，并将这项研究应用于疾病和其他健康问题的控制。计算和数据科学的最新进展导致了创新建模环境的发展，以支持这一重要目标。用于驱动动态模型的数据集以及由这些模型产生的数据由于其规模、异质性和多样性而呈现出独特的挑战。这些数据集构成了有效且易于使用的决策支持和分析环境的基础。因此，开发可扩展的数据管理系统来存储、管理和集成这些数据集非常重要。在本文中，我们开发了epik -一个知识库，促进决策支持和分析环境的发展，以支持流行病科学。一个重要目标是制定一个框架，将输入和输出数据集联系起来，以促进有效的时空和社会推理，这对于流行病之前和期间的规划和干预分析至关重要。数据管理框架使用受控词汇表链接建模工作流数据及其元数据。元数据捕获有关存储、链接模型与物理布局之间的映射以及支持服务的关系的信息。EpiK旨在支持基于代理的建模和分析框架——聚合模型可以被视为特殊情况，因此得到了支持。我们使用语义web技术来创建数据集的表示，它封装了位置和模式的异构性。选择RDF作为表示语言的动机是需要集成的数据集的多样性和增长。开发了一个查询库——这些查询捕获了一系列广泛的问题，这些问题可以在与疾病爆发有关的典型案例研究中提出和回答。查询是在EpiK上使用SPARQL协议和RDF查询语言（SPARQL）构建的。EpiK可以隐藏模式和位置异质性，同时有效地支持跨越计算流行病学建模管道的查询：从模型构建到仿真输出。我们展示了基准查询的性能随着数据库和资源描述框架（RDF）引擎底层硬件的选择而显著不同。

{"title":"EpiK: A Knowledge Base for Epidemiological Modeling and Analytics of Infectious Diseases.","authors":"S M Shamimul Hasan, Edward A Fox, Keith Bisset, Madhav V Marathe","doi":"10.1007/s41666-017-0010-9","DOIUrl":"https://doi.org/10.1007/s41666-017-0010-9","url":null,"abstract":"Computational epidemiology seeks to develop computational methods to study the distribution and determinants of health-related states or events (including disease), and the application of this study to the control of diseases and other health problems. Recent advances in computing and data sciences have led to the development of innovative modeling environments to support this important goal. The datasets used to drive the dynamic models as well as the data produced by these models presents unique challenges owing to their size, heterogeneity and diversity. These datasets form the basis of effective and easy to use decision support and analytical environments. As a result, it is important to develop scalable data management systems to store, manage and integrate these datasets. In this paper, we develop EpiK-a knowledge base that facilitates the development of decision support and analytical environments to support epidemic science. An important goal is to develop a framework that links the input as well as output datasets to facilitate effective spatio-temporal and social reasoning that is critical in planning and intervention analysis before and during an epidemic. The data management framework links modeling workflow data and its metadata using a controlled vocabulary. The metadata captures information about storage, the mapping between the linked model and the physical layout, and relationships to support services. EpiK is designed to support agent-based modeling and analytics frameworks-aggregate models can be seen as special cases and are thus supported. We use semantic web technologies to create a representation of the datasets that encapsulates both the location and the schema heterogeneity. The choice of RDF as a representation language is motivated by the diversity and growth of the datasets that need to be integrated. A query bank is developed-the queries capture a broad range of questions that can be posed and answered during a typical case study pertaining to disease outbreaks. The queries are constructed using SPARQL Protocol and RDF Query Language (SPARQL) over the EpiK. EpiK can hide schema and location heterogeneity while efficiently supporting queries that span the computational epidemiology modeling pipeline: from model construction to simulation output. We show that the performance of benchmark queries varies significantly with respect to the choice of hardware underlying the database and resource description framework (RDF) engine.","PeriodicalId":101413,"journal":{"name":"Journal of healthcare informatics research","volume":"1 2","pages":"260-303"},"PeriodicalIF":5.4,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982844/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143417135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0