ACM transactions on computing for healthcare最新文献_第6页

Predicting the Evolution of Pain Relief 预测疼痛缓解的发展

ACM transactions on computing for healthcare

Pub Date : 2021-09-14 DOI: 10.1145/3466781

Anderson F. B. F. da Costa, Larissa Moreira, D. Andrade, Adriano Veloso, N. Ziviani

Modeling from data usually has two distinct facets: building sound explanatory models or creating powerful predictive models for a system or phenomenon. Most of recent literature does not exploit the relationship between explanation and prediction while learning models from data. Recent algorithms are not taking advantage of the fact that many phenomena are actually defined by diverse sub-populations and local structures, and thus there are many possible predictive models providing contrasting interpretations or competing explanations for the same phenomenon. In this article, we propose to explore a complementary link between explanation and prediction. Our main intuition is that models having their decisions explained by the same factors are likely to perform better predictions for data points within the same local structures. We evaluate our methodology to model the evolution of pain relief in patients suffering from chronic pain under usual guideline-based treatment. The ensembles generated using our framework are compared with all-in-one approaches of robust algorithms to high-dimensional data, such as Random Forests and XGBoost. Chronic pain can be primary or secondary to diseases. Its symptomatology can be classified as nociceptive, nociplastic, or neuropathic, and is generally associated with many different causal structures, challenging typical modeling methodologies. Our data includes 631 patients receiving pain treatment. We considered 338 features providing information about pain sensation, socioeconomic status, and prescribed treatments. Our goal is to predict, using data from the first consultation only, if the patient will be successful in treatment for chronic pain relief. As a result of this work, we were able to build ensembles that are able to consistently improve performance by up to 33% when compared to models trained using all the available features. We also obtained relevant gains in interpretability, with resulting ensembles using only 15% of the total number of features. We show we can effectively generate ensembles from competing explanations, promoting diversity in ensemble learning and leading to significant gains in accuracy by enforcing a stable scenario in which models that are dissimilar in terms of their predictions are also dissimilar in terms of their explanation factors.

根据数据建模通常有两个不同的方面：建立合理的解释模型或为系统或现象创建强大的预测模型。最近的大多数文献在从数据中学习模型时都没有利用解释和预测之间的关系。最近的算法没有利用这样一个事实，即许多现象实际上是由不同的亚种群和局部结构定义的，因此有许多可能的预测模型为同一现象提供对比的解释或竞争的解释。在这篇文章中，我们建议探索解释和预测之间的互补联系。我们的主要直觉是，由相同因素解释决策的模型可能会对相同局部结构内的数据点进行更好的预测。我们评估了我们的方法，以模拟在常规指南治疗下慢性疼痛患者疼痛缓解的演变。使用我们的框架生成的集合与高维数据的鲁棒算法的一体化方法进行了比较，如随机森林和XGBoost。慢性疼痛可以是原发性的，也可以是继发性的。其症状学可分为伤害性、伤害性或神经性，通常与许多不同的因果结构有关，这对典型的建模方法提出了挑战。我们的数据包括631名接受疼痛治疗的患者。我们考虑了338个提供疼痛感、社会经济地位和处方治疗信息的特征。我们的目标是仅使用第一次会诊的数据来预测患者是否能成功治疗慢性疼痛。由于这项工作，与使用所有可用功能训练的模型相比，我们能够构建能够持续提高33%性能的集成。我们还获得了可解释性方面的相关增益，由此产生的集合仅使用了特征总数的15%。我们表明，我们可以从相互竞争的解释中有效地生成集合，促进集合学习的多样性，并通过实施一种稳定的场景来显著提高准确性，在这种场景中，在预测方面不同的模型在解释因素方面也不同。

{"title":"Predicting the Evolution of Pain Relief","authors":"Anderson F. B. F. da Costa, Larissa Moreira, D. Andrade, Adriano Veloso, N. Ziviani","doi":"10.1145/3466781","DOIUrl":"https://doi.org/10.1145/3466781","url":null,"abstract":"Modeling from data usually has two distinct facets: building sound explanatory models or creating powerful predictive models for a system or phenomenon. Most of recent literature does not exploit the relationship between explanation and prediction while learning models from data. Recent algorithms are not taking advantage of the fact that many phenomena are actually defined by diverse sub-populations and local structures, and thus there are many possible predictive models providing contrasting interpretations or competing explanations for the same phenomenon. In this article, we propose to explore a complementary link between explanation and prediction. Our main intuition is that models having their decisions explained by the same factors are likely to perform better predictions for data points within the same local structures. We evaluate our methodology to model the evolution of pain relief in patients suffering from chronic pain under usual guideline-based treatment. The ensembles generated using our framework are compared with all-in-one approaches of robust algorithms to high-dimensional data, such as Random Forests and XGBoost. Chronic pain can be primary or secondary to diseases. Its symptomatology can be classified as nociceptive, nociplastic, or neuropathic, and is generally associated with many different causal structures, challenging typical modeling methodologies. Our data includes 631 patients receiving pain treatment. We considered 338 features providing information about pain sensation, socioeconomic status, and prescribed treatments. Our goal is to predict, using data from the first consultation only, if the patient will be successful in treatment for chronic pain relief. As a result of this work, we were able to build ensembles that are able to consistently improve performance by up to 33% when compared to models trained using all the available features. We also obtained relevant gains in interpretability, with resulting ensembles using only 15% of the total number of features. We show we can effectively generate ensembles from competing explanations, promoting diversity in ensemble learning and leading to significant gains in accuracy by enforcing a stable scenario in which models that are dissimilar in terms of their predictions are also dissimilar in terms of their explanation factors.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"2 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47662891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Optimal COVID-19 Vaccination Strategies with Limited Vaccine and Delivery Capabilities 疫苗和递送能力有限的COVID-19最优疫苗接种策略

ACM transactions on computing for healthcare

Pub Date : 2021-09-14 DOI: 10.1145/3466622

S. Santini

We develop a model of infection spread that takes into account the existence of a vulnerable group as well as the variability of the social relations of individuals. We develop a compartmentalized power-law model, with power-law connections between the vulnerable and the general population, considering these connections as well as the connections among the vulnerable as parameters that we vary in our tests. We use the model to study a number of vaccination strategies under two hypotheses: first, we assume a limited availability of vaccine but an infinite vaccination capacity, so all the available doses can be administered in a short time (negligible with respect to the evolution of the epidemic). Then, we assume a limited vaccination capacity, so the doses are administered in a time non-negligible with respect to the evolution of the epidemic. We develop optimal strategies for the various social parameters, where a strategy consists of (1) the fraction of vaccine that is administered to the vulnerable population and (2) the criterion that is used to administer it to the general population. In the case of a limited vaccination capacity, the fraction (1) is a function of time, and we study how to optimize it to obtain a maximal reduction in the number of fatalities.

我们开发了一个模型的感染传播，考虑到弱势群体的存在，以及个人的社会关系的可变性。我们开发了一个划分的幂律模型，在弱势群体和一般人群之间具有幂律联系，将这些联系以及弱势群体之间的联系作为我们在测试中改变的参数。我们使用该模型在两个假设下研究了许多疫苗接种策略:首先，我们假设疫苗的可用性有限，但接种能力无限，因此所有可用剂量都可以在短时间内接种(相对于流行病的演变可以忽略不计)。然后，我们假设有限的疫苗接种能力，因此剂量是在一段时间内给予的，相对于流行病的演变是不可忽略的。我们针对各种社会参数制定最佳策略，其中策略包括(1)向脆弱人群接种疫苗的比例和(2)用于向一般人群接种疫苗的标准。在疫苗接种能力有限的情况下，分数(1)是时间的函数，我们研究如何优化它以最大限度地减少死亡人数。

{"title":"Optimal COVID-19 Vaccination Strategies with Limited Vaccine and Delivery Capabilities","authors":"S. Santini","doi":"10.1145/3466622","DOIUrl":"https://doi.org/10.1145/3466622","url":null,"abstract":"We develop a model of infection spread that takes into account the existence of a vulnerable group as well as the variability of the social relations of individuals. We develop a compartmentalized power-law model, with power-law connections between the vulnerable and the general population, considering these connections as well as the connections among the vulnerable as parameters that we vary in our tests. We use the model to study a number of vaccination strategies under two hypotheses: first, we assume a limited availability of vaccine but an infinite vaccination capacity, so all the available doses can be administered in a short time (negligible with respect to the evolution of the epidemic). Then, we assume a limited vaccination capacity, so the doses are administered in a time non-negligible with respect to the evolution of the epidemic. We develop optimal strategies for the various social parameters, where a strategy consists of (1) the fraction of vaccine that is administered to the vulnerable population and (2) the criterion that is used to administer it to the general population. In the case of a limited vaccination capacity, the fraction (1) is a function of time, and we study how to optimize it to obtain a maximal reduction in the number of fatalities.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"2 1","pages":"1 - 16"},"PeriodicalIF":0.0,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44165555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analyzing Head Pose in Remotely Collected Videos of People with Parkinson’s Disease 帕金森病患者远程采集视频中的头部姿势分析

ACM transactions on computing for healthcare

Pub Date : 2021-09-14 DOI: 10.1145/3459669

M. R. Ali, Taylan K. Sen, Qianyi Li, Raina Langevin, Taylor Myers, E. Dorsey, Saloni Sharma, E. Hoque

We developed an intelligent web interface that guides users to perform several Parkinson’s disease (PD) motion assessment tests in front of their webcam. After gathering data from 329 participants (N = 199 with PD, N = 130 without PD), we developed a methodology for measuring head motion randomness based on the frequency distribution of the motion. We found PD is associated with significantly higher randomness in side-to-side head motion as measured by the variance and number of large frequency components compared to the age-matched non-PD control group (p = 0.001, d = 0.13). Additionally, in participants taking levodopa (N = 151), the most common drug to treat Parkinson’s, the degree of random side-to-side head motion was found to follow an exponential-decay activity model following the time of the last dose taken (r = −0.404, p = 6e-5). A logistic regression model for classifying PD vs. non-PD groups identified that higher frequency components are more associated with PD. Our findings could potentially be useful toward objectively quantifying differences in head motions that may be due to either PD or PD medications.

我们开发了一个智能网络界面，指导用户在他们的网络摄像头前进行几项帕金森病(PD)运动评估测试。在收集了329名参与者(N = 199患有PD, N = 130没有PD)的数据后，我们开发了一种基于运动频率分布的测量头部运动随机性的方法。我们发现，与年龄匹配的非PD对照组相比，PD与侧对侧头部运动的随机性显著更高(p = 0.001, d = 0.13)。此外，在服用左旋多巴(N = 151)(治疗帕金森病最常见的药物)的参与者中，发现随机左右头部运动的程度随最后一次服用剂量的时间呈指数衰减活动模型(r = - 0.404, p = 6e-5)。对PD组和非PD组进行分类的逻辑回归模型发现，高频成分与PD的关联更大。我们的研究结果可能有助于客观地量化PD或PD药物引起的头部运动差异。

{"title":"Analyzing Head Pose in Remotely Collected Videos of People with Parkinson’s Disease","authors":"M. R. Ali, Taylan K. Sen, Qianyi Li, Raina Langevin, Taylor Myers, E. Dorsey, Saloni Sharma, E. Hoque","doi":"10.1145/3459669","DOIUrl":"https://doi.org/10.1145/3459669","url":null,"abstract":"We developed an intelligent web interface that guides users to perform several Parkinson’s disease (PD) motion assessment tests in front of their webcam. After gathering data from 329 participants (N = 199 with PD, N = 130 without PD), we developed a methodology for measuring head motion randomness based on the frequency distribution of the motion. We found PD is associated with significantly higher randomness in side-to-side head motion as measured by the variance and number of large frequency components compared to the age-matched non-PD control group (p = 0.001, d = 0.13). Additionally, in participants taking levodopa (N = 151), the most common drug to treat Parkinson’s, the degree of random side-to-side head motion was found to follow an exponential-decay activity model following the time of the last dose taken (r = −0.404, p = 6e-5). A logistic regression model for classifying PD vs. non-PD groups identified that higher frequency components are more associated with PD. Our findings could potentially be useful toward objectively quantifying differences in head motions that may be due to either PD or PD medications.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"2 1","pages":"1 - 13"},"PeriodicalIF":0.0,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42904975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Opportunities and Barriers for Adoption of a Decision-Support Tool for Alzheimer’s Disease 采用阿尔茨海默病决策支持工具的机会和障碍

ACM transactions on computing for healthcare

Pub Date : 2021-09-14 DOI: 10.1145/3462764

Maura Bellio, D. Furniss, N. Oxtoby, Sara Garbarino, Nicholas C. Firth, A. Ribbens, D. Alexander, A. Blandford

Clinical decision-support tools (DSTs) represent a valuable resource in healthcare. However, lack of Human Factors considerations and early design research has often limited their successful adoption. To complement previous technically focused work, we studied adoption opportunities of a future DST built on a predictive model of Alzheimer’s Disease (AD) progression. Our aim is two-fold: exploring adoption opportunities for DSTs in AD clinical care, and testing a novel combination of methods to support this process. We focused on understanding current clinical needs and practices, and the potential for such a tool to be integrated into the setting, prior to its development. Our user-centred approach was based on field observations and semi-structured interviews, analysed through workflow analysis, user profiles, and a design-reality gap model. The first two are common practice, whilst the latter provided added value in highlighting specific adoption needs. We identified the likely early adopters of the tool as being both psychiatrists and neurologists based in research-oriented clinical settings. We defined ten key requirements for the translation and adoption of DSTs for AD around IT, user, and contextual factors. Future works can use and build on these requirements to stand a greater chance to get adopted in the clinical setting.

临床决策支持工具（DST）是医疗保健领域的宝贵资源。然而，缺乏人为因素的考虑和早期的设计研究往往限制了它们的成功采用。为了补充之前以技术为重点的工作，我们研究了基于阿尔茨海默病（AD）进展预测模型的未来DST的采用机会。我们的目标有两个：探索DST在AD临床护理中的采用机会，并测试一种新的方法组合来支持这一过程。我们专注于了解当前的临床需求和实践，以及在开发之前将此类工具集成到环境中的潜力。我们以用户为中心的方法基于实地观察和半结构化访谈，通过工作流程分析、用户档案和设计-现实差距模型进行分析。前两种是常见的做法，而后者在强调具体的采用需求方面提供了附加值。我们确定，该工具的早期使用者可能是以研究为导向的临床环境中的精神科医生和神经学家。我们围绕IT、用户和上下文因素，为AD的DST的翻译和采用定义了十个关键要求。未来的工作可以使用并建立在这些要求的基础上，以便有更大的机会在临床环境中被采用。

{"title":"Opportunities and Barriers for Adoption of a Decision-Support Tool for Alzheimer’s Disease","authors":"Maura Bellio, D. Furniss, N. Oxtoby, Sara Garbarino, Nicholas C. Firth, A. Ribbens, D. Alexander, A. Blandford","doi":"10.1145/3462764","DOIUrl":"https://doi.org/10.1145/3462764","url":null,"abstract":"Clinical decision-support tools (DSTs) represent a valuable resource in healthcare. However, lack of Human Factors considerations and early design research has often limited their successful adoption. To complement previous technically focused work, we studied adoption opportunities of a future DST built on a predictive model of Alzheimer’s Disease (AD) progression. Our aim is two-fold: exploring adoption opportunities for DSTs in AD clinical care, and testing a novel combination of methods to support this process. We focused on understanding current clinical needs and practices, and the potential for such a tool to be integrated into the setting, prior to its development. Our user-centred approach was based on field observations and semi-structured interviews, analysed through workflow analysis, user profiles, and a design-reality gap model. The first two are common practice, whilst the latter provided added value in highlighting specific adoption needs. We identified the likely early adopters of the tool as being both psychiatrists and neurologists based in research-oriented clinical settings. We defined ten key requirements for the translation and adoption of DSTs for AD around IT, user, and contextual factors. Future works can use and build on these requirements to stand a greater chance to get adopted in the clinical setting.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"2 1","pages":"1 - 19"},"PeriodicalIF":0.0,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44384201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Attention-Based Deep Recurrent Model for Survival Prediction 基于注意力的生存预测深度递归模型

ACM transactions on computing for healthcare

Pub Date : 2021-09-14 DOI: 10.1145/3466782

Zhaohong Sun, Wei Dong, Jinlong Shi, K. He, Zhengxing Huang

Survival analysis exhibits profound effects on health service management. Traditional approaches for survival analysis have a pre-assumption on the time-to-event probability distribution and seldom consider sequential visits of patients on medical facilities. Although recent studies leverage the merits of deep learning techniques to capture non-linear features and long-term dependencies within multiple visits for survival analysis, the lack of interpretability prevents deep learning models from being applied to clinical practice. To address this challenge, this article proposes a novel attention-based deep recurrent model, named AttenSurv, for clinical survival analysis. Specifically, a global attention mechanism is proposed to extract essential/critical risk factors for interpretability improvement. Thereafter, Bi-directional Long Short-Term Memory is employed to capture the long-term dependency on data from a series of visits of patients. To further improve both the prediction performance and the interpretability of the proposed model, we propose another model, named GNNAttenSurv, by incorporating a graph neural network into AttenSurv, to extract the latent correlations between risk factors. We validated our solution on three public follow-up datasets and two electronic health record datasets. The results demonstrated that our proposed models yielded consistent improvement compared to the state-of-the-art baselines on survival analysis.

生存分析对卫生服务管理有着深远的影响。传统的生存分析方法对事件发生时间概率分布有预先假设，很少考虑患者在医疗机构的连续就诊。尽管最近的研究利用深度学习技术的优点来捕捉多次访问中的非线性特征和长期依赖性进行生存分析，但由于缺乏可解释性，深度学习模型无法应用于临床实践。为了应对这一挑战，本文提出了一种新的基于注意力的深度复发模型，名为AttenSurv，用于临床生存分析。具体而言，提出了一种全局注意力机制来提取基本/关键风险因素，以提高可解释性。此后，采用双向长短期记忆来捕捉对患者一系列就诊数据的长期依赖性。为了进一步提高所提出模型的预测性能和可解释性，我们提出了另一个模型，名为GNNAttenSurv，通过将图神经网络纳入AttenSurv，来提取风险因素之间的潜在相关性。我们在三个公共随访数据集和两个电子健康记录数据集上验证了我们的解决方案。结果表明，与最先进的生存分析基线相比，我们提出的模型产生了一致的改进。

{"title":"Attention-Based Deep Recurrent Model for Survival Prediction","authors":"Zhaohong Sun, Wei Dong, Jinlong Shi, K. He, Zhengxing Huang","doi":"10.1145/3466782","DOIUrl":"https://doi.org/10.1145/3466782","url":null,"abstract":"Survival analysis exhibits profound effects on health service management. Traditional approaches for survival analysis have a pre-assumption on the time-to-event probability distribution and seldom consider sequential visits of patients on medical facilities. Although recent studies leverage the merits of deep learning techniques to capture non-linear features and long-term dependencies within multiple visits for survival analysis, the lack of interpretability prevents deep learning models from being applied to clinical practice. To address this challenge, this article proposes a novel attention-based deep recurrent model, named AttenSurv, for clinical survival analysis. Specifically, a global attention mechanism is proposed to extract essential/critical risk factors for interpretability improvement. Thereafter, Bi-directional Long Short-Term Memory is employed to capture the long-term dependency on data from a series of visits of patients. To further improve both the prediction performance and the interpretability of the proposed model, we propose another model, named GNNAttenSurv, by incorporating a graph neural network into AttenSurv, to extract the latent correlations between risk factors. We validated our solution on three public follow-up datasets and two electronic health record datasets. The results demonstrated that our proposed models yielded consistent improvement compared to the state-of-the-art baselines on survival analysis.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"2 1","pages":"1 - 18"},"PeriodicalIF":0.0,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41328605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Biomedical Named Entity Recognition via Knowledge Guidance and Question Answering 基于知识引导和问答的生物医学命名实体识别

ACM transactions on computing for healthcare

Pub Date : 2021-07-18 DOI: 10.1145/3465221

Pratyay Banerjee, Kuntal Kumar Pal, M. Devarakonda, Chitta Baral

In this work, we formulated the named entity recognition (NER) task as a multi-answer knowledge guided question-answer task (KGQA) and showed that the knowledge guidance helps to achieve state-of-the-art results for 11 of 18 biomedical NER datasets. We prepended five different knowledge contexts—entity types, questions, definitions, and examples—to the input text and trained and tested BERT-based neural models on such input sequences from a combined dataset of the 18 different datasets. This novel formulation of the task (a) improved named entity recognition and illustrated the impact of different knowledge contexts, (b) reduced system confusion by limiting prediction to a single entity-class for each input token (i.e., B, I, O only) compared to multiple entity-classes in traditional NER (i.e., Bentity1, Bentity2, Ientity1, I, O), (c) made detection of nested entities easier, and (d) enabled the models to jointly learn NER-specific features from a large number of datasets. We performed extensive experiments of this KGQA formulation on the biomedical datasets, and through the experiments, we showed when knowledge improved named entity recognition. We analyzed the effect of the task formulation, the impact of the different knowledge contexts, the multi-task aspect of the generic format, and the generalization ability of KGQA. We also probed the model to better understand the key contributors for these improvements.

在这项工作中，我们将命名实体识别(NER)任务制定为多答案知识引导问答任务(KGQA)，并表明知识引导有助于18个生物医学NER数据集中的11个获得最先进的结果。我们为输入文本准备了五种不同的知识上下文——实体类型、问题、定义和示例，并在来自18个不同数据集的组合数据集的输入序列上训练和测试了基于bert的神经模型。这种任务的新公式(a)改进了命名实体识别，并说明了不同知识上下文的影响;(b)通过将每个输入标记的预测限制为单个实体类(即，b, I, O)，而不是传统NER中的多个实体类(即，Bentity1, Bentity2, Ientity1, I, O)，减少了系统混淆;(c)使嵌套实体的检测更容易;(d)使模型能够从大量数据集中共同学习NER特定的特征。我们在生物医学数据集上对这个KGQA公式进行了大量的实验，通过实验，我们展示了知识在什么时候提高了命名实体的识别。我们分析了任务制定的影响、不同知识背景的影响、通用格式的多任务方面的影响以及KGQA的泛化能力。我们还研究了该模型，以便更好地理解这些改进的关键贡献者。

{"title":"Biomedical Named Entity Recognition via Knowledge Guidance and Question Answering","authors":"Pratyay Banerjee, Kuntal Kumar Pal, M. Devarakonda, Chitta Baral","doi":"10.1145/3465221","DOIUrl":"https://doi.org/10.1145/3465221","url":null,"abstract":"In this work, we formulated the named entity recognition (NER) task as a multi-answer knowledge guided question-answer task (KGQA) and showed that the knowledge guidance helps to achieve state-of-the-art results for 11 of 18 biomedical NER datasets. We prepended five different knowledge contexts—entity types, questions, definitions, and examples—to the input text and trained and tested BERT-based neural models on such input sequences from a combined dataset of the 18 different datasets. This novel formulation of the task (a) improved named entity recognition and illustrated the impact of different knowledge contexts, (b) reduced system confusion by limiting prediction to a single entity-class for each input token (i.e., B, I, O only) compared to multiple entity-classes in traditional NER (i.e., Bentity1, Bentity2, Ientity1, I, O), (c) made detection of nested entities easier, and (d) enabled the models to jointly learn NER-specific features from a large number of datasets. We performed extensive experiments of this KGQA formulation on the biomedical datasets, and through the experiments, we showed when knowledge improved named entity recognition. We analyzed the effect of the task formulation, the impact of the different knowledge contexts, the multi-task aspect of the generic format, and the generalization ability of KGQA. We also probed the model to better understand the key contributors for these improvements.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" ","pages":"1 - 24"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3465221","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48654752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Designing Robust Models for Behaviour Prediction Using Sparse Data from Mobile Sensing 基于移动传感稀疏数据的行为预测鲁棒模型设计

ACM transactions on computing for healthcare

Pub Date : 2021-07-18 DOI: 10.1145/3458753

Seyma Kucukozer-Cavdar, T. Taşkaya-Temizel, Abhinav Mehrotra, Mirco Musolesi, P. Tiňo

Understanding in which circumstances office workers take rest breaks is important for delivering effective mobile notifications and make inferences about their daily lifestyle, e.g., whether they are active and/or have a sedentary life. Previous studies designed for office workers show the effectiveness of rest breaks for preventing work-related conditions. In this article, we propose a hybrid personalised model involving a kernel density estimation model and a generalised linear mixed model to model office workers’ available moments for rest breaks during working hours. We adopt the experience-based sampling method through which we collected office workers’ responses regarding their availability through a mobile application with contextual information extracted by means of the mobile phone sensors. The experiment lasted 10 workdays and involved 19 office workers with a total of 528 responses. Our results show that time, location, ringer mode, and activity are effective features for predicting office workers’ availability. Our method can address sparse sample issues for building individual predictive behavioural models based on limited and unbalanced data. In particular, the proposed method can be considered as a potential solution to the “cold-start problem,” i.e., the negative impact of the lack of individual data when a new application is installed.

了解上班族在哪些情况下休息对于提供有效的移动通知和推断他们的日常生活方式很重要，例如，他们是否活跃和/或久坐不动。以前为上班族设计的研究表明，休息时间对预防与工作相关的疾病是有效的。在这篇文章中，我们提出了一个混合个性化模型，包括核密度估计模型和广义线性混合模型，以模拟上班族在工作时间休息的可用时间。我们采用基于经验的抽样方法，通过移动应用程序收集办公室工作人员对其可用性的反应，并通过手机传感器提取上下文信息。这项实验持续了10个工作日，涉及19名上班族，共有528条回复。我们的研究结果表明，时间、地点、铃声模式和活动是预测上班族可用性的有效特征。我们的方法可以解决基于有限和不平衡数据构建个人预测行为模型的稀疏样本问题。特别是，所提出的方法可以被视为“冷启动问题”的潜在解决方案，即安装新应用程序时缺乏单个数据的负面影响。

{"title":"Designing Robust Models for Behaviour Prediction Using Sparse Data from Mobile Sensing","authors":"Seyma Kucukozer-Cavdar, T. Taşkaya-Temizel, Abhinav Mehrotra, Mirco Musolesi, P. Tiňo","doi":"10.1145/3458753","DOIUrl":"https://doi.org/10.1145/3458753","url":null,"abstract":"Understanding in which circumstances office workers take rest breaks is important for delivering effective mobile notifications and make inferences about their daily lifestyle, e.g., whether they are active and/or have a sedentary life. Previous studies designed for office workers show the effectiveness of rest breaks for preventing work-related conditions. In this article, we propose a hybrid personalised model involving a kernel density estimation model and a generalised linear mixed model to model office workers’ available moments for rest breaks during working hours. We adopt the experience-based sampling method through which we collected office workers’ responses regarding their availability through a mobile application with contextual information extracted by means of the mobile phone sensors. The experiment lasted 10 workdays and involved 19 office workers with a total of 528 responses. Our results show that time, location, ringer mode, and activity are effective features for predicting office workers’ availability. Our method can address sparse sample issues for building individual predictive behavioural models based on limited and unbalanced data. In particular, the proposed method can be considered as a potential solution to the “cold-start problem,” i.e., the negative impact of the lack of individual data when a new application is installed.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" ","pages":"1 - 33"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3458753","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47583665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Novel Reconfigurable Hardware Systems for Tumor Growth Prediction 用于肿瘤生长预测的新型可重构硬件系统

ACM transactions on computing for healthcare

Pub Date : 2021-07-18 DOI: 10.1145/3454126

Konstantinos Malavazos, M. Papadogiorgaki, Pavlos Malakonakis, I. Papaefstathiou

An emerging trend in biomedical systems research is the development of models that take full advantage of the increasing available computational power to manage and analyze new biological data as well as to model complex biological processes. Such biomedical models require significant computational resources, since they process and analyze large amounts of data, such as medical image sequences. We present a family of advanced computational models for the prediction of the spatio-temporal evolution of glioma and their novel implementation in state-of-the-art FPGA devices. Glioma is a rapidly evolving type of brain cancer, well known for its aggressive and diffusive behavior. The developed system simulates the glioma tumor growth in the brain tissue, which consists of different anatomic structures, by utilizing MRI slices. The presented models have been proved highly accurate in predicting the growth of the tumor, whereas the developed innovative hardware system, when implemented on a low-end, low-cost FPGA, is up to 85% faster than a high-end server consisting of 20 physical cores (and 40 virtual ones) and more than 28× more energy-efficient than it; the energy efficiency grows up to 50× and the speedup up to 14× if the presented designs are implemented in a high-end FPGA. Moreover, the proposed reconfigurable system, when implemented in a large FPGA, is significantly faster than a high-end GPU (i.e., from 80% and up to 250% faster), for the majority of the models, while it is also significantly better (i.e., from 80% to over 1,600%) in terms of power efficiency, for all the implemented models.

生物医学系统研究的一个新兴趋势是开发模型，充分利用日益增长的可用计算能力来管理和分析新的生物数据以及模拟复杂的生物过程。这种生物医学模型需要大量的计算资源，因为它们需要处理和分析大量数据，例如医学图像序列。我们提出了一系列先进的计算模型，用于预测胶质瘤的时空演变，并在最先进的FPGA器件中实现。胶质瘤是一种快速发展的脑癌，以其侵袭性和弥漫性行为而闻名。该系统利用MRI切片模拟由不同解剖结构组成的脑组织中胶质瘤的生长。所提出的模型已被证明在预测肿瘤生长方面具有很高的准确性，而所开发的创新硬件系统，当在低端，低成本的FPGA上实现时，比由20个物理内核(和40个虚拟内核)组成的高端服务器快85%，并且比其节能28倍以上;如果在高端FPGA中实现，则能效可提高50倍，加速可提高14倍。此外，所提出的可重构系统，当在大型FPGA中实现时，对于大多数模型来说，比高端GPU要快得多(即从80%到高达250%的速度)，而对于所有实现的模型来说，在功率效率方面也明显更好(即从80%到超过1600%)。

{"title":"Novel Reconfigurable Hardware Systems for Tumor Growth Prediction","authors":"Konstantinos Malavazos, M. Papadogiorgaki, Pavlos Malakonakis, I. Papaefstathiou","doi":"10.1145/3454126","DOIUrl":"https://doi.org/10.1145/3454126","url":null,"abstract":"An emerging trend in biomedical systems research is the development of models that take full advantage of the increasing available computational power to manage and analyze new biological data as well as to model complex biological processes. Such biomedical models require significant computational resources, since they process and analyze large amounts of data, such as medical image sequences. We present a family of advanced computational models for the prediction of the spatio-temporal evolution of glioma and their novel implementation in state-of-the-art FPGA devices. Glioma is a rapidly evolving type of brain cancer, well known for its aggressive and diffusive behavior. The developed system simulates the glioma tumor growth in the brain tissue, which consists of different anatomic structures, by utilizing MRI slices. The presented models have been proved highly accurate in predicting the growth of the tumor, whereas the developed innovative hardware system, when implemented on a low-end, low-cost FPGA, is up to 85% faster than a high-end server consisting of 20 physical cores (and 40 virtual ones) and more than 28× more energy-efficient than it; the energy efficiency grows up to 50× and the speedup up to 14× if the presented designs are implemented in a high-end FPGA. Moreover, the proposed reconfigurable system, when implemented in a large FPGA, is significantly faster than a high-end GPU (i.e., from 80% and up to 250% faster), for the majority of the models, while it is also significantly better (i.e., from 80% to over 1,600%) in terms of power efficiency, for all the implemented models.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"2 1","pages":"1 - 27"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3454126","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44700547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Triage of 2D Mammographic Images Using Multi-view Multi-task Convolutional Neural Networks 基于多视图多任务卷积神经网络的二维乳房x线图像分诊

ACM transactions on computing for healthcare

Pub Date : 2021-07-15 DOI: 10.1145/3453166

T. Kyono, F. Gilbert, M. Schaar

With an aging and growing population, the number of women receiving mammograms is increasing. However, existing techniques for autonomous diagnosis do not surpass a well-trained radiologist. Therefore, to reduce the number of mammograms that require examination by a radiologist, subject to preserving the diagnostic accuracy observed in current clinical practice, we develop Man and Machine Mammography Oracle (MAMMO)—a clinical decision support system capable of determining whether its predicted diagnoses require further radiologist examination. We first introduce a novel multi-view convolutional neural network (CNN) trained using multi-task learning (MTL) to diagnose mammograms and predict the radiological assessments known to be associated with cancer. MTL improves diagnostic performance and triage efficiency while providing an additional layer of model interpretability. Furthermore, we introduce a novel triage network that takes as input the radiological assessment and diagnostic predictions of the multi-view CNN and determines whether the radiologist or CNN will most likely provide the correct diagnosis. Results obtained on a dataset of over 7,000 patients show that MAMMO reduced the number of diagnostic mammograms requiring radiologist reading by 42.8% while improving the overall diagnostic accuracy in comparison to readings done by radiologists alone.

随着人口老龄化和增长，接受乳房X光检查的女性人数正在增加。然而，现有的自主诊断技术并不能超越训练有素的放射科医生。因此，为了减少需要放射科医生检查的乳房X光片数量，在保持当前临床实践中观察到的诊断准确性的前提下，我们开发了人机乳腺X光片Oracle（MAMMO）——一种临床决策支持系统，能够确定其预测诊断是否需要放射科医师进一步检查。我们首先介绍了一种新的多视图卷积神经网络（CNN），该网络使用多任务学习（MTL）进行训练，以诊断乳房X光检查并预测已知与癌症相关的放射学评估。MTL提高了诊断性能和分诊效率，同时提供了额外的模型可解释性层。此外，我们引入了一种新的分诊网络，该网络将多视图CNN的放射学评估和诊断预测作为输入，并确定放射科医生或CNN是否最有可能提供正确的诊断。在7000多名患者的数据集上获得的结果显示，与放射科医生单独读取的数据相比，MAMMO将需要放射科医生读取的诊断性乳房X光照片数量减少了42.8%，同时提高了整体诊断准确性。

{"title":"Triage of 2D Mammographic Images Using Multi-view Multi-task Convolutional Neural Networks","authors":"T. Kyono, F. Gilbert, M. Schaar","doi":"10.1145/3453166","DOIUrl":"https://doi.org/10.1145/3453166","url":null,"abstract":"With an aging and growing population, the number of women receiving mammograms is increasing. However, existing techniques for autonomous diagnosis do not surpass a well-trained radiologist. Therefore, to reduce the number of mammograms that require examination by a radiologist, subject to preserving the diagnostic accuracy observed in current clinical practice, we develop Man and Machine Mammography Oracle (MAMMO)—a clinical decision support system capable of determining whether its predicted diagnoses require further radiologist examination. We first introduce a novel multi-view convolutional neural network (CNN) trained using multi-task learning (MTL) to diagnose mammograms and predict the radiological assessments known to be associated with cancer. MTL improves diagnostic performance and triage efficiency while providing an additional layer of model interpretability. Furthermore, we introduce a novel triage network that takes as input the radiological assessment and diagnostic predictions of the multi-view CNN and determines whether the radiologist or CNN will most likely provide the correct diagnosis. Results obtained on a dataset of over 7,000 patients show that MAMMO reduced the number of diagnostic mammograms requiring radiologist reading by 42.8% while improving the overall diagnostic accuracy in comparison to readings done by radiologists alone.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" ","pages":"1 - 24"},"PeriodicalIF":0.0,"publicationDate":"2021-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3453166","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46394461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Fast ECG Diagnosis by Using Non-Uniform Spectral Analysis and the Artificial Neural Network 基于非均匀谱分析和人工神经网络的心电快速诊断

ACM transactions on computing for healthcare

Pub Date : 2021-07-15 DOI: 10.1145/3453174

K. Chen, Po-Chen Chien, Zi-Jie Gao, Chi-Hsun Wu

The electrocardiogram (ECG) has been proven as an efficient diagnostic tool to monitor the electrical activity of the heart and has become a widely used clinical approach to diagnose heart diseases. In a practical way, the ECG signal can be decomposed into P, Q, R, S, and T waves. Based on the information of the features in these waves, such as the amplitude and the interval between each wave, many types of heart diseases can be detected by using the neural network (NN)-based ECG analysis approach. However, because of a large amount of computing to preprocess the raw ECG signal, it is time consuming to analyze the ECG signal in the time domain. In addition, the non-linear ECG signal analysis worsens the difficulty to diagnose the ECG signal. To solve the problem, we propose a fast ECG diagnosis approach based on spectral analysis and the artificial neural network. Compared with the conventional time-domain approaches, the proposed approach analyzes the ECG signal only in the frequency domain. However, because most of the noises in the raw ECG signal belong to high-frequency signals, it is necessary to acquire more features in the low-frequency spectrum and fewer features in the high-frequency spectrum. Hence, a non-uniform feature extraction approach is proposed in this article. According to less data preprocessing in the frequency domain than the one in the time domain, the proposed approach not only reduces the total diagnosis latency but also reduces the computing power consumption of the ECG diagnosis. To verify the proposed approach, the well-known MIT-BIH arrhythmia database is involved in this work. The experimental results show that the proposed approach can reduce ECG diagnosis latency by 47% to 52% compared with conventional ECG analysis methods under similar diagnostic accuracy of heart diseases. In addition, because of less data preprocessing, the proposed approach can achieve lower area overhead by 22% to 29% and lower computing power consumption by 29% to 34% compared with the related works, which is proper for applying this approach to portable medical devices.

心电图（ECG）已被证明是监测心脏电活动的有效诊断工具，并已成为诊断心脏病的广泛应用的临床方法。在实际的方式中，ECG信号可以被分解为P、Q、R、S和T波。基于这些波中的特征信息，如每个波的振幅和间隔，可以使用基于神经网络的ECG分析方法来检测多种类型的心脏病。然而，由于对原始ECG信号进行预处理需要大量的计算，因此在时域中分析ECG信号是耗时的。此外，非线性ECG信号分析加剧了ECG信号的诊断难度。为了解决这个问题，我们提出了一种基于频谱分析和人工神经网络的快速心电诊断方法。与传统的时域方法相比，该方法只在频域中分析心电信号。然而，由于原始ECG信号中的大多数噪声属于高频信号，因此需要在低频谱中获取更多的特征，而在高频谱中获取更少的特征。因此，本文提出了一种非均匀特征提取方法。由于频域中的数据预处理少于时域中的数据，该方法不仅降低了总诊断延迟，而且降低了ECG诊断的计算功耗。为了验证所提出的方法，著名的MIT-BIH心律失常数据库参与了这项工作。实验结果表明，在心脏病诊断准确率相似的情况下，与传统的心电图分析方法相比，该方法可以将心电图诊断延迟降低47%至52%。此外，由于数据预处理较少，与相关工作相比，该方法可以实现22%至29%的区域开销和29%至34%的计算功耗，适合将该方法应用于便携式医疗设备。

{"title":"A Fast ECG Diagnosis by Using Non-Uniform Spectral Analysis and the Artificial Neural Network","authors":"K. Chen, Po-Chen Chien, Zi-Jie Gao, Chi-Hsun Wu","doi":"10.1145/3453174","DOIUrl":"https://doi.org/10.1145/3453174","url":null,"abstract":"The electrocardiogram (ECG) has been proven as an efficient diagnostic tool to monitor the electrical activity of the heart and has become a widely used clinical approach to diagnose heart diseases. In a practical way, the ECG signal can be decomposed into P, Q, R, S, and T waves. Based on the information of the features in these waves, such as the amplitude and the interval between each wave, many types of heart diseases can be detected by using the neural network (NN)-based ECG analysis approach. However, because of a large amount of computing to preprocess the raw ECG signal, it is time consuming to analyze the ECG signal in the time domain. In addition, the non-linear ECG signal analysis worsens the difficulty to diagnose the ECG signal. To solve the problem, we propose a fast ECG diagnosis approach based on spectral analysis and the artificial neural network. Compared with the conventional time-domain approaches, the proposed approach analyzes the ECG signal only in the frequency domain. However, because most of the noises in the raw ECG signal belong to high-frequency signals, it is necessary to acquire more features in the low-frequency spectrum and fewer features in the high-frequency spectrum. Hence, a non-uniform feature extraction approach is proposed in this article. According to less data preprocessing in the frequency domain than the one in the time domain, the proposed approach not only reduces the total diagnosis latency but also reduces the computing power consumption of the ECG diagnosis. To verify the proposed approach, the well-known MIT-BIH arrhythmia database is involved in this work. The experimental results show that the proposed approach can reduce ECG diagnosis latency by 47% to 52% compared with conventional ECG analysis methods under similar diagnostic accuracy of heart diseases. In addition, because of less data preprocessing, the proposed approach can achieve lower area overhead by 22% to 29% and lower computing power consumption by 29% to 34% compared with the related works, which is proper for applying this approach to portable medical devices.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" ","pages":"1 - 21"},"PeriodicalIF":0.0,"publicationDate":"2021-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3453174","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47935708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2