首页 > 最新文献

SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)最新文献

英文 中文
An automated data utility clustering methodology using data constraint rules 一种使用数据约束规则的自动数据实用程序聚类方法
Stuart Morton, M. Mahoui, P. Gibson
Many data privacy models have been created in the last few years using the k-anonymization methodology including l-diversity, p-sensitive k-anonymity, and t-closeness. While these methods differ in their approaches and quality of the results, they all focus on ensuring the anonymization of the data while at the same time attempt to protect the quality of the data by minimizing the loss of the information contained in the original data set. In this paper, we propose an automated k-anonymity approach that uses clustering to maximize the utility of the data while ensuring that the data privacy is maintained. Our method employs data constraint rules, which are defined by the data research expert to represent especially informative distributions in categorical attributes or inflections points in a continuous attribute. The values of the data constraints are an integral component of our utility function, which is used to maximize the utility of the anonymized dataset. Finally, we present our experimental results that show that our approach meets or exceeds existing methods that do not incorporate data constraint rules.
在过去的几年里,许多数据隐私模型都是使用k匿名化方法创建的,包括l多样性、p敏感k匿名和t接近。虽然这些方法在方法和结果质量上有所不同,但它们都侧重于确保数据的匿名化,同时试图通过最小化原始数据集中包含的信息的丢失来保护数据的质量。在本文中,我们提出了一种自动k-匿名方法,该方法使用聚类来最大化数据的效用,同时确保数据隐私得到维护。我们的方法使用由数据研究专家定义的数据约束规则来表示分类属性或连续属性拐点中的特别信息分布。数据约束的值是我们的效用函数的一个组成部分,它用于最大化匿名数据集的效用。最后,我们展示了我们的实验结果,表明我们的方法满足或超过了不包含数据约束规则的现有方法。
{"title":"An automated data utility clustering methodology using data constraint rules","authors":"Stuart Morton, M. Mahoui, P. Gibson","doi":"10.1145/2389707.2389710","DOIUrl":"https://doi.org/10.1145/2389707.2389710","url":null,"abstract":"Many data privacy models have been created in the last few years using the k-anonymization methodology including l-diversity, p-sensitive k-anonymity, and t-closeness. While these methods differ in their approaches and quality of the results, they all focus on ensuring the anonymization of the data while at the same time attempt to protect the quality of the data by minimizing the loss of the information contained in the original data set. In this paper, we propose an automated k-anonymity approach that uses clustering to maximize the utility of the data while ensuring that the data privacy is maintained. Our method employs data constraint rules, which are defined by the data research expert to represent especially informative distributions in categorical attributes or inflections points in a continuous attribute. The values of the data constraints are an integral component of our utility function, which is used to maximize the utility of the anonymized dataset. Finally, we present our experimental results that show that our approach meets or exceeds existing methods that do not incorporate data constraint rules.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"3 1","pages":"9-16"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82113357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Towards Large-scale Twitter Mining for Drug-related Adverse Events. 面向药物相关不良事件的大规模Twitter挖掘。
Jiang Bian, Umit Topaloglu, Fan Yu

Drug-related adverse events pose substantial risks to patients who consume post-market or Drug-related adverse events pose substantial risks to patients who consume post-market or investigational drugs. Early detection of adverse events benefits not only the drug regulators, but also the manufacturers for pharmacovigilance. Existing methods rely on patients' "spontaneous" self-reports that attest problems. The increasing popularity of social media platforms like the Twitter presents us a new information source for finding potential adverse events. Given the high frequency of user updates, mining Twitter messages can lead us to real-time pharmacovigilance. In this paper, we describe an approach to find drug users and potential adverse events by analyzing the content of twitter messages utilizing Natural Language Processing (NLP) and to build Support Vector Machine (SVM) classifiers. Due to the size nature of the dataset (i.e., 2 billion Tweets), the experiments were conducted on a High Performance Computing (HPC) platform using MapReduce, which exhibits the trend of big data analytics. The results suggest that daily-life social networking data could help early detection of important patient safety issues.

药物相关不良事件对上市后用药的患者构成重大风险或药物相关不良事件对上市后用药或临床试验用药的患者构成重大风险。早期发现不良事件不仅有利于药品监管机构,也有利于制造商提高药物警戒。现有的方法依赖于患者“自发”的自我报告来证明问题。Twitter等社交媒体平台的日益普及为我们发现潜在不良事件提供了新的信息来源。考虑到用户更新的高频率,挖掘Twitter信息可以让我们进行实时药物警戒。在本文中,我们描述了一种方法,通过使用自然语言处理(NLP)分析twitter消息的内容来发现吸毒者和潜在的不良事件,并建立支持向量机(SVM)分类器。由于数据集的规模(即20亿条推文),实验是在高性能计算(High Performance Computing, HPC)平台上使用MapReduce进行的,这体现了大数据分析的趋势。研究结果表明,日常生活中的社交网络数据可以帮助早期发现重要的患者安全问题。
{"title":"Towards Large-scale Twitter Mining for Drug-related Adverse Events.","authors":"Jiang Bian,&nbsp;Umit Topaloglu,&nbsp;Fan Yu","doi":"10.1145/2389707.2389713","DOIUrl":"https://doi.org/10.1145/2389707.2389713","url":null,"abstract":"<p><p>Drug-related adverse events pose substantial risks to patients who consume post-market or Drug-related adverse events pose substantial risks to patients who consume post-market or investigational drugs. Early detection of adverse events benefits not only the drug regulators, but also the manufacturers for pharmacovigilance. Existing methods rely on patients' \"spontaneous\" self-reports that attest problems. The increasing popularity of social media platforms like the Twitter presents us a new information source for finding potential adverse events. Given the high frequency of user updates, mining Twitter messages can lead us to real-time pharmacovigilance. In this paper, we describe an approach to find drug users and potential adverse events by analyzing the content of twitter messages utilizing Natural Language Processing (NLP) and to build Support Vector Machine (SVM) classifiers. Due to the size nature of the dataset (i.e., 2 billion Tweets), the experiments were conducted on a High Performance Computing (HPC) platform using MapReduce, which exhibits the trend of big data analytics. The results suggest that daily-life social networking data could help early detection of important patient safety issues.</p>","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"2012 ","pages":"25-32"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/2389707.2389713","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35459545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 235
Simulating prosthetic vision with disortions for retinal prosthesis design 视网膜假体设计中模拟假体视觉的畸变
M. Subramaniam, P. Chundi, A. Muthuraj, E. Margalit, Sylvie Sim
Retinal prostheses are used to restore vision to individuals with vision impairments caused by the damaged photoreceptors in their retina. Despite the early successes, designing prostheses that can restore functional vision in general, continues to be a challenging problem due to the large number of design parameters that need to be customized for individual users. Gathering data using real patients in a timely and safe manner is also difficult. To address these problems, a virtual environment for realistically and safely simulating prosthetic vision is described. Besides supporting phosphenized rendering of images at different resolutions to normal users, and eye movement tracking, the environment also supports spatial distortions that are commonly perceived by prostheses users. A procedure to automatically generate such spatial distortions is developed. User corrections if any, are logged and compared with the original distortion values to evaluate distortion perception. Experimental results obtained in using this environment to perform various visual acuity tasks are described.
视网膜假体用于恢复因视网膜感光器受损而导致的视力障碍患者的视力。尽管早期取得了成功,但由于需要为个人用户定制大量的设计参数,设计可以恢复一般功能视觉的假肢仍然是一个具有挑战性的问题。以及时和安全的方式收集真实患者的数据也很困难。为了解决这些问题,本文描述了一个真实、安全的模拟假肢视觉的虚拟环境。除了支持不同分辨率的图像磷化渲染和眼动跟踪,该环境还支持假肢用户通常感知的空间扭曲。开发了一种自动生成这种空间畸变的程序。用户修正(如果有的话)将被记录下来,并与原始失真值进行比较,以评估失真感知。本文描述了在这种环境下执行各种视觉灵敏度任务的实验结果。
{"title":"Simulating prosthetic vision with disortions for retinal prosthesis design","authors":"M. Subramaniam, P. Chundi, A. Muthuraj, E. Margalit, Sylvie Sim","doi":"10.1145/2389707.2389719","DOIUrl":"https://doi.org/10.1145/2389707.2389719","url":null,"abstract":"Retinal prostheses are used to restore vision to individuals with vision impairments caused by the damaged photoreceptors in their retina. Despite the early successes, designing prostheses that can restore functional vision in general, continues to be a challenging problem due to the large number of design parameters that need to be customized for individual users. Gathering data using real patients in a timely and safe manner is also difficult. To address these problems, a virtual environment for realistically and safely simulating prosthetic vision is described. Besides supporting phosphenized rendering of images at different resolutions to normal users, and eye movement tracking, the environment also supports spatial distortions that are commonly perceived by prostheses users. A procedure to automatically generate such spatial distortions is developed. User corrections if any, are logged and compared with the original distortion values to evaluate distortion perception. Experimental results obtained in using this environment to perform various visual acuity tasks are described.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"23 1","pages":"57-64"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76528536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Combining multi-level evidence for medical record retrieval 结合多层次证据进行病历检索
Dongqing Zhu, Ben Carterette
The increasing prevalence of electronic health records containing rich information about a patient's health and physical condition has the potential to transform research in health and medicine. In this work, we present a health record search system for finding patients matching certain inclusion criteria (specified as keyword queries) for clinical studies. In particular, our system aggregates multi-level evidence and combines proven statistical IR models, both in an innovative way, and achieves a 20% MAP (mean average precision) improvement over a strong baseline. Moreover, our cross-validation results show that the overall performance of our system is comparable to other top-performing systems on the same task.
包含有关病人健康和身体状况的丰富信息的电子健康记录日益普及,有可能改变健康和医学研究。在这项工作中,我们提出了一个健康记录搜索系统,用于查找符合临床研究的某些纳入标准(指定为关键字查询)的患者。特别是,我们的系统以创新的方式汇集了多层次的证据,并结合了经过验证的统计IR模型,并在强基线的基础上实现了20%的MAP(平均精度)提高。此外,我们的交叉验证结果表明,在相同的任务上,我们的系统的整体性能与其他性能最好的系统相当。
{"title":"Combining multi-level evidence for medical record retrieval","authors":"Dongqing Zhu, Ben Carterette","doi":"10.1145/2389707.2389717","DOIUrl":"https://doi.org/10.1145/2389707.2389717","url":null,"abstract":"The increasing prevalence of electronic health records containing rich information about a patient's health and physical condition has the potential to transform research in health and medicine. In this work, we present a health record search system for finding patients matching certain inclusion criteria (specified as keyword queries) for clinical studies. In particular, our system aggregates multi-level evidence and combines proven statistical IR models, both in an innovative way, and achieves a 20% MAP (mean average precision) improvement over a strong baseline. Moreover, our cross-validation results show that the overall performance of our system is comparable to other top-performing systems on the same task.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"15 1","pages":"49-56"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91217491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
An architecture for personalized health information retrieval 用于个性化健康信息检索的体系结构
N. Yadav, C. Poellabauer
With the rapid proliferation of the Internet, traditional Information Retrieval (IR) techniques need to address challenges that stem from information overload by filtering web documents and ranking them in an order that can be perceived to be more relevant and credible to the end-user. In the domain of health care, an increasing number of people turn to the Internet for their health and wellness concerns. The results returned by traditional search engines can therefore be overwhelming and, even worse, inaccurate. As a consequence there is a need to design more "intelligent" web services that pre-process and alter information on the user's behalf. Specifically, this paper describes the design of a personalized search engine that utilizes patient data (either stored in user-managed personal health records or in provider-managed electronic medical records) and couples this with a selective crawling of credible medical information to eliminate search results that appear irrelevant to the user (given the user's "health profile") and rank the remaining results in order of relevance based on the health conditions of users performing the searches. Toward this end, a new ranking algorithm that combines a user's search query and the user's health profile is introduced. Finally, comparisons of the search results for users with different health profiles and diverse queries are presented using this architecture.
随着Internet的快速发展,传统的信息检索(IR)技术需要通过过滤web文档并按照最终用户认为更相关和更可信的顺序对它们进行排序来解决源于信息过载的挑战。在医疗保健领域,越来越多的人转向互联网寻求他们的健康和保健问题。因此,传统搜索引擎返回的结果可能是压倒性的,甚至更糟糕的是,不准确。因此,有必要设计更“智能”的web服务来代表用户对信息进行预处理和修改。具体地说,本文描述了一种个性化搜索引擎的设计,该引擎利用患者数据(存储在用户管理的个人健康记录中或存储在提供商管理的电子医疗记录中),并将其与可靠医疗信息的选择性抓取结合起来,以消除与用户无关的搜索结果(给定用户的“健康档案”),并根据执行搜索的用户的健康状况按相关性顺序对剩余结果进行排序。为此,介绍了一种结合用户搜索查询和用户健康状况的排名算法。最后,使用该架构对具有不同健康概况和不同查询的用户的搜索结果进行了比较。
{"title":"An architecture for personalized health information retrieval","authors":"N. Yadav, C. Poellabauer","doi":"10.1145/2389707.2389716","DOIUrl":"https://doi.org/10.1145/2389707.2389716","url":null,"abstract":"With the rapid proliferation of the Internet, traditional Information Retrieval (IR) techniques need to address challenges that stem from information overload by filtering web documents and ranking them in an order that can be perceived to be more relevant and credible to the end-user. In the domain of health care, an increasing number of people turn to the Internet for their health and wellness concerns. The results returned by traditional search engines can therefore be overwhelming and, even worse, inaccurate. As a consequence there is a need to design more \"intelligent\" web services that pre-process and alter information on the user's behalf. Specifically, this paper describes the design of a personalized search engine that utilizes patient data (either stored in user-managed personal health records or in provider-managed electronic medical records) and couples this with a selective crawling of credible medical information to eliminate search results that appear irrelevant to the user (given the user's \"health profile\") and rank the remaining results in order of relevance based on the health conditions of users performing the searches. Toward this end, a new ranking algorithm that combines a user's search query and the user's health profile is introduced. Finally, comparisons of the search results for users with different health profiles and diverse queries are presented using this architecture.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"17 1","pages":"41-48"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79249320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Designing the reconciled schema for a pharmacovigilance data warehouse through a temporally-enhanced ER model 通过临时增强的ER模型设计药物警戒数据仓库的协调模式
Riccardo Lora, Alberto Sabaini, Combi Carlo, U. Moretti
Pharmacovigilance is the activity related to the collection, analysis, and prevention of adverse reactions induced by drugs. The spontaneous reporting of adverse drug reactions is a system for identifying and sending reports about unexpected reactions to the regulatory authority. In Italy the information needed for properly carrying out the pharmacovigilance activities is scattered in different databases, which often contain the same information but encoded in different and temporally evolving ways. The data contained in the mentioned archives need to be integrated with information contained in other databases. In this paper we describe the construction of a data warehousing system, called VigiSegn, for the national center of pharmacovigilance; in particular, we focus on the data sources analysis and the design of the reconciled database. The (temporal) schema of reconciled data has been designed by using the TimeER conceptual data model.
药物警戒是收集、分析和预防药物引起的不良反应的活动。药物不良反应的自发报告是一种识别并向监管机构发送意外反应报告的系统。在意大利,适当开展药物警戒活动所需的信息分散在不同的数据库中,这些数据库通常包含相同的信息,但以不同的和随时间变化的方式编码。上述档案中包含的数据需要与其他数据库中包含的信息相结合。在本文中,我们描述了一个数据仓库系统的建设,称为VigiSegn,为国家药物警戒中心;重点介绍了数据来源分析和协调数据库的设计。使用TimeER概念数据模型设计了协调数据的(时态)模式。
{"title":"Designing the reconciled schema for a pharmacovigilance data warehouse through a temporally-enhanced ER model","authors":"Riccardo Lora, Alberto Sabaini, Combi Carlo, U. Moretti","doi":"10.1145/2389707.2389711","DOIUrl":"https://doi.org/10.1145/2389707.2389711","url":null,"abstract":"Pharmacovigilance is the activity related to the collection, analysis, and prevention of adverse reactions induced by drugs. The spontaneous reporting of adverse drug reactions is a system for identifying and sending reports about unexpected reactions to the regulatory authority. In Italy the information needed for properly carrying out the pharmacovigilance activities is scattered in different databases, which often contain the same information but encoded in different and temporally evolving ways. The data contained in the mentioned archives need to be integrated with information contained in other databases. In this paper we describe the construction of a data warehousing system, called VigiSegn, for the national center of pharmacovigilance; in particular, we focus on the data sources analysis and the design of the reconciled database. The (temporal) schema of reconciled data has been designed by using the TimeER conceptual data model.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"5 1","pages":"17-24"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87379846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Moving from descriptive to causal analytics: case study of discovering knowledge from us health indicators warehouse 从描述分析到因果分析:从美国健康指标仓库发现知识的案例研究
J. Schryver, M. Shankar, Songhua Xu
The knowledge management community has introduced a multitude of methods for knowledge discovery on large datasets. In the context of public health intelligence, we integrated and incorporated some of these methods into an analyst's workflow that proceeds from the data-centric descriptive level of analysis to the model-centric causal level of reasoning. We show several case studies of the proposed analyst's workflow as applied to the US Health Indicators Warehouse (HIW), which is a medium scale, public dataset regarding community health information as collected by the US federal government. In our case studies, we demonstrate a series of visual analytics efforts targeted at the HIW, including visual analysis according to correlation matrices, multivariate outlier analysis, multiple linear regression of Medicare costs, confirmatory factor analysis, and hybrid scatterplot and heatmap visualization for distributions of a group of health indicators. We conclude by sketching a preliminary framework for examining causal dependence hypotheses for future data science research in public health.
知识管理社区已经为大型数据集的知识发现引入了大量的方法。在公共卫生情报的背景下,我们将其中的一些方法集成到分析人员的工作流程中,从以数据为中心的描述性分析级别到以模型为中心的因果推理级别。我们展示了几个应用于美国健康指标仓库(HIW)的拟议分析师工作流程的案例研究,HIW是由美国联邦政府收集的关于社区健康信息的中等规模公共数据集。在我们的案例研究中,我们展示了一系列针对HIW的可视化分析,包括根据相关矩阵的可视化分析、多变量离群分析、医疗保险成本的多元线性回归、验证性因素分析,以及一组健康指标分布的混合散点图和热图可视化。最后,我们概述了一个初步框架,用于检查未来公共卫生数据科学研究的因果关系假设。
{"title":"Moving from descriptive to causal analytics: case study of discovering knowledge from us health indicators warehouse","authors":"J. Schryver, M. Shankar, Songhua Xu","doi":"10.1145/2389707.2389709","DOIUrl":"https://doi.org/10.1145/2389707.2389709","url":null,"abstract":"The knowledge management community has introduced a multitude of methods for knowledge discovery on large datasets. In the context of public health intelligence, we integrated and incorporated some of these methods into an analyst's workflow that proceeds from the data-centric descriptive level of analysis to the model-centric causal level of reasoning. We show several case studies of the proposed analyst's workflow as applied to the US Health Indicators Warehouse (HIW), which is a medium scale, public dataset regarding community health information as collected by the US federal government. In our case studies, we demonstrate a series of visual analytics efforts targeted at the HIW, including visual analysis according to correlation matrices, multivariate outlier analysis, multiple linear regression of Medicare costs, confirmatory factor analysis, and hybrid scatterplot and heatmap visualization for distributions of a group of health indicators. We conclude by sketching a preliminary framework for examining causal dependence hypotheses for future data science research in public health.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"8 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79091511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Social media mining for drug safety signal detection 社交媒体挖掘药物安全信号检测
Christopher C. Yang, Haodong Yang, Ling Jiang, Mi Zhang
Adverse Drug Reactions (ADRs) represent a serious problem all over the world. They may complicate a patient's medical conditions and increase the morbidity, even mortality. Drug safety currently depends heavily on post-marketing surveillance, because pre-marketing review process cannot identify all possible adverse drug reactions in that it is limited by scale and time span. However, current post-marketing surveillance is conducted through centralized volunteering reporting systems, and the reporting rate is low. Consequently, it is difficult to detect the adverse drug reactions signals in a timely manner. To solve this problem, many researchers have explored methods to detect ADRs in electronic health records. Nevertheless, we only have access to electronic health records form particular health units. Aggregating and integrating electronic health records from multiple sources is rather challenging. With the advance of Web 2.0 technologies and the popularity of social media, many health consumers are discussing and exchanging health-related information with their peers. Many of this online discussion involve adverse drug reactions. In this work, we propose to use association mining and Proportional Reporting Ratios to mine the associations between drugs and adverse reactions from the user contributed content in social media. We have conducted an experiment using ten drugs and five adverse drug reactions. The FDA alerts are used as the gold standard to test the performance of the proposed techniques. The result shows that the metrics leverage, lift, and PRR are all promising to detect the adverse drug reactions reported by FDA. However, PRR outperformed the other two metrics.
药物不良反应(adr)在世界范围内是一个严重的问题。它们可能使病人的病情复杂化,增加发病率,甚至死亡率。药物安全目前在很大程度上依赖于上市后的监督,因为上市前的审查过程不能识别所有可能的药物不良反应,因为它受到规模和时间跨度的限制。然而,目前的上市后监测是通过集中的志愿报告系统进行的,报告率很低。因此,很难及时发现药物不良反应信号。为了解决这一问题,许多研究者探索了检测电子病历中不良反应的方法。然而,我们只能从特定保健单位获得电子健康记录。聚合和集成来自多个来源的电子健康记录相当具有挑战性。随着Web 2.0技术的进步和社交媒体的普及,许多健康消费者正在与他们的同伴讨论和交换健康相关的信息。许多在线讨论涉及药物不良反应。在这项工作中,我们建议使用关联挖掘和比例报告比率来挖掘社交媒体中用户贡献内容中药物与不良反应之间的关联。我们用十种药物和五种药物不良反应进行了实验。FDA的警报被用作测试拟议技术性能的金标准。结果表明,杠杆率、提升率和PRR指标都有希望检测FDA报告的药物不良反应。然而,PRR优于其他两个指标。
{"title":"Social media mining for drug safety signal detection","authors":"Christopher C. Yang, Haodong Yang, Ling Jiang, Mi Zhang","doi":"10.1145/2389707.2389714","DOIUrl":"https://doi.org/10.1145/2389707.2389714","url":null,"abstract":"Adverse Drug Reactions (ADRs) represent a serious problem all over the world. They may complicate a patient's medical conditions and increase the morbidity, even mortality. Drug safety currently depends heavily on post-marketing surveillance, because pre-marketing review process cannot identify all possible adverse drug reactions in that it is limited by scale and time span. However, current post-marketing surveillance is conducted through centralized volunteering reporting systems, and the reporting rate is low. Consequently, it is difficult to detect the adverse drug reactions signals in a timely manner. To solve this problem, many researchers have explored methods to detect ADRs in electronic health records. Nevertheless, we only have access to electronic health records form particular health units. Aggregating and integrating electronic health records from multiple sources is rather challenging. With the advance of Web 2.0 technologies and the popularity of social media, many health consumers are discussing and exchanging health-related information with their peers. Many of this online discussion involve adverse drug reactions. In this work, we propose to use association mining and Proportional Reporting Ratios to mine the associations between drugs and adverse reactions from the user contributed content in social media. We have conducted an experiment using ten drugs and five adverse drug reactions. The FDA alerts are used as the gold standard to test the performance of the proposed techniques. The result shows that the metrics leverage, lift, and PRR are all promising to detect the adverse drug reactions reported by FDA. However, PRR outperformed the other two metrics.","PeriodicalId":92138,"journal":{"name":"SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)","volume":"4 1","pages":"33-40"},"PeriodicalIF":0.0,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74499237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 152
期刊
SHB'12 : proceedings of the 2012 ACM International Workshop on Smart Health and Wellbeing : October 29, 2012, Maui, Hawaii, USA. International Workshop on Smart Health and Wellbeing (2012 : Maui, Hawaii)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1