首页 > 最新文献

Proceedings. IEEE International Congress on Big Data最新文献

英文 中文
SketchyCoreSVD: SketchySVD from Random Subsampling of the Data Matrix. SketchyCoreSVD:从数据矩阵的随机子采样的SketchySVD。
Pub Date : 2019-12-01 Epub Date: 2020-02-24 DOI: 10.1109/bigdata47090.2019.9006345
Chandrajit Bajaj, Yi Wang, Tianming Wang

We present a method called SketchyCoreSVD to compute the near-optimal rank r SVD of a data matrix by building random sketches only from its subsampled columns and rows. We provide theoretical guarantees under incoherence assumptions, and validate the performance of our SketchyCoreSVD method on various large static and time-varying datasets.

我们提出了一种称为SketchyCoreSVD的方法,通过仅从其子采样的列和行构建随机草图来计算数据矩阵的近最优秩r SVD。我们在非相干假设下提供了理论保证,并在各种大型静态和时变数据集上验证了我们的SketchyCoreSVD方法的性能。
{"title":"SketchyCoreSVD: SketchySVD from Random Subsampling of the Data Matrix.","authors":"Chandrajit Bajaj,&nbsp;Yi Wang,&nbsp;Tianming Wang","doi":"10.1109/bigdata47090.2019.9006345","DOIUrl":"https://doi.org/10.1109/bigdata47090.2019.9006345","url":null,"abstract":"<p><p>We present a method called SketchyCoreSVD to compute the near-optimal rank <i>r</i> SVD of a data matrix by building random sketches only from its subsampled columns and rows. We provide theoretical guarantees under incoherence assumptions, and validate the performance of our SketchyCoreSVD method on various large static and time-varying datasets.</p>","PeriodicalId":91601,"journal":{"name":"Proceedings. IEEE International Congress on Big Data","volume":"2019 ","pages":"26-35"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/bigdata47090.2019.9006345","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37897182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Experiences with the Twitter Health Surveillance (THS) System. Twitter健康监测(THS)系统的经验。
Pub Date : 2017-06-01 Epub Date: 2017-09-11 DOI: 10.1109/BigDataCongress.2017.55
Manuel Rodríguez-Martínez

Social media has become an important platform to gauge public opinion on topics related to our daily lives. In practice, processing these posts requires big data analytics tools since the volume of data and the speed of production overwhelm single-server solutions. Building an application to capture and analyze posts from social media can be a challenge simply because it requires combining a set of complex software tools that often times are tricky to configure, tune, and maintain. In many instances, the application ends up being an assorted collection of Java/Scala programs or Python scripts that developers cobble together to generate the data products they need. In this paper, we present the Twitter Health Surveillance (THS) application framework. THS is designed as a platform to allow end-users to monitor a stream of tweets, and process the stream with a combination of built-in functionality and their own user-defined functions. We discuss the architecture of THS, and describe its implementation atop the Apache Hadoop Ecosystem. We also present several lessons learned while developing our current prototype.

社交媒体已经成为衡量公众对与我们日常生活有关的话题的意见的重要平台。实际上,处理这些帖子需要大数据分析工具,因为数据量和生产速度超过了单服务器解决方案。构建一个应用程序来捕获和分析来自社交媒体的帖子可能是一项挑战,因为它需要组合一组复杂的软件工具,而这些工具通常很难配置、调优和维护。在许多情况下,应用程序最终是Java/Scala程序或Python脚本的组合,开发人员将其拼凑在一起以生成所需的数据产品。在本文中,我们提出了Twitter健康监测(THS)应用框架。THS被设计成一个平台,允许最终用户监控tweet流,并通过内置功能和用户自定义功能的组合来处理流。我们讨论了THS的架构,并描述了它在Apache Hadoop生态系统上的实现。我们还介绍了在开发当前原型过程中获得的一些经验教训。
{"title":"Experiences with the Twitter Health Surveillance (THS) System.","authors":"Manuel Rodríguez-Martínez","doi":"10.1109/BigDataCongress.2017.55","DOIUrl":"https://doi.org/10.1109/BigDataCongress.2017.55","url":null,"abstract":"<p><p>Social media has become an important platform to gauge public opinion on topics related to our daily lives. In practice, processing these posts requires big data analytics tools since the volume of data and the speed of production overwhelm single-server solutions. Building an application to capture and analyze posts from social media can be a challenge simply because it requires combining a set of complex software tools that often times are tricky to configure, tune, and maintain. In many instances, the application ends up being an assorted collection of Java/Scala programs or Python scripts that developers cobble together to generate the data products they need. In this paper, we present the Twitter Health Surveillance (THS) application framework. THS is designed as a platform to allow end-users to monitor a stream of tweets, and process the stream with a combination of built-in functionality and their own user-defined functions. We discuss the architecture of THS, and describe its implementation atop the Apache Hadoop Ecosystem. We also present several lessons learned while developing our current prototype.</p>","PeriodicalId":91601,"journal":{"name":"Proceedings. IEEE International Congress on Big Data","volume":"2017 ","pages":"376-383"},"PeriodicalIF":0.0,"publicationDate":"2017-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/BigDataCongress.2017.55","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35968420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive Modeling for Comfortable Death Outcome Using Electronic Health Records. 利用电子健康记录对舒适死亡结果进行预测建模。
Pub Date : 2015-06-01 DOI: 10.1109/BigDataCongress.2015.67
Muhammad Kamran Lodhi, Rashid Ansari, Yingwei Yao, Gail M Keenan, Diana J Wilkie, Ashfaq A Khokhar

Electronic health record (EHR) systems are used in healthcare industry to observe the progress of patients. With fast growth of the data, EHR data analysis has become a big data problem. Most EHRs are sparse and multi-dimensional datasets and mining them is a challenging task due to a number of reasons. In this paper, we have used a nursing EHR system to build predictive models to determine what factors impact death anxiety, a significant problem for the dying patients. Different existing modeling techniques have been used to develop coarse-grained as well as fine-grained models to predict patient outcomes. The coarse-grained models help in predicting the outcome at the end of each hospitalization, whereas fine-grained models help in predicting the outcome at the end of each shift, therefore providing a trajectory of predicted outcomes. Based on different modeling techniques, our results show significantly accurate predictions, due to relatively noise-free data. These models can help in determining effective treatments, lowering healthcare costs, and improving the quality of end-of-life (EOL) care.

电子健康记录(EHR)系统在医疗保健行业中用于观察患者的病情进展。随着数据量的快速增长,电子病历数据分析已成为一个大数据问题。大多数电子病历都是稀疏的多维数据集,由于许多原因,挖掘它们是一项具有挑战性的任务。本文利用护理电子病历系统建立预测模型,以确定影响死亡焦虑的因素,这是临终患者的一个重要问题。不同的现有建模技术已被用于开发粗粒度和细粒度模型来预测患者的结果。粗粒度模型有助于预测每次住院结束时的结果,而细粒度模型有助于预测每次轮班结束时的结果,从而提供预测结果的轨迹。基于不同的建模技术,我们的结果显示出非常准确的预测,由于相对无噪声的数据。这些模型可以帮助确定有效的治疗方法,降低医疗保健成本,提高生命末期(EOL)护理的质量。
{"title":"Predictive Modeling for Comfortable Death Outcome Using Electronic Health Records.","authors":"Muhammad Kamran Lodhi,&nbsp;Rashid Ansari,&nbsp;Yingwei Yao,&nbsp;Gail M Keenan,&nbsp;Diana J Wilkie,&nbsp;Ashfaq A Khokhar","doi":"10.1109/BigDataCongress.2015.67","DOIUrl":"https://doi.org/10.1109/BigDataCongress.2015.67","url":null,"abstract":"<p><p>Electronic health record (EHR) systems are used in healthcare industry to observe the progress of patients. With fast growth of the data, EHR data analysis has become a big data problem. Most EHRs are sparse and multi-dimensional datasets and mining them is a challenging task due to a number of reasons. In this paper, we have used a nursing EHR system to build predictive models to determine what factors impact death anxiety, a significant problem for the dying patients. Different existing modeling techniques have been used to develop coarse-grained as well as fine-grained models to predict patient outcomes. The coarse-grained models help in predicting the outcome at the end of each hospitalization, whereas fine-grained models help in predicting the outcome at the end of each shift, therefore providing a trajectory of predicted outcomes. Based on different modeling techniques, our results show significantly accurate predictions, due to relatively noise-free data. These models can help in determining effective treatments, lowering healthcare costs, and improving the quality of end-of-life (EOL) care.</p>","PeriodicalId":91601,"journal":{"name":"Proceedings. IEEE International Congress on Big Data","volume":"2015 ","pages":"409-415"},"PeriodicalIF":0.0,"publicationDate":"2015-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/BigDataCongress.2015.67","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34349962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Proceedings. IEEE International Congress on Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1