{"title":"药物发现的大数据分析","authors":"Keith C. C. Chan","doi":"10.1109/BIBM.2013.6732448","DOIUrl":null,"url":null,"abstract":"Big Data refers to data sets that are so large and complex that traditional data processing tools and technologies cannot cope with. The process of examining such data to uncover hidden patterns in them is referred to as Big Data Analytics. Drug discovery is related to big data analytics as the process may require the collection, processing and analysis of extremely large volume of structured and unstructured biomedical data stemming from a wide range of experiments and surveys collected by hospitals, laboratories, pharmaceutical companies or even social media. These data may include sequencing and gene expression data, drug data including molecular data, protein and drug interaction data, clinical trial and electronic patient record data, patient behavior and self-reporting data in social media, regulatory monitoring data, and literatures where trends and drug repurposing and protein-protein interaction data may be found. To analyze such diversity of data types in large volumes for the purpose of drug discovery, we need algorithms that are simple, effective, efficient and scalable. In this talk, we discuss how we can take advantage of the recent development in big data analytics to improve the drug discovery process. We describe what have recently been done and what remain to be done to develop big data algorithms for drug discovery. We present the effort we have recently made to develop such algorithms to uncover hidden patterns in such data as unreported drug side-effect discussions in social media communications, patient record and sequencing data, regulatory monitoring and drug-protein interaction data, protein-chemical interactions data, etc., for drug side-effect prediction and how such predictions may be used to determine possible drug structures with different desirable properties. We discuss how big data analytics may contribute to better drug efficacy and safety for pharmaceutical companies and regulators.","PeriodicalId":73283,"journal":{"name":"IEEE International Conference on Bioinformatics and Biomedicine workshops. IEEE International Conference on Bioinformatics and Biomedicine","volume":"40 1","pages":"1"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Big data analytics for drug discovery\",\"authors\":\"Keith C. C. Chan\",\"doi\":\"10.1109/BIBM.2013.6732448\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big Data refers to data sets that are so large and complex that traditional data processing tools and technologies cannot cope with. The process of examining such data to uncover hidden patterns in them is referred to as Big Data Analytics. Drug discovery is related to big data analytics as the process may require the collection, processing and analysis of extremely large volume of structured and unstructured biomedical data stemming from a wide range of experiments and surveys collected by hospitals, laboratories, pharmaceutical companies or even social media. These data may include sequencing and gene expression data, drug data including molecular data, protein and drug interaction data, clinical trial and electronic patient record data, patient behavior and self-reporting data in social media, regulatory monitoring data, and literatures where trends and drug repurposing and protein-protein interaction data may be found. To analyze such diversity of data types in large volumes for the purpose of drug discovery, we need algorithms that are simple, effective, efficient and scalable. In this talk, we discuss how we can take advantage of the recent development in big data analytics to improve the drug discovery process. We describe what have recently been done and what remain to be done to develop big data algorithms for drug discovery. We present the effort we have recently made to develop such algorithms to uncover hidden patterns in such data as unreported drug side-effect discussions in social media communications, patient record and sequencing data, regulatory monitoring and drug-protein interaction data, protein-chemical interactions data, etc., for drug side-effect prediction and how such predictions may be used to determine possible drug structures with different desirable properties. We discuss how big data analytics may contribute to better drug efficacy and safety for pharmaceutical companies and regulators.\",\"PeriodicalId\":73283,\"journal\":{\"name\":\"IEEE International Conference on Bioinformatics and Biomedicine workshops. IEEE International Conference on Bioinformatics and Biomedicine\",\"volume\":\"40 1\",\"pages\":\"1\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE International Conference on Bioinformatics and Biomedicine workshops. IEEE International Conference on Bioinformatics and Biomedicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM.2013.6732448\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Conference on Bioinformatics and Biomedicine workshops. IEEE International Conference on Bioinformatics and Biomedicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2013.6732448","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

大数据是指传统的数据处理工具和技术无法处理的庞大而复杂的数据集。检查这些数据以发现其中隐藏模式的过程被称为大数据分析。药物发现与大数据分析相关,因为该过程可能需要收集、处理和分析极其大量的结构化和非结构化生物医学数据,这些数据来自医院、实验室、制药公司甚至社交媒体收集的广泛的实验和调查。这些数据可能包括测序和基因表达数据、药物数据(包括分子数据)、蛋白质和药物相互作用数据、临床试验和电子病历数据、社交媒体上的患者行为和自我报告数据、监管监测数据,以及可能发现趋势和药物再利用以及蛋白质-蛋白质相互作用数据的文献。为了分析大量数据类型的多样性以用于药物发现,我们需要简单、有效、高效和可扩展的算法。在这次演讲中,我们将讨论如何利用大数据分析的最新发展来改善药物发现过程。我们描述了最近在开发药物发现的大数据算法方面所做的工作和需要做的工作。我们介绍了我们最近所做的努力,以开发这样的算法来揭示这些数据中的隐藏模式,如社交媒体通信中未报告的药物副作用讨论,患者记录和测序数据,监管监测和药物-蛋白质相互作用数据,蛋白质-化学相互作用数据等,用于药物副作用预测以及如何使用这些预测来确定具有不同理想特性的可能药物结构。我们将讨论大数据分析如何为制药公司和监管机构提供更好的药物疗效和安全性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Big data analytics for drug discovery
Big Data refers to data sets that are so large and complex that traditional data processing tools and technologies cannot cope with. The process of examining such data to uncover hidden patterns in them is referred to as Big Data Analytics. Drug discovery is related to big data analytics as the process may require the collection, processing and analysis of extremely large volume of structured and unstructured biomedical data stemming from a wide range of experiments and surveys collected by hospitals, laboratories, pharmaceutical companies or even social media. These data may include sequencing and gene expression data, drug data including molecular data, protein and drug interaction data, clinical trial and electronic patient record data, patient behavior and self-reporting data in social media, regulatory monitoring data, and literatures where trends and drug repurposing and protein-protein interaction data may be found. To analyze such diversity of data types in large volumes for the purpose of drug discovery, we need algorithms that are simple, effective, efficient and scalable. In this talk, we discuss how we can take advantage of the recent development in big data analytics to improve the drug discovery process. We describe what have recently been done and what remain to be done to develop big data algorithms for drug discovery. We present the effort we have recently made to develop such algorithms to uncover hidden patterns in such data as unreported drug side-effect discussions in social media communications, patient record and sequencing data, regulatory monitoring and drug-protein interaction data, protein-chemical interactions data, etc., for drug side-effect prediction and how such predictions may be used to determine possible drug structures with different desirable properties. We discuss how big data analytics may contribute to better drug efficacy and safety for pharmaceutical companies and regulators.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Diurnal Pain Classification in Critically Ill Patients using Machine Learning on Accelerometry and Analgesic Data. Transmission cluster characteristics of global, regional, and lineage-specific SARS-CoV-2 phylogenies. Document-level DDI relation extraction with document-entity embedding The Network Pharmacological Mechanism of Yizhiningshen Oral Liquid in the Treatment of Tic Disorders Study on the Medication Law of Traditional Chinese medicine treating Lumbago based on TCM electronic medical record
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1