首页 > 最新文献

Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics最新文献

英文 中文
Inferring Microbial Interactions from Metagenomic Time-series Using Prior Biological Knowledge 利用先验生物学知识从宏基因组时间序列推断微生物相互作用
Chieh Lo, R. Marculescu
Due to the recent advances in modern metagenomics sequencing methods, it becomes possible to directly analyze the microbial communities within human body. To understand how microbial communities adapt, develop, and interact over time with the human body and the surrounding environment, a critical step is the inference of interactions among different microbes directly from sequencing data. However, metagenomics data is both compositional and highly dimensional in nature. Consequently, new approaches that can accurately and robustly estimate the interactions among various microbe species are needed to analyze such data. To this end, we propose a novel framework called Microbial Time-series Prior Lasso (MTPLasso) which integrates sparse linear regression with microbial co-occurrences and associations obtained from scientific literature and cross-sectional metagenomics data. We show that MTPLasso outperforms existing models in terms of precision and recall rates, as well as the accuracy in inferring the interaction types. Finally, the interaction networks we infer from human gut data demonstrate credible results when compared against real data.
由于现代宏基因组测序方法的最新进展,直接分析人体微生物群落成为可能。为了了解微生物群落是如何适应、发展并随时间与人体和周围环境相互作用的,关键的一步是直接从测序数据推断不同微生物之间的相互作用。然而,宏基因组学数据在本质上既是组成的,又是高度多维的。因此,需要新的方法来准确和稳健地估计各种微生物物种之间的相互作用来分析这些数据。为此,我们提出了一个新的框架,称为微生物时间序列先验套索(MTPLasso),它将稀疏线性回归与从科学文献和横断面宏基因组学数据中获得的微生物共现和关联相结合。我们表明,MTPLasso在精度和召回率以及推断交互类型的准确性方面优于现有模型。最后,与真实数据相比,我们从人类肠道数据推断出的相互作用网络显示出可信的结果。
{"title":"Inferring Microbial Interactions from Metagenomic Time-series Using Prior Biological Knowledge","authors":"Chieh Lo, R. Marculescu","doi":"10.1145/3107411.3107435","DOIUrl":"https://doi.org/10.1145/3107411.3107435","url":null,"abstract":"Due to the recent advances in modern metagenomics sequencing methods, it becomes possible to directly analyze the microbial communities within human body. To understand how microbial communities adapt, develop, and interact over time with the human body and the surrounding environment, a critical step is the inference of interactions among different microbes directly from sequencing data. However, metagenomics data is both compositional and highly dimensional in nature. Consequently, new approaches that can accurately and robustly estimate the interactions among various microbe species are needed to analyze such data. To this end, we propose a novel framework called Microbial Time-series Prior Lasso (MTPLasso) which integrates sparse linear regression with microbial co-occurrences and associations obtained from scientific literature and cross-sectional metagenomics data. We show that MTPLasso outperforms existing models in terms of precision and recall rates, as well as the accuracy in inferring the interaction types. Finally, the interaction networks we infer from human gut data demonstrate credible results when compared against real data.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130511934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Dynamic Phylogenetic Inference for Sequence-based Typing Data 基于序列分型数据的动态系统发育推断
Alexandre P. Francisco, M. Nascimento, Cátia Vaz
Typing methods are widely used in the surveillance of infectious diseases, outbreaks investigation and studies of the natural history of an infection. And their use is becoming standard, in particular with the introduction of High Throughput Sequencing (HTS). On the other hand, the data being generated is massive and many algorithms have been proposed for phylogenetic analysis of typing data, such as the goeBURST algorithm. These algorithms must however be run whenever new data becomes available starting from scratch. We address this issue proposing a dynamic version of goeBURST algorithm. Experimental results show that this new version is efficient on integrating new data and updating inferred evolutionary patterns, improving the update running time by at least one order of magnitude.
分型方法广泛应用于传染病监测、疫情调查和感染自然史研究。它们的使用正在成为标准,特别是随着高通量测序(HTS)的引入。另一方面,生成的数据是海量的,人们提出了许多算法来进行分型数据的系统发育分析,如goeBURST算法。然而,每当有新的数据可用时,必须从头开始运行这些算法。我们提出了一个动态版本的goeBURST算法来解决这个问题。实验结果表明,新版本在集成新数据和更新推断进化模式方面效率很高,更新运行时间至少提高了一个数量级。
{"title":"Dynamic Phylogenetic Inference for Sequence-based Typing Data","authors":"Alexandre P. Francisco, M. Nascimento, Cátia Vaz","doi":"10.1145/3107411.3108214","DOIUrl":"https://doi.org/10.1145/3107411.3108214","url":null,"abstract":"Typing methods are widely used in the surveillance of infectious diseases, outbreaks investigation and studies of the natural history of an infection. And their use is becoming standard, in particular with the introduction of High Throughput Sequencing (HTS). On the other hand, the data being generated is massive and many algorithms have been proposed for phylogenetic analysis of typing data, such as the goeBURST algorithm. These algorithms must however be run whenever new data becomes available starting from scratch. We address this issue proposing a dynamic version of goeBURST algorithm. Experimental results show that this new version is efficient on integrating new data and updating inferred evolutionary patterns, improving the update running time by at least one order of magnitude.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"25 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120856937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Cophenetic Median Trees Under the Manhattan Distance 曼哈顿距离下的隐中树
Alexey Markin, O. Eulenstein
Computing median trees from gene trees using path-difference metrics has provided several credible species tree estimates. Similar to these metrics is the cophenetic family of metrics that originates from a dendrogram comparison metric introduced more than 50 years ago. Despite the tradition and appeal of the cophenetic metrics, the problem of computing median trees under this family of metrics has not been analyzed. Like other standard median tree problems relevant in practice, as we show here, this problem is also NP-hard. NP-hard median tree problems have been successfully addressed by local search heuristics that are solving thousands of instances of a corresponding local search problem. For the local search problem under a cophenetic metric the best known (naive) algorithm has a time complexity that is typically prohibitive for effective heuristic searches. Focusing on the Manhattan norm (Manhattan cophenetic metric), we describe an efficient algorithm for this problem that improves on the naive solution by a factor of n, where n is the size of the input trees. We demonstrate the performance of our local search algorithm in a comparative study using published empirical data sets.
利用路径差度量从基因树计算中位数树提供了几个可信的物种树估计。与这些指标类似的是源于50多年前引入的树形图比较指标的相干指标族。尽管隐度量的传统和吸引力,但在这类度量下计算中值树的问题尚未得到分析。正如我们在这里展示的,与实践中相关的其他标准中值树问题一样,这个问题也是np困难的。NP-hard中值树问题已经通过局部搜索启发式方法成功解决,该方法解决了对应的局部搜索问题的数千个实例。对于隐度量下的局部搜索问题,最著名的(朴素)算法具有时间复杂度,这对于有效的启发式搜索通常是禁止的。我们将重点放在曼哈顿范数(Manhattan cophenetic metric)上,描述了一种针对该问题的有效算法,该算法在朴素解的基础上提高了n倍,其中n是输入树的大小。我们在使用已发表的经验数据集的比较研究中展示了我们的局部搜索算法的性能。
{"title":"Cophenetic Median Trees Under the Manhattan Distance","authors":"Alexey Markin, O. Eulenstein","doi":"10.1145/3107411.3107443","DOIUrl":"https://doi.org/10.1145/3107411.3107443","url":null,"abstract":"Computing median trees from gene trees using path-difference metrics has provided several credible species tree estimates. Similar to these metrics is the cophenetic family of metrics that originates from a dendrogram comparison metric introduced more than 50 years ago. Despite the tradition and appeal of the cophenetic metrics, the problem of computing median trees under this family of metrics has not been analyzed. Like other standard median tree problems relevant in practice, as we show here, this problem is also NP-hard. NP-hard median tree problems have been successfully addressed by local search heuristics that are solving thousands of instances of a corresponding local search problem. For the local search problem under a cophenetic metric the best known (naive) algorithm has a time complexity that is typically prohibitive for effective heuristic searches. Focusing on the Manhattan norm (Manhattan cophenetic metric), we describe an efficient algorithm for this problem that improves on the naive solution by a factor of n, where n is the size of the input trees. We demonstrate the performance of our local search algorithm in a comparative study using published empirical data sets.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"39 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120885023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Applying Bayesian Changepoint Model and Hierarchical Divisive Model for Detecting Anomalies in Clinical Decision Support Alert Firing 应用贝叶斯变点模型和层次分裂模型检测临床决策支持警报触发异常
Soumi Ray, A. Wright
Clinical Decision Support (CDS) Systems are widely used to support efficient evidence-based care and have become an important aspect of healthcare. CDS systems are complex, and sometimes malfunction or exhibit anomalous behavior. We have previously shown how anomaly detection models can be used to successfully identify malfunctions in CDS systems. We have extended this work and applied two new anomaly detection models on CDS alert firing data from a large health system.
临床决策支持(CDS)系统被广泛用于支持高效的循证护理,已成为医疗保健的一个重要方面。CDS系统很复杂,有时会发生故障或表现出异常行为。我们之前已经展示了如何使用异常检测模型成功地识别CDS系统中的故障。我们扩展了这项工作,并将两个新的异常检测模型应用于来自大型卫生系统的CDS警报发射数据。
{"title":"Applying Bayesian Changepoint Model and Hierarchical Divisive Model for Detecting Anomalies in Clinical Decision Support Alert Firing","authors":"Soumi Ray, A. Wright","doi":"10.1145/3107411.3108200","DOIUrl":"https://doi.org/10.1145/3107411.3108200","url":null,"abstract":"Clinical Decision Support (CDS) Systems are widely used to support efficient evidence-based care and have become an important aspect of healthcare. CDS systems are complex, and sometimes malfunction or exhibit anomalous behavior. We have previously shown how anomaly detection models can be used to successfully identify malfunctions in CDS systems. We have extended this work and applied two new anomaly detection models on CDS alert firing data from a large health system.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"89 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120903307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Sparse Latent Regression Approach for Integrative Analysis of Glycomic and Glycotranscriptomic Data 用于糖组学和糖转录组学数据综合分析的稀疏隐回归方法
Xuefu Wang, Sujun Li, Wenjing Peng, Y. Mechref, Haixu Tang
Glycomics and glycotranscitomics have emerged as two key high-throughput approaches to interrogating the glycome within specific cells, tissues or organisms under specific conditions. Because the glycotransciptomic analysis utilizes the same experimental protocol as the whole-transcriptome sequencing (RNA-seq) that is commonly used in the genomic research, the glycotranscriptomic information can be conveniently extracted in silico for many biological samples from which RNA-seq data have been collected and made publicly available through large-scale projects such as The Cancer Genome Atlas (TCGA) proeject. However, the glycomic data collection is constrained by specialized analytical tools that are less accessible by biological researchers. In this paper, we present a Bayesian sparse latent regression (BSLR) model for predicting quantitative glycan abundances from glycotranscriptomic data. The model is built using the matched glycomic and glycotranscriptomic data collected in a same set of samples as training sets, and is then exploited to study the common properties of the training samples and to predict these properties (e.g., the glycan abundances) in similar samples from which only glycotranscriptomc data are available. The BSLR model assumes the glycomic and the glycotranscriptomic abundances are both modulated by a small number of independent latent variables, and thus can be constructed by using only a relatively small number of training samples. When tested on simulated data, we show our approach achieves satisfactory performance using only 10-20 training samples. We also tested our model on five cancer cell lines, and showed the BSLR model can accurately predict the glycan abundances from the transcription levels of glycan synthetic genes. Furthermore, the predicted glycan abundances can distinguish the metastatic cell line specifically targeting brain from the remaining breast cancer cell lines as well as the a brain cancer cell line, with only slightly lower power than the observed glycan abundances in glycomic experiments, indicating the BSLR prediction retains the variations of glycan abundances across different groups of samples from their glycotranscriptomic data.
糖组学和糖转录组学已经成为在特定条件下研究特定细胞、组织或生物体内的糖的两种关键的高通量方法。由于糖转录组分析采用与基因组研究中常用的全转录组测序(RNA-seq)相同的实验方案,因此糖转录组信息可以方便地在计算机上提取许多生物样品,其中RNA-seq数据已通过诸如癌症基因组图谱(TCGA)项目等大型项目收集并公开。然而,糖糖数据的收集受到专门分析工具的限制,这些工具对生物学研究人员来说是不太容易获得的。在本文中,我们提出了一个贝叶斯稀疏潜回归(BSLR)模型,用于预测糖转录组数据的定量多糖丰度。该模型是使用在同一组样本中收集的与训练集相匹配的糖组学和糖转录组学数据建立的,然后用于研究训练样本的共同特性,并在只有糖转录组学数据的类似样本中预测这些特性(例如,聚糖丰度)。BSLR模型假设糖组和糖转录组丰度都受到少量独立潜在变量的调节,因此只需使用相对较少的训练样本即可构建。当在模拟数据上进行测试时,我们表明我们的方法仅使用10-20个训练样本就取得了令人满意的性能。我们还在5个癌细胞系上测试了我们的模型,结果表明BSLR模型可以准确地从聚糖合成基因的转录水平预测聚糖丰度。此外,预测的多糖丰度可以将特异性靶向脑的转移细胞系与剩余的乳腺癌细胞系以及脑癌细胞系区分开来,仅比糖组学实验中观察到的多糖丰度略低,这表明BSLR预测保留了不同组样品中糖转录组数据中多糖丰度的变化。
{"title":"A Sparse Latent Regression Approach for Integrative Analysis of Glycomic and Glycotranscriptomic Data","authors":"Xuefu Wang, Sujun Li, Wenjing Peng, Y. Mechref, Haixu Tang","doi":"10.1145/3107411.3107468","DOIUrl":"https://doi.org/10.1145/3107411.3107468","url":null,"abstract":"Glycomics and glycotranscitomics have emerged as two key high-throughput approaches to interrogating the glycome within specific cells, tissues or organisms under specific conditions. Because the glycotransciptomic analysis utilizes the same experimental protocol as the whole-transcriptome sequencing (RNA-seq) that is commonly used in the genomic research, the glycotranscriptomic information can be conveniently extracted in silico for many biological samples from which RNA-seq data have been collected and made publicly available through large-scale projects such as The Cancer Genome Atlas (TCGA) proeject. However, the glycomic data collection is constrained by specialized analytical tools that are less accessible by biological researchers. In this paper, we present a Bayesian sparse latent regression (BSLR) model for predicting quantitative glycan abundances from glycotranscriptomic data. The model is built using the matched glycomic and glycotranscriptomic data collected in a same set of samples as training sets, and is then exploited to study the common properties of the training samples and to predict these properties (e.g., the glycan abundances) in similar samples from which only glycotranscriptomc data are available. The BSLR model assumes the glycomic and the glycotranscriptomic abundances are both modulated by a small number of independent latent variables, and thus can be constructed by using only a relatively small number of training samples. When tested on simulated data, we show our approach achieves satisfactory performance using only 10-20 training samples. We also tested our model on five cancer cell lines, and showed the BSLR model can accurately predict the glycan abundances from the transcription levels of glycan synthetic genes. Furthermore, the predicted glycan abundances can distinguish the metastatic cell line specifically targeting brain from the remaining breast cancer cell lines as well as the a brain cancer cell line, with only slightly lower power than the observed glycan abundances in glycomic experiments, indicating the BSLR prediction retains the variations of glycan abundances across different groups of samples from their glycotranscriptomic data.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129952794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Graph Based Method for the Prediction of Backbone Trace from Cryo-EM Density Maps 基于图的冷电镜密度图主干迹线预测方法
P. Collins, Dong Si
Cryo-electron microscopy (Cryo-EM) is able to achieve high resolution density maps. These density maps are near atomic resolution and individual atoms can be seen as well as large secondary structure elements. However, it is challenging to extract the backbone structure information automatically and efficiently. This paper presents a novel method for creating a backbone trace and predicting locations of Cα atoms for high resolution density maps. It is a graph based method utilizing density along a backbone trace and features of secondary structure elements to find the optimal backbone trace and Cα atom locations. The method is mostly automatic requiring an initial user determined threshold value and primary secondary structure type. We tested our method on fifteen simulated maps at 3A resolution and four experimental cryo-EM density maps between 2.6-3.1A resolution. The result shows that our method is able to generate a complete Cα backbone trace when the density map is not missing data at near atomic resolution.
低温电子显微镜(Cryo-EM)能够获得高分辨率的密度图。这些密度图接近原子分辨率,可以看到单个原子以及大的二级结构元素。然而,如何自动高效地提取主干结构信息是一个难题。本文提出了一种用于高分辨率密度图建立主链轨迹和预测Cα原子位置的新方法。它是一种基于图的方法,利用沿主链的密度和二级结构元素的特征来寻找最佳的主链和Cα原子位置。该方法基本上是自动的,需要初始用户确定阈值和主次结构类型。我们在15张3A分辨率的模拟图和4张2.6-3.1A分辨率的实验低温电镜密度图上测试了我们的方法。结果表明,当密度图在近原子分辨率下不丢失数据时,我们的方法能够生成完整的Cα主链迹。
{"title":"A Graph Based Method for the Prediction of Backbone Trace from Cryo-EM Density Maps","authors":"P. Collins, Dong Si","doi":"10.1145/3107411.3107501","DOIUrl":"https://doi.org/10.1145/3107411.3107501","url":null,"abstract":"Cryo-electron microscopy (Cryo-EM) is able to achieve high resolution density maps. These density maps are near atomic resolution and individual atoms can be seen as well as large secondary structure elements. However, it is challenging to extract the backbone structure information automatically and efficiently. This paper presents a novel method for creating a backbone trace and predicting locations of Cα atoms for high resolution density maps. It is a graph based method utilizing density along a backbone trace and features of secondary structure elements to find the optimal backbone trace and Cα atom locations. The method is mostly automatic requiring an initial user determined threshold value and primary secondary structure type. We tested our method on fifteen simulated maps at 3A resolution and four experimental cryo-EM density maps between 2.6-3.1A resolution. The result shows that our method is able to generate a complete Cα backbone trace when the density map is not missing data at near atomic resolution.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134446852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Cell Neighbor Determination in the Metazoan Embryo System 后生动物胚胎系统中细胞邻居的确定
Z. Wang, Dali Wang, Husheng Li, Z. Bao
Cell neighbor determination is a significant component in the simulation of a metazoan embryo system since it influences a number of fundamental biological processes, such as cell signaling, migration, and proliferation. Traditional approaches to find the neighbors of a cell such as Voronoi diagram successfully accomplish this goal, but are too time-consuming as the number of cells grows exponentially. In this paper, we propose a learning-based algorithm that determines the neighbors of specific cells in the metazoan embryo in real-time. We decrease the computational time by four orders of magnitude, and achieve an accuracy of 99.66%. For the verification purpose, the simulation results indicate that our model successfully reproduces the neighbor relationship in C. elegans Notch signaling pathways and cell-cell squeeze force modeling of the cell division process.
细胞邻居的确定是后生动物胚胎系统模拟中的一个重要组成部分,因为它影响许多基本的生物过程,如细胞信号传导、迁移和增殖。寻找细胞邻居的传统方法(如Voronoi图)成功地实现了这一目标,但由于细胞数量呈指数增长,因此过于耗时。在本文中,我们提出了一种基于学习的算法来实时确定后生动物胚胎中特定细胞的邻居。计算时间缩短了4个数量级,准确率达到99.66%。为了验证这一点,模拟结果表明,我们的模型成功地再现了线虫Notch信号通路中的邻居关系和细胞分裂过程中的细胞挤压力模型。
{"title":"Cell Neighbor Determination in the Metazoan Embryo System","authors":"Z. Wang, Dali Wang, Husheng Li, Z. Bao","doi":"10.1145/3107411.3107465","DOIUrl":"https://doi.org/10.1145/3107411.3107465","url":null,"abstract":"Cell neighbor determination is a significant component in the simulation of a metazoan embryo system since it influences a number of fundamental biological processes, such as cell signaling, migration, and proliferation. Traditional approaches to find the neighbors of a cell such as Voronoi diagram successfully accomplish this goal, but are too time-consuming as the number of cells grows exponentially. In this paper, we propose a learning-based algorithm that determines the neighbors of specific cells in the metazoan embryo in real-time. We decrease the computational time by four orders of magnitude, and achieve an accuracy of 99.66%. For the verification purpose, the simulation results indicate that our model successfully reproduces the neighbor relationship in C. elegans Notch signaling pathways and cell-cell squeeze force modeling of the cell division process.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132272437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Model-based Transcriptome Engineering 基于模型的转录组工程
M. Brent
The ability to rationally manipulate the transcriptional states of cells would be of great use in medicine and bioengineering. We have developed an algorithm, NetSurgeon, which uses genome wide gene-regulatory networks to identify interventions that force a cell toward a desired expression state. We first validated NetSurgeon extensively on existing datasets. Next, we used Net-Surgeon to select transcription factor deletions aimed at improving ethanol production in Saccharomyces cerevisiae cultures that are catabolizing xylose. We reasoned that interventions that move the transcriptional state of cells using xylose toward that of cells producing large amounts of ethanol from glucose might improve xylose fermentation. Some of the interventions selected by NetSurgeon successfully promoted a fermentative transcriptional state in the absence of glucose, resulting in strains with a 2.7-fold increase in xylose import rates, a 4-fold improvement in xylose integration into central carbon metabolism, or a 1.3-fold increase in ethanol production rate. We conclude by presenting an integrated model of transcriptional regulation and metabolic flux that will enable future efforts aimed at improving xylose fermentation to prioritize functional regulators of central carbon metabolism.
合理操纵细胞转录状态的能力将在医学和生物工程中有很大的用途。我们已经开发了一种算法,NetSurgeon,它使用全基因组范围的基因调控网络来识别迫使细胞进入所需表达状态的干预措施。我们首先在现有数据集上广泛验证了NetSurgeon。接下来,我们使用Net-Surgeon选择转录因子缺失,旨在提高分解木糖的酿酒酵母培养物的乙醇产量。我们推断,将使用木糖的细胞的转录状态转移到从葡萄糖产生大量乙醇的细胞的转录状态可能会改善木糖发酵。NetSurgeon选择的一些干预措施成功地在没有葡萄糖的情况下促进了发酵转录状态,结果菌株木糖进口率提高了2.7倍,木糖整合到中心碳代谢的效率提高了4倍,乙醇产量提高了1.3倍。最后,我们提出了一个转录调控和代谢通量的综合模型,这将使未来的努力旨在改善木糖发酵,优先考虑中心碳代谢的功能调节因子。
{"title":"Model-based Transcriptome Engineering","authors":"M. Brent","doi":"10.1145/3107411.3107454","DOIUrl":"https://doi.org/10.1145/3107411.3107454","url":null,"abstract":"The ability to rationally manipulate the transcriptional states of cells would be of great use in medicine and bioengineering. We have developed an algorithm, NetSurgeon, which uses genome wide gene-regulatory networks to identify interventions that force a cell toward a desired expression state. We first validated NetSurgeon extensively on existing datasets. Next, we used Net-Surgeon to select transcription factor deletions aimed at improving ethanol production in Saccharomyces cerevisiae cultures that are catabolizing xylose. We reasoned that interventions that move the transcriptional state of cells using xylose toward that of cells producing large amounts of ethanol from glucose might improve xylose fermentation. Some of the interventions selected by NetSurgeon successfully promoted a fermentative transcriptional state in the absence of glucose, resulting in strains with a 2.7-fold increase in xylose import rates, a 4-fold improvement in xylose integration into central carbon metabolism, or a 1.3-fold increase in ethanol production rate. We conclude by presenting an integrated model of transcriptional regulation and metabolic flux that will enable future efforts aimed at improving xylose fermentation to prioritize functional regulators of central carbon metabolism.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129472373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome-Wide Association Study (GWAS) for the Infliximab Responsiveness in Korean Inflammatory Bowel Disease Patients 韩国炎症性肠病患者英夫利昔单抗反应性的全基因组关联研究(GWAS
Z. Park, Ko-Woon Choi, D. Seo, S. Ryu, Jong Gu Lee, W. Oh
Inflammatory bowel disease (IBD) subdividing into Crohn's disease (CD) and ulcerative colitis (UC) is a chronic intestinal inflammatory disorder. Infliximab (IFX) as an anti-TNF-α has been prescribed for treatment of IBD patients. However, some patients show no response or a loss of response to this agent. In this study, we investigated to identify genetic variants associated with response to IFX. A total of 148 IBD patients from Yonsei University Health System who received IFX were classified according to subtypes of IBD except 12 patients unsuitable for this study. We also categorized the patients into three groups by IFX response; response (sustained response, loss of response), nonresponse. Whole exome sequencing (WES) was performed and identified on average 35,000 variants including silent, missense and nonsense mutation in each sample. We performed GWAS using the WES data to find out genetic variants associated with response to IFX. We identified only missense variants with suggestive evidence of association. In CD patients, AEBP1 (rs2537188) was associated with nonresponse and PLA2R1 (rs35771982, rs3749117) and IDO2 (rs10109853) were associated with loss of response. In UC patients, AMACR (rs10941112, rs3195676) was associated with loss of response. Furthermore, we will investigate in vitro study at the cellular level for the functional analysis of those genetic variants.
炎症性肠病(IBD)是一种慢性肠道炎症性疾病,又分为克罗恩病(CD)和溃疡性结肠炎(UC)。英夫利昔单抗(IFX)作为抗tnf -α药物已被用于治疗IBD患者。然而,一些患者对该药没有反应或失去反应。在这项研究中,我们调查了与IFX反应相关的遗传变异。除12例不适合本研究的患者外,延世大学卫生系统接受IFX治疗的IBD患者共148例,按照IBD亚型进行分类。我们还根据IFX反应将患者分为三组;反应(持续反应,失去反应),无反应。全外显子组测序(WES)在每个样本中平均鉴定出35000个变异,包括沉默突变、错义突变和无义突变。我们使用WES数据进行GWAS,以找出与IFX反应相关的遗传变异。我们只发现了带有暗示证据的错义变异。在CD患者中,AEBP1 (rs2537188)与无应答相关,PLA2R1 (rs35771982、rs3749117)和IDO2 (rs10109853)与无应答相关。在UC患者中,AMACR (rs10941112, rs3195676)与反应丧失相关。此外,我们将在细胞水平上进行体外研究,对这些遗传变异进行功能分析。
{"title":"Genome-Wide Association Study (GWAS) for the Infliximab Responsiveness in Korean Inflammatory Bowel Disease Patients","authors":"Z. Park, Ko-Woon Choi, D. Seo, S. Ryu, Jong Gu Lee, W. Oh","doi":"10.1145/3107411.3108218","DOIUrl":"https://doi.org/10.1145/3107411.3108218","url":null,"abstract":"Inflammatory bowel disease (IBD) subdividing into Crohn's disease (CD) and ulcerative colitis (UC) is a chronic intestinal inflammatory disorder. Infliximab (IFX) as an anti-TNF-α has been prescribed for treatment of IBD patients. However, some patients show no response or a loss of response to this agent. In this study, we investigated to identify genetic variants associated with response to IFX. A total of 148 IBD patients from Yonsei University Health System who received IFX were classified according to subtypes of IBD except 12 patients unsuitable for this study. We also categorized the patients into three groups by IFX response; response (sustained response, loss of response), nonresponse. Whole exome sequencing (WES) was performed and identified on average 35,000 variants including silent, missense and nonsense mutation in each sample. We performed GWAS using the WES data to find out genetic variants associated with response to IFX. We identified only missense variants with suggestive evidence of association. In CD patients, AEBP1 (rs2537188) was associated with nonresponse and PLA2R1 (rs35771982, rs3749117) and IDO2 (rs10109853) were associated with loss of response. In UC patients, AMACR (rs10941112, rs3195676) was associated with loss of response. Furthermore, we will investigate in vitro study at the cellular level for the functional analysis of those genetic variants.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132235677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Differential Compound Prioritization via Bi-Directional Selectivity Push with Power 基于功率的双向选择性推动的差分化合物优先排序
Junfeng Liu, Xia Ning
Effective in silico compound prioritization is critical to identify promising candidates in the early stages of drug discovery. Current methods typically focus on compound ranking based on one single property, for example, activity, against a single target. However, compound selectivity is also a key property that should be deliberated simultaneously so as to reduce the likelihood of undesired side effects of future drugs. In this paper, we present a novel machine learning based differential compound prioritization method dCPPP. This dCPPP method learns compound prioritization models that rank active compounds well, and meanwhile, preferably rank selective compounds higher via a bi-directional push strategy. The bidirectional push is enhanced with push powers that are determined by ranking difference of selective compounds over multiple bioassays. Our experiments demonstrate that the dCPPP achieves an overall 19.221% improvement on prioritizing selective compounds over baseline models.
有效的硅化合物优先排序对于在药物发现的早期阶段确定有希望的候选药物至关重要。当前的方法通常侧重于针对单个目标基于单个属性(例如,activity)的复合排名。然而,化合物的选择性也是一个需要同时考虑的关键特性,以减少未来药物产生不良副作用的可能性。本文提出了一种基于机器学习的差分复合优先排序方法dCPPP。该方法学习的化合物优先排序模型对活性化合物进行了较好的排序,同时通过双向推送策略对选择性化合物进行了较好的排序。双向推动是通过在多种生物测定中选择化合物的排序差异来确定的。我们的实验表明,与基线模型相比,dCPPP在优先选择化合物方面实现了19.221%的总体改进。
{"title":"Differential Compound Prioritization via Bi-Directional Selectivity Push with Power","authors":"Junfeng Liu, Xia Ning","doi":"10.1145/3107411.3107486","DOIUrl":"https://doi.org/10.1145/3107411.3107486","url":null,"abstract":"Effective in silico compound prioritization is critical to identify promising candidates in the early stages of drug discovery. Current methods typically focus on compound ranking based on one single property, for example, activity, against a single target. However, compound selectivity is also a key property that should be deliberated simultaneously so as to reduce the likelihood of undesired side effects of future drugs. In this paper, we present a novel machine learning based differential compound prioritization method dCPPP. This dCPPP method learns compound prioritization models that rank active compounds well, and meanwhile, preferably rank selective compounds higher via a bi-directional push strategy. The bidirectional push is enhanced with push powers that are determined by ranking difference of selective compounds over multiple bioassays. Our experiments demonstrate that the dCPPP achieves an overall 19.221% improvement on prioritizing selective compounds over baseline models.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133765257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1