首页 > 最新文献

In Silico Biology最新文献

英文 中文
Iterative visual clustering for unstructured text mining 用于非结构化文本挖掘的迭代视觉聚类
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722054
Qian You, S. Fang, P. Ebright
This paper proposes the iterative visual clustering (IVC) on unstructured text sequences to form and evaluate keyword clusters, based on which users can use visual analysis, domain knowledge to discover knowledge in the text. The text sequence data are broken down into a list representative keywords after textual evaluation, and the keywords are then grouped to form keyword clusters via an iterative stochastic process and are visualized as distributions over the time lines. The visual evaluation model provides shape evaluations as quantitative tools and users' interactions as qualitative tools to visually investigate the trends, patterns represented by the keyword clusters' distributions. The keyword clustering model, guided by the feedback of visual evaluations, step-wisely enumerates newer generations of keyword clusters and their patterns, therefore narrows down the search space. Then the proposed IVC is applied onto nursing narratives and is able to identify interesting keyword clusters implying hidden knowledge regarding to the working patterns and environment of registered nurses. The loop of producing next generation of keyword clusters in IVC is driven and controlled by users' perception, domain knowledge and interactions, and it is also guided by a stochastic search model. So both semantic and distribution features enable IVC to have significant applications as a text mining tool, on many other data sets, such as biomedical literatures.
本文提出了对非结构化文本序列进行迭代视觉聚类(IVC)来形成关键字聚类并对其进行评价,用户可以在此基础上利用视觉分析、领域知识来发现文本中的知识。文本序列数据经过文本评估后被分解为一个具有代表性的关键词列表,然后通过迭代随机过程将这些关键词分组形成关键字簇,并将其可视化为时间线上的分布。视觉评价模型将形状评价作为定量工具,将用户交互作为定性工具,直观地考察关键字聚类分布所代表的趋势和模式。关键词聚类模型在视觉评价反馈的指导下,逐步列举出新一代的关键词聚类及其模式,从而缩小了搜索空间。然后将所提出的IVC应用于护理叙述,并能够识别有关注册护士工作模式和环境的隐含知识的有趣关键字聚类。在IVC中产生下一代关键字聚类的循环是由用户感知、领域知识和交互驱动和控制的,并以随机搜索模型为指导。因此,语义和分布特性使IVC作为文本挖掘工具在许多其他数据集(如生物医学文献)上具有重要的应用。
{"title":"Iterative visual clustering for unstructured text mining","authors":"Qian You, S. Fang, P. Ebright","doi":"10.1145/1722024.1722054","DOIUrl":"https://doi.org/10.1145/1722024.1722054","url":null,"abstract":"This paper proposes the iterative visual clustering (IVC) on unstructured text sequences to form and evaluate keyword clusters, based on which users can use visual analysis, domain knowledge to discover knowledge in the text. The text sequence data are broken down into a list representative keywords after textual evaluation, and the keywords are then grouped to form keyword clusters via an iterative stochastic process and are visualized as distributions over the time lines. The visual evaluation model provides shape evaluations as quantitative tools and users' interactions as qualitative tools to visually investigate the trends, patterns represented by the keyword clusters' distributions. The keyword clustering model, guided by the feedback of visual evaluations, step-wisely enumerates newer generations of keyword clusters and their patterns, therefore narrows down the search space. Then the proposed IVC is applied onto nursing narratives and is able to identify interesting keyword clusters implying hidden knowledge regarding to the working patterns and environment of registered nurses. The loop of producing next generation of keyword clusters in IVC is driven and controlled by users' perception, domain knowledge and interactions, and it is also guided by a stochastic search model. So both semantic and distribution features enable IVC to have significant applications as a text mining tool, on many other data sets, such as biomedical literatures.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"26"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722054","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
DNA sequence representation methods DNA序列表示方法
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722073
G. Santhosh Kumar, S. Shiji
DNA sequence representation methods are used to denote a gene structure effectively and help in similarities/dissimilarities analysis of coding sequences. Many different kinds of representations have been proposed in the literature. They can be broadly classified into Numerical, Graphical, Geometrical and Hybrid representation methods. DNA structure and function analysis are made easy with graphical and geometrical representation methods since it gives visual representation of a DNA structure. In numerical method, numerical values are assigned to a sequence and digital signal processing methods are used to analyze the sequence. Hybrid approaches are also reported in the literature to analyze DNA sequences. This paper reviews the latest developments in DNA Sequence representation methods. We also present a taxonomy of various methods. A comparison of these methods where ever possible is also done.
DNA序列表示方法用于有效地表示基因结构,有助于编码序列的异同分析。文献中提出了许多不同类型的表述。它们大致可分为数值表示、图形表示、几何表示和混合表示。DNA的结构和功能分析很容易与图形和几何表示方法,因为它提供了DNA结构的视觉表示。在数值方法中,将数值赋给序列,并使用数字信号处理方法对序列进行分析。混合方法也报道在文献中分析DNA序列。本文综述了DNA序列表示方法的最新进展。我们还提出了各种方法的分类。尽可能地对这些方法进行比较。
{"title":"DNA sequence representation methods","authors":"G. Santhosh Kumar, S. Shiji","doi":"10.1145/1722024.1722073","DOIUrl":"https://doi.org/10.1145/1722024.1722073","url":null,"abstract":"DNA sequence representation methods are used to denote a gene structure effectively and help in similarities/dissimilarities analysis of coding sequences. Many different kinds of representations have been proposed in the literature. They can be broadly classified into Numerical, Graphical, Geometrical and Hybrid representation methods. DNA structure and function analysis are made easy with graphical and geometrical representation methods since it gives visual representation of a DNA structure. In numerical method, numerical values are assigned to a sequence and digital signal processing methods are used to analyze the sequence. Hybrid approaches are also reported in the literature to analyze DNA sequences. This paper reviews the latest developments in DNA Sequence representation methods. We also present a taxonomy of various methods. A comparison of these methods where ever possible is also done.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"42"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722073","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Conserved orthology in mitochondrial genomes of distantly related nematodes 远亲线虫线粒体基因组的保守同源性
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722075
P. Nima, A. Riju, N. Reena, S. Eapen
Identification of orthologous segments plays a very important role in comparative genomics studies. In the present study, we have identified orthologous segments shared between Radopholus similis and 15 other nematodes. Complete genomes of 16 nematodes were used for the study. OSfinder was used to find the orthologous segments shared between R. similis and other 15 nematodes. Orthologous segments were visualized with the help of GTK powered Murasaki Visualizer (GMV) programme. Extremely AT rich genome of the burrowing nematode R. similis, which has the largest mitochondrial genome, was found to have orthologous segments from start position, 4 to end position 16791 with 15 nematodes. Brugia malayi, Dirofilaria immitis, Onchocerca volvulus, and Xiphinema americanum share similar orthologous segment with that of R. similis. The mitochondrial genome analysis revealed the presence of conserved gene locations in mitochondrion and the close evolutionary relationship of nematodes belonging to different clades and different parasitic habitats. This study has many practical implications like reconstruction of ancestral genome of nematode and calculation of evolutionary time.
同源片段的鉴定在比较基因组学研究中起着非常重要的作用。在本研究中,我们鉴定了相似Radopholus与其他15种线虫共有的同源片段。本研究使用了16种线虫的全基因组。利用OSfinder查找相似圆线虫与其他15种线虫共有的同源片段。在GTK驱动的Murasaki Visualizer (GMV)程序的帮助下对同源片段进行可视化。在线粒体基因组最大的穴居线虫R. similis基因组中,发现15个线虫从起始位置4到结束位置16791都有同源片段。马来布鲁氏菌、免疫dirofilia immitis、盘尾丝虫和美洲Xiphinema americanum与相似的r.s similis有相似的同源节段。线粒体基因组分析揭示了线虫线粒体中存在保守的基因位点,揭示了线虫属于不同支系和不同寄生生境的密切进化关系。该研究对线虫祖先基因组的重建和进化时间的计算具有重要的现实意义。
{"title":"Conserved orthology in mitochondrial genomes of distantly related nematodes","authors":"P. Nima, A. Riju, N. Reena, S. Eapen","doi":"10.1145/1722024.1722075","DOIUrl":"https://doi.org/10.1145/1722024.1722075","url":null,"abstract":"Identification of orthologous segments plays a very important role in comparative genomics studies. In the present study, we have identified orthologous segments shared between Radopholus similis and 15 other nematodes. Complete genomes of 16 nematodes were used for the study. OSfinder was used to find the orthologous segments shared between R. similis and other 15 nematodes. Orthologous segments were visualized with the help of GTK powered Murasaki Visualizer (GMV) programme. Extremely AT rich genome of the burrowing nematode R. similis, which has the largest mitochondrial genome, was found to have orthologous segments from start position, 4 to end position 16791 with 15 nematodes. Brugia malayi, Dirofilaria immitis, Onchocerca volvulus, and Xiphinema americanum share similar orthologous segment with that of R. similis. The mitochondrial genome analysis revealed the presence of conserved gene locations in mitochondrion and the close evolutionary relationship of nematodes belonging to different clades and different parasitic habitats. This study has many practical implications like reconstruction of ancestral genome of nematode and calculation of evolutionary time.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"44"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722075","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chronological order of reversal events on Rickettsia genus 立克次体属逆转事件的时间顺序
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722026
Christian Baudet, Zanoni Dias
Traditional algorithms for sorting permutations by signed reversals output one solution while the solution space can be huge. The enumeration of traces of solutions for this problem can be a powerful tool to help the study of rear-rangement scenarios which only include reversals. Through the analysis of the permutations of six members of the Rickettsia genus in relation with their common ancestral, we were able to produce all possible scenarios and infer some chronological order over the reversal events that occurred during the evolution of these species. Our results matched with the scenario proposed in the literature.
传统的带符号反转排列排序算法输出一个解,而解空间可能非常大。这个问题的解的轨迹的列举可以是一个强大的工具,以帮助研究只包括逆转的后排情景。通过对立克次体属6个成员与其共同祖先的排列分析,我们能够产生所有可能的情况,并推断出这些物种进化过程中发生的逆转事件的一些时间顺序。我们的结果与文献中提出的情况相符。
{"title":"Chronological order of reversal events on Rickettsia genus","authors":"Christian Baudet, Zanoni Dias","doi":"10.1145/1722024.1722026","DOIUrl":"https://doi.org/10.1145/1722024.1722026","url":null,"abstract":"Traditional algorithms for sorting permutations by signed reversals output one solution while the solution space can be huge. The enumeration of traces of solutions for this problem can be a powerful tool to help the study of rear-rangement scenarios which only include reversals. Through the analysis of the permutations of six members of the Rickettsia genus in relation with their common ancestral, we were able to produce all possible scenarios and infer some chronological order over the reversal events that occurred during the evolution of these species. Our results matched with the scenario proposed in the literature.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"1"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722026","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64107908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
In silico approach to discover multi-target-directed ligands for the treatment of Alzheimer's disease 用计算机方法发现治疗阿尔茨海默病的多靶点定向配体
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722032
A. Tyagi, Shikha Gupta, C. G. Mohan
Multi-target directed (MTD) drugs have been found to be very effective in controlling neurodegenerative diseases. We have developed an in silico strategy to screen molecules for both AChE and BACE-1 enzyme dual inhibition. Pharmacophore model development of known AChE and BACE-1 inhibitors were used for sequential virtual screening (VS) of three different small molecule databases. Eight new MTD ligands were identified using these sequential VS techniques. Among these molecule 2 obtained from NCI database was found to be most promising hit on the basis of Gold docking score and Log-BB value, and which could be further explored for experimental analysis. Our present strategy for identification of the AChE and BACE-1 dual inhibitors might be one of the promising directions to discover better leads for the treatment of Alzheimer's disease.
多靶点定向(MTD)药物已被发现在控制神经退行性疾病方面非常有效。我们已经开发了一种硅策略来筛选AChE和BACE-1酶双重抑制的分子。利用已知AChE和BACE-1抑制剂的药效团模型,对三种不同的小分子数据库进行序贯虚拟筛选(VS)。使用这些序列VS技术鉴定了8个新的MTD配体。根据Gold对接评分和Log-BB值,从NCI数据库中获得的分子2是最有希望命中的,可以进一步进行实验分析。我们目前鉴定AChE和BACE-1双抑制剂的策略可能是发现治疗阿尔茨海默病的更好线索的有希望的方向之一。
{"title":"In silico approach to discover multi-target-directed ligands for the treatment of Alzheimer's disease","authors":"A. Tyagi, Shikha Gupta, C. G. Mohan","doi":"10.1145/1722024.1722032","DOIUrl":"https://doi.org/10.1145/1722024.1722032","url":null,"abstract":"Multi-target directed (MTD) drugs have been found to be very effective in controlling neurodegenerative diseases. We have developed an in silico strategy to screen molecules for both AChE and BACE-1 enzyme dual inhibition. Pharmacophore model development of known AChE and BACE-1 inhibitors were used for sequential virtual screening (VS) of three different small molecule databases. Eight new MTD ligands were identified using these sequential VS techniques. Among these molecule 2 obtained from NCI database was found to be most promising hit on the basis of Gold docking score and Log-BB value, and which could be further explored for experimental analysis. Our present strategy for identification of the AChE and BACE-1 dual inhibitors might be one of the promising directions to discover better leads for the treatment of Alzheimer's disease.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722032","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An integrated multistep prediction system based on wavelet filter analysis and improved instance based learning (IIBL) 基于小波滤波和改进实例学习(IIBL)的集成多步预测系统
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722078
M. Pushpalatha, N. Nalini
In this paper we present a novel wavelet based forecast model integrating wavelet filters for denoising and Improved Instance based learning approach. The proposed model implements a novel technique that extends the nearest neighbor algorithm to include the concept of pattern matching so as to identify similar instances thus implementing a nonparametric regression approach. A hybrid distance measure combining correlation and euclidean distance to select similar instances has been proposed. To illustrate the performance and effectiveness of the proposed model simulations using Mackey-Glass benchmark series and a real time Nord pool time series used in day-ahead forecast of electricity prices have been carried out. We apply a comprehensive set of non redundant orthogonal wavelet transforms for individual wavelet subband to denoise the signal. The analysis of simulations demonstrate that the proposed wavelet based - IIBL model results in accurate predictions and encouraging results for both the series.
本文提出了一种新的基于小波的预测模型,该模型集成了小波滤波去噪和改进的基于实例的学习方法。该模型实现了一种新的技术,将最近邻算法扩展到包含模式匹配的概念,从而识别相似的实例,从而实现非参数回归方法。提出了一种结合相关度和欧氏距离的混合距离度量方法来选择相似实例。为了说明所提出的模型模拟的性能和有效性,使用了Mackey-Glass基准序列和实时Nord pool时间序列进行了日前电价预测。我们对单个小波子带采用一套完整的无冗余正交小波变换来对信号进行降噪。仿真分析表明,所提出的基于小波的- IIBL模型对两个序列的预测结果都是准确的和令人鼓舞的。
{"title":"An integrated multistep prediction system based on wavelet filter analysis and improved instance based learning (IIBL)","authors":"M. Pushpalatha, N. Nalini","doi":"10.1145/1722024.1722078","DOIUrl":"https://doi.org/10.1145/1722024.1722078","url":null,"abstract":"In this paper we present a novel wavelet based forecast model integrating wavelet filters for denoising and Improved Instance based learning approach. The proposed model implements a novel technique that extends the nearest neighbor algorithm to include the concept of pattern matching so as to identify similar instances thus implementing a nonparametric regression approach. A hybrid distance measure combining correlation and euclidean distance to select similar instances has been proposed. To illustrate the performance and effectiveness of the proposed model simulations using Mackey-Glass benchmark series and a real time Nord pool time series used in day-ahead forecast of electricity prices have been carried out. We apply a comprehensive set of non redundant orthogonal wavelet transforms for individual wavelet subband to denoise the signal. The analysis of simulations demonstrate that the proposed wavelet based - IIBL model results in accurate predictions and encouraging results for both the series.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"47"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722078","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference of gene regulatory network using modified genetic algorithm 基于改进遗传算法的基因调控网络推理
Q2 Medicine Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722049
S. Seema, K. Ramanatha
The major challenge of inferring genetic network is mining the dependencies and regulating relationship among genes. The paper tries to address this problem using Genetic Algorithms to infer the transcription regulatory network. While Genetic Algorithms(GA) are able to infer smaller networks with good sensitivity and precision, several generations and much greater computation power are required to infer regulatory networks from realistic data. Here a modified GA that uses statistical techniques to narrow the search space is proposed. The system is tested on the publicly available datasets of the Hela cell cycle and Yeast cell cycle. The results have been compared with regulatory networks inferred by using second order differential equations. It is found that the sensitivity and specificity are at par with differential equation method and has a considerable improvement in comparison with the Basic GA method.
基因网络推断的主要挑战是挖掘基因间的依赖关系和调节关系。本文试图利用遗传算法来推断转录调控网络来解决这一问题。虽然遗传算法(GA)能够以良好的灵敏度和精度推断较小的网络,但从实际数据推断监管网络需要几代和更大的计算能力。本文提出了一种改进的遗传算法,利用统计技术来缩小搜索空间。该系统在Hela细胞周期和酵母细胞周期的公开数据集上进行了测试。结果与利用二阶微分方程推导的调节网络进行了比较。结果表明,该方法的灵敏度和特异度与微分方程法相当,与基本遗传算法相比有较大提高。
{"title":"Inference of gene regulatory network using modified genetic algorithm","authors":"S. Seema, K. Ramanatha","doi":"10.1145/1722024.1722049","DOIUrl":"https://doi.org/10.1145/1722024.1722049","url":null,"abstract":"The major challenge of inferring genetic network is mining the dependencies and regulating relationship among genes. The paper tries to address this problem using Genetic Algorithms to infer the transcription regulatory network. While Genetic Algorithms(GA) are able to infer smaller networks with good sensitivity and precision, several generations and much greater computation power are required to infer regulatory networks from realistic data. Here a modified GA that uses statistical techniques to narrow the search space is proposed. The system is tested on the publicly available datasets of the Hela cell cycle and Yeast cell cycle. The results have been compared with regulatory networks inferred by using second order differential equations. It is found that the sensitivity and specificity are at par with differential equation method and has a considerable improvement in comparison with the Basic GA method.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"21"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722049","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exhaustive analysis of the modular structure of the spliceosomal assembly network: a Petri net approach. 剪接体组装网络的模块结构的详尽分析:一个Petri网方法。
Q2 Medicine Pub Date : 2010-01-01 DOI: 10.3233/ISB-2010-0419
Ralf H Bortfeldt, Stefan Schuster, Ina Koch

Spliceosomes are macro-complexes involving hundreds of proteins with many functional interactions. Spliceosome assembly belongs to the key processes that enable splicing of mRNA and modulate alternative splicing. A detailed list of factors involved in spliceosomal reactions has been assorted over the past decade, but, their functional interplay is often unknown and most of the present biological models cover only parts of the complete assembly process. It is a challenging task to build a computational model that integrates dispersed knowledge and combines a multitude of reaction schemes proposed earlier.Because for most reactions involved in spliceosome assembly kinetic parameters are not available, we propose a discrete modeling using Petri nets, through which we are enabled to get insights into the system's behavior via computation of structural and dynamic properties. In this paper, we compile and examine reactions from experimental reports that contribute to a functional spliceosome. All these reactions form a network, which describes the inventory and conditions necessary to perform the splicing process. The analysis is mainly based on system invariants. Transition invariants (T-invariants) can be interpreted as signaling routes through the network. Due to the huge number of T-invariants that arise with increasing network size and complexity, maximal common transition sets (MCTS) and T-clusters were used for further analysis. Additionally, we introduce a false color map representation, which allows a quick survey of network modules and the visual detection of single reactions or reaction sequences, which participate in more than one signaling route. We designed a structured model of spliceosome assembly, which combines the demands on a platform that i) can display involved factors and concurrent processes, ii) offers the possibility to run computational methods for knowledge extraction, and iii) is successively extendable as new insights into spliceosome function are reported by experimental reports. The network consists of 161 transitions (reactions) and 140 places (reactants). All reactions are part of at least one of the 71 T-invariants. These T-invariants define pathways, which are in good agreement with the current knowledge and known hypotheses on reaction sequences during spliceosome assembly, hence contributing to a functional spliceosome. We demonstrate that present knowledge, in particular of the initial part of the assembly process, describes parallelism and interaction of signaling routes, which indicate functional redundancy and reflect the dependency of spliceosome assembly initiation on different cellular conditions. The complexity of the network is further increased by two switches, which introduce alternative routes during A-complex formation in early spliceosome assembly and upon transition from the B-complex to the C-complex. By compiling known reactions into a complete network, the combinatorial nature of invariant

剪接体是包含数百种具有多种功能相互作用的蛋白质的宏观复合物。剪接体组装是mRNA剪接和调节选择性剪接的关键过程。在过去的十年中,剪接体反应中涉及的因素的详细列表已经分类,但是,它们的功能相互作用通常是未知的,而且大多数目前的生物学模型只涵盖了完整组装过程的一部分。建立一个集成了分散的知识并结合了先前提出的多种反应方案的计算模型是一项具有挑战性的任务。由于大多数涉及剪接体装配的反应动力学参数不可用,我们提出了一个使用Petri网的离散建模,通过该模型,我们能够通过计算结构和动态特性来深入了解系统的行为。在本文中,我们从实验报告中编译并检查了有助于功能剪接体的反应。所有这些反应形成一个网络,描述了执行拼接过程所需的清单和条件。分析主要基于系统不变量。转换不变量(t不变量)可以解释为通过网络的信令路由。由于随着网络规模和复杂性的增加而出现大量的t不变量,因此使用最大公共转移集(MCTS)和t聚类进行进一步分析。此外,我们引入了一种假彩色地图表示,它允许快速调查网络模块和视觉检测单个反应或反应序列,这些反应或反应序列参与多个信号通路。我们设计了一个剪接体组装的结构化模型,该模型结合了对平台的需求,i)可以显示相关因素和并发过程,ii)提供运行知识提取的计算方法的可能性,iii)随着实验报告对剪接体功能的新见解的报道而不断扩展。该网络由161个跃迁(反应)和140个位置(反应物)组成。所有的反应都至少属于71个t不变量中的一个。这些t不变量定义了通路,这与目前关于剪接体组装过程中反应序列的知识和已知假设很好地一致,因此有助于剪接体的功能。我们证明了目前的知识,特别是组装过程的初始部分,描述了信号通路的并行性和相互作用,这表明功能冗余,并反映了剪接体组装起始对不同细胞条件的依赖性。两个开关进一步增加了网络的复杂性,这两个开关在早期剪接体组装的a复合体形成过程中以及从b复合体过渡到c复合体时引入了可选的路线。通过将已知的反应编译成一个完整的网络,不变计算的组合性质导致了以前没有被描述为连接路线的途径,尽管它们的成分是已知的。t簇将网络划分为模块,我们将其解释为剪接体成熟的构建块。我们得出结论,大型生物网络和系统不变量的Petri网表示非常适合作为验证实验知识整合到一致模型中的手段。基于该网络模型,便于进一步的实验设计。
{"title":"Exhaustive analysis of the modular structure of the spliceosomal assembly network: a Petri net approach.","authors":"Ralf H Bortfeldt,&nbsp;Stefan Schuster,&nbsp;Ina Koch","doi":"10.3233/ISB-2010-0419","DOIUrl":"https://doi.org/10.3233/ISB-2010-0419","url":null,"abstract":"<p><p>Spliceosomes are macro-complexes involving hundreds of proteins with many functional interactions. Spliceosome assembly belongs to the key processes that enable splicing of mRNA and modulate alternative splicing. A detailed list of factors involved in spliceosomal reactions has been assorted over the past decade, but, their functional interplay is often unknown and most of the present biological models cover only parts of the complete assembly process. It is a challenging task to build a computational model that integrates dispersed knowledge and combines a multitude of reaction schemes proposed earlier.Because for most reactions involved in spliceosome assembly kinetic parameters are not available, we propose a discrete modeling using Petri nets, through which we are enabled to get insights into the system's behavior via computation of structural and dynamic properties. In this paper, we compile and examine reactions from experimental reports that contribute to a functional spliceosome. All these reactions form a network, which describes the inventory and conditions necessary to perform the splicing process. The analysis is mainly based on system invariants. Transition invariants (T-invariants) can be interpreted as signaling routes through the network. Due to the huge number of T-invariants that arise with increasing network size and complexity, maximal common transition sets (MCTS) and T-clusters were used for further analysis. Additionally, we introduce a false color map representation, which allows a quick survey of network modules and the visual detection of single reactions or reaction sequences, which participate in more than one signaling route. We designed a structured model of spliceosome assembly, which combines the demands on a platform that i) can display involved factors and concurrent processes, ii) offers the possibility to run computational methods for knowledge extraction, and iii) is successively extendable as new insights into spliceosome function are reported by experimental reports. The network consists of 161 transitions (reactions) and 140 places (reactants). All reactions are part of at least one of the 71 T-invariants. These T-invariants define pathways, which are in good agreement with the current knowledge and known hypotheses on reaction sequences during spliceosome assembly, hence contributing to a functional spliceosome. We demonstrate that present knowledge, in particular of the initial part of the assembly process, describes parallelism and interaction of signaling routes, which indicate functional redundancy and reflect the dependency of spliceosome assembly initiation on different cellular conditions. The complexity of the network is further increased by two switches, which introduce alternative routes during A-complex formation in early spliceosome assembly and upon transition from the B-complex to the C-complex. By compiling known reactions into a complete network, the combinatorial nature of invariant ","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"10 1","pages":"89-123"},"PeriodicalIF":0.0,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3233/ISB-2010-0419","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30512780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Cell Illustrator 4.0: a computational platform for systems biology. Cell Illustrator 4.0:系统生物学的计算平台。
Q2 Medicine Pub Date : 2010-01-01 DOI: 10.3233/ISB-2010-0415
Masao Nagasaki, Ayumu Saito, Euna Jeong, Chen Li, Kaname Kojima, Emi Ikeda, Satoru Miyano

Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.

Cell Illustrator是一个系统生物学的软件平台,它使用Petri网的概念来建模和模拟生物途径。它是为在实验室工作的生物科学家设计的。最新版本的Cell Illustrator 4.0使用Java Web Start技术,并增强了新的功能,包括:使用本体信息的自动图形网格布局算法;使用细胞系统标记语言(CSML) 3.0和细胞系统本体3.0工具;参数搜索模块;高性能仿真模块;CSML数据库管理系统;从CSML模型到编程语言(FORTRAN, C, c++, Java, Python和Perl)的转换;从SBML、CellML和BioPAX导入;导出为SVG和HTML。Cell Illustrator采用面向对象风格的混合Petri网扩展,因此生物通路模型可以包括诸如DNA序列,分子密度,3D定位信息,带帧移位的转录,带密码子表的翻译以及生化反应等对象。
{"title":"Cell Illustrator 4.0: a computational platform for systems biology.","authors":"Masao Nagasaki,&nbsp;Ayumu Saito,&nbsp;Euna Jeong,&nbsp;Chen Li,&nbsp;Kaname Kojima,&nbsp;Emi Ikeda,&nbsp;Satoru Miyano","doi":"10.3233/ISB-2010-0415","DOIUrl":"https://doi.org/10.3233/ISB-2010-0415","url":null,"abstract":"<p><p>Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.</p>","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"10 1","pages":"5-26"},"PeriodicalIF":0.0,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3233/ISB-2010-0415","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30513843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 98
A computational approach for identification of epitopes in dengue virus envelope protein: a step towards designing a universal dengue vaccine targeting endemic regions. 鉴定登革热病毒包膜蛋白表位的计算方法:朝着设计针对流行区域的通用登革热疫苗迈出的一步。
Q2 Medicine Pub Date : 2010-01-01 DOI: 10.3233/ISB-2010-0435
Sajib Chakraborty, Rajib Chakravorty, Musaddeque Ahmed, Atiqur Rahman, T M Zaved Waise, Faizule Hassan, Mustafizur Rahman, Sohel Shamsuzzaman

A major problem in designing vaccine for the dengue virus has been the high antigenic variability in the envelope protein of different virus strains. In this study, a computational approach was adopted to identify a multi-epitope vaccine candidate against dengue virus that may be suitable for large populations in the dengue-endemic regions. Different bioinformatics tools were exploited that helped the identification of a conserved immunological hot-spot in the dengue envelope protein. The tools also rendered the prediction of immunogenicity and population coverage to the proposed 'in silico' vaccine candidate against dengue. A peptide region, spanning 19 amino acids, was identified in the envelope protein which found to be conserved in all four types of dengue viruses. Ten proteasomal cleavage sites were identified within the 19-mer conserved peptide sequence and a total of 8 overlapping putative cytotoxic T cell (CTL) epitopes were identified. The immunogenicity of these epitopes was evaluated in terms of their binding affinities to and dissociation half-time from respective human leukocyte antigen (HLA) molecules. The HLA allele frequencies were studied among populations in the dengue endemic regions and compared with respect to HLA restriction patterns of the overlapping epitopes. The cumulative population coverage for these epitopes as vaccine candidates was high ranging from approximately 80% to 92%. Structural analysis suggested that a 9-mer epitope fitted well into the peptide-binding groove of HLA-A*0201. In conclusion, the 19-mer epitope cluster was shown to have the potential for use as a vaccine candidate against dengue.

设计登革热病毒疫苗的一个主要问题是不同病毒株包膜蛋白的高度抗原变异性。在这项研究中,采用了一种计算方法来确定一种针对登革热病毒的多表位候选疫苗,这种疫苗可能适用于登革热流行地区的大量人群。利用不同的生物信息学工具,帮助鉴定登革热包膜蛋白中的保守免疫热点。这些工具还预测了拟议的针对登革热的“硅”候选疫苗的免疫原性和人口覆盖率。在包膜蛋白中发现了一个跨越19个氨基酸的肽区,发现该区域在所有四种登革热病毒中都是保守的。在19个保守肽序列中鉴定出10个蛋白酶体切割位点,共鉴定出8个重叠的假定细胞毒性T细胞(CTL)表位。这些表位的免疫原性是根据它们与各自的人类白细胞抗原(HLA)分子的结合亲和力和解离半时间来评估的。研究了登革热流行地区人群HLA等位基因频率,并比较了重叠表位的HLA限制性模式。这些表位作为候选疫苗的累积人群覆盖率很高,约为80%至92%。结构分析表明,一个9-mer表位与HLA-A*0201的肽结合槽非常吻合。总之,该19-mer表位簇具有作为登革热候选疫苗的潜力。
{"title":"A computational approach for identification of epitopes in dengue virus envelope protein: a step towards designing a universal dengue vaccine targeting endemic regions.","authors":"Sajib Chakraborty,&nbsp;Rajib Chakravorty,&nbsp;Musaddeque Ahmed,&nbsp;Atiqur Rahman,&nbsp;T M Zaved Waise,&nbsp;Faizule Hassan,&nbsp;Mustafizur Rahman,&nbsp;Sohel Shamsuzzaman","doi":"10.3233/ISB-2010-0435","DOIUrl":"https://doi.org/10.3233/ISB-2010-0435","url":null,"abstract":"<p><p>A major problem in designing vaccine for the dengue virus has been the high antigenic variability in the envelope protein of different virus strains. In this study, a computational approach was adopted to identify a multi-epitope vaccine candidate against dengue virus that may be suitable for large populations in the dengue-endemic regions. Different bioinformatics tools were exploited that helped the identification of a conserved immunological hot-spot in the dengue envelope protein. The tools also rendered the prediction of immunogenicity and population coverage to the proposed 'in silico' vaccine candidate against dengue. A peptide region, spanning 19 amino acids, was identified in the envelope protein which found to be conserved in all four types of dengue viruses. Ten proteasomal cleavage sites were identified within the 19-mer conserved peptide sequence and a total of 8 overlapping putative cytotoxic T cell (CTL) epitopes were identified. The immunogenicity of these epitopes was evaluated in terms of their binding affinities to and dissociation half-time from respective human leukocyte antigen (HLA) molecules. The HLA allele frequencies were studied among populations in the dengue endemic regions and compared with respect to HLA restriction patterns of the overlapping epitopes. The cumulative population coverage for these epitopes as vaccine candidates was high ranging from approximately 80% to 92%. Structural analysis suggested that a 9-mer epitope fitted well into the peptide-binding groove of HLA-A*0201. In conclusion, the 19-mer epitope cluster was shown to have the potential for use as a vaccine candidate against dengue.</p>","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"10 5-6","pages":"235-46"},"PeriodicalIF":0.0,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3233/ISB-2010-0435","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30512906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
期刊
In Silico Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1