In Silico Biology最新文献

英文中文

Iterative visual clustering for unstructured text mining 用于非结构化文本挖掘的迭代视觉聚类

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722054

Qian You, S. Fang, P. Ebright

This paper proposes the iterative visual clustering (IVC) on unstructured text sequences to form and evaluate keyword clusters, based on which users can use visual analysis, domain knowledge to discover knowledge in the text. The text sequence data are broken down into a list representative keywords after textual evaluation, and the keywords are then grouped to form keyword clusters via an iterative stochastic process and are visualized as distributions over the time lines. The visual evaluation model provides shape evaluations as quantitative tools and users' interactions as qualitative tools to visually investigate the trends, patterns represented by the keyword clusters' distributions. The keyword clustering model, guided by the feedback of visual evaluations, step-wisely enumerates newer generations of keyword clusters and their patterns, therefore narrows down the search space. Then the proposed IVC is applied onto nursing narratives and is able to identify interesting keyword clusters implying hidden knowledge regarding to the working patterns and environment of registered nurses. The loop of producing next generation of keyword clusters in IVC is driven and controlled by users' perception, domain knowledge and interactions, and it is also guided by a stochastic search model. So both semantic and distribution features enable IVC to have significant applications as a text mining tool, on many other data sets, such as biomedical literatures.

本文提出了对非结构化文本序列进行迭代视觉聚类(IVC)来形成关键字聚类并对其进行评价，用户可以在此基础上利用视觉分析、领域知识来发现文本中的知识。文本序列数据经过文本评估后被分解为一个具有代表性的关键词列表，然后通过迭代随机过程将这些关键词分组形成关键字簇，并将其可视化为时间线上的分布。视觉评价模型将形状评价作为定量工具，将用户交互作为定性工具，直观地考察关键字聚类分布所代表的趋势和模式。关键词聚类模型在视觉评价反馈的指导下，逐步列举出新一代的关键词聚类及其模式，从而缩小了搜索空间。然后将所提出的IVC应用于护理叙述，并能够识别有关注册护士工作模式和环境的隐含知识的有趣关键字聚类。在IVC中产生下一代关键字聚类的循环是由用户感知、领域知识和交互驱动和控制的，并以随机搜索模型为指导。因此，语义和分布特性使IVC作为文本挖掘工具在许多其他数据集(如生物医学文献)上具有重要的应用。

{"title":"Iterative visual clustering for unstructured text mining","authors":"Qian You, S. Fang, P. Ebright","doi":"10.1145/1722024.1722054","DOIUrl":"https://doi.org/10.1145/1722024.1722054","url":null,"abstract":"This paper proposes the iterative visual clustering (IVC) on unstructured text sequences to form and evaluate keyword clusters, based on which users can use visual analysis, domain knowledge to discover knowledge in the text. The text sequence data are broken down into a list representative keywords after textual evaluation, and the keywords are then grouped to form keyword clusters via an iterative stochastic process and are visualized as distributions over the time lines. The visual evaluation model provides shape evaluations as quantitative tools and users' interactions as qualitative tools to visually investigate the trends, patterns represented by the keyword clusters' distributions. The keyword clustering model, guided by the feedback of visual evaluations, step-wisely enumerates newer generations of keyword clusters and their patterns, therefore narrows down the search space. Then the proposed IVC is applied onto nursing narratives and is able to identify interesting keyword clusters implying hidden knowledge regarding to the working patterns and environment of registered nurses. The loop of producing next generation of keyword clusters in IVC is driven and controlled by users' perception, domain knowledge and interactions, and it is also guided by a stochastic search model. So both semantic and distribution features enable IVC to have significant applications as a text mining tool, on many other data sets, such as biomedical literatures.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"26"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722054","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Mining SSR and SNP/Indel sites in expressed sequence tag libraries of Radopholus similis 类人猿表达序列标签库中SSR和SNP/Indel位点的挖掘

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722042

A. Riju, P. Lakshmi, P. Nima, N. Reena, S. Eapen

The objective of this study is to explore the single sequence repeats (SSRs) and single nucleotide polymorphims (SNPs) in expressed sequence tags (ESTs) of Radopholus similis. We retrieved 7380 EST sequences consisting different tissues/condition libraries from dbEST of National Centre for Biotechnology Information (NCBI). A total of 1449 SSRs were detected by MISA perl script. Hexa-nucleotide repeats (836 nos.) followed by mononucleotide repeats (207 nos.) were found to be more abundant than other types of repeats. Putative SNP/Indels were found out with the help of AutoSNP. As many as 1038 SNPs and 108 small indels (insertion/deletion) were found with a density of one SNP/191 bp and one indel/1.8 kbp. Candidate SNPs were categorized according to nucleotide substitution as either transition (C↔T or G↔A) or transversion (C↔G, A↔T, C↔A or T↔G). We observed a higher number of transversions type substitution (537) than transitions (501). However considering the individual substitutions, G↔A (281) and C↔T (220) were found to be predominant than purine to pyrimidine base substitutions. Since the SSR and SNP markers are invaluable tools for genetic analysis, the identified SSRs and SNPs of R. similis could be used in diversity analysis, genetic trait mapping, association studies and marker assisted selection.

本研究的目的是探讨相似Radopholus similis表达序列标签(est)中的单序列重复序列(SSRs)和单核苷酸多态性(snp)。我们从美国国家生物技术信息中心(NCBI)的dbEST数据库中检索了7380条EST序列，这些序列包含不同的组织/条件库。MISA perl脚本共检测到1449个SSRs。六核苷酸重复序列(836个)和单核苷酸重复序列(207个)比其他类型的重复序列更丰富。假定的SNP/Indels是在AutoSNP的帮助下发现的。共发现1038个SNP和108个小缺失(插入/缺失)，密度分别为1个SNP/191 bp和1个indel/1.8 kbp。候选snp根据核苷酸替换分为过渡(C↔T或G↔A)或转换(C↔G, A↔T, C↔A或T↔G)。我们观察到更多的翻转型取代(537)比过渡(501)。然而，考虑到单个替换，我们发现G↔A(281)和C↔T(220)比嘌呤到嘧啶基替换更占优势。由于SSR和SNP标记是遗传分析的宝贵工具，因此鉴定出的相似根SSR和SNP可用于多样性分析、遗传性状定位、关联研究和标记辅助选择。

{"title":"Mining SSR and SNP/Indel sites in expressed sequence tag libraries of Radopholus similis","authors":"A. Riju, P. Lakshmi, P. Nima, N. Reena, S. Eapen","doi":"10.1145/1722024.1722042","DOIUrl":"https://doi.org/10.1145/1722024.1722042","url":null,"abstract":"The objective of this study is to explore the single sequence repeats (SSRs) and single nucleotide polymorphims (SNPs) in expressed sequence tags (ESTs) of Radopholus similis. We retrieved 7380 EST sequences consisting different tissues/condition libraries from dbEST of National Centre for Biotechnology Information (NCBI). A total of 1449 SSRs were detected by MISA perl script. Hexa-nucleotide repeats (836 nos.) followed by mononucleotide repeats (207 nos.) were found to be more abundant than other types of repeats. Putative SNP/Indels were found out with the help of AutoSNP. As many as 1038 SNPs and 108 small indels (insertion/deletion) were found with a density of one SNP/191 bp and one indel/1.8 kbp. Candidate SNPs were categorized according to nucleotide substitution as either transition (C↔T or G↔A) or transversion (C↔G, A↔T, C↔A or T↔G). We observed a higher number of transversions type substitution (537) than transitions (501). However considering the individual substitutions, G↔A (281) and C↔T (220) were found to be predominant than purine to pyrimidine base substitutions. Since the SSR and SNP markers are invaluable tools for genetic analysis, the identified SSRs and SNPs of R. similis could be used in diversity analysis, genetic trait mapping, association studies and marker assisted selection.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"15"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722042","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64107884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analysis of disordered regions in protein kinase subfamilies of Homo sapiens and Coenorhabditis elegans 智人与秀丽隐杆线虫蛋白激酶亚家族紊乱区分析

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722028

K. Kurup, J. Natarajan

Protein kinase is a kinase enzyme that modifies other proteins by chemically adding phosphate groups to them. In this work, first the protein kinases of Coenorhabditis elegans and Homo sapiens with three or more common domain were grouped and disorder regions of protein kinases in each group were predicted. Then the similarities of the disordered regions among the organisms were found. Linear motifs present in these similar disorder regions were identified and tested for their conservation in both Homo sapiens and Coenorhabditis elegans. It is found that, though the similarities in disorder regions are high, the linear motifs are not conserved much in these distantly related organisms.

蛋白激酶是一种激酶酶，通过在其他蛋白质上添加磷酸基团来修饰它们。本研究首先对线虫和智人具有3个或3个以上共同结构域的蛋白激酶进行分组，并对每组蛋白激酶的紊乱区进行预测。然后发现了生物间无序区域的相似性。在这些相似的紊乱区域中存在线性基序，并对其在智人和秀丽隐杆线虫中的保守性进行了鉴定和测试。研究发现，尽管在这些远亲生物中，紊乱区域的相似性很高，但线性基序的保守性并不高。

引用次数: 0

Conserved orthology in mitochondrial genomes of distantly related nematodes 远亲线虫线粒体基因组的保守同源性

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722075

P. Nima, A. Riju, N. Reena, S. Eapen

Identification of orthologous segments plays a very important role in comparative genomics studies. In the present study, we have identified orthologous segments shared between Radopholus similis and 15 other nematodes. Complete genomes of 16 nematodes were used for the study. OSfinder was used to find the orthologous segments shared between R. similis and other 15 nematodes. Orthologous segments were visualized with the help of GTK powered Murasaki Visualizer (GMV) programme. Extremely AT rich genome of the burrowing nematode R. similis, which has the largest mitochondrial genome, was found to have orthologous segments from start position, 4 to end position 16791 with 15 nematodes. Brugia malayi, Dirofilaria immitis, Onchocerca volvulus, and Xiphinema americanum share similar orthologous segment with that of R. similis. The mitochondrial genome analysis revealed the presence of conserved gene locations in mitochondrion and the close evolutionary relationship of nematodes belonging to different clades and different parasitic habitats. This study has many practical implications like reconstruction of ancestral genome of nematode and calculation of evolutionary time.

同源片段的鉴定在比较基因组学研究中起着非常重要的作用。在本研究中，我们鉴定了相似Radopholus与其他15种线虫共有的同源片段。本研究使用了16种线虫的全基因组。利用OSfinder查找相似圆线虫与其他15种线虫共有的同源片段。在GTK驱动的Murasaki Visualizer (GMV)程序的帮助下对同源片段进行可视化。在线粒体基因组最大的穴居线虫R. similis基因组中，发现15个线虫从起始位置4到结束位置16791都有同源片段。马来布鲁氏菌、免疫dirofilia immitis、盘尾丝虫和美洲Xiphinema americanum与相似的r.s similis有相似的同源节段。线粒体基因组分析揭示了线虫线粒体中存在保守的基因位点，揭示了线虫属于不同支系和不同寄生生境的密切进化关系。该研究对线虫祖先基因组的重建和进化时间的计算具有重要的现实意义。

{"title":"Conserved orthology in mitochondrial genomes of distantly related nematodes","authors":"P. Nima, A. Riju, N. Reena, S. Eapen","doi":"10.1145/1722024.1722075","DOIUrl":"https://doi.org/10.1145/1722024.1722075","url":null,"abstract":"Identification of orthologous segments plays a very important role in comparative genomics studies. In the present study, we have identified orthologous segments shared between Radopholus similis and 15 other nematodes. Complete genomes of 16 nematodes were used for the study. OSfinder was used to find the orthologous segments shared between R. similis and other 15 nematodes. Orthologous segments were visualized with the help of GTK powered Murasaki Visualizer (GMV) programme. Extremely AT rich genome of the burrowing nematode R. similis, which has the largest mitochondrial genome, was found to have orthologous segments from start position, 4 to end position 16791 with 15 nematodes. Brugia malayi, Dirofilaria immitis, Onchocerca volvulus, and Xiphinema americanum share similar orthologous segment with that of R. similis. The mitochondrial genome analysis revealed the presence of conserved gene locations in mitochondrion and the close evolutionary relationship of nematodes belonging to different clades and different parasitic habitats. This study has many practical implications like reconstruction of ancestral genome of nematode and calculation of evolutionary time.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"1 1","pages":"44"},"PeriodicalIF":0.0,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722075","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64108625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Chronological order of reversal events on Rickettsia genus 立克次体属逆转事件的时间顺序

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722026

Christian Baudet, Zanoni Dias

Traditional algorithms for sorting permutations by signed reversals output one solution while the solution space can be huge. The enumeration of traces of solutions for this problem can be a powerful tool to help the study of rear-rangement scenarios which only include reversals. Through the analysis of the permutations of six members of the Rickettsia genus in relation with their common ancestral, we were able to produce all possible scenarios and infer some chronological order over the reversal events that occurred during the evolution of these species. Our results matched with the scenario proposed in the literature.

传统的带符号反转排列排序算法输出一个解，而解空间可能非常大。这个问题的解的轨迹的列举可以是一个强大的工具，以帮助研究只包括逆转的后排情景。通过对立克次体属6个成员与其共同祖先的排列分析，我们能够产生所有可能的情况，并推断出这些物种进化过程中发生的逆转事件的一些时间顺序。我们的结果与文献中提出的情况相符。

引用次数: 1

In silico approach to discover multi-target-directed ligands for the treatment of Alzheimer's disease 用计算机方法发现治疗阿尔茨海默病的多靶点定向配体

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722032

A. Tyagi, Shikha Gupta, C. G. Mohan

Multi-target directed (MTD) drugs have been found to be very effective in controlling neurodegenerative diseases. We have developed an in silico strategy to screen molecules for both AChE and BACE-1 enzyme dual inhibition. Pharmacophore model development of known AChE and BACE-1 inhibitors were used for sequential virtual screening (VS) of three different small molecule databases. Eight new MTD ligands were identified using these sequential VS techniques. Among these molecule 2 obtained from NCI database was found to be most promising hit on the basis of Gold docking score and Log-BB value, and which could be further explored for experimental analysis. Our present strategy for identification of the AChE and BACE-1 dual inhibitors might be one of the promising directions to discover better leads for the treatment of Alzheimer's disease.

多靶点定向(MTD)药物已被发现在控制神经退行性疾病方面非常有效。我们已经开发了一种硅策略来筛选AChE和BACE-1酶双重抑制的分子。利用已知AChE和BACE-1抑制剂的药效团模型，对三种不同的小分子数据库进行序贯虚拟筛选(VS)。使用这些序列VS技术鉴定了8个新的MTD配体。根据Gold对接评分和Log-BB值，从NCI数据库中获得的分子2是最有希望命中的，可以进一步进行实验分析。我们目前鉴定AChE和BACE-1双抑制剂的策略可能是发现治疗阿尔茨海默病的更好线索的有希望的方向之一。

引用次数: 0

An integrated multistep prediction system based on wavelet filter analysis and improved instance based learning (IIBL) 基于小波滤波和改进实例学习(IIBL)的集成多步预测系统

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722078

M. Pushpalatha, N. Nalini

In this paper we present a novel wavelet based forecast model integrating wavelet filters for denoising and Improved Instance based learning approach. The proposed model implements a novel technique that extends the nearest neighbor algorithm to include the concept of pattern matching so as to identify similar instances thus implementing a nonparametric regression approach. A hybrid distance measure combining correlation and euclidean distance to select similar instances has been proposed. To illustrate the performance and effectiveness of the proposed model simulations using Mackey-Glass benchmark series and a real time Nord pool time series used in day-ahead forecast of electricity prices have been carried out. We apply a comprehensive set of non redundant orthogonal wavelet transforms for individual wavelet subband to denoise the signal. The analysis of simulations demonstrate that the proposed wavelet based - IIBL model results in accurate predictions and encouraging results for both the series.

本文提出了一种新的基于小波的预测模型，该模型集成了小波滤波去噪和改进的基于实例的学习方法。该模型实现了一种新的技术，将最近邻算法扩展到包含模式匹配的概念，从而识别相似的实例，从而实现非参数回归方法。提出了一种结合相关度和欧氏距离的混合距离度量方法来选择相似实例。为了说明所提出的模型模拟的性能和有效性，使用了Mackey-Glass基准序列和实时Nord pool时间序列进行了日前电价预测。我们对单个小波子带采用一套完整的无冗余正交小波变换来对信号进行降噪。仿真分析表明，所提出的基于小波的- IIBL模型对两个序列的预测结果都是准确的和令人鼓舞的。

引用次数: 0

Inference of gene regulatory network using modified genetic algorithm 基于改进遗传算法的基因调控网络推理

Q2 Medicine

In Silico Biology

Pub Date : 2010-02-15 DOI: 10.1145/1722024.1722049

S. Seema, K. Ramanatha

The major challenge of inferring genetic network is mining the dependencies and regulating relationship among genes. The paper tries to address this problem using Genetic Algorithms to infer the transcription regulatory network. While Genetic Algorithms(GA) are able to infer smaller networks with good sensitivity and precision, several generations and much greater computation power are required to infer regulatory networks from realistic data. Here a modified GA that uses statistical techniques to narrow the search space is proposed. The system is tested on the publicly available datasets of the Hela cell cycle and Yeast cell cycle. The results have been compared with regulatory networks inferred by using second order differential equations. It is found that the sensitivity and specificity are at par with differential equation method and has a considerable improvement in comparison with the Basic GA method.

基因网络推断的主要挑战是挖掘基因间的依赖关系和调节关系。本文试图利用遗传算法来推断转录调控网络来解决这一问题。虽然遗传算法(GA)能够以良好的灵敏度和精度推断较小的网络，但从实际数据推断监管网络需要几代和更大的计算能力。本文提出了一种改进的遗传算法，利用统计技术来缩小搜索空间。该系统在Hela细胞周期和酵母细胞周期的公开数据集上进行了测试。结果与利用二阶微分方程推导的调节网络进行了比较。结果表明，该方法的灵敏度和特异度与微分方程法相当，与基本遗传算法相比有较大提高。

引用次数: 0

Exhaustive analysis of the modular structure of the spliceosomal assembly network: a Petri net approach. 剪接体组装网络的模块结构的详尽分析:一个Petri网方法。

Q2 Medicine

In Silico Biology

Pub Date : 2010-01-01 DOI: 10.3233/ISB-2010-0419

Ralf H Bortfeldt, Stefan Schuster, Ina Koch

Spliceosomes are macro-complexes involving hundreds of proteins with many functional interactions. Spliceosome assembly belongs to the key processes that enable splicing of mRNA and modulate alternative splicing. A detailed list of factors involved in spliceosomal reactions has been assorted over the past decade, but, their functional interplay is often unknown and most of the present biological models cover only parts of the complete assembly process. It is a challenging task to build a computational model that integrates dispersed knowledge and combines a multitude of reaction schemes proposed earlier.Because for most reactions involved in spliceosome assembly kinetic parameters are not available, we propose a discrete modeling using Petri nets, through which we are enabled to get insights into the system's behavior via computation of structural and dynamic properties. In this paper, we compile and examine reactions from experimental reports that contribute to a functional spliceosome. All these reactions form a network, which describes the inventory and conditions necessary to perform the splicing process. The analysis is mainly based on system invariants. Transition invariants (T-invariants) can be interpreted as signaling routes through the network. Due to the huge number of T-invariants that arise with increasing network size and complexity, maximal common transition sets (MCTS) and T-clusters were used for further analysis. Additionally, we introduce a false color map representation, which allows a quick survey of network modules and the visual detection of single reactions or reaction sequences, which participate in more than one signaling route. We designed a structured model of spliceosome assembly, which combines the demands on a platform that i) can display involved factors and concurrent processes, ii) offers the possibility to run computational methods for knowledge extraction, and iii) is successively extendable as new insights into spliceosome function are reported by experimental reports. The network consists of 161 transitions (reactions) and 140 places (reactants). All reactions are part of at least one of the 71 T-invariants. These T-invariants define pathways, which are in good agreement with the current knowledge and known hypotheses on reaction sequences during spliceosome assembly, hence contributing to a functional spliceosome. We demonstrate that present knowledge, in particular of the initial part of the assembly process, describes parallelism and interaction of signaling routes, which indicate functional redundancy and reflect the dependency of spliceosome assembly initiation on different cellular conditions. The complexity of the network is further increased by two switches, which introduce alternative routes during A-complex formation in early spliceosome assembly and upon transition from the B-complex to the C-complex. By compiling known reactions into a complete network, the combinatorial nature of invariant

剪接体是包含数百种具有多种功能相互作用的蛋白质的宏观复合物。剪接体组装是mRNA剪接和调节选择性剪接的关键过程。在过去的十年中，剪接体反应中涉及的因素的详细列表已经分类，但是，它们的功能相互作用通常是未知的，而且大多数目前的生物学模型只涵盖了完整组装过程的一部分。建立一个集成了分散的知识并结合了先前提出的多种反应方案的计算模型是一项具有挑战性的任务。由于大多数涉及剪接体装配的反应动力学参数不可用，我们提出了一个使用Petri网的离散建模，通过该模型，我们能够通过计算结构和动态特性来深入了解系统的行为。在本文中，我们从实验报告中编译并检查了有助于功能剪接体的反应。所有这些反应形成一个网络，描述了执行拼接过程所需的清单和条件。分析主要基于系统不变量。转换不变量(t不变量)可以解释为通过网络的信令路由。由于随着网络规模和复杂性的增加而出现大量的t不变量，因此使用最大公共转移集(MCTS)和t聚类进行进一步分析。此外，我们引入了一种假彩色地图表示，它允许快速调查网络模块和视觉检测单个反应或反应序列，这些反应或反应序列参与多个信号通路。我们设计了一个剪接体组装的结构化模型，该模型结合了对平台的需求，i)可以显示相关因素和并发过程，ii)提供运行知识提取的计算方法的可能性，iii)随着实验报告对剪接体功能的新见解的报道而不断扩展。该网络由161个跃迁(反应)和140个位置(反应物)组成。所有的反应都至少属于71个t不变量中的一个。这些t不变量定义了通路，这与目前关于剪接体组装过程中反应序列的知识和已知假设很好地一致，因此有助于剪接体的功能。我们证明了目前的知识，特别是组装过程的初始部分，描述了信号通路的并行性和相互作用，这表明功能冗余，并反映了剪接体组装起始对不同细胞条件的依赖性。两个开关进一步增加了网络的复杂性，这两个开关在早期剪接体组装的a复合体形成过程中以及从b复合体过渡到c复合体时引入了可选的路线。通过将已知的反应编译成一个完整的网络，不变计算的组合性质导致了以前没有被描述为连接路线的途径，尽管它们的成分是已知的。t簇将网络划分为模块，我们将其解释为剪接体成熟的构建块。我们得出结论，大型生物网络和系统不变量的Petri网表示非常适合作为验证实验知识整合到一致模型中的手段。基于该网络模型，便于进一步的实验设计。

{"title":"Exhaustive analysis of the modular structure of the spliceosomal assembly network: a Petri net approach.","authors":"Ralf H Bortfeldt, Stefan Schuster, Ina Koch","doi":"10.3233/ISB-2010-0419","DOIUrl":"https://doi.org/10.3233/ISB-2010-0419","url":null,"abstract":"<p><p>Spliceosomes are macro-complexes involving hundreds of proteins with many functional interactions. Spliceosome assembly belongs to the key processes that enable splicing of mRNA and modulate alternative splicing. A detailed list of factors involved in spliceosomal reactions has been assorted over the past decade, but, their functional interplay is often unknown and most of the present biological models cover only parts of the complete assembly process. It is a challenging task to build a computational model that integrates dispersed knowledge and combines a multitude of reaction schemes proposed earlier.Because for most reactions involved in spliceosome assembly kinetic parameters are not available, we propose a discrete modeling using Petri nets, through which we are enabled to get insights into the system's behavior via computation of structural and dynamic properties. In this paper, we compile and examine reactions from experimental reports that contribute to a functional spliceosome. All these reactions form a network, which describes the inventory and conditions necessary to perform the splicing process. The analysis is mainly based on system invariants. Transition invariants (T-invariants) can be interpreted as signaling routes through the network. Due to the huge number of T-invariants that arise with increasing network size and complexity, maximal common transition sets (MCTS) and T-clusters were used for further analysis. Additionally, we introduce a false color map representation, which allows a quick survey of network modules and the visual detection of single reactions or reaction sequences, which participate in more than one signaling route. We designed a structured model of spliceosome assembly, which combines the demands on a platform that i) can display involved factors and concurrent processes, ii) offers the possibility to run computational methods for knowledge extraction, and iii) is successively extendable as new insights into spliceosome function are reported by experimental reports. The network consists of 161 transitions (reactions) and 140 places (reactants). All reactions are part of at least one of the 71 T-invariants. These T-invariants define pathways, which are in good agreement with the current knowledge and known hypotheses on reaction sequences during spliceosome assembly, hence contributing to a functional spliceosome. We demonstrate that present knowledge, in particular of the initial part of the assembly process, describes parallelism and interaction of signaling routes, which indicate functional redundancy and reflect the dependency of spliceosome assembly initiation on different cellular conditions. The complexity of the network is further increased by two switches, which introduce alternative routes during A-complex formation in early spliceosome assembly and upon transition from the B-complex to the C-complex. By compiling known reactions into a complete network, the combinatorial nature of invariant ","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":"10 1","pages":"89-123"},"PeriodicalIF":0.0,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3233/ISB-2010-0419","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30512780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Cell Illustrator 4.0: a computational platform for systems biology. Cell Illustrator 4.0:系统生物学的计算平台。

Q2 Medicine

In Silico Biology

Pub Date : 2010-01-01 DOI: 10.3233/ISB-2010-0415

Masao Nagasaki, Ayumu Saito, Euna Jeong, Chen Li, Kaname Kojima, Emi Ikeda, Satoru Miyano

Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.

Cell Illustrator是一个系统生物学的软件平台，它使用Petri网的概念来建模和模拟生物途径。它是为在实验室工作的生物科学家设计的。最新版本的Cell Illustrator 4.0使用Java Web Start技术，并增强了新的功能，包括:使用本体信息的自动图形网格布局算法;使用细胞系统标记语言(CSML) 3.0和细胞系统本体3.0工具;参数搜索模块;高性能仿真模块;CSML数据库管理系统;从CSML模型到编程语言(FORTRAN, C, c++， Java, Python和Perl)的转换;从SBML、CellML和BioPAX导入;导出为SVG和HTML。Cell Illustrator采用面向对象风格的混合Petri网扩展，因此生物通路模型可以包括诸如DNA序列，分子密度，3D定位信息，带帧移位的转录，带密码子表的翻译以及生化反应等对象。

引用次数: 98

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

In Silico Biology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀