Proceedings of the ... Asia-Pacific bioinformatics conference最新文献

英文中文

Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP PAMP检测基因组反转和缺失的引物选择方法

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0036

B. Dasgupta, Jin-Hunh Jun, I. Măndoiu

Primer Approximation Multiplex PCR (PAMP) is a recently introduced experimental technique for detecting large-scale cancer genome lesions such as inversions and deletions from heterogeneous samples containing a mixture of cancer and normal cells. In this paper we give integer linear programming formulations for the problem of selecting sets of PAMP primers that minimize detection failure probability. We also show that PAMP primer selection for detection of anchored deletions cannot be approximated within a factor of 2 ", and give a 2-approximation algorithm for a special case of the problem. Experimental results show that our ILP formulations can be used to optimally solve medium size instances of the inversion detection problem, and that heuristics based on iteratively solving ILP formulations for a one-sided version of the problem give near-optimal solutions for anchored deletion detection with highly scalable runtime.

引物近似多重PCR (PAMP)是最近引入的一种实验技术，用于检测大规模癌症基因组病变，例如从含有癌症和正常细胞混合物的异质样品中检测反转和缺失。本文给出了使检测失效概率最小的PAMP引物集合选择问题的整数线性规划公式。我们还表明，PAMP引物选择检测锚定缺失不能在2”的因子内近似，并给出了该问题的特殊情况下的2-近似算法。实验结果表明，我们的ILP公式可用于最优解决反转检测问题的中等大小实例，并且基于迭代求解问题的单侧版本的ILP公式的启发式方法为锚定删除检测提供了具有高度可扩展运行时的近最优解决方案。

引用次数: 3

A Novel Method for Reducing Computational Complexity of Whole Genome Sequence Alignment 一种降低全基因组序列比对计算复杂度的新方法

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0013

Ryuichiro Nakato, O. Gotoh

National Institute of Advanced Industrial Science and Technology,Computational Biology Research Center,2-42 Aomi, Koto-ku, Tokyo 135-0064, JapanGenomic sequence alignment is a powerful tool for ﬁnding com mon subsequence patterns sharedby the input sequences and identifying evolutionary relationships between the species. However, therunning time and space requirement of genome alignment have often been very extensive. In thisresearch, we propose a novel algorithm called Coarse-Grained AlignmenT (CGAT) algorithm, forreducing computational complexity necessary for cross-species whole genome sequence alignment.The CGAT ﬁrst divides the input sequences into ”blocks” with a ﬁxed length and aligns these blocksto each other. The generated block-level alignment is then reﬁned at the nucleotide level. This two-stepprocedure can drastically reduce the overall computational time and space necessary for an alignment.In this paper, we show the effectiveness of the proposed algorithm by applying it to whole genomesequences of several bacteria.Keywords: Genome Alignment; Multiple Alignment; Sequence Analysis; Comparative Genomics.

基因组序列比对是寻找输入序列共享的共同子序列模式和识别物种之间进化关系的有力工具。然而，基因组比对的运行时间和空间要求往往非常广泛。在本研究中，我们提出了一种新的算法，称为粗粒度比对(CGAT)算法，以降低跨物种全基因组序列比对所需的计算复杂度。CGAT首先将输入序列划分为固定长度的“块”，并将这些块彼此对齐。生成的块级比对然后在核苷酸水平上进行细化。这两步过程可以大大减少校准所需的总体计算时间和空间。在本文中，我们通过将该算法应用于几种细菌的全基因组序列来证明该算法的有效性。关键词:基因组比对;多个对齐;序列分析;比较基因组学。

{"title":"A Novel Method for Reducing Computational Complexity of Whole Genome Sequence Alignment","authors":"Ryuichiro Nakato, O. Gotoh","doi":"10.1142/9781848161092_0013","DOIUrl":"https://doi.org/10.1142/9781848161092_0013","url":null,"abstract":"National Institute of Advanced Industrial Science and Technology,Computational Biology Research Center,2-42 Aomi, Koto-ku, Tokyo 135-0064, JapanGenomic sequence alignment is a powerful tool for ﬁnding com mon subsequence patterns sharedby the input sequences and identifying evolutionary relationships between the species. However, therunning time and space requirement of genome alignment have often been very extensive. In thisresearch, we propose a novel algorithm called Coarse-Grained AlignmenT (CGAT) algorithm, forreducing computational complexity necessary for cross-species whole genome sequence alignment.The CGAT ﬁrst divides the input sequences into ”blocks” with a ﬁxed length and aligns these blocksto each other. The generated block-level alignment is then reﬁned at the nucleotide level. This two-stepprocedure can drastically reduce the overall computational time and space necessary for an alignment.In this paper, we show the effectiveness of the proposed algorithm by applying it to whole genomesequences of several bacteria.Keywords: Genome Alignment; Multiple Alignment; Sequence Analysis; Comparative Genomics.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"26 1","pages":"101-110"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79240518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Near-sigmoid Modeling to Simultaneously Profile Genome-wide DNA Replication Timing and Efficiency in Single DNA Replication Microarray Studies 近s型模型同时分析全基因组DNA复制时间和效率在单DNA复制微阵列研究

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0039

Juntao Li, M. Eshaghi, Jianhua Liu, K. R. K. Murthy

DNA replication is a key process in cell division cycle. It is initiated in coordinated manner in several species. To understand the DNA replication in a species one needs to measure the half replication timing (or replication timing) and the efficiency of replication which vary across genome in higher eukaryotes. In the previous studies, no direct assessment of replication efficiency on a genomic scale was performed while the replication timing was indirectly assessed using average DNA. In this paper, we present a first-ever-method of directly measuring both half replication timing and efficiency simultaneously from a single DNA microarray time-course data. We achieve it by fitting the so called near-sigmoid model to each locus of the DNA. We use this model apply S. pombe DNA replication microarray data and show that it is effective for genome-scale replication timing and efficiency profiling studies.

DNA复制是细胞分裂周期的关键过程。它在几个物种中以协调的方式开始。为了了解一个物种的DNA复制，需要测量半复制时间(或复制时间)和复制效率，这在高等真核生物的基因组中是不同的。在以前的研究中，没有在基因组尺度上直接评估复制效率，而是使用平均DNA间接评估复制时间。在本文中，我们提出了一种直接测量半复制时间和效率的方法，同时从单个DNA微阵列时间过程数据。我们通过将所谓的近s型模型拟合到DNA的每个位点来实现它。我们将该模型应用于S. pombe DNA复制微阵列数据，并表明它对基因组规模的复制时间和效率分析研究是有效的。

引用次数: 1

Alignment of Minisatellite Maps: A Minimum Spanning Tree-based Approach 小卫星地图的对齐:基于最小生成树的方法

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0028

M. Abouelhoda, R. Giegerich, B. Behzadi, J. Steyaert

In addition to the well-known edit operations, the alignment of minisatellite maps includes duplication events. We model these duplications using a special kind of spanning trees and deduce an optimal duplication scenario by computing the respective minimum spanning tree. Based on best duplication scenarios for all substrings of the given sequences, we compute an optimal alignment of two minisatellite maps. Our algorithm improves upon the previously developed algorithms in the generality of the model, in alignment quality and in space-time efficiency. Using this algorithm, we derive evidence that there is a directional bias in the growth of minisatellites of the MSY1 dataset.

除了众所周知的编辑操作外，小卫星地图的对齐还包括复制事件。我们使用一种特殊的生成树对这些复制进行建模，并通过计算各自的最小生成树来推断出最优的复制场景。基于给定序列的所有子串的最佳复制场景，我们计算了两个小卫星图的最优对齐。该算法在模型通用性、对齐质量和时空效率等方面都有较好的改进。利用该算法，我们得出了MSY1数据集的小卫星生长存在方向性偏差的证据。

引用次数: 2

Estimation of Population Allele Frequencies from Small Samples Containing Multiple Generations 包含多代的小样本群体等位基因频率估计

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0033

D. Konovalov, D. Heg

Estimations of population genetic parameters like allele frequencies, heterozygosities, inbreeding coefficients and genetic distances rely on the assumption that all sampled genotypes come from a randomly interbreeding population or sub-population. Here we show that small cross-generational samples may severely affect estimates of allele frequencies, when a small number of progenies dominate the next generation or the sample. A new estimator of allele frequencies is developed for such cases when the kin structure of the focal sample is unknown and has to be assessed simultaneously. Using Monte Carlo simulations it was demonstrated that the new estimator delivered significant improvement over the conventional allele counting estimator.

群体遗传参数的估计，如等位基因频率、杂合度、近交系数和遗传距离，依赖于所有样本基因型来自随机杂交群体或亚群体的假设。在这里，我们表明，当少数后代在下一代或样本中占主导地位时，小的跨代样本可能严重影响等位基因频率的估计。本文提出了一种新的等位基因频率估计方法，用于在病灶样品的亲缘结构未知的情况下同时进行评估。通过蒙特卡罗模拟，证明了新的估计方法比传统的等位基因计数估计方法有显著的改进。

引用次数: 10

From Text to Pathway: Corpus Annotation for Knowledge Acquisition from Biomedical Literature 从文本到路径:生物医学文献知识获取的语料库标注

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0019

Jin-Dong Kim, Tomoko Ohta, Kanae Oda, Junichi Tsujii

We present a new direction of research, which deploys Text Mining technologies to construct and maintain data bases organized in the form of pathway, by associating parts of papers with relevant portions of a pathway and vice versa. In order to materialize this scenario, we present two annotated corpora. The first, Event Annotation, identifies the spans of text in which biological events are reported, while the other, Pathway Annotation, associates portions of papers with specific parts in a pathway.

我们提出了一个新的研究方向，即利用文本挖掘技术，通过将论文的部分内容与路径的相关部分相关联来构建和维护以路径形式组织的数据库，反之亦然。为了实现这一场景，我们提出了两个带注释的语料库。第一种方法是Event Annotation，它确定了报道生物事件的文本范围，而另一种方法是Pathway Annotation，它将论文的某些部分与路径中的特定部分联系起来。

引用次数: 13

A Memory Efficient Algorithm for Structural Alignment of RNAs with Embedded Simple Pseudoknots 嵌入简单伪结rna结构比对的高效记忆算法

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0012

Thomas K. F. Wong, Y. Chiu, T. Lam, S. Yiu

In this paper, we consider the problem of structural alignment of a target RNA sequence of length n and a query RNA sequence of length m with known secondary structure that may contain embedded simple pseduoknots. The best known algorithm for solving this problem (Dost et al. [13]) runs in O(mn4) time with space complexity of O(mn3), which requires too much memory making it infeasible for comparing ncRNAs (non-coding RNAs) with length several hundreds or more. We propose a memory efficient algorithm to solve the same problem. We reduce the space complexity to O(mn2 + n3) while maintaining the same time complexity of Dost et al.’s algorithm. Experimental reslts show that our algorithm is feasible for comparing ncRNAs of length more than 500. Availability: The source code of our program is available upon request.

在本文中，我们考虑了长度为n的目标RNA序列和长度为m的具有已知二级结构的查询RNA序列的结构比对问题，这些二级结构可能包含嵌入的简单伪多结。解决该问题的最著名的算法(Dost等人，[13])运行时间为0 (mn4)，空间复杂度为0 (mn3)，这需要太多的内存，使得比较长度为数百或更多的ncrna(非编码rna)变得不可行。我们提出了一种高效存储算法来解决同样的问题。我们将空间复杂度降低到O(mn2 + n3)，同时保持了与Dost等人算法相同的时间复杂度。实验结果表明，该算法对于长度大于500的ncrna的比较是可行的。可用性:我们的程序的源代码是可要求的。

引用次数: 3

Structural Descriptors of Protein-Protein Binding Sites 蛋白质结合位点的结构描述符

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0011

Oliver Sander, F. S. Domingues, Hongbo Zhu, Thomas Lengauer, Ingolf Sommer

Structural bioinformatics provides new tools for investigating protein-protein interactions at the molecular level. We present two types of structural descriptors for efficiently representing and comparing protein-protein binding sites and surface patches. The descriptors are based on distributions of distances between five types of functional atoms, thereby capturing the spatial arrangement of physicochemical properties in 3D space. Experiments with the method are performed on two tasks: (1) detection of binding sites with known similarity from homologous proteins, and (2) scanning of the surfaces of two non-homologous proteins for similar regions.

结构生物信息学为在分子水平上研究蛋白质-蛋白质相互作用提供了新的工具。我们提出了两种有效表示和比较蛋白质结合位点和表面斑块的结构描述符。描述符基于五种功能原子之间的距离分布，从而捕获三维空间中物理化学性质的空间排列。用该方法进行的实验有两个任务:(1)从同源蛋白中检测已知相似的结合位点;(2)扫描两个非同源蛋白的表面以寻找相似区域。

引用次数: 3

Seed Optimization Is No Easier than Optimal Golomb Ruler Design 种子优化并不比优化Golomb标尺设计更容易

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0016

Bin Ma, Hongyi Yao

Spaced seed is a lter method invented to eciently identify the regions of interest in similarity searches. It is now well known that certain spaced seeds hit (detect) a randomly sampled similarity region with higher probabilities than the others. Assume each position of the similarity region is identity with probability p independently. The seed optimization problem seeks for the optimal seed achieving the highest hit probability with given length and weight. Despite that the problem was previously shown not to be NP-hard, in practice it seems dicult to solve. The only algorithm known to compute the optimal seed is still exhaustive search in exponential time. In this article we put some insight into the hardness of the seed design problem by demonstrating the relation between the seed optimization problem and the optimal Golomb ruler design problem, which is a well known dicult problem in combinatorial design.

间隔种子是为了在相似性搜索中快速识别感兴趣区域而发明的一种新方法。现在大家都知道，某些间隔的种子以比其他种子更高的概率击中(探测到)随机抽样的相似区域。假设相似区域的每一个位置都是独立的，且概率为p。种子优化问题寻求在给定长度和权重下命中概率最高的最优种子。尽管这个问题之前被证明不是np难题，但实际上它似乎很难解决。唯一已知的计算最优种子的算法仍然是指数时间内的穷举搜索。本文通过论证种子优化问题与组合设计中最优Golomb标尺设计问题之间的关系，对种子设计问题的难度进行了深入的探讨。

引用次数: 13

Protein Interactions Extracted from Genomes and Papers 从基因组和论文中提取的蛋白质相互作用

Proceedings of the ... Asia-Pacific bioinformatics conference

Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0003

A. Valencia

A compact and low profile document handling unit with short and simple document recirculation paths for recirculating a set of original document sheets to and from one side of the platen of a copier with or without inversion for selective repeated copying of one or both sides of the documents, in which the selectable document inverter in the document recirculation path includes a generally planar document sheet reversing chute which is generally parallel to and closely overlying the platen and a sheet reversing feeder adjacent the same side of the platen for feeding the document sheets into the reversing chute directly from the platen, but inverted, and for feeding them out of the reversing chute to a restacking tray at that same side of the platen without inversion.

一种紧凑而低调的文件处理装置，备有简短而简单的文件循环路径，以便将一套原始文件从复印机的单面(带反转或不带反转)来回循环，以便有选择性地重复复制文件的一面或两面。其中，在文件再循环路径中的可选文件逆变器包括通常与压板平行并紧密覆盖在压板上的一般平面的文件页换向滑槽和靠近压板同侧的纸张换向给料器，用于将文件页直接从压板倒送至换向滑槽，并将其从换向滑槽中送出至位于压板同侧的无反转的堆叠托盘。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the ... Asia-Pacific bioinformatics conference

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀