首页 > 最新文献

Proceedings of the ... Asia-Pacific bioinformatics conference最新文献

英文 中文
Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP PAMP检测基因组反转和缺失的引物选择方法
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0036
B. Dasgupta, Jin-Hunh Jun, I. Măndoiu
Primer Approximation Multiplex PCR (PAMP) is a recently introduced experimental technique for detecting large-scale cancer genome lesions such as inversions and deletions from heterogeneous samples containing a mixture of cancer and normal cells. In this paper we give integer linear programming formulations for the problem of selecting sets of PAMP primers that minimize detection failure probability. We also show that PAMP primer selection for detection of anchored deletions cannot be approximated within a factor of 2 ", and give a 2-approximation algorithm for a special case of the problem. Experimental results show that our ILP formulations can be used to optimally solve medium size instances of the inversion detection problem, and that heuristics based on iteratively solving ILP formulations for a one-sided version of the problem give near-optimal solutions for anchored deletion detection with highly scalable runtime.
引物近似多重PCR (PAMP)是最近引入的一种实验技术,用于检测大规模癌症基因组病变,例如从含有癌症和正常细胞混合物的异质样品中检测反转和缺失。本文给出了使检测失效概率最小的PAMP引物集合选择问题的整数线性规划公式。我们还表明,PAMP引物选择检测锚定缺失不能在2”的因子内近似,并给出了该问题的特殊情况下的2-近似算法。实验结果表明,我们的ILP公式可用于最优解决反转检测问题的中等大小实例,并且基于迭代求解问题的单侧版本的ILP公式的启发式方法为锚定删除检测提供了具有高度可扩展运行时的近最优解决方案。
{"title":"Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP","authors":"B. Dasgupta, Jin-Hunh Jun, I. Măndoiu","doi":"10.1142/9781848161092_0036","DOIUrl":"https://doi.org/10.1142/9781848161092_0036","url":null,"abstract":"Primer Approximation Multiplex PCR (PAMP) is a recently introduced experimental technique for detecting large-scale cancer genome lesions such as inversions and deletions from heterogeneous samples containing a mixture of cancer and normal cells. In this paper we give integer linear programming formulations for the problem of selecting sets of PAMP primers that minimize detection failure probability. We also show that PAMP primer selection for detection of anchored deletions cannot be approximated within a factor of 2 \", and give a 2-approximation algorithm for a special case of the problem. Experimental results show that our ILP formulations can be used to optimally solve medium size instances of the inversion detection problem, and that heuristics based on iteratively solving ILP formulations for a one-sided version of the problem give near-optimal solutions for anchored deletion detection with highly scalable runtime.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"60 1","pages":"353-362"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85861917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Novel Method for Reducing Computational Complexity of Whole Genome Sequence Alignment 一种降低全基因组序列比对计算复杂度的新方法
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0013
Ryuichiro Nakato, O. Gotoh
National Institute of Advanced Industrial Science and Technology,Computational Biology Research Center,2-42 Aomi, Koto-ku, Tokyo 135-0064, JapanGenomic sequence alignment is a powerful tool for finding com mon subsequence patterns sharedby the input sequences and identifying evolutionary relationships between the species. However, therunning time and space requirement of genome alignment have often been very extensive. In thisresearch, we propose a novel algorithm called Coarse-Grained AlignmenT (CGAT) algorithm, forreducing computational complexity necessary for cross-species whole genome sequence alignment.The CGAT first divides the input sequences into ”blocks” with a fixed length and aligns these blocksto each other. The generated block-level alignment is then refined at the nucleotide level. This two-stepprocedure can drastically reduce the overall computational time and space necessary for an alignment.In this paper, we show the effectiveness of the proposed algorithm by applying it to whole genomesequences of several bacteria.Keywords: Genome Alignment; Multiple Alignment; Sequence Analysis; Comparative Genomics.
基因组序列比对是寻找输入序列共享的共同子序列模式和识别物种之间进化关系的有力工具。然而,基因组比对的运行时间和空间要求往往非常广泛。在本研究中,我们提出了一种新的算法,称为粗粒度比对(CGAT)算法,以降低跨物种全基因组序列比对所需的计算复杂度。CGAT首先将输入序列划分为固定长度的“块”,并将这些块彼此对齐。生成的块级比对然后在核苷酸水平上进行细化。这两步过程可以大大减少校准所需的总体计算时间和空间。在本文中,我们通过将该算法应用于几种细菌的全基因组序列来证明该算法的有效性。关键词:基因组比对;多个对齐;序列分析;比较基因组学。
{"title":"A Novel Method for Reducing Computational Complexity of Whole Genome Sequence Alignment","authors":"Ryuichiro Nakato, O. Gotoh","doi":"10.1142/9781848161092_0013","DOIUrl":"https://doi.org/10.1142/9781848161092_0013","url":null,"abstract":"National Institute of Advanced Industrial Science and Technology,Computational Biology Research Center,2-42 Aomi, Koto-ku, Tokyo 135-0064, JapanGenomic sequence alignment is a powerful tool for finding com mon subsequence patterns sharedby the input sequences and identifying evolutionary relationships between the species. However, therunning time and space requirement of genome alignment have often been very extensive. In thisresearch, we propose a novel algorithm called Coarse-Grained AlignmenT (CGAT) algorithm, forreducing computational complexity necessary for cross-species whole genome sequence alignment.The CGAT first divides the input sequences into ”blocks” with a fixed length and aligns these blocksto each other. The generated block-level alignment is then refined at the nucleotide level. This two-stepprocedure can drastically reduce the overall computational time and space necessary for an alignment.In this paper, we show the effectiveness of the proposed algorithm by applying it to whole genomesequences of several bacteria.Keywords: Genome Alignment; Multiple Alignment; Sequence Analysis; Comparative Genomics.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"26 1","pages":"101-110"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79240518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Near-sigmoid Modeling to Simultaneously Profile Genome-wide DNA Replication Timing and Efficiency in Single DNA Replication Microarray Studies 近s型模型同时分析全基因组DNA复制时间和效率在单DNA复制微阵列研究
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0039
Juntao Li, M. Eshaghi, Jianhua Liu, K. R. K. Murthy
DNA replication is a key process in cell division cycle. It is initiated in coordinated manner in several species. To understand the DNA replication in a species one needs to measure the half replication timing (or replication timing) and the efficiency of replication which vary across genome in higher eukaryotes. In the previous studies, no direct assessment of replication efficiency on a genomic scale was performed while the replication timing was indirectly assessed using average DNA. In this paper, we present a first-ever-method of directly measuring both half replication timing and efficiency simultaneously from a single DNA microarray time-course data. We achieve it by fitting the so called near-sigmoid model to each locus of the DNA. We use this model apply S. pombe DNA replication microarray data and show that it is effective for genome-scale replication timing and efficiency profiling studies.
DNA复制是细胞分裂周期的关键过程。它在几个物种中以协调的方式开始。为了了解一个物种的DNA复制,需要测量半复制时间(或复制时间)和复制效率,这在高等真核生物的基因组中是不同的。在以前的研究中,没有在基因组尺度上直接评估复制效率,而是使用平均DNA间接评估复制时间。在本文中,我们提出了一种直接测量半复制时间和效率的方法,同时从单个DNA微阵列时间过程数据。我们通过将所谓的近s型模型拟合到DNA的每个位点来实现它。我们将该模型应用于S. pombe DNA复制微阵列数据,并表明它对基因组规模的复制时间和效率分析研究是有效的。
{"title":"Near-sigmoid Modeling to Simultaneously Profile Genome-wide DNA Replication Timing and Efficiency in Single DNA Replication Microarray Studies","authors":"Juntao Li, M. Eshaghi, Jianhua Liu, K. R. K. Murthy","doi":"10.1142/9781848161092_0039","DOIUrl":"https://doi.org/10.1142/9781848161092_0039","url":null,"abstract":"DNA replication is a key process in cell division cycle. It is initiated in coordinated manner in several species. To understand the DNA replication in a species one needs to measure the half replication timing (or replication timing) and the efficiency of replication which vary across genome in higher eukaryotes. In the previous studies, no direct assessment of replication efficiency on a genomic scale was performed while the replication timing was indirectly assessed using average DNA. In this paper, we present a first-ever-method of directly measuring both half replication timing and efficiency simultaneously from a single DNA microarray time-course data. We achieve it by fitting the so called near-sigmoid model to each locus of the DNA. We use this model apply S. pombe DNA replication microarray data and show that it is effective for genome-scale replication timing and efficiency profiling studies.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"14 1","pages":"383-392"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87765619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Memory Efficient Algorithm for Structural Alignment of RNAs with Embedded Simple Pseudoknots 嵌入简单伪结rna结构比对的高效记忆算法
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0012
Thomas K. F. Wong, Y. Chiu, T. Lam, S. Yiu
In this paper, we consider the problem of structural alignment of a target RNA sequence of length n and a query RNA sequence of length m with known secondary structure that may contain embedded simple pseduoknots. The best known algorithm for solving this problem (Dost et al. [13]) runs in O(mn4) time with space complexity of O(mn3), which requires too much memory making it infeasible for comparing ncRNAs (non-coding RNAs) with length several hundreds or more. We propose a memory efficient algorithm to solve the same problem. We reduce the space complexity to O(mn2 + n3) while maintaining the same time complexity of Dost et al.’s algorithm. Experimental reslts show that our algorithm is feasible for comparing ncRNAs of length more than 500. Availability: The source code of our program is available upon request.
在本文中,我们考虑了长度为n的目标RNA序列和长度为m的具有已知二级结构的查询RNA序列的结构比对问题,这些二级结构可能包含嵌入的简单伪多结。解决该问题的最著名的算法(Dost等人,[13])运行时间为0 (mn4),空间复杂度为0 (mn3),这需要太多的内存,使得比较长度为数百或更多的ncrna(非编码rna)变得不可行。我们提出了一种高效存储算法来解决同样的问题。我们将空间复杂度降低到O(mn2 + n3),同时保持了与Dost等人算法相同的时间复杂度。实验结果表明,该算法对于长度大于500的ncrna的比较是可行的。可用性:我们的程序的源代码是可要求的。
{"title":"A Memory Efficient Algorithm for Structural Alignment of RNAs with Embedded Simple Pseudoknots","authors":"Thomas K. F. Wong, Y. Chiu, T. Lam, S. Yiu","doi":"10.1142/9781848161092_0012","DOIUrl":"https://doi.org/10.1142/9781848161092_0012","url":null,"abstract":"In this paper, we consider the problem of structural alignment of a target RNA sequence of length n and a query RNA sequence of length m with known secondary structure that may contain embedded simple pseduoknots. The best known algorithm for solving this problem (Dost et al. [13]) runs in O(mn4) time with space complexity of O(mn3), which requires too much memory making it infeasible for comparing ncRNAs (non-coding RNAs) with length several hundreds or more. We propose a memory efficient algorithm to solve the same problem. We reduce the space complexity to O(mn2 + n3) while maintaining the same time complexity of Dost et al.’s algorithm. Experimental reslts show that our algorithm is feasible for comparing ncRNAs of length more than 500. Availability: The source code of our program is available upon request.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"29 1","pages":"89-100"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76985733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Alignment of Minisatellite Maps: A Minimum Spanning Tree-based Approach 小卫星地图的对齐:基于最小生成树的方法
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0028
M. Abouelhoda, R. Giegerich, B. Behzadi, J. Steyaert
In addition to the well-known edit operations, the alignment of minisatellite maps includes duplication events. We model these duplications using a special kind of spanning trees and deduce an optimal duplication scenario by computing the respective minimum spanning tree. Based on best duplication scenarios for all substrings of the given sequences, we compute an optimal alignment of two minisatellite maps. Our algorithm improves upon the previously developed algorithms in the generality of the model, in alignment quality and in space-time efficiency. Using this algorithm, we derive evidence that there is a directional bias in the growth of minisatellites of the MSY1 dataset.
除了众所周知的编辑操作外,小卫星地图的对齐还包括复制事件。我们使用一种特殊的生成树对这些复制进行建模,并通过计算各自的最小生成树来推断出最优的复制场景。基于给定序列的所有子串的最佳复制场景,我们计算了两个小卫星图的最优对齐。该算法在模型通用性、对齐质量和时空效率等方面都有较好的改进。利用该算法,我们得出了MSY1数据集的小卫星生长存在方向性偏差的证据。
{"title":"Alignment of Minisatellite Maps: A Minimum Spanning Tree-based Approach","authors":"M. Abouelhoda, R. Giegerich, B. Behzadi, J. Steyaert","doi":"10.1142/9781848161092_0028","DOIUrl":"https://doi.org/10.1142/9781848161092_0028","url":null,"abstract":"In addition to the well-known edit operations, the alignment of minisatellite maps includes duplication events. We model these duplications using a special kind of spanning trees and deduce an optimal duplication scenario by computing the respective minimum spanning tree. Based on best duplication scenarios for all substrings of the given sequences, we compute an optimal alignment of two minisatellite maps. Our algorithm improves upon the previously developed algorithms in the generality of the model, in alignment quality and in space-time efficiency. Using this algorithm, we derive evidence that there is a directional bias in the growth of minisatellites of the MSY1 dataset.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"88 1","pages":"261-272"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83841317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
From Text to Pathway: Corpus Annotation for Knowledge Acquisition from Biomedical Literature 从文本到路径:生物医学文献知识获取的语料库标注
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0019
Jin-Dong Kim, Tomoko Ohta, Kanae Oda, Junichi Tsujii
We present a new direction of research, which deploys Text Mining technologies to construct and maintain data bases organized in the form of pathway, by associating parts of papers with relevant portions of a pathway and vice versa. In order to materialize this scenario, we present two annotated corpora. The first, Event Annotation, identifies the spans of text in which biological events are reported, while the other, Pathway Annotation, associates portions of papers with specific parts in a pathway.
我们提出了一个新的研究方向,即利用文本挖掘技术,通过将论文的部分内容与路径的相关部分相关联来构建和维护以路径形式组织的数据库,反之亦然。为了实现这一场景,我们提出了两个带注释的语料库。第一种方法是Event Annotation,它确定了报道生物事件的文本范围,而另一种方法是Pathway Annotation,它将论文的某些部分与路径中的特定部分联系起来。
{"title":"From Text to Pathway: Corpus Annotation for Knowledge Acquisition from Biomedical Literature","authors":"Jin-Dong Kim, Tomoko Ohta, Kanae Oda, Junichi Tsujii","doi":"10.1142/9781848161092_0019","DOIUrl":"https://doi.org/10.1142/9781848161092_0019","url":null,"abstract":"We present a new direction of research, which deploys Text Mining technologies to construct and maintain data bases organized in the form of pathway, by associating parts of papers with relevant portions of a pathway and vice versa. In order to materialize this scenario, we present two annotated corpora. The first, Event Annotation, identifies the spans of text in which biological events are reported, while the other, Pathway Annotation, associates portions of papers with specific parts in a pathway.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"35 1","pages":"165-176"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80663458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Estimation of Population Allele Frequencies from Small Samples Containing Multiple Generations 包含多代的小样本群体等位基因频率估计
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0033
D. Konovalov, D. Heg
Estimations of population genetic parameters like allele frequencies, heterozygosities, inbreeding coefficients and genetic distances rely on the assumption that all sampled genotypes come from a randomly interbreeding population or sub-population. Here we show that small cross-generational samples may severely affect estimates of allele frequencies, when a small number of progenies dominate the next generation or the sample. A new estimator of allele frequencies is developed for such cases when the kin structure of the focal sample is unknown and has to be assessed simultaneously. Using Monte Carlo simulations it was demonstrated that the new estimator delivered significant improvement over the conventional allele counting estimator.
群体遗传参数的估计,如等位基因频率、杂合度、近交系数和遗传距离,依赖于所有样本基因型来自随机杂交群体或亚群体的假设。在这里,我们表明,当少数后代在下一代或样本中占主导地位时,小的跨代样本可能严重影响等位基因频率的估计。本文提出了一种新的等位基因频率估计方法,用于在病灶样品的亲缘结构未知的情况下同时进行评估。通过蒙特卡罗模拟,证明了新的估计方法比传统的等位基因计数估计方法有显著的改进。
{"title":"Estimation of Population Allele Frequencies from Small Samples Containing Multiple Generations","authors":"D. Konovalov, D. Heg","doi":"10.1142/9781848161092_0033","DOIUrl":"https://doi.org/10.1142/9781848161092_0033","url":null,"abstract":"Estimations of population genetic parameters like allele frequencies, heterozygosities, inbreeding coefficients and genetic distances rely on the assumption that all sampled genotypes come from a randomly interbreeding population or sub-population. Here we show that small cross-generational samples may severely affect estimates of allele frequencies, when a small number of progenies dominate the next generation or the sample. A new estimator of allele frequencies is developed for such cases when the kin structure of the focal sample is unknown and has to be assessed simultaneously. Using Monte Carlo simulations it was demonstrated that the new estimator delivered significant improvement over the conventional allele counting estimator.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"87 1","pages":"321-332"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84068953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Structural Descriptors of Protein-Protein Binding Sites 蛋白质结合位点的结构描述符
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0011
Oliver Sander, F. S. Domingues, Hongbo Zhu, Thomas Lengauer, Ingolf Sommer
Structural bioinformatics provides new tools for investigating protein-protein interactions at the molecular level. We present two types of structural descriptors for efficiently representing and comparing protein-protein binding sites and surface patches. The descriptors are based on distributions of distances between five types of functional atoms, thereby capturing the spatial arrangement of physicochemical properties in 3D space. Experiments with the method are performed on two tasks: (1) detection of binding sites with known similarity from homologous proteins, and (2) scanning of the surfaces of two non-homologous proteins for similar regions.
结构生物信息学为在分子水平上研究蛋白质-蛋白质相互作用提供了新的工具。我们提出了两种有效表示和比较蛋白质结合位点和表面斑块的结构描述符。描述符基于五种功能原子之间的距离分布,从而捕获三维空间中物理化学性质的空间排列。用该方法进行的实验有两个任务:(1)从同源蛋白中检测已知相似的结合位点;(2)扫描两个非同源蛋白的表面以寻找相似区域。
{"title":"Structural Descriptors of Protein-Protein Binding Sites","authors":"Oliver Sander, F. S. Domingues, Hongbo Zhu, Thomas Lengauer, Ingolf Sommer","doi":"10.1142/9781848161092_0011","DOIUrl":"https://doi.org/10.1142/9781848161092_0011","url":null,"abstract":"Structural bioinformatics provides new tools for investigating protein-protein interactions at the molecular level. We present two types of structural descriptors for efficiently representing and comparing protein-protein binding sites and surface patches. The descriptors are based on distributions of distances between five types of functional atoms, thereby capturing the spatial arrangement of physicochemical properties in 3D space. Experiments with the method are performed on two tasks: (1) detection of binding sites with known similarity from homologous proteins, and (2) scanning of the surfaces of two non-homologous proteins for similar regions.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"4 1","pages":"79-88"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74807030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Protein Interactions Extracted from Genomes and Papers 从基因组和论文中提取的蛋白质相互作用
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0003
A. Valencia
A compact and low profile document handling unit with short and simple document recirculation paths for recirculating a set of original document sheets to and from one side of the platen of a copier with or without inversion for selective repeated copying of one or both sides of the documents, in which the selectable document inverter in the document recirculation path includes a generally planar document sheet reversing chute which is generally parallel to and closely overlying the platen and a sheet reversing feeder adjacent the same side of the platen for feeding the document sheets into the reversing chute directly from the platen, but inverted, and for feeding them out of the reversing chute to a restacking tray at that same side of the platen without inversion.
一种紧凑而低调的文件处理装置,备有简短而简单的文件循环路径,以便将一套原始文件从复印机的单面(带反转或不带反转)来回循环,以便有选择性地重复复制文件的一面或两面。其中,在文件再循环路径中的可选文件逆变器包括通常与压板平行并紧密覆盖在压板上的一般平面的文件页换向滑槽和靠近压板同侧的纸张换向给料器,用于将文件页直接从压板倒送至换向滑槽,并将其从换向滑槽中送出至位于压板同侧的无反转的堆叠托盘。
{"title":"Protein Interactions Extracted from Genomes and Papers","authors":"A. Valencia","doi":"10.1142/9781848161092_0003","DOIUrl":"https://doi.org/10.1142/9781848161092_0003","url":null,"abstract":"A compact and low profile document handling unit with short and simple document recirculation paths for recirculating a set of original document sheets to and from one side of the platen of a copier with or without inversion for selective repeated copying of one or both sides of the documents, in which the selectable document inverter in the document recirculation path includes a generally planar document sheet reversing chute which is generally parallel to and closely overlying the platen and a sheet reversing feeder adjacent the same side of the platen for feeding the document sheets into the reversing chute directly from the platen, but inverted, and for feeding them out of the reversing chute to a restacking tray at that same side of the platen without inversion.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"19 1","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79945046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
String Kernels with Feature Selection for SVM Protein Classification 基于特征选择的字符串核支持向量机蛋白质分类
Pub Date : 2007-12-01 DOI: 10.1142/9781848161092_0004
Wen-Yun Yang, Bao-Liang Lu
We introduce a general framework for string kernels. This framework can produce various types of kernels, including a number of existing kernels, to be used with support vector machines (SVMs). In this framework, we can select the informative subsequences to reduce the dimensionality of the feature space. We can model the mutations in biological sequences. Finally, we combine contributions of subsequences in a weighted fashion to get the target kernel. In practical computation, we develop a novel tree structure, coupled with a traversal algorithm to speed up the computation. The experimental results on a benchmark SCOP data set show that the kernels produced by our framework outperform the existing spectrum kernels, in both e‐ciency and ROC50 scores.
我们介绍了字符串内核的一般框架。该框架可以生成各种类型的内核,包括许多现有的内核,用于支持向量机(svm)。在这个框架中,我们可以选择信息子序列来降低特征空间的维数。我们可以模拟生物序列中的突变。最后,我们以加权的方式组合子序列的贡献来得到目标核。在实际计算中,我们开发了一种新的树状结构,并结合遍历算法来加快计算速度。在基准SCOP数据集上的实验结果表明,我们的框架生成的核在e - ciency和ROC50分数上都优于现有的频谱核。
{"title":"String Kernels with Feature Selection for SVM Protein Classification","authors":"Wen-Yun Yang, Bao-Liang Lu","doi":"10.1142/9781848161092_0004","DOIUrl":"https://doi.org/10.1142/9781848161092_0004","url":null,"abstract":"We introduce a general framework for string kernels. This framework can produce various types of kernels, including a number of existing kernels, to be used with support vector machines (SVMs). In this framework, we can select the informative subsequences to reduce the dimensionality of the feature space. We can model the mutations in biological sequences. Finally, we combine contributions of subsequences in a weighted fashion to get the target kernel. In practical computation, we develop a novel tree structure, coupled with a traversal algorithm to speed up the computation. The experimental results on a benchmark SCOP data set show that the kernels produced by our framework outperform the existing spectrum kernels, in both e‐ciency and ROC50 scores.","PeriodicalId":74513,"journal":{"name":"Proceedings of the ... Asia-Pacific bioinformatics conference","volume":"15 1","pages":"9-18"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78179524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the ... Asia-Pacific bioinformatics conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1