Ipsj Digital Courier最新文献

英文中文

A Benchmark Tool for Network I/O Management Architectures 网络I/O管理体系结构的基准测试工具

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.138

Eiji Kawai, S. Yamaguchi

The performance of a network server is directly influenced by its network I/O management architecture, i.e., its network I/O multiplexing mechanism. Existing benchmark tools focus on the evaluation of high-level service performance of network servers that implement specific application-layer protocols or the evaluation of low-level communication performance of network paths. However, such tools are not suitable for performance evaluation of server architectures. In this study, we developed a benchmark tool for network I/O management architectures. We implemented five representative network I/O management mechanisms as modules: multi-process, multi-thread, select, poll, and epoll. This modularised implementation enabled quantitative and fair comparisons among them. Our experimental results on Linux 2.6 revealed that the select-based and poll-based servers had no performance advantage over the others and the multi-process and multi-thread servers achieved a high performance almost equal to that of the epoll-based server.

网络服务器的性能直接受到其网络I/O管理体系结构，即网络I/O多路复用机制的影响。现有的基准测试工具侧重于评估实现特定应用层协议的网络服务器的高级服务性能或评估网络路径的低级通信性能。然而，这些工具并不适合服务器体系结构的性能评估。在本研究中，我们为网络I/O管理架构开发了一个基准测试工具。我们实现了五种代表性的网络I/O管理机制模块:多进程、多线程、选择、轮询和epoll。这种模块化的实现使它们之间能够进行定量和公平的比较。我们在Linux 2.6上的实验结果表明，基于选择和基于轮询的服务器没有性能优势，而多进程和多线程服务器的性能几乎与基于poll的服务器相当。

引用次数: 0

Peer-to-Peer Multimedia Streaming with Guaranteed QoS for Future Real-time Applications 具有保证QoS的点对点多媒体流，用于未来的实时应用

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.128

M. Hayasaka, T. Miki

Peer-to-Peer multimedia streaming is expected to grow rapidly in the near future. Packet losses during transmission are a serious problem for streaming media as they result in degradation of the quality of service (QoS). Forward Error Correction (FEC) is a promising technique to recover the lost packets and improve the QoS of streaming media. However, FEC may degrade the QoS of all streaming due to the increased congestion caused by the FEC overhead when streaming sessions increase. Although streaming media can be categorized into live and on-demand streaming contents, conventional FEC methods apply the same FEC scheme for both contents without distinguishing them. In this paper, we clarify the effective ranges where each conventional FEC and Retransmission scheme works well. Then, we propose a novel FEC method that distinguishes two types of streaming media and is applied for on-demand streaming contents. It can overcome the adverse effect of the FEC overhead in on-demand streaming contents during media streaming and therefore reduce the packet loss due to the FEC overhead. As a result, the packet loss ratios of both live and on-demand streaming contents are improved. Moreover, it provides the QoS according to the requirements and environments of users by using layered coding of FEC. Thus, packet losses are recovered at each end host and do not affect the next-hop streaming. The numerical analyses show that our proposed method highly improves the packet loss ratio compared to the conventional method.

点对点多媒体流媒体预计将在不久的将来迅速增长。流媒体在传输过程中丢包是一个严重的问题，它会导致服务质量(QoS)的降低。前向纠错(Forward Error Correction, FEC)是一种很有前途的恢复流媒体丢失数据包和提高流媒体服务质量的技术。但是，FEC可能会降低所有流的QoS，因为当流会话增加时，FEC开销会增加拥塞。虽然流媒体可以分为直播流媒体和点播流媒体内容，但传统的FEC方法对这两种内容采用相同的FEC方案，没有加以区分。在本文中，我们阐明了各种传统FEC和重传方案的有效范围。然后，我们提出了一种新的FEC方法来区分两种类型的流媒体，并应用于点播流媒体内容。它可以克服流媒体点播流内容中FEC开销的不利影响，从而减少由于FEC开销造成的丢包。因此，直播和点播流媒体内容的丢包率都得到了提高。采用FEC的分层编码，根据用户的需求和环境提供QoS。这样，丢包在每台终端主机上都能恢复，不会影响下一跳流。数值分析表明，与传统方法相比，该方法大大提高了丢包率。

{"title":"Peer-to-Peer Multimedia Streaming with Guaranteed QoS for Future Real-time Applications","authors":"M. Hayasaka, T. Miki","doi":"10.2197/IPSJDC.4.128","DOIUrl":"https://doi.org/10.2197/IPSJDC.4.128","url":null,"abstract":"Peer-to-Peer multimedia streaming is expected to grow rapidly in the near future. Packet losses during transmission are a serious problem for streaming media as they result in degradation of the quality of service (QoS). Forward Error Correction (FEC) is a promising technique to recover the lost packets and improve the QoS of streaming media. However, FEC may degrade the QoS of all streaming due to the increased congestion caused by the FEC overhead when streaming sessions increase. Although streaming media can be categorized into live and on-demand streaming contents, conventional FEC methods apply the same FEC scheme for both contents without distinguishing them. In this paper, we clarify the effective ranges where each conventional FEC and Retransmission scheme works well. Then, we propose a novel FEC method that distinguishes two types of streaming media and is applied for on-demand streaming contents. It can overcome the adverse effect of the FEC overhead in on-demand streaming contents during media streaming and therefore reduce the packet loss due to the FEC overhead. As a result, the packet loss ratios of both live and on-demand streaming contents are improved. Moreover, it provides the QoS according to the requirements and environments of users by using layered coding of FEC. Thus, packet losses are recovered at each end host and do not affect the next-hop streaming. The numerical analyses show that our proposed method highly improves the packet loss ratio compared to the conventional method.","PeriodicalId":432390,"journal":{"name":"Ipsj Digital Courier","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127823713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Parallel Skeletons for Sparse Matrices in SkeTo Skeleton Library SkeTo骨架库中稀疏矩阵的并行骨架

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.167

Yuki Karasawa, H. Iwasaki

Skeletal parallel programming makes both parallel programs development and parallelization easier. The idea is to abstract generic and recurring patterns within parallel programs as skeletons and provide them as a library whose parallel implementations are transparent to the programmer. SkeTo is a parallel skeleton library that enables programmers to write parallel programs in C++ in a sequential style. However, SkeTo's matrix skeletons assume that a matrix is dense, so they are incapable of efficiently dealing with a sparse matrix, which has many zeros, because of duplicated computations and commutations of identical values. This problem is solved by re-formalizing the matrix data type to cope with sparse matrices and by implementing a new C++ class of SkeTo with efficient sparse matrix skeletons based on this new formalization. Experimental results show that the new skeletons for sparse matrices perform well compared to existing skeletons for dense matrices.

框架并行编程使得并行程序的开发和并行化更加容易。其思想是将并行程序中的通用和重复模式抽象为框架，并将其作为库提供，其并行实现对程序员是透明的。SkeTo是一个并行骨架库，它使程序员能够以顺序风格用c++编写并行程序。然而，SkeTo的矩阵骨架假设矩阵是密集的，因此由于重复计算和相同值的交换，它们无法有效地处理具有许多零的稀疏矩阵。通过重新形式化矩阵数据类型来处理稀疏矩阵，并基于这种新的形式化实现具有高效稀疏矩阵骨架的SkeTo的新c++类，可以解决这个问题。实验结果表明，与现有的密集矩阵骨架相比，稀疏矩阵的新骨架具有更好的性能。

引用次数: 5

A Type System for Dynamic Delimited Continuations 动态定界延拓的类型系统

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.182

T. Yonezawa, Yukiyoshi Kameyama

We study the control operators “control” and “prompt” which manage part of continuations, that is, delimited continuations. They are similar to the well-known control operators“shift” and “reset”, but differ in that the former is dynamic, while the latter is static. In this paper, we introduce a static type system for “control”and “prompt” which does not use recursive types. We design our type system based on the dynamic CPS transformation recently proposed by Biernacki, Danvy and Millikin. We also introduce let-polymorphism into our type system, and show that our type system satisfies several important properties such as strong type soundness.

研究了控制算子“control”和“prompt”管理部分连续，即有界连续。它们与众所周知的控制算子“shift”和“reset”相似，但不同之处在于前者是动态的，而后者是静态的。在本文中，我们介绍了一个不使用递归类型的“控制”和“提示”静态类型系统。我们基于Biernacki, Danvy和Millikin最近提出的动态CPS转换来设计我们的类型系统。我们还在类型系统中引入let-多态性，并说明我们的类型系统满足几个重要的属性，如强类型稳健性。

引用次数: 1

Reaction Structure Profile : A Comparative Analysis of Metabolic Pathways Based on Important Substructures 反应结构概况:基于重要亚结构的代谢途径的比较分析

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.228

Y. Ashida, Tomonobu Ozaki, T. Ohkawa

Comparative analysis of organisms with metabolic pathways gives important information about functions within organisms. In this paper, we propose a new method for comparing the metabolic pathways with reaction structures that include important enzymes. In this method, subgraphs from pathways that include ‘choke point’ or ‘load point’ are extracted as important “reaction structures,” and a “reaction structure profile,” which represents whether extracted reaction structures are observed in the metabolic pathway of other organisms, is created. Distance regarding function within organisms between species is defined using the “reaction structure profile.” By applying the proposed method to the metabolic networks of 64 representative organisms selected from Archaea, Eubacteria and Eukaryote in the KEGG database, we succeed in reconstructing a phylogenetic tree, and confirm the effectiveness of the method.

对具有代谢途径的生物体进行比较分析，可以提供有关生物体内部功能的重要信息。在本文中，我们提出了一种新的方法来比较代谢途径与包括重要酶的反应结构。在这种方法中，从包括“阻塞点”或“负荷点”的途径中提取子图作为重要的“反应结构”，并创建“反应结构剖面”，该剖面表示提取的反应结构是否在其他生物体的代谢途径中被观察到。不同物种之间关于生物内部功能的距离是用“反应结构剖面”来定义的。将该方法应用于KEGG数据库中64个具有代表性的古细菌、真细菌和真核生物的代谢网络，成功地重建了系统发育树，并证实了该方法的有效性。

引用次数: 1

A Combination Method of the Tanimoto Coefficient and Proximity Measure of Random Forest for Compound Activity Prediction 谷本系数与随机森林接近度相结合的复合活性预测方法

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.238

G. Kawamura, S. Seno, Y. Takenaka, H. Matsuda

Chemical and biological activities of compounds provide valuable information for discovering new drugs. The compound fingerprint that is represented by structural information of the activities is used for candidates for investigating similarity. However, there are several problems with predicting accuracy from the requirement in the compound structural similarity. Although the amount of compound data is growing rapidly, the number of well-annotated compounds, e.g., those in the MDL Drug Data Report (MDDR)database, has not increased quickly. Since the compounds that are known to have some activities of a biological class of the target are rare in the drug discovery process, the accuracy of the prediction should be increased as the activity decreases or the false positive rate should be maintained in databases that have a large number of un-annotated compounds and a small number of annotated compounds of the biological activity. In this paper, we propose a new similarity scoring method composed of a combination of the Tanimoto coefficient and the proximity measure of random forest. The score contains two properties that are derived from unsupervised and supervised methods of partial dependence for compounds. Thus, the proposed method is expected to indicate compounds that have accurate activities. By evaluating the performance of the prediction compared with the two scores of the Tanimoto coefficient and the proximity measure, we demonstrate that the prediction result of the proposed scoring method is better than those of the two methods by using the Linear Discriminant Analysis (LDA) method. We estimate the prediction accuracy of compound datasets extracted from MDDR using the proposed method. It is also shown that the proposed method can identify active compounds in datasets including several un-annotated compounds.

化合物的化学和生物活性为发现新药提供了有价值的信息。用活动的结构信息表示的复合指纹作为候选物进行相似性调查。然而，从复合结构相似性的要求出发，预测精度存在一些问题。虽然化合物数据的数量正在快速增长，但注释良好的化合物的数量，例如MDL药物数据报告(MDDR)数据库中的化合物，并没有迅速增加。由于已知具有某一类靶标生物活性的化合物在药物发现过程中是罕见的，因此在具有大量未注释化合物和少量该生物活性已注释化合物的数据库中，应随着活性的降低而提高预测的准确性或保持假阳性率。本文提出了一种将谷本系数与随机森林的接近度测度相结合的相似性评分方法。分数包含两个属性，它们是由化合物的部分依赖的非监督和监督方法导出的。因此，所提出的方法有望表明具有准确活性的化合物。通过比较谷本系数和接近度两种评分方法的预测效果，证明了该评分方法的预测效果优于线性判别分析(LDA)方法的预测效果。我们使用该方法对从MDDR中提取的复合数据集的预测精度进行了估计。结果表明，该方法可以在包含多个未注释化合物的数据集中识别出活性化合物。

{"title":"A Combination Method of the Tanimoto Coefficient and Proximity Measure of Random Forest for Compound Activity Prediction","authors":"G. Kawamura, S. Seno, Y. Takenaka, H. Matsuda","doi":"10.2197/IPSJDC.4.238","DOIUrl":"https://doi.org/10.2197/IPSJDC.4.238","url":null,"abstract":"Chemical and biological activities of compounds provide valuable information for discovering new drugs. The compound fingerprint that is represented by structural information of the activities is used for candidates for investigating similarity. However, there are several problems with predicting accuracy from the requirement in the compound structural similarity. Although the amount of compound data is growing rapidly, the number of well-annotated compounds, e.g., those in the MDL Drug Data Report (MDDR)database, has not increased quickly. Since the compounds that are known to have some activities of a biological class of the target are rare in the drug discovery process, the accuracy of the prediction should be increased as the activity decreases or the false positive rate should be maintained in databases that have a large number of un-annotated compounds and a small number of annotated compounds of the biological activity. In this paper, we propose a new similarity scoring method composed of a combination of the Tanimoto coefficient and the proximity measure of random forest. The score contains two properties that are derived from unsupervised and supervised methods of partial dependence for compounds. Thus, the proposed method is expected to indicate compounds that have accurate activities. By evaluating the performance of the prediction compared with the two scores of the Tanimoto coefficient and the proximity measure, we demonstrate that the prediction result of the proposed scoring method is better than those of the two methods by using the Linear Discriminant Analysis (LDA) method. We estimate the prediction accuracy of compound datasets extracted from MDDR using the proposed method. It is also shown that the proposed method can identify active compounds in datasets including several un-annotated compounds.","PeriodicalId":432390,"journal":{"name":"Ipsj Digital Courier","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125269014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Clustering Method for Analysis of Sequence Similarity Networks of Proteins Using Maximal Components of Graphs 利用图的极大分量分析蛋白质序列相似网络的聚类方法

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.207

M. Hayashida, T. Akutsu, H. Nagamochi

This paper proposes a novel clustering method based on graph theory for analysis of biological networks. In this method, each biological network is treated as an undirected graph and edges are weighted based on similarities of nodes. Then, maximal components, which are defined based on edge connectivity, are computed and the nodes are partitioned into clusters by selecting disjoint maximal components. The proposed method was applied to clustering of protein sequences and was compared with conventional clustering methods. The obtained clusters were evaluated using P-values for GO(GeneOntology) terms. The average P-values for the proposed method were better than those for other methods.

提出了一种基于图论的生物网络聚类分析方法。该方法将每个生物网络视为一个无向图，并根据节点的相似度对边缘进行加权。然后，计算基于边缘连通性定义的极大分量，并通过选取不相交的极大分量对节点进行聚类;将该方法应用于蛋白质序列的聚类，并与传统的聚类方法进行了比较。使用GO(GeneOntology)术语的p值对获得的聚类进行评估。该方法的平均p值优于其他方法。

引用次数: 0

Fast Projective Reconstruction: Toward Ultimate Efficiency 快速投影重建:走向终极效率

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.257

H. Ackermann, K. Kanatani

We accelerate the time-consuming iterations for projective reconstruction, a key component of self-calibration for computing 3-D shapes from feature point tracking over a video sequence. We first summarize the algorithms of the primal and dual methods for projective reconstruction. Then, we replace the eigenvalue computation in each step by the power method. We also accelerate the power method itself. Furthermore, we introduce the SOR method for accelerating the subspace fitting involved in the iterations. Using simulated and real video images, we demonstrate that the computation sometimes becomes several thousand times faster.

我们加速了耗时的投影重建迭代，投影重建是视频序列上从特征点跟踪计算三维形状的自校准的关键组成部分。本文首先总结了投影重建的原始方法和对偶方法。然后，用幂函数法代替每一步的特征值计算。我们也加速了幂法本身。此外，我们还引入了SOR方法来加速迭代所涉及的子空间拟合。通过模拟和真实的视频图像，我们证明计算速度有时会提高几千倍。

引用次数: 5

Level-3 BLAS and LU Factorization on a Matrix Processor 矩阵处理器上的3级BLAS和LU分解

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.151

A. Zekri, S. Sedukhin

As increasing clock frequency approaches its physical limits, a good approach to enhance performance is to increase parallelism by integrating more cores as coprocessors to general-purpose processors in order to handle the diﬀerent workloads in scientiﬁc, engineering, and signal processing applications. In this paper, we propose a many-core matrix processor model consisting of a scalar unit augmented with b × b simple cores tightly connected in a 2D torus matrix unit to accelerate matrix-based kernels. Data load/store is overlapped with computing using a decoupled data access unit that moves b × b blocks of data between memory and the two scalar and matrix processing units. The operation of the matrix unit is mainly processing ﬁne-grained b × b matrix multiply-add (MMA) operations. We formulate the data alignment operations including matrix transposition and skewing as MMA operations in order to overlap them with data load/store. Two fundamental linear algebra algorithms are designed and an- alytically evaluated on the proposed matrix processor: the Level-3 BLAS kernel, GEMM, and the LU factorization with partial pivoting, the main step in solving linear systems of equa- tions. For the GEMM kernel, the maximum speed of computing measured in FLOPs/cycle is approached for diﬀerent matrix sizes, n , and block sizes, b. The speed of the LU factorization for relatively large values of n ranges from around 50–90% of the maximum speed depending on the model parameters. Overall, the analytical results show the merits of using the matrix unit for accelerating the matrix-based applications.

随着时钟频率的增加接近其物理极限，提高性能的一个好方法是通过将更多的内核作为协处理器集成到通用处理器来增加并行性，以便处理科学、工程和信号处理应用程序中的不同工作负载。本文提出了一种多核矩阵处理器模型，该模型由一个标量单元增广b × b个紧密连接在二维环面矩阵单元中的简单核组成，以加速基于矩阵的内核。数据加载/存储与使用解耦数据访问单元的计算重叠，该单元在内存和两个标量和矩阵处理单元之间移动b × b块数据。矩阵单元的操作主要是处理细粒度的b × b矩阵乘加(MMA)运算。我们将包括矩阵变换和倾斜在内的数据对齐操作制定为MMA操作，以便与数据加载/存储重叠。设计了两种基本的线性代数算法，并在该矩阵处理器上进行了分析评估:3级BLAS核(GEMM)和求解线性方程组的主要步骤—带偏轴的LU分解。对于GEMM内核，对于不同的矩阵大小、n和块大小，可以接近以FLOPs/周期为单位测量的最大计算速度，b。对于相对较大的n值，LU分解的速度范围约为最大速度的50-90%，具体取决于模型参数。总的来说，分析结果表明了使用矩阵单元来加速基于矩阵的应用的优点。

{"title":"Level-3 BLAS and LU Factorization on a Matrix Processor","authors":"A. Zekri, S. Sedukhin","doi":"10.2197/IPSJDC.4.151","DOIUrl":"https://doi.org/10.2197/IPSJDC.4.151","url":null,"abstract":"As increasing clock frequency approaches its physical limits, a good approach to enhance performance is to increase parallelism by integrating more cores as coprocessors to general-purpose processors in order to handle the diﬀerent workloads in scientiﬁc, engineering, and signal processing applications. In this paper, we propose a many-core matrix processor model consisting of a scalar unit augmented with b × b simple cores tightly connected in a 2D torus matrix unit to accelerate matrix-based kernels. Data load/store is overlapped with computing using a decoupled data access unit that moves b × b blocks of data between memory and the two scalar and matrix processing units. The operation of the matrix unit is mainly processing ﬁne-grained b × b matrix multiply-add (MMA) operations. We formulate the data alignment operations including matrix transposition and skewing as MMA operations in order to overlap them with data load/store. Two fundamental linear algebra algorithms are designed and an- alytically evaluated on the proposed matrix processor: the Level-3 BLAS kernel, GEMM, and the LU factorization with partial pivoting, the main step in solving linear systems of equa- tions. For the GEMM kernel, the maximum speed of computing measured in FLOPs/cycle is approached for diﬀerent matrix sizes, n , and block sizes, b. The speed of the LU factorization for relatively large values of n ranges from around 50–90% of the maximum speed depending on the model parameters. Overall, the analytical results show the merits of using the matrix unit for accelerating the matrix-based applications.","PeriodicalId":432390,"journal":{"name":"Ipsj Digital Courier","volume":"129 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134060281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Prediction of Protein-Protein Interaction Sites Using Only Sequence Information and Using Both Sequence and Structural Information 仅使用序列信息和同时使用序列和结构信息的蛋白质相互作用位点预测

Ipsj Digital Courier

Pub Date : 2008-03-15 DOI: 10.2197/IPSJDC.4.217

Masanori Kakuta, Shugo Nakamura, K. Shimizu

Protein-protein interactions play an important role in a number of biological activities. We developed two methods of predictingprotein-protein interaction site residues. One method uses only sequence information and the other method uses both sequence and structural information. We used support vector machine (SVM) with a position specific scoring matrix (PSSM) as sequence information and accessible surface area(ASA) of polar and non-polar atoms as structural information. SVM is used in two stages. In the first stage, an interaction residue is predicted by taking PSSMs of sequentially neighboring residues or taking PSSMs and ASAs of spatially neighboring residues as features. The second stage acts as a filter to refine the prediction results. The recall and precision of the predictor using both sequence and structural information are 73.6% and 50.5%, respectively. We found that using PSSM instead of frequency of amino acid appearance was the main factor of improvement of our methods.

蛋白质之间的相互作用在许多生物活动中起着重要作用。我们开发了两种预测蛋白质相互作用位点残基的方法。一种方法只使用序列信息，另一种方法同时使用序列和结构信息。我们使用具有位置特定评分矩阵(PSSM)的支持向量机(SVM)作为序列信息，并使用极性和非极性原子的可及表面积(ASA)作为结构信息。支持向量机的使用分为两个阶段。在第一阶段，通过取序列相邻残基的pssm或取空间相邻残基的pssm和asa作为特征来预测相互作用残基。第二阶段充当过滤器，以细化预测结果。同时使用序列和结构信息的预测器的召回率和精度分别为73.6%和50.5%。我们发现用PSSM代替氨基酸出现频率是我们改进方法的主要因素。

引用次数: 7

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Ipsj Digital Courier

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀