首页 > 最新文献

Match-Communications in Mathematical and in Computer Chemistry最新文献

英文 中文
Fer-COCL: A Novel Method Based on Multiple Deep Learning Algorithms for Identifying Fertility-Related Proteins ferc - cocl:一种基于多个深度学习算法的鉴定生育相关蛋白的新方法
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-07-01 DOI: 10.46793/match.90-3.537z
Shenmin Zhang, Xinjie Li, Hongyan Shi, Yuanyuan Jing, Yunyun Liang, Yusen Zhang
The survival of species depends on the fertility of organisms. It is also worthwhile to study the proteins that can regulate the reproductive activity of organisms. Since biological experiments are laborious to confirm proteins, it has become a priority that develop relevant computational models to predict the function of fertility-related proteins. With the development of machine learning, pertinent various algorithms can be the key to identifying fertility-related proteins. In this work, we develop a model Fer-COCL based on deep learning. The model consists of multiple features as well as multiple deep learning algorithms. First, we extract features using Amino acid composition (AAC), Dipeptide composition (DPC), CTD transition (CTDT) and deviation between the dipeptide and the expected mean (DDE). After that, the spliced features are fed into the classifier. The data processed jointly by convolutional neural network and long short-term memory is input to the fully connected layer for classification. After evaluating the model using 10-fold cross-validation, the accuracy of the two data sets reaches 97.1% and 98.3%, respectively. The results indicate that the model is efficient and accurate, facilitating biologists' research on biological fertility. In addition, a free online tool for predicting the function of fertility-related proteins is available at http://fercocl.zhanglab.site/.
物种的生存取决于生物体的繁殖力。研究能够调节生物体生殖活动的蛋白质也是值得的。由于生物实验很难确认蛋白质,因此开发相关的计算模型来预测生育相关蛋白质的功能已成为当务之急。随着机器学习的发展,相关的各种算法可以成为识别生育相关蛋白的关键。在这项工作中,我们开发了一个基于深度学习的Fer-COCL模型。该模型由多个特征和多个深度学习算法组成。首先,我们利用氨基酸组成(AAC)、二肽组成(DPC)、CTD过渡(CTDT)和二肽与预期均值之间的偏差(DDE)提取特征。之后,将拼接后的特征输入到分类器中。将卷积神经网络与长短期记忆共同处理的数据输入到全连接层进行分类。采用10倍交叉验证对模型进行评估后,两组数据集的准确率分别达到97.1%和98.3%。结果表明,该模型高效、准确,为生物学家研究生物生育力提供了方便。此外,一个预测生育相关蛋白质功能的免费在线工具可在http://fercocl.zhanglab.site/上获得。
{"title":"Fer-COCL: A Novel Method Based on Multiple Deep Learning Algorithms for Identifying Fertility-Related Proteins","authors":"Shenmin Zhang, Xinjie Li, Hongyan Shi, Yuanyuan Jing, Yunyun Liang, Yusen Zhang","doi":"10.46793/match.90-3.537z","DOIUrl":"https://doi.org/10.46793/match.90-3.537z","url":null,"abstract":"The survival of species depends on the fertility of organisms. It is also worthwhile to study the proteins that can regulate the reproductive activity of organisms. Since biological experiments are laborious to confirm proteins, it has become a priority that develop relevant computational models to predict the function of fertility-related proteins. With the development of machine learning, pertinent various algorithms can be the key to identifying fertility-related proteins. In this work, we develop a model Fer-COCL based on deep learning. The model consists of multiple features as well as multiple deep learning algorithms. First, we extract features using Amino acid composition (AAC), Dipeptide composition (DPC), CTD transition (CTDT) and deviation between the dipeptide and the expected mean (DDE). After that, the spliced features are fed into the classifier. The data processed jointly by convolutional neural network and long short-term memory is input to the fully connected layer for classification. After evaluating the model using 10-fold cross-validation, the accuracy of the two data sets reaches 97.1% and 98.3%, respectively. The results indicate that the model is efficient and accurate, facilitating biologists' research on biological fertility. In addition, a free online tool for predicting the function of fertility-related proteins is available at http://fercocl.zhanglab.site/.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80134941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estrada and L-Estrada Indices of a Graph and Their Relationship with the Number of Spanning Trees 图的Estrada和L-Estrada指标及其与生成树数目的关系
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-07-01 DOI: 10.46793/match.90-3.787a
Mahsa Arabzadeh, G. Fath-Tabar, Hamid Rasoli, A. Tehranian
{"title":"Estrada and L-Estrada Indices of a Graph and Their Relationship with the Number of Spanning Trees","authors":"Mahsa Arabzadeh, G. Fath-Tabar, Hamid Rasoli, A. Tehranian","doi":"10.46793/match.90-3.787a","DOIUrl":"https://doi.org/10.46793/match.90-3.787a","url":null,"abstract":"","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79731483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Note on an Inequality Between Energy and Sombor Index of a Graph 图的能量与Sombor指数不等的一个注记
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-07-01 DOI: 10.46793/match.90-3.765a
S. Akbari, M. Habibi, Soheyr Rouhani
{"title":"A Note on an Inequality Between Energy and Sombor Index of a Graph","authors":"S. Akbari, M. Habibi, Soheyr Rouhani","doi":"10.46793/match.90-3.765a","DOIUrl":"https://doi.org/10.46793/match.90-3.765a","url":null,"abstract":"","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84293737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Harary Index of Graphs with Given Dissociation Number 给定解离数图的哈里索引
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-07-01 DOI: 10.46793/match.90-3.649h
Jing Huang, Minjie Zhang
{"title":"On the Harary Index of Graphs with Given Dissociation Number","authors":"Jing Huang, Minjie Zhang","doi":"10.46793/match.90-3.649h","DOIUrl":"https://doi.org/10.46793/match.90-3.649h","url":null,"abstract":"","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82310787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Fast Approach for Protein Classification and Evolutionary Analysis 一种新的快速蛋白质分类和进化分析方法
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-04-01 DOI: 10.46793/match.90-2.381a
Liang Ai, Jie Feng, Yu-Hua Yao
In this paper, we propose a new fast alignment-free method for protein sequence similarity and evolutionary analysis. First 20 natural amino acids are clustered into 6 groups based on their physicochemical properties, then a 12-dimensional vector is constructed based on the frequency and the average position of occurrence of amino acids in each reduced amino acid sequences. Finally, the Euclidean distance is used to measure the similarity and evolutionary distance between protein sequences. The test on three datasets shows that our method can cluster each protein sequence accurately, which illustrates the effective of our method.
本文提出了一种新的蛋白质序列相似性和进化分析的快速无比对方法。首先根据20种天然氨基酸的理化性质将其聚为6类,然后根据氨基酸在每个还原氨基酸序列中出现的频率和平均位置构建一个12维向量。最后,利用欧几里得距离来度量蛋白质序列之间的相似性和进化距离。在三个数据集上的测试表明,我们的方法可以准确地聚类每个蛋白质序列,说明了我们的方法的有效性。
{"title":"A Novel Fast Approach for Protein Classification and Evolutionary Analysis","authors":"Liang Ai, Jie Feng, Yu-Hua Yao","doi":"10.46793/match.90-2.381a","DOIUrl":"https://doi.org/10.46793/match.90-2.381a","url":null,"abstract":"In this paper, we propose a new fast alignment-free method for protein sequence similarity and evolutionary analysis. First 20 natural amino acids are clustered into 6 groups based on their physicochemical properties, then a 12-dimensional vector is constructed based on the frequency and the average position of occurrence of amino acids in each reduced amino acid sequences. Finally, the Euclidean distance is used to measure the similarity and evolutionary distance between protein sequences. The test on three datasets shows that our method can cluster each protein sequence accurately, which illustrates the effective of our method.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73934735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kashuri Fundo Decomposition Method for Solving Michaelis-Menten Nonlinear Biochemical Reaction Model 求解Michaelis-Menten非线性生化反应模型的Kashuri Fundo分解法
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-04-01 DOI: 10.46793/match.90-2.315p
H. Peker, Fatma Aybike Çuha, Bilge Peker
In most of the real life problems, we encounter with nonlinear differential equations. Problems are made more understandable by modeling them with these equations. In this way, it becomes easier to interpret the problems and reach the results. In 1913, the basic enzymatic reaction model introduced by Michaelis and Menten to describe enzyme processes is an example of nonlinear differential equation. This model is the one of the simplest and best-known approaches of the mechanisms used to model enzyme-catalyzed reactions and is the most studied. For most nonlinear differential equations, it is very difficult to get an analytical solution. For this reason, various studies have been carried out to find approximate solutions to such equations. Among these studies, those in which two different methods are used by blending attract attention. In this study, a blended form of the Kashuri Fundo transform method and the Adomian decomposition method, so-called the Kashuri Fundo decomposition method, is used to find a solution to the Michaelis-Menten nonlinear biochemical reaction model in this way. This method has been applied to the biochemical reaction model and an approximate solution has been obtained for this model without complex calculations. This shows that the hybrid method is an effective, reliable, simpler and time-saving method in reaching the solutions of nonlinear differential equations.
在现实生活中的大多数问题中,我们都会遇到非线性微分方程。用这些方程来模拟问题,问题就容易理解了。这样,就更容易解释问题并得出结果。1913年Michaelis和Menten提出的描述酶过程的基本酶反应模型就是一个非线性微分方程的例子。该模型是用于模拟酶催化反应的机制中最简单和最著名的方法之一,也是研究最多的。对于大多数非线性微分方程,很难得到解析解。由于这个原因,人们进行了各种各样的研究来寻找这类方程的近似解。在这些研究中,混合使用两种不同方法的研究引起了人们的注意。本研究采用Kashuri Fundo变换法和Adomian分解法的混合形式,即所谓的Kashuri Fundo分解法,以这种方式求解Michaelis-Menten非线性生化反应模型。将该方法应用于生化反应模型,无需复杂的计算即可得到该模型的近似解。结果表明,该方法是求解非线性微分方程的一种有效、可靠、简便、省时的方法。
{"title":"Kashuri Fundo Decomposition Method for Solving Michaelis-Menten Nonlinear Biochemical Reaction Model","authors":"H. Peker, Fatma Aybike Çuha, Bilge Peker","doi":"10.46793/match.90-2.315p","DOIUrl":"https://doi.org/10.46793/match.90-2.315p","url":null,"abstract":"In most of the real life problems, we encounter with nonlinear differential equations. Problems are made more understandable by modeling them with these equations. In this way, it becomes easier to interpret the problems and reach the results. In 1913, the basic enzymatic reaction model introduced by Michaelis and Menten to describe enzyme processes is an example of nonlinear differential equation. This model is the one of the simplest and best-known approaches of the mechanisms used to model enzyme-catalyzed reactions and is the most studied. For most nonlinear differential equations, it is very difficult to get an analytical solution. For this reason, various studies have been carried out to find approximate solutions to such equations. Among these studies, those in which two different methods are used by blending attract attention. In this study, a blended form of the Kashuri Fundo transform method and the Adomian decomposition method, so-called the Kashuri Fundo decomposition method, is used to find a solution to the Michaelis-Menten nonlinear biochemical reaction model in this way. This method has been applied to the biochemical reaction model and an approximate solution has been obtained for this model without complex calculations. This shows that the hybrid method is an effective, reliable, simpler and time-saving method in reaching the solutions of nonlinear differential equations.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74466097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Persistent Subspaces of Reaction-Based Dynamical Systems 基于反应的动力系统的持久子空间
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-04-01 DOI: 10.46793/match.90-2.471i
B. Ibrahim, Stephan Peter
Various types of dynamical systems, such as ordinary differential equations (ODEs) or partial differential equations (PDEs), are widely applied not only in chemistry but also in many scientific disciplines to model the dynamics arising from interactions described by reactions between molecules, individuals, or species. This study provides an overview of how Chemical Organization Theory (COT) can be used to analyze such systems by identifying all potentially persistent species solely from the underlying reaction network, without the need for simulations or even knowledge of reaction constants or kinetic laws. Two minimalist examples with only three resp. four species are used to introduce all fundamental definitions including a new, naturally arising concept of persistence, and to illustrate the fore-mentioned technique without mathematical details such as proofs. Thereby, COT is shown to provide measures to analyze, compare, and construct very complex systems on an abstract level and thus to complement other powerful techniques for the analysis of complex systems such as deficiency, RAF theory, elementary modes, graph theory, Lyapunov functions, and bifurcation theory.
各种类型的动力系统,如常微分方程(ode)或偏微分方程(PDEs),不仅广泛应用于化学,而且广泛应用于许多科学学科,以模拟分子、个体或物种之间的反应所产生的相互作用的动力学。本研究概述了化学组织理论(COT)如何通过仅从潜在的反应网络中识别所有潜在的持久性物种来分析此类系统,而无需模拟甚至不需要了解反应常数或动力学定律。两个极简主义的例子,只有三个响应。使用四个种类来介绍所有基本定义,包括一个新的、自然产生的持久性概念,并在没有数学细节(如证明)的情况下说明前面提到的技术。因此,COT提供了在抽象层次上分析、比较和构建非常复杂系统的方法,从而补充了分析复杂系统的其他强大技术,如缺陷、RAF理论、基本模态、图论、Lyapunov函数和分岔理论。
{"title":"Persistent Subspaces of Reaction-Based Dynamical Systems","authors":"B. Ibrahim, Stephan Peter","doi":"10.46793/match.90-2.471i","DOIUrl":"https://doi.org/10.46793/match.90-2.471i","url":null,"abstract":"Various types of dynamical systems, such as ordinary differential equations (ODEs) or partial differential equations (PDEs), are widely applied not only in chemistry but also in many scientific disciplines to model the dynamics arising from interactions described by reactions between molecules, individuals, or species. This study provides an overview of how Chemical Organization Theory (COT) can be used to analyze such systems by identifying all potentially persistent species solely from the underlying reaction network, without the need for simulations or even knowledge of reaction constants or kinetic laws. Two minimalist examples with only three resp. four species are used to introduce all fundamental definitions including a new, naturally arising concept of persistence, and to illustrate the fore-mentioned technique without mathematical details such as proofs. Thereby, COT is shown to provide measures to analyze, compare, and construct very complex systems on an abstract level and thus to complement other powerful techniques for the analysis of complex systems such as deficiency, RAF theory, elementary modes, graph theory, Lyapunov functions, and bifurcation theory.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82238236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sombor Index of c-Cyclic Chemical Graphs c-循环化学图的Sombor指数
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-04-01 DOI: 10.46793/match.90-2.495l
Hechao Liu, L. You, Yufei Huang
,
{"title":"Sombor Index of c-Cyclic Chemical Graphs","authors":"Hechao Liu, L. You, Yufei Huang","doi":"10.46793/match.90-2.495l","DOIUrl":"https://doi.org/10.46793/match.90-2.495l","url":null,"abstract":",","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82684269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Resonance Graphs and a Binary Coding of Perfect Matchings of Outerplane Bipartite Graphs 共振图与外平面二部图完美匹配的二进制编码
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-04-01 DOI: 10.46793/match.90-2.453b
S. Brezovnik, Niko Tratnik, Petra Žigert Pleteršek
The aim of this paper is to investigate resonance graphs of 2- connected outerplane bipartite graphs, which include various families of molecular graphs. Firstly, we present an algorithm for a binary coding of perfect matchings of these graphs. Further, 2- connected outerplane bipartite graphs with isomorphic resonance graphs are considered. In particular, it is shown that if two 2- connected outerplane bipartite graphs are evenly homeomorphic, then its resonance graphs are isomorphic. Moreover, we prove that for any 2-connected outerplane bipartite graph G there exists a catacondensed even ring systems H such that the resonance graphs of G and H are isomorphic. We conclude with the characterization of 2-connected outerplane bipartite graphs whose resonance graphs are daisy cubes.
本文的目的是研究包括各种分子图族的2连通外平面二部图的共振图。首先,我们提出了一个对这些图的完美匹配进行二进制编码的算法。进一步研究了具有同构共振图的2连通外平面二部图。特别地,证明了如果两个2连通的外平面二部图是均匀同胚的,则其共振图是同胚的。进一步证明了对于任意2连通外平面二部图G,存在一个凝聚偶环系统H,使得G与H的共振图同构。我们得到了共振图为雏菊立方体的2连通外平面二部图的表征。
{"title":"Resonance Graphs and a Binary Coding of Perfect Matchings of Outerplane Bipartite Graphs","authors":"S. Brezovnik, Niko Tratnik, Petra Žigert Pleteršek","doi":"10.46793/match.90-2.453b","DOIUrl":"https://doi.org/10.46793/match.90-2.453b","url":null,"abstract":"The aim of this paper is to investigate resonance graphs of 2- connected outerplane bipartite graphs, which include various families of molecular graphs. Firstly, we present an algorithm for a binary coding of perfect matchings of these graphs. Further, 2- connected outerplane bipartite graphs with isomorphic resonance graphs are considered. In particular, it is shown that if two 2- connected outerplane bipartite graphs are evenly homeomorphic, then its resonance graphs are isomorphic. Moreover, we prove that for any 2-connected outerplane bipartite graph G there exists a catacondensed even ring systems H such that the resonance graphs of G and H are isomorphic. We conclude with the characterization of 2-connected outerplane bipartite graphs whose resonance graphs are daisy cubes.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78042618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Protein Sequence Comparison Method Based on 3-ary Huffman Coding 基于3-ary Huffman编码的蛋白质序列比较方法
IF 2.6 2区 化学 Q1 Mathematics Pub Date : 2023-04-01 DOI: 10.46793/match.90-2.357q
Zhaohui Qi, Yingqiang Ning, Yinmei Huang
Based on 3-ary Huffman coding algorithm, we propose a digital mapping method of protein sequence. Firstly, a 3-ary Huffman tree is defined by the frequency characteristic of 20 amino acids in given protein sequences. The 0-2 codes of 20 amino acids constructed by the 3-ary Huffman tree can convert long protein sequences into one-to-one 0-2 digital sequences. According to the frequency characteristic and the distribution information of 0-2 codes of 20 amino acids in the 0-2 digital sequences, we design the 40-dimensional vectors to characterize the protein sequences. Next, the proposed digital mapping method is used to perform three separate applications, similarity comparison of nine ND6 proteins, evolutionary trend analysis of the 2009 pandemic Human influenza A (H1N1) viruses from January 2020 to June 2022, and the evolution analysis of 95 coronavirus genes. The results illustrate the utility of the proposed method.
基于三进位霍夫曼编码算法,提出了一种蛋白质序列的数字映射方法。首先,根据给定蛋白质序列中20个氨基酸的频率特征,定义3-ary Huffman树;由3-ary Huffman树构建的20个氨基酸的0-2编码可以将长蛋白质序列转换成一对一的0-2数字序列。根据0-2数字序列中20个氨基酸的0-2编码的频率特征和分布信息,设计了40维载体来表征蛋白质序列。接下来,利用所提出的数字作图方法对9个ND6蛋白的相似性比较、2009年甲型H1N1流感大流行病毒2020年1月至2022年6月的进化趋势分析和95个冠状病毒基因的进化分析进行了3个单独的应用。结果表明了该方法的有效性。
{"title":"Protein Sequence Comparison Method Based on 3-ary Huffman Coding","authors":"Zhaohui Qi, Yingqiang Ning, Yinmei Huang","doi":"10.46793/match.90-2.357q","DOIUrl":"https://doi.org/10.46793/match.90-2.357q","url":null,"abstract":"Based on 3-ary Huffman coding algorithm, we propose a digital mapping method of protein sequence. Firstly, a 3-ary Huffman tree is defined by the frequency characteristic of 20 amino acids in given protein sequences. The 0-2 codes of 20 amino acids constructed by the 3-ary Huffman tree can convert long protein sequences into one-to-one 0-2 digital sequences. According to the frequency characteristic and the distribution information of 0-2 codes of 20 amino acids in the 0-2 digital sequences, we design the 40-dimensional vectors to characterize the protein sequences. Next, the proposed digital mapping method is used to perform three separate applications, similarity comparison of nine ND6 proteins, evolutionary trend analysis of the 2009 pandemic Human influenza A (H1N1) viruses from January 2020 to June 2022, and the evolution analysis of 95 coronavirus genes. The results illustrate the utility of the proposed method.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83045307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Match-Communications in Mathematical and in Computer Chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1