首页 > 最新文献

Vavilovskii Zhurnal Genetiki i Selektsii最新文献

英文 中文
Identification of proteins regulating phenotype-associated genes of M2 macrophages: a bioinformatic analysis. M2巨噬细胞表型相关基因调节蛋白的鉴定:生物信息学分析。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-104
E A Antropova, I V Yatsyk, P S Demenkov, T V Ivanisenko, V A Ivanisenko

Macrophages are immune system cells that perform various, often opposing, functions in the organism depending on the incoming microenvironment signals. This is possible due to the plasticity of macrophages, which allows them to radically alter their phenotypic characteristics and gene expression profiles, as well as return to their original, non-activated state. Depending on the inductors acting on the cell, macrophages are activated into various functional states. There are five main phenotypes of activated macrophages: M1, M2a, M2b, M2c, and M2d. Although the amount of genome-wide transcriptomic and proteomic data showing differences between major macrophage phenotypes and non-activated macrophages (M0) is rapidly growing, questions regarding the mechanisms regulating gene and protein expression profiles in macrophages of different phenotypes still remain. We compiled lists of proteins associated with the macrophage phenotypes M1, M2a, M2b, M2c, and M2d (phenotype-associated proteins) and analyzed the data on potential mediators of macrophage polarization. Furthermore, using the computational system ANDSystem, we conducted a search and analysis of the relationships between potential regulatory proteins and the genes encoding the proteins associated with the M2 group phenotypes, obtaining estimates of the statistical significance of these relationships. The results indicate that the differences in the M2a, M2b, M2c, and M2d macrophage phenotypes may be attributed to the regulatory effects of the proteins JUN, IL8, NFAC2, CCND1, and YAP1. The expression levels of these proteins vary among the M2 group phenotypes, which in turn leads to different levels of gene expression associated with specific phenotypes.

巨噬细胞是一种免疫系统细胞,在生物体中根据传入的微环境信号执行各种(通常是相反的)功能。这是可能的,因为巨噬细胞具有可塑性,可以从根本上改变其表型特征和基因表达谱,并返回到原始的非激活状态。根据作用于细胞的诱导剂,巨噬细胞被激活到不同的功能状态。活化的巨噬细胞有五种主要表型:M1、M2a、M2b、M2c和M2d。尽管显示主要巨噬细胞表型和非活化巨噬细胞(M0)之间差异的全基因组转录组学和蛋白质组学数据正在迅速增加,但关于不同表型巨噬细胞中基因和蛋白质表达谱的调节机制仍然存在疑问。我们编制了巨噬细胞表型M1、M2a、M2b、M2c和M2d(表型相关蛋白)相关蛋白列表,并分析了巨噬细胞极化的潜在介质数据。此外,利用计算系统ANDSystem,我们对潜在调控蛋白与编码M2组表型相关蛋白的基因之间的关系进行了搜索和分析,获得了这些关系的统计显著性估计。结果表明,M2a, M2b, M2c和M2d巨噬细胞表型的差异可能归因于蛋白JUN, IL8, NFAC2, CCND1和YAP1的调节作用。这些蛋白的表达水平在M2组表型中有所不同,这反过来导致与特定表型相关的基因表达水平不同。
{"title":"Identification of proteins regulating phenotype-associated genes of M2 macrophages: a bioinformatic analysis.","authors":"E A Antropova, I V Yatsyk, P S Demenkov, T V Ivanisenko, V A Ivanisenko","doi":"10.18699/vjgb-25-104","DOIUrl":"https://doi.org/10.18699/vjgb-25-104","url":null,"abstract":"<p><p>Macrophages are immune system cells that perform various, often opposing, functions in the organism depending on the incoming microenvironment signals. This is possible due to the plasticity of macrophages, which allows them to radically alter their phenotypic characteristics and gene expression profiles, as well as return to their original, non-activated state. Depending on the inductors acting on the cell, macrophages are activated into various functional states. There are five main phenotypes of activated macrophages: M1, M2a, M2b, M2c, and M2d. Although the amount of genome-wide transcriptomic and proteomic data showing differences between major macrophage phenotypes and non-activated macrophages (M0) is rapidly growing, questions regarding the mechanisms regulating gene and protein expression profiles in macrophages of different phenotypes still remain. We compiled lists of proteins associated with the macrophage phenotypes M1, M2a, M2b, M2c, and M2d (phenotype-associated proteins) and analyzed the data on potential mediators of macrophage polarization. Furthermore, using the computational system ANDSystem, we conducted a search and analysis of the relationships between potential regulatory proteins and the genes encoding the proteins associated with the M2 group phenotypes, obtaining estimates of the statistical significance of these relationships. The results indicate that the differences in the M2a, M2b, M2c, and M2d macrophage phenotypes may be attributed to the regulatory effects of the proteins JUN, IL8, NFAC2, CCND1, and YAP1. The expression levels of these proteins vary among the M2 group phenotypes, which in turn leads to different levels of gene expression associated with specific phenotypes.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"990-999"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12800646/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145991224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OrthoML2GO: homology-based protein function prediction using orthogroups and machine learning. OrthoML2GO:基于同源的蛋白质功能预测,使用正交群和机器学习。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-119
E V Malyugin, D A Afonnikov

In recent years, the rapid growth of sequencing data has exacerbated the problem of functional annotation of protein sequences, as traditional homology-based methods face limitations when working with distant homologs, making it difficult to accurately determine protein functions. This paper introduces the OrthoML2GO method for protein function prediction, which integrates homology searches using the USEARCH algorithm, orthogroup analysis based on OrthoDB version 12.0, and a machine learning algorithm (gradient boosting). A key feature of our approach is the use of orthogroup information to account for the evolutionary and functional similarity of proteins and the application of machine learning to refine the assigned GO terms for the target sequence. To select the optimal algorithm for protein annotation, the following approaches were applied sequentially: the k-nearest neighbors (KNN) method; a method based on the annotation of the orthogroup most represented in the k-nearest homologs (OG); a method of verifying the GO terms identified in the previous stage using machine learning algorithms. A comparison of the prediction accuracy of GO terms using the OrthoML2GO method with the Blast2GO and PANNZER2 annotation programs was performed on sequence samples from both individual organisms (humans, Arabidopsis) and a combined sample represented by different taxa. Our results demonstrate that the proposed method is comparable to, and by some evaluation metrics outperforms, these existing methods in terms of the quality of protein function prediction, especially on large and heterogeneous samples of organisms. The greatest performance improvement is achieved by combining information about the closest homologs and orthogroups with verification of terms using machine learning methods. Our approach demonstrates high performance for large-scale automatic protein annotation, and prospects for further development include optimizing machine learning model parameters for specific biological tasks and integrating additional sources of structural and functional information, which will further improve the method's accuracy and versatility. In addition, the introduction of new bioinformatics tools and the expansion of the annotated protein database will contribute to the further improvement of the proposed approach.

近年来,测序数据的快速增长加剧了蛋白质序列的功能标注问题,传统的基于同源性的方法在处理远同源物时存在局限性,难以准确确定蛋白质的功能。本文介绍了用于蛋白质功能预测的OrthoML2GO方法,该方法集成了使用USEARCH算法的同源性搜索、基于OrthoDB version 12.0的正交群分析和机器学习算法(梯度增强)。我们方法的一个关键特征是使用正群信息来解释蛋白质的进化和功能相似性,并应用机器学习来优化目标序列的GO术语。为了选择最优的蛋白质注释算法,我们依次采用了以下几种方法:k近邻(KNN)方法;基于k近邻同系物(OG)中最具代表性的正群注释的方法;一种使用机器学习算法验证在前一阶段识别的GO术语的方法。利用OrthoML2GO方法与Blast2GO和PANNZER2注释程序对来自个体生物(人类、拟南芥)和不同分类群代表的组合样本的序列样本进行了GO项预测精度的比较。我们的研究结果表明,就蛋白质功能预测的质量而言,所提出的方法与这些现有方法相当,并且通过一些评估指标优于这些方法,特别是在大型和异质生物体样本上。最大的性能改进是通过使用机器学习方法将关于最接近的同系词和正群的信息与术语验证相结合来实现的。我们的方法证明了大规模自动蛋白质注释的高性能,进一步发展的前景包括优化特定生物任务的机器学习模型参数,整合额外的结构和功能信息源,这将进一步提高方法的准确性和通用性。此外,新的生物信息学工具的引入和注释蛋白数据库的扩展将有助于进一步改进所提出的方法。
{"title":"OrthoML2GO: homology-based protein function prediction using orthogroups and machine learning.","authors":"E V Malyugin, D A Afonnikov","doi":"10.18699/vjgb-25-119","DOIUrl":"https://doi.org/10.18699/vjgb-25-119","url":null,"abstract":"<p><p>In recent years, the rapid growth of sequencing data has exacerbated the problem of functional annotation of protein sequences, as traditional homology-based methods face limitations when working with distant homologs, making it difficult to accurately determine protein functions. This paper introduces the OrthoML2GO method for protein function prediction, which integrates homology searches using the USEARCH algorithm, orthogroup analysis based on OrthoDB version 12.0, and a machine learning algorithm (gradient boosting). A key feature of our approach is the use of orthogroup information to account for the evolutionary and functional similarity of proteins and the application of machine learning to refine the assigned GO terms for the target sequence. To select the optimal algorithm for protein annotation, the following approaches were applied sequentially: the k-nearest neighbors (KNN) method; a method based on the annotation of the orthogroup most represented in the k-nearest homologs (OG); a method of verifying the GO terms identified in the previous stage using machine learning algorithms. A comparison of the prediction accuracy of GO terms using the OrthoML2GO method with the Blast2GO and PANNZER2 annotation programs was performed on sequence samples from both individual organisms (humans, Arabidopsis) and a combined sample represented by different taxa. Our results demonstrate that the proposed method is comparable to, and by some evaluation metrics outperforms, these existing methods in terms of the quality of protein function prediction, especially on large and heterogeneous samples of organisms. The greatest performance improvement is achieved by combining information about the closest homologs and orthogroups with verification of terms using machine learning methods. Our approach demonstrates high performance for large-scale automatic protein annotation, and prospects for further development include optimizing machine learning model parameters for specific biological tasks and integrating additional sources of structural and functional information, which will further improve the method's accuracy and versatility. In addition, the introduction of new bioinformatics tools and the expansion of the annotated protein database will contribute to the further improvement of the proposed approach.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"1145-1154"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12799360/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145991280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computer modeling of spatial dynamics and primary genetic divergence for a population system in a ring areal. 环形区域内种群系统空间动力学和初级遗传分化的计算机模拟。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-115
M P Kulakov, O L Zhdanova, E Ya Frisman

One of the main goals of modern evolutionary biology is to understand the mechanisms that lead to the initial differentiation (primary divergence) of populations into groups with genetic traits. This divergence requires reproductive isolation, which prevents or hinders contact and the exchange of genetic material between populations. This study explores the potential for isolation based not on obvious geographical barriers, population distance, or ecological specialization, but rather on hereditary mechanisms, such as gene drift and flow and selection against heterozygous individuals. To this end, we propose and investigate a dynamic discrete-time model that describes the dynamics of frequencies and numbers in a system of limited populations coupled by migrations. We consider a panmictic population with Mendelian inheritance rules, one-locus selection, and density-dependent factors limiting population growth. Individuals freely mate and randomly move around a one-dimensional ring-shaped habitat. The model was verified using data from an experiment on the box population system of Drosophila melanogaster performed by Yu.P. Altukhov et al. With rather simple assumptions, the model explains some mechanisms for the emergence and preservation of significant genetic differences between subpopulations (primary genetic divergence), accompanied by heterogeneity in allele frequencies and abundances within a homogeneous area. In this scenario, several large groups of genetically homogeneous subpopulations form and independently develop. Hybridization occurs at contact sites, and polymorphism is maintained through migration from genetically homogeneous nearby sites. It was found that only disruptive selection, directed against heterozygous individuals, can sustainably maintain such a spatial distribution. Under directional selection, divergence may occur for a short time as part of the transitional evolutionary process towards the best-adapted genotype. Because of the reduced adaptability of heterozygous (hybrid) individuals and low growth rates in these sites (hybrid zones), gene flow between adjacent sites with opposite genotypes (phenotypes) is significantly impeded. As a result, the hybrid zones can become effective geographical barriers that prevent the genetic flow between coupled subpopulations.

现代进化生物学的主要目标之一是了解导致种群初始分化(初级分化)为具有遗传特征的群体的机制。这种分化需要生殖隔离,这阻止或阻碍了种群之间的接触和遗传物质的交换。这项研究探索了隔离的可能性,这种隔离不是基于明显的地理障碍、种群距离或生态专门化,而是基于遗传机制,如基因漂变、基因流动和对杂合个体的选择。为此,我们提出并研究了一个动态离散时间模型,该模型描述了由迁移耦合的有限种群系统中频率和数量的动态。我们考虑一个具有孟德尔遗传规则、单位点选择和限制种群增长的密度依赖因子的泛型种群。个体可以自由交配,并在一维环形栖息地中随意移动。利用yup对黑腹果蝇箱形种群系统的实验数据对模型进行了验证。Altukhov等。该模型以相当简单的假设,解释了亚种群之间显著遗传差异(初级遗传差异)的出现和保存的一些机制,并伴随着等位基因频率和丰度在同一区域内的异质性。在这种情况下,几个遗传上同质的大群体形成并独立发展。杂交发生在接触位点,多态通过从基因同质的附近位点迁移而维持。研究发现,只有针对杂合个体的破坏性选择才能维持这样的空间分布。在定向选择下,作为向最佳适应基因型过渡进化过程的一部分,分化可能会在短时间内发生。由于杂合(杂交)个体的适应性降低以及这些位点(杂交区)的生长速度低,具有相反基因型(表型)的相邻位点之间的基因流动明显受阻。因此,杂交带可以成为有效的地理屏障,阻止耦合亚群体之间的遗传流动。
{"title":"Computer modeling of spatial dynamics and primary genetic divergence for a population system in a ring areal.","authors":"M P Kulakov, O L Zhdanova, E Ya Frisman","doi":"10.18699/vjgb-25-115","DOIUrl":"https://doi.org/10.18699/vjgb-25-115","url":null,"abstract":"<p><p>One of the main goals of modern evolutionary biology is to understand the mechanisms that lead to the initial differentiation (primary divergence) of populations into groups with genetic traits. This divergence requires reproductive isolation, which prevents or hinders contact and the exchange of genetic material between populations. This study explores the potential for isolation based not on obvious geographical barriers, population distance, or ecological specialization, but rather on hereditary mechanisms, such as gene drift and flow and selection against heterozygous individuals. To this end, we propose and investigate a dynamic discrete-time model that describes the dynamics of frequencies and numbers in a system of limited populations coupled by migrations. We consider a panmictic population with Mendelian inheritance rules, one-locus selection, and density-dependent factors limiting population growth. Individuals freely mate and randomly move around a one-dimensional ring-shaped habitat. The model was verified using data from an experiment on the box population system of Drosophila melanogaster performed by Yu.P. Altukhov et al. With rather simple assumptions, the model explains some mechanisms for the emergence and preservation of significant genetic differences between subpopulations (primary genetic divergence), accompanied by heterogeneity in allele frequencies and abundances within a homogeneous area. In this scenario, several large groups of genetically homogeneous subpopulations form and independently develop. Hybridization occurs at contact sites, and polymorphism is maintained through migration from genetically homogeneous nearby sites. It was found that only disruptive selection, directed against heterozygous individuals, can sustainably maintain such a spatial distribution. Under directional selection, divergence may occur for a short time as part of the transitional evolutionary process towards the best-adapted genotype. Because of the reduced adaptability of heterozygous (hybrid) individuals and low growth rates in these sites (hybrid zones), gene flow between adjacent sites with opposite genotypes (phenotypes) is significantly impeded. As a result, the hybrid zones can become effective geographical barriers that prevent the genetic flow between coupled subpopulations.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"1109-1121"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12799358/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145991292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Silencing of the Nicotiana benthamiana phytoendesaturase gene by root treatment of exogenous dsRNA. 外源dsRNA根处理对烟叶植物端胞饱和酶基因的沉默作用。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-123
Т S Golubeva, V A Cherenko, E A Filipenko, I V Zhirnov, A A Ivanov, A V Kochetov

RNA interference (RNAi) is a powerful tool for gene silencing. It has recently been used to design promising plant protection strategies against pests such as viruses, insects, etc. This generally requires modifying the plant genome to achieve in planta synthesis of the double-stranded RNA (dsRNA), which guides the cellular RNA interference machinery to silence the genes of interest. However, given Russian legislation, the approach in which dsRNA is synthesized by the plant itself remains unavailable for crop protection. The use of exogenously produced dsRNA appears to be a promising alternative, allowing researchers to avoid genetic modification of plants, making it possible to implement potential results in agriculture. Furthermore, exogenous dsRNAs are superior to chemical pesticides (fungicides, insecticides, etc.), which are widely used to control various plant diseases. The dsRNA acts through sequence-specific nucleic acid interactions, making it extremely selective and unlikely to harm off-target organisms. Thus, it seems promising to utilize RNAi technology for agricultural plant protection. In this case, questions arise regarding how to produce the required amounts of pathogen-specific exogenous dsRNA, and which delivery method will be optimal for providing sufficient protection. This work aims to utilize exogenous dsRNA to silence the Nicotiana benthamiana phytoene desaturase gene. Phytoene desaturase is a convenient model gene in gene silencing experiments, as its knockdown results in a distinct phenotypic manifestation, namely, leaf bleaching. The dsRNA synthesis for this work was performed in vivo in Escherichia coli cells, and the chosen delivery method was root treatment through watering, both techniques being as simple and accessible as possible. It is surmised that the proposed approach could be adapted for broader use of RNAi technologies in agricultural crop protection.

RNA干扰(RNAi)是基因沉默的有力工具。它最近被用于设计有前途的植物保护策略,以对抗诸如病毒、昆虫等害虫。这通常需要修改植物基因组来实现植物双链RNA (dsRNA)的合成,它引导细胞RNA干扰机制沉默感兴趣的基因。然而,鉴于俄罗斯的立法,由植物自身合成dsRNA的方法仍然无法用于作物保护。使用外源产生的dsRNA似乎是一种很有前途的替代方法,使研究人员能够避免对植物进行基因改造,从而有可能在农业中实现潜在的成果。此外,外源dsRNAs优于化学农药(杀菌剂、杀虫剂等),被广泛用于防治各种植物病害。dsRNA通过序列特异性核酸相互作用起作用,使其具有极强的选择性,不太可能伤害非目标生物。因此,RNAi技术在农业植物保护中的应用前景广阔。在这种情况下,关于如何产生所需数量的病原体特异性外源性dsRNA以及哪种递送方法将是提供充分保护的最佳方法的问题就出现了。本研究旨在利用外源dsRNA沉默本烟烯去饱和酶基因。植物烯去饱和酶是基因沉默实验中一种方便的模式基因,因为它的敲除会导致叶片白化。本研究的dsRNA合成是在大肠杆菌细胞中进行的,所选择的传递方法是通过浇水的根处理,这两种技术都是尽可能简单和容易的。据推测,该方法可以适用于RNAi技术在农业作物保护中的广泛应用。
{"title":"Silencing of the Nicotiana benthamiana phytoendesaturase gene by root treatment of exogenous dsRNA.","authors":"Т S Golubeva, V A Cherenko, E A Filipenko, I V Zhirnov, A A Ivanov, A V Kochetov","doi":"10.18699/vjgb-25-123","DOIUrl":"https://doi.org/10.18699/vjgb-25-123","url":null,"abstract":"<p><p>RNA interference (RNAi) is a powerful tool for gene silencing. It has recently been used to design promising plant protection strategies against pests such as viruses, insects, etc. This generally requires modifying the plant genome to achieve in planta synthesis of the double-stranded RNA (dsRNA), which guides the cellular RNA interference machinery to silence the genes of interest. However, given Russian legislation, the approach in which dsRNA is synthesized by the plant itself remains unavailable for crop protection. The use of exogenously produced dsRNA appears to be a promising alternative, allowing researchers to avoid genetic modification of plants, making it possible to implement potential results in agriculture. Furthermore, exogenous dsRNAs are superior to chemical pesticides (fungicides, insecticides, etc.), which are widely used to control various plant diseases. The dsRNA acts through sequence-specific nucleic acid interactions, making it extremely selective and unlikely to harm off-target organisms. Thus, it seems promising to utilize RNAi technology for agricultural plant protection. In this case, questions arise regarding how to produce the required amounts of pathogen-specific exogenous dsRNA, and which delivery method will be optimal for providing sufficient protection. This work aims to utilize exogenous dsRNA to silence the Nicotiana benthamiana phytoene desaturase gene. Phytoene desaturase is a convenient model gene in gene silencing experiments, as its knockdown results in a distinct phenotypic manifestation, namely, leaf bleaching. The dsRNA synthesis for this work was performed in vivo in Escherichia coli cells, and the chosen delivery method was root treatment through watering, both techniques being as simple and accessible as possible. It is surmised that the proposed approach could be adapted for broader use of RNAi technologies in agricultural crop protection.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 8","pages":"1169-1175"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12876927/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frame-based mathematical models - a tool for the study of molecular genetic systems. 基于框架的数学模型——研究分子遗传系统的工具。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-135
F V Kazantsev, S A Lashin, Yu G Matushkin

This paper reviews existing approaches for reconstructing frame-based mathematical models of molecular genetic systems from the level of genetic synthesis to models of metabolic networks. A frame-based mathematical model is a model in which the following terms are specified: formal structure, type of mathematical model for a particular biochemical process, reactants and their roles. Typically, such models are generated automatically on the basis of description of biological processes in terms of domain-specific languages. For molecular genetic systems, these languages use constructions familiar to a wide range of biologists in the form of a list of biochemical reactions. They rely on the concepts of elementary subsystems, where complex models are assembled from small block units ("frames"). In this paper, we have shown an example with the generation of a classical repressilator model consisting of three genes that mutually inhibit each other's synthesis. We have given it in three different versions of the graphic standard, its characteristic mathematical interpretation and variants of its numerical calculation. We have shown that even at the level of frame models it is possible to identify qualitatively new behaviour of the model through the introduction of just one gene into the model structure. This change provides a way to control the modes of behaviour of the model through changing the concentrations of reactants. The frame-based approach opens the way to generate models of cells, tissues, organs, organisms and communities through frame-based model generation tools that specify structure, roles of modelled reactants using domain-specific languages and graphical methods of model specification.

从遗传合成水平到代谢网络模型,综述了现有的基于框架的分子遗传系统数学模型重建方法。基于框架的数学模型是一种模型,其中指定了以下术语:形式结构、特定生化过程的数学模型类型、反应物及其作用。通常,这样的模型是根据领域特定语言对生物过程的描述自动生成的。对于分子遗传系统,这些语言以生物化学反应列表的形式使用广泛的生物学家所熟悉的结构。它们依赖于基本子系统的概念,其中复杂的模型由小块单元(“框架”)组装而成。在本文中,我们展示了一个由三个相互抑制合成的基因组成的经典再抑制子模型的例子。我们给出了三种不同版本的图形标准,其特征的数学解释和数值计算的变体。我们已经证明,即使在框架模型的水平上,也有可能通过将一个基因引入模型结构来定性地识别模型的新行为。这种变化提供了一种通过改变反应物浓度来控制模型行为模式的方法。基于框架的方法通过基于框架的模型生成工具开辟了生成细胞、组织、器官、生物体和群落模型的途径,这些工具使用领域特定语言和模型规范的图形方法指定结构、建模反应物的角色。
{"title":"Frame-based mathematical models - a tool for the study of molecular genetic systems.","authors":"F V Kazantsev, S A Lashin, Yu G Matushkin","doi":"10.18699/vjgb-25-135","DOIUrl":"https://doi.org/10.18699/vjgb-25-135","url":null,"abstract":"<p><p>This paper reviews existing approaches for reconstructing frame-based mathematical models of molecular genetic systems from the level of genetic synthesis to models of metabolic networks. A frame-based mathematical model is a model in which the following terms are specified: formal structure, type of mathematical model for a particular biochemical process, reactants and their roles. Typically, such models are generated automatically on the basis of description of biological processes in terms of domain-specific languages. For molecular genetic systems, these languages use constructions familiar to a wide range of biologists in the form of a list of biochemical reactions. They rely on the concepts of elementary subsystems, where complex models are assembled from small block units (\"frames\"). In this paper, we have shown an example with the generation of a classical repressilator model consisting of three genes that mutually inhibit each other's synthesis. We have given it in three different versions of the graphic standard, its characteristic mathematical interpretation and variants of its numerical calculation. We have shown that even at the level of frame models it is possible to identify qualitatively new behaviour of the model through the introduction of just one gene into the model structure. This change provides a way to control the modes of behaviour of the model through changing the concentrations of reactants. The frame-based approach opens the way to generate models of cells, tissues, organs, organisms and communities through frame-based model generation tools that specify structure, roles of modelled reactants using domain-specific languages and graphical methods of model specification.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"25 8","pages":"1288-1294"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12876924/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study of the influence of introgression from chromosome 2 of the At subgenome of cotton Gossypium barbadense L. during backcrossing with the original lines of G. hirsutum L. 巴巴多斯棉At亚基因组2号染色体渐渗对棉原系回交影响的研究。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-125
M F Sanamyan, Sh U Bobokhujayev, Sh S Abdukarimov, J S Uralov, A B Rustamov

The creation of chromosome substitution lines containing one pair of chromosomes from a related species is one method for introgression of alien genetic material. The frequency of substitutions in different chromosomes of the genome varies due to the selective transmission of alien chromosomes through the gametes of hybrids. The use of monosomic lines with identified univalent chromosomes and molecular genetic SSR markers at the seedling stage allowed rapid screening of the identity of the alien chromosome in backcross hybrids, significantly accelerating and facilitating the backcrossing process for the creation of new chromosome substitution cotton lines. As a result of studying the process of transmission of chromosome 2 of the At subgenome of the cotton plant G. barbadense L. during backcrossing of four original monosomic lines of G. hirsutum L. with monosomic backcross hybrids with substitution of chromosome 2 of the At subgenome, the following specific consequences of the introgression of this chromosome were revealed: decreased crossability, setting and germination of hybrid seeds; differences in the frequency and nature of transmission of chromosome 2 of the At subgenome of the cotton plant G. barbadensе; regularity of chromosome behavior in meiosis; a high meiotic index; a significant decrease in pollen fertility in backcross monosomic hybrids BC1F1; specific morphobiological characteristics of monosomic backcrossed plants, such as delayed development of vegetative and generative organs; dwarfism; reduced foliage; and poor budding and flowering during the first year of vegetation. All of these factors negatively impact the study and backcrossing of monosomic hybrids and significantly complicate and delay the creation of chromosome-substituted forms concerning chromosome 2 of the At subgenome of cotton, G. barbadense. These specific changes likely occurred as a result of hybrid genome reorganization and introgression of alien chromatin. Furthermore, the effectiveness of using molecular genetic microsatellite (SSR) markers to monitor backcrossing processes and eliminate genetic material from the Pima 3-79 donor line of G. barbadense for the selection of genotypes with alien chromosome substitutions has been demonstrated.

建立含有亲缘物种的一对染色体的染色体替代系是外来遗传物质渗入的一种方法。由于异种染色体通过杂交配子的选择性传递,基因组中不同染色体的替换频率各不相同。在苗期利用单价染色体鉴定的单体系和分子遗传SSR标记,可以快速筛选回交杂交种的异源染色体身份,大大加快和促进了回交过程,从而创造新的染色体代换棉花品系。通过对4个原棉花单染色体系与替换了At亚基因组2号染色体的单染色体回交杂交种进行回交时,巴贝多棉花At亚基因组2号染色体的传递过程进行研究,揭示了该染色体渗入的具体后果:杂种种子的亲和性、结实率和发芽率下降;棉花At亚基因组2号染色体遗传频率和性质的差异减数分裂中染色体行为的规律性减数分裂指数高;回交单体杂种BC1F1花粉育性显著降低;单体回交植物的特殊形态生物学特征,如营养器官和生殖器官发育的延迟;侏儒症;减少叶;在植被的第一年,萌芽和开花都很差。所有这些因素都对单体杂交的研究和回交产生不利影响,并使巴贝多棉花At亚基因组2号染色体取代形式的产生复杂化和延迟。这些特定的变化可能是杂交基因组重组和外来染色质渗入的结果。此外,利用分子遗传微卫星(SSR)标记监测巴巴多斯巴巴多斯Pima 3-79供体遗传物质的回交过程和遗传物质的消除,对外源染色体置换基因型的选择具有有效性。
{"title":"Study of the influence of introgression from chromosome 2 of the At subgenome of cotton Gossypium barbadense L. during backcrossing with the original lines of G. hirsutum L.","authors":"M F Sanamyan, Sh U Bobokhujayev, Sh S Abdukarimov, J S Uralov, A B Rustamov","doi":"10.18699/vjgb-25-125","DOIUrl":"https://doi.org/10.18699/vjgb-25-125","url":null,"abstract":"<p><p>The creation of chromosome substitution lines containing one pair of chromosomes from a related species is one method for introgression of alien genetic material. The frequency of substitutions in different chromosomes of the genome varies due to the selective transmission of alien chromosomes through the gametes of hybrids. The use of monosomic lines with identified univalent chromosomes and molecular genetic SSR markers at the seedling stage allowed rapid screening of the identity of the alien chromosome in backcross hybrids, significantly accelerating and facilitating the backcrossing process for the creation of new chromosome substitution cotton lines. As a result of studying the process of transmission of chromosome 2 of the At subgenome of the cotton plant G. barbadense L. during backcrossing of four original monosomic lines of G. hirsutum L. with monosomic backcross hybrids with substitution of chromosome 2 of the At subgenome, the following specific consequences of the introgression of this chromosome were revealed: decreased crossability, setting and germination of hybrid seeds; differences in the frequency and nature of transmission of chromosome 2 of the At subgenome of the cotton plant G. barbadensе; regularity of chromosome behavior in meiosis; a high meiotic index; a significant decrease in pollen fertility in backcross monosomic hybrids BC1F1; specific morphobiological characteristics of monosomic backcrossed plants, such as delayed development of vegetative and generative organs; dwarfism; reduced foliage; and poor budding and flowering during the first year of vegetation. All of these factors negatively impact the study and backcrossing of monosomic hybrids and significantly complicate and delay the creation of chromosome-substituted forms concerning chromosome 2 of the At subgenome of cotton, G. barbadense. These specific changes likely occurred as a result of hybrid genome reorganization and introgression of alien chromatin. Furthermore, the effectiveness of using molecular genetic microsatellite (SSR) markers to monitor backcrossing processes and eliminate genetic material from the Pima 3-79 donor line of G. barbadense for the selection of genotypes with alien chromosome substitutions has been demonstrated.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 8","pages":"1184-1194"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12876928/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-learning virtual organisms in a physics simulator: on the optimal resolution of their visual system, the architecture of the nervous system and the computational complexity of the problem. 物理模拟器中的自我学习虚拟生物体:关于其视觉系统的最佳分辨率,神经系统的结构和问题的计算复杂性。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-110
M S Zenin, A P Devyaterikov, A Yu Palyanov

Vision plays a key role in the lives of various organisms, enabling spatial orientation, foraging, predator avoidance and social interaction. In species with relatively simple visual systems, such as insects, effective behavioral strategies are achieved through high neural specialization, adaptation to specific environmental conditions, and the use of additional sensory systems such as olfaction or hearing. Animals with more complex vision and nervous systems, such as mammals, have greater cognitive abilities and flexibility, but this comes with increased demands on the brain's energy costs and computational resources. Modeling the features of such systems in a virtual environment could allow researchers to explore the fundamental principles of sensorimotor integration and the limits of cognitive complexity, as well as test hypotheses about the interaction between perception, memory and decision-making mechanisms. In this work, we implement and investigate a model of virtual organisms with a visual system operating in a three-dimensional physical environment using the Unity ML-Agents software - one of the most high-performance simulation platforms currently available. We propose a hierarchical control architecture that separates locomotion and navigation tasks between two modules: (1) visual perception and decision-making, and (2) coordinated control of limb movement for locomotion in the physical environment. A series of numerical experiments was conducted to examine the influence of visual system parameters (e. g, resolution of the "first-person" view), environmental configuration and agent architectural features on the efficiency and outcomes of reinforcement learning (using the PPO algorithm). The results demonstrate the existence of an optimal range of resolutions that provide a trade-off between computational complexity and success in accomplishing the task, while excessive dimensionality of sensory inputs or action space leads to slower learning. We performed system performance profiling and identified key bottlenecks in large-scale simulations. The discussion considers biological parallels, highlighting cases of high behavioral efficiency in insects with relatively low-resolution visual systems, and the potential of neuroevolutionary approaches for adapting agent architectures. The proposed approach and the results obtained are of potential interest to researchers working on biologically inspired artificial agents, evolutionary modeling, and the study of cognitive processes in artificial systems.

视觉在各种生物的生活中起着关键作用,使空间定位,觅食,捕食者躲避和社会互动。在视觉系统相对简单的物种中,如昆虫,有效的行为策略是通过高度的神经特化、对特定环境条件的适应以及使用额外的感觉系统(如嗅觉或听觉)来实现的。具有更复杂的视觉和神经系统的动物,如哺乳动物,具有更强的认知能力和灵活性,但这对大脑的能量消耗和计算资源的需求也在增加。在虚拟环境中对这些系统的特征进行建模,可以让研究人员探索感觉运动整合的基本原理和认知复杂性的极限,以及测试关于感知、记忆和决策机制之间相互作用的假设。在这项工作中,我们使用Unity ML-Agents软件(目前可用的高性能仿真平台之一)实现和研究了一个具有在三维物理环境中操作的视觉系统的虚拟生物模型。我们提出了一种分层控制架构,将运动和导航任务分离为两个模块:(1)视觉感知和决策;(2)肢体运动在物理环境中的协调控制。通过一系列数值实验,研究了视觉系统参数(如“第一人称”视角的分辨率)、环境配置和智能体架构特征对强化学习(使用PPO算法)的效率和结果的影响。结果表明,存在一个最佳分辨率范围,在计算复杂性和完成任务的成功之间提供权衡,而过度的感官输入或动作空间会导致学习速度变慢。我们执行了系统性能分析,并确定了大规模模拟中的关键瓶颈。讨论考虑了生物学上的相似之处,突出了具有相对低分辨率视觉系统的昆虫的高行为效率的案例,以及适应主体结构的神经进化方法的潜力。所提出的方法和所获得的结果对从事生物学启发的人工智能体、进化建模和人工系统认知过程研究的研究人员具有潜在的兴趣。
{"title":"Self-learning virtual organisms in a physics simulator: on the optimal resolution of their visual system, the architecture of the nervous system and the computational complexity of the problem.","authors":"M S Zenin, A P Devyaterikov, A Yu Palyanov","doi":"10.18699/vjgb-25-110","DOIUrl":"https://doi.org/10.18699/vjgb-25-110","url":null,"abstract":"<p><p>Vision plays a key role in the lives of various organisms, enabling spatial orientation, foraging, predator avoidance and social interaction. In species with relatively simple visual systems, such as insects, effective behavioral strategies are achieved through high neural specialization, adaptation to specific environmental conditions, and the use of additional sensory systems such as olfaction or hearing. Animals with more complex vision and nervous systems, such as mammals, have greater cognitive abilities and flexibility, but this comes with increased demands on the brain's energy costs and computational resources. Modeling the features of such systems in a virtual environment could allow researchers to explore the fundamental principles of sensorimotor integration and the limits of cognitive complexity, as well as test hypotheses about the interaction between perception, memory and decision-making mechanisms. In this work, we implement and investigate a model of virtual organisms with a visual system operating in a three-dimensional physical environment using the Unity ML-Agents software - one of the most high-performance simulation platforms currently available. We propose a hierarchical control architecture that separates locomotion and navigation tasks between two modules: (1) visual perception and decision-making, and (2) coordinated control of limb movement for locomotion in the physical environment. A series of numerical experiments was conducted to examine the influence of visual system parameters (e. g, resolution of the \"first-person\" view), environmental configuration and agent architectural features on the efficiency and outcomes of reinforcement learning (using the PPO algorithm). The results demonstrate the existence of an optimal range of resolutions that provide a trade-off between computational complexity and success in accomplishing the task, while excessive dimensionality of sensory inputs or action space leads to slower learning. We performed system performance profiling and identified key bottlenecks in large-scale simulations. The discussion considers biological parallels, highlighting cases of high behavioral efficiency in insects with relatively low-resolution visual systems, and the potential of neuroevolutionary approaches for adapting agent architectures. The proposed approach and the results obtained are of potential interest to researchers working on biologically inspired artificial agents, evolutionary modeling, and the study of cognitive processes in artificial systems.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"1051-1061"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12795856/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145970907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linking hierarchical classification of transcription factors by the structure of their DNA-binding domains to the variability of their binding site motifs. 通过其dna结合域的结构将转录因子的等级分类与其结合位点基序的可变性联系起来。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-99
V G Levitsky, T Yu Vatolina, V V Raditsa

De novo motif search is the main approach for determining the nucleotide specificity of binding of the key regulators of gene transcription, transcription factors (TFs), based on data from massive genome-wide sequencing of their binding site regions in vivo, such as ChIP-seq. The number of motifs of known TF binding sites (TFBSs) has increased several times in recent years. Due to the similarity in the structure of the DNA-binding domains of TFs, many structurally cognate TFs have similar and sometimes almost indistinguishable binding site motifs. The classification of TFs by the structure of the DNA-binding domains from the TFClass database defines the top levels of the hierarchy (superclasses and classes of TFs) by the structure of these domains, and the next levels (families and subfamilies of TFs) by the alignments of amino acid sequences of domains. However, this classification does not take into account the similarity of TFBS motifs, whereas identification of valid TFs from massive sequencing data of TFBSs, such as ChIP- seq, requires working with TFBS motifs rather than TFs themselves. Therefore, in this study we extracted from the Hocomoco and Jaspar databases the TFBS motifs for human and fruit fly Drosophila melanogaster, and considered the pairwise similarity of binding site motifs of cognate TFs according to their classification from the TFClass database. We have shown that the common tree of the TF hierarchy by the structure of DNA-binding domains can be split into separate branches representing non-overlapping sets of TFs. Within each branch, the majority of TF pairs have significantly similar binding site motifs. Each branch can include one or more sister elementary units of the hierarchy and all its/their lower levels: one or more TFs of the same subfamily, or the whole subfamily, one or several subfamilies of the same family, an entire family, etc., up to the entire class. Analysis of the seven largest human and two largest Drosophila TF classes showed that the similarity of TFs in terms of TFBS motifs for different corresponding levels (classes, families) is noticeably different. Supplementing the hierarchical classification of TFs with branches combining significantly similar motifs of TFBSs can increase the efficiency of identifying involved TFs through enriched motifs detected by de novo motif search for massive sequencing data of TFBSs from the ChIP-seq technology.

De novo motif search是确定基因转录关键调控因子转录因子(transcription factors, TFs)结合核苷酸特异性的主要方法,它基于大量体内转录因子结合位点区域的全基因组测序数据,如ChIP-seq。近年来,已知TF结合位点(TFBSs)的基序数量增加了几倍。由于tf的dna结合域结构相似,许多结构同源的tf具有相似的,有时几乎无法区分的结合位点基序。根据TFClass数据库中dna结合结构域的结构对tf进行分类,根据这些结构域的结构定义了tf的上层(超类和类),根据结构域的氨基酸序列比对定义了tf的下一级(家族和亚家族)。然而,这种分类没有考虑到TFBS基序的相似性,而从大量的TFBS测序数据(如ChIP- seq)中识别有效的TFBS,需要使用TFBS基序而不是tffs本身。因此,本研究从Hocomoco和jasar数据库中提取了人类和果蝇的TFBS基序,并根据TFClass数据库中的分类考虑同源tf结合位点基序的两两相似性。我们已经证明,由dna结合域结构构成的TF层次结构的共同树可以分裂成代表非重叠TF集合的单独分支。在每个分支中,大多数TF对具有显著相似的结合位点基序。每个分支可以包括层次结构的一个或多个姐妹基本单位及其所有较低的层次:同一亚族的一个或多个tf,或整个亚族,同一家族的一个或几个亚族,整个家族,等等,直到整个类。对7个最大的人类TF类和2个最大的果蝇TF类的分析表明,不同相应水平(类、科)的TF在TFBS基序方面的相似性有显著差异。利用ChIP-seq技术对大量的TFBSs测序数据进行从头基序搜索,检测到丰富的基序,通过结合显著相似基序的分支来补充tffs的分层分类,可以提高识别相关tf的效率。
{"title":"Linking hierarchical classification of transcription factors by the structure of their DNA-binding domains to the variability of their binding site motifs.","authors":"V G Levitsky, T Yu Vatolina, V V Raditsa","doi":"10.18699/vjgb-25-99","DOIUrl":"https://doi.org/10.18699/vjgb-25-99","url":null,"abstract":"<p><p>De novo motif search is the main approach for determining the nucleotide specificity of binding of the key regulators of gene transcription, transcription factors (TFs), based on data from massive genome-wide sequencing of their binding site regions in vivo, such as ChIP-seq. The number of motifs of known TF binding sites (TFBSs) has increased several times in recent years. Due to the similarity in the structure of the DNA-binding domains of TFs, many structurally cognate TFs have similar and sometimes almost indistinguishable binding site motifs. The classification of TFs by the structure of the DNA-binding domains from the TFClass database defines the top levels of the hierarchy (superclasses and classes of TFs) by the structure of these domains, and the next levels (families and subfamilies of TFs) by the alignments of amino acid sequences of domains. However, this classification does not take into account the similarity of TFBS motifs, whereas identification of valid TFs from massive sequencing data of TFBSs, such as ChIP- seq, requires working with TFBS motifs rather than TFs themselves. Therefore, in this study we extracted from the Hocomoco and Jaspar databases the TFBS motifs for human and fruit fly Drosophila melanogaster, and considered the pairwise similarity of binding site motifs of cognate TFs according to their classification from the TFClass database. We have shown that the common tree of the TF hierarchy by the structure of DNA-binding domains can be split into separate branches representing non-overlapping sets of TFs. Within each branch, the majority of TF pairs have significantly similar binding site motifs. Each branch can include one or more sister elementary units of the hierarchy and all its/their lower levels: one or more TFs of the same subfamily, or the whole subfamily, one or several subfamilies of the same family, an entire family, etc., up to the entire class. Analysis of the seven largest human and two largest Drosophila TF classes showed that the similarity of TFs in terms of TFBS motifs for different corresponding levels (classes, families) is noticeably different. Supplementing the hierarchical classification of TFs with branches combining significantly similar motifs of TFBSs can increase the efficiency of identifying involved TFs through enriched motifs detected by de novo motif search for massive sequencing data of TFBSs from the ChIP-seq technology.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"925-939"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12795858/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of interactions between the SARS-CoV-2 ORF3a protein and small-molecule ligands using the ANDSystem cognitive platform, graph neural networks, and molecular modeling. 利用ANDSystem认知平台、图神经网络和分子模型预测SARS-CoV-2 ORF3a蛋白与小分子配体的相互作用
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-113
T V Ivanisenko, P S Demenkov, M A Kleshchev, V A Ivanisenko

In recent years, artificial intelligence methods based on the analysis of heterogeneous graphs of biomedical networks have become widely used for predicting molecular interactions. In particular, graph neural networks (GNNs) effectively identify missing edges in gene networks - such as protein-protein interaction, gene-disease, drug-target, and other networks - thereby enabling the prediction of new biological relationships. To reconstruct gene networks, cognitive systems for automatic text mining of scientific publications and databases are often employed. One such AI-driven platform, ANDSystem, is designed for automatic knowledge extraction of molecular interactions and, on this basis, the reconstruction of associative gene networks. The ANDSystem knowledge base contains information on more than 100 million interactions among diverse molecular genetic entities (genes, proteins, metabolites, drugs, etc.). The interactions span a wide range of types: regulatory relationships, physical interactions (protein-protein, protein-ligand), catalytic and chemical reactions, and associations among genes, phenotypes, diseases, and more. In the present study, we applied attention-based graph neural networks trained on the ANDSystem knowledge graph to predict new edges between proteins and ligands and to identify potential ligands for the SARS-CoV-2 ORF3a protein. The accessory protein ORF3a plays an important role in viral pathogenesis through ion-channel activity, induction of apoptosis, and the ability to modulate endolysosomal processes and the host innate immune response. Despite this broad functional spectrum, ORF3a has been explored far less as a pharmacological target than other viral proteins. Using a graph neural network, we predicted five small molecules of different origins (metabolites and a drug) that potentially interact with ORF3a: N-acetyl-D-glucosamine, 4-(benzoylamino)benzoic acid, austocystin D, bictegravirum, and L-threonine. Molecular docking and MM/GBSA affinity estimation indicate the potential ability of these compounds to form complexes with ORF3a. Localization analysis showed that the binding sites of bictegravir and 4-(benzoylamino)benzoic acid lie in a cytosolic surface pocket of the protein that is solvent-exposed; L-threonine binds within the intersubunit cleft of the dimer; and austocystin D and N-acetyl-D-glucosamine are positioned at the boundary between the cytosolic surface and the transmembrane region. The accessibility of these binding sites may be reduced by the influence of the lipid bilayer. The binding energetics for bictegravirum were more favorable than for 4-(benzoylamino)benzoic acid (docking score -7.37 kcal/mol; MM/GBSA ΔG -14.71 ± 3.12 kcal/mol), making bictegravirum a promising candidate for repurposing as an ORF3a inhibitor.

近年来,基于生物医学网络异构图分析的人工智能方法已被广泛用于预测分子相互作用。特别是,图神经网络(gnn)有效地识别基因网络中缺失的边缘-例如蛋白质-蛋白质相互作用,基因-疾病,药物靶点和其他网络-从而能够预测新的生物关系。为了重建基因网络,经常使用科学出版物和数据库的自动文本挖掘认知系统。其中一个人工智能驱动的平台ANDSystem是为分子相互作用的自动知识提取而设计的,并在此基础上重建关联基因网络。ANDSystem知识库包含不同分子遗传实体(基因、蛋白质、代谢物、药物等)之间超过1亿种相互作用的信息。相互作用跨越了广泛的类型:调节关系,物理相互作用(蛋白质-蛋白质,蛋白质-配体),催化和化学反应,以及基因,表型,疾病等之间的关联。在本研究中,我们应用在ANDSystem知识图上训练的基于注意力的图神经网络来预测蛋白质和配体之间的新边缘,并识别SARS-CoV-2 ORF3a蛋白的潜在配体。辅助蛋白ORF3a通过离子通道活性、诱导细胞凋亡以及调节内溶酶体过程和宿主先天免疫反应的能力,在病毒发病过程中发挥重要作用。尽管ORF3a具有广泛的功能谱,但与其他病毒蛋白相比,ORF3a作为药理学靶点的研究远远不够。利用图神经网络,我们预测了可能与ORF3a相互作用的5种不同来源的小分子(代谢物和药物):n -乙酰基-D-葡萄糖胺、4-(苯甲酰氨基)苯甲酸、austocystin D、bictegravirum和l-苏氨酸。分子对接和MM/GBSA亲和估计表明这些化合物具有与ORF3a形成配合物的潜在能力。定位分析表明,比替格拉韦和4-(苯甲酰氨基)苯甲酸的结合位点位于溶剂暴露的蛋白胞质表面口袋中;l -苏氨酸在二聚体的亚基间隙内结合;缩囊素D和n -乙酰-D-氨基葡萄糖位于细胞质表面和跨膜区域之间的边界。脂质双分子层的影响可能会降低这些结合位点的可及性。bictegravirum的结合能比4-(苯甲酰胺)苯甲酸更有利(对接分数-7.37 kcal/mol; MM/GBSA ΔG -14.71±3.12 kcal/mol),使bictegravirum成为ORF3a抑制剂的一个有希望的候选者。
{"title":"Prediction of interactions between the SARS-CoV-2 ORF3a protein and small-molecule ligands using the ANDSystem cognitive platform, graph neural networks, and molecular modeling.","authors":"T V Ivanisenko, P S Demenkov, M A Kleshchev, V A Ivanisenko","doi":"10.18699/vjgb-25-113","DOIUrl":"https://doi.org/10.18699/vjgb-25-113","url":null,"abstract":"<p><p>In recent years, artificial intelligence methods based on the analysis of heterogeneous graphs of biomedical networks have become widely used for predicting molecular interactions. In particular, graph neural networks (GNNs) effectively identify missing edges in gene networks - such as protein-protein interaction, gene-disease, drug-target, and other networks - thereby enabling the prediction of new biological relationships. To reconstruct gene networks, cognitive systems for automatic text mining of scientific publications and databases are often employed. One such AI-driven platform, ANDSystem, is designed for automatic knowledge extraction of molecular interactions and, on this basis, the reconstruction of associative gene networks. The ANDSystem knowledge base contains information on more than 100 million interactions among diverse molecular genetic entities (genes, proteins, metabolites, drugs, etc.). The interactions span a wide range of types: regulatory relationships, physical interactions (protein-protein, protein-ligand), catalytic and chemical reactions, and associations among genes, phenotypes, diseases, and more. In the present study, we applied attention-based graph neural networks trained on the ANDSystem knowledge graph to predict new edges between proteins and ligands and to identify potential ligands for the SARS-CoV-2 ORF3a protein. The accessory protein ORF3a plays an important role in viral pathogenesis through ion-channel activity, induction of apoptosis, and the ability to modulate endolysosomal processes and the host innate immune response. Despite this broad functional spectrum, ORF3a has been explored far less as a pharmacological target than other viral proteins. Using a graph neural network, we predicted five small molecules of different origins (metabolites and a drug) that potentially interact with ORF3a: N-acetyl-D-glucosamine, 4-(benzoylamino)benzoic acid, austocystin D, bictegravirum, and L-threonine. Molecular docking and MM/GBSA affinity estimation indicate the potential ability of these compounds to form complexes with ORF3a. Localization analysis showed that the binding sites of bictegravir and 4-(benzoylamino)benzoic acid lie in a cytosolic surface pocket of the protein that is solvent-exposed; L-threonine binds within the intersubunit cleft of the dimer; and austocystin D and N-acetyl-D-glucosamine are positioned at the boundary between the cytosolic surface and the transmembrane region. The accessibility of these binding sites may be reduced by the influence of the lipid bilayer. The binding energetics for bictegravirum were more favorable than for 4-(benzoylamino)benzoic acid (docking score -7.37 kcal/mol; MM/GBSA ΔG -14.71 ± 3.12 kcal/mol), making bictegravirum a promising candidate for repurposing as an ORF3a inhibitor.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"1084-1096"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12799363/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145991272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Searching for biological processes as targets for rheumatoid arthritis targeted therapy with ANDSystem, an integrated software and information platform. 利用ANDSystem集成软件和信息平台寻找类风湿关节炎靶向治疗的生物过程靶点。
IF 1 Q3 AGRICULTURE, MULTIDISCIPLINARY Pub Date : 2025-12-01 DOI: 10.18699/vjgb-25-107
E L Mishchenko, I V Yatsyk, P S Demenkov, A V Adamovskaya, T V Ivanisenko, M A Kleshchev, V A Ivanisenko

Rheumatoid arthritis (RA) is a systemic autoimmune disease characterized primarily by joint involvement with progressive destruction of cartilage and bone tissue. To date, RA remains an incurable disease that leads to a significant deterioration in quality of life and patient disability. Despite a wide arsenal of disease-modifying antirheumatic drugs, approximately 40 % of patients show an insufficient response to standard treatment, highlighting the urgent need to identify new pharmacological targets. The aim of this study was to search for novel biological processes that could serve as promising targets for the targeted therapy of RA. To achieve this goal, we employed an approach based on the automated extraction of knowledge from scientific publications and biomedical databases using the ANDSystem software. This approach involved the reconstruction and subsequent analysis of two types of associative gene networks: a) gene networks describing genes and proteins associated with the development of RA, and b) gene networks describing genes and proteins involved in the functional responses to drugs used for the disease's therapy. The analysis of the reconstructed networks identified 11 biological processes that play a significant role in the pathogenesis of RA but are not yet direct targets of existing disease-modifying antirheumatic drugs. The most promising of these, described by Gene Ontology terms, include: a) the Toll-like receptor signaling pathway; b) neutrophil activation; c) regulation of osteoblast differentiation; d) regulation of osteoclast differentiation; e) the prostaglandin biosynthetic process, and f) the canonical Wnt signaling pathway. The identified biological processes and their key regulators represent promising targets for the development of new drugs capable of improving the efficacy of RA therapy, particularly in patients resistant to existing treatments. The developed approach can also be successfully applied to the search for new targeted therapy targets for other diseases.

类风湿性关节炎(RA)是一种系统性自身免疫性疾病,其主要特征是关节累及软骨和骨组织的进行性破坏。迄今为止,类风湿性关节炎仍然是一种无法治愈的疾病,它会导致生活质量的严重恶化和患者的残疾。尽管有广泛的疾病改善抗风湿药物,但大约40%的患者对标准治疗反应不足,这突出了迫切需要确定新的药理靶点。本研究的目的是寻找新的生物过程,可以作为RA靶向治疗的有希望的靶点。为了实现这一目标,我们采用了一种基于ANDSystem软件从科学出版物和生物医学数据库中自动提取知识的方法。该方法涉及两种类型的相关基因网络的重建和后续分析:a)描述与RA发展相关的基因和蛋白质的基因网络,b)描述与用于疾病治疗的药物的功能反应相关的基因和蛋白质的基因网络。对重建网络的分析确定了11个在RA发病机制中起重要作用的生物过程,但它们尚未成为现有疾病改善抗风湿药物的直接靶点。其中最有希望的,用基因本体术语描述,包括:a) toll样受体信号通路;B)中性粒细胞活化;C)成骨细胞分化的调控;D)破骨细胞分化的调控;e)前列腺素生物合成过程,f)典型的Wnt信号通路。已确定的生物过程及其关键调节因子为开发能够提高RA治疗疗效的新药提供了有希望的靶点,特别是在对现有治疗产生耐药性的患者中。所开发的方法也可以成功地应用于寻找其他疾病的新的靶向治疗靶点。
{"title":"Searching for biological processes as targets for rheumatoid arthritis targeted therapy with ANDSystem, an integrated software and information platform.","authors":"E L Mishchenko, I V Yatsyk, P S Demenkov, A V Adamovskaya, T V Ivanisenko, M A Kleshchev, V A Ivanisenko","doi":"10.18699/vjgb-25-107","DOIUrl":"https://doi.org/10.18699/vjgb-25-107","url":null,"abstract":"<p><p>Rheumatoid arthritis (RA) is a systemic autoimmune disease characterized primarily by joint involvement with progressive destruction of cartilage and bone tissue. To date, RA remains an incurable disease that leads to a significant deterioration in quality of life and patient disability. Despite a wide arsenal of disease-modifying antirheumatic drugs, approximately 40 % of patients show an insufficient response to standard treatment, highlighting the urgent need to identify new pharmacological targets. The aim of this study was to search for novel biological processes that could serve as promising targets for the targeted therapy of RA. To achieve this goal, we employed an approach based on the automated extraction of knowledge from scientific publications and biomedical databases using the ANDSystem software. This approach involved the reconstruction and subsequent analysis of two types of associative gene networks: a) gene networks describing genes and proteins associated with the development of RA, and b) gene networks describing genes and proteins involved in the functional responses to drugs used for the disease's therapy. The analysis of the reconstructed networks identified 11 biological processes that play a significant role in the pathogenesis of RA but are not yet direct targets of existing disease-modifying antirheumatic drugs. The most promising of these, described by Gene Ontology terms, include: a) the Toll-like receptor signaling pathway; b) neutrophil activation; c) regulation of osteoblast differentiation; d) regulation of osteoclast differentiation; e) the prostaglandin biosynthetic process, and f) the canonical Wnt signaling pathway. The identified biological processes and their key regulators represent promising targets for the development of new drugs capable of improving the efficacy of RA therapy, particularly in patients resistant to existing treatments. The developed approach can also be successfully applied to the search for new targeted therapy targets for other diseases.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":"29 7","pages":"1020-1030"},"PeriodicalIF":1.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12795834/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145970931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Vavilovskii Zhurnal Genetiki i Selektsii
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1