首页 > 最新文献

Journal of Computational Biology最新文献

英文 中文
A Comparative Study of Gene Co-Expression Thresholding Algorithms. 基因共表达阈值算法比较研究
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-06-01 Epub Date: 2024-05-23 DOI: 10.1089/cmb.2024.0509
Carissa Bleker, Stephen K Grady, Michael A Langston

The thresholding problem is studied in the context of graph theoretical analysis of gene co-expression data. A number of thresholding methodologies are described, implemented, and tested over a large collection of graphs derived from real high-throughput biological data. Comparative results are presented and discussed.

在对基因共表达数据进行图论分析的背景下研究了阈值问题。介绍、实施并测试了从真实高通量生物数据中提取的大量图谱中的一些阈值计算方法。比较结果将被展示和讨论。
{"title":"A Comparative Study of Gene Co-Expression Thresholding Algorithms.","authors":"Carissa Bleker, Stephen K Grady, Michael A Langston","doi":"10.1089/cmb.2024.0509","DOIUrl":"10.1089/cmb.2024.0509","url":null,"abstract":"<p><p>The thresholding problem is studied in the context of graph theoretical analysis of gene co-expression data. A number of thresholding methodologies are described, implemented, and tested over a large collection of graphs derived from real high-throughput biological data. Comparative results are presented and discussed.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141087641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fusion Learning Model Based on Deep Learning for Single-Cell RNA Sequencing Data Clustering. 基于深度学习的单细胞 RNA 测序数据聚类融合学习模型。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-06-01 Epub Date: 2024-05-20 DOI: 10.1089/cmb.2024.0512
Tian-Jing Qiao, Feng Li, Sha-Sha Yuan, Ling-Yun Dai, Juan Wang

Single-cell RNA sequencing (scRNA-seq) technology provides a means for studying biology from a cellular perspective. The fundamental goal of scRNA-seq data analysis is to discriminate single-cell types using unsupervised clustering. Few single-cell clustering algorithms have taken into account both deep and surface information, despite the recent slew of suggestions. Consequently, this article constructs a fusion learning framework based on deep learning, namely scGASI. For learning a clustering similarity matrix, scGASI integrates data affinity recovery and deep feature embedding in a unified scheme based on various top feature sets. Next, scGASI learns the low-dimensional latent representation underlying the data using a graph autoencoder to mine the hidden information residing in the data. To efficiently merge the surface information from raw area and the deeper potential information from underlying area, we then construct a fusion learning model based on self-expression. scGASI uses this fusion learning model to learn the similarity matrix of an individual feature set as well as the clustering similarity matrix of all feature sets. Lastly, gene marker identification, visualization, and clustering are accomplished using the clustering similarity matrix. Extensive verification on actual data sets demonstrates that scGASI outperforms many widely used clustering techniques in terms of clustering accuracy.

单细胞 RNA 测序(scRNA-seq)技术为从细胞角度研究生物学提供了一种手段。scRNA-seq 数据分析的基本目标是利用无监督聚类来区分单细胞类型。尽管最近有很多建议,但很少有单细胞聚类算法同时考虑到深层和表层信息。因此,本文构建了一个基于深度学习的融合学习框架,即 scGASI。为了学习聚类相似性矩阵,scGASI 基于各种顶级特征集,将数据亲和性恢复和深度特征嵌入整合到一个统一的方案中。接下来,scGASI 利用图自动编码器学习数据底层的低维潜在表示,挖掘数据中的隐藏信息。为了有效融合原始区域的表层信息和底层区域的深层潜在信息,我们构建了一个基于自我表达的融合学习模型。scGASI 利用该融合学习模型学习单个特征集的相似性矩阵以及所有特征集的聚类相似性矩阵。最后,利用聚类相似性矩阵完成基因标记的识别、可视化和聚类。在实际数据集上的广泛验证表明,scGASI 在聚类精度方面优于许多广泛使用的聚类技术。
{"title":"A Fusion Learning Model Based on Deep Learning for Single-Cell RNA Sequencing Data Clustering.","authors":"Tian-Jing Qiao, Feng Li, Sha-Sha Yuan, Ling-Yun Dai, Juan Wang","doi":"10.1089/cmb.2024.0512","DOIUrl":"10.1089/cmb.2024.0512","url":null,"abstract":"<p><p>\u0000 <b>Single-cell RNA sequencing (scRNA-seq) technology provides a means for studying biology from a cellular perspective. The fundamental goal of scRNA-seq data analysis is to discriminate single-cell types using unsupervised clustering. Few single-cell clustering algorithms have taken into account both deep and surface information, despite the recent slew of suggestions. Consequently, this article constructs a fusion learning framework based on deep learning, namely scGASI. For learning a clustering similarity matrix, scGASI integrates data affinity recovery and deep feature embedding in a unified scheme based on various top feature sets. Next, scGASI learns the low-dimensional latent representation underlying the data using a graph autoencoder to mine the hidden information residing in the data. To efficiently merge the surface information from raw area and the deeper potential information from underlying area, we then construct a fusion learning model based on self-expression. scGASI uses this fusion learning model to learn the similarity matrix of an individual feature set as well as the clustering similarity matrix of all feature sets. Lastly, gene marker identification, visualization, and clustering are accomplished using the clustering similarity matrix. Extensive verification on actual data sets demonstrates that scGASI outperforms many widely used clustering techniques in terms of clustering accuracy.</b>\u0000 </p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140957730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian Change Point Model for Dynamic Alternative Transcription Start Site Usage During Cellular Differentiation. 细胞分化过程中动态替代转录起始位点使用的贝叶斯变化点模型
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-05-01 Epub Date: 2024-05-14 DOI: 10.1089/cmb.2023.0174
Juan Xia, Yuxia Li, Haotian Zhu, Feiyang Xue, Feng Shi, Nana Li

ABSTRACT An alternative transcription start site (ATSS) is a major driving force for increasing the complexity of transcripts in human tissues. As a transcriptional regulatory mechanism, ATSS has biological significance. Many studies have confirmed that ATSS plays an important role in diseases and cell development and differentiation. However, exploration of its dynamic mechanisms remains insufficient. Identifying ATSS change points during cell differentiation is critical for elucidating potential dynamic mechanisms. For relative ATSS usage as percentage data, the existing methods lack sensitivity to detect the change point for ATSS longitudinal data. In addition, some methods have strict requirements for data distribution and cannot be applied to deal with this problem. In this study, the Bayesian change point detection model was first constructed using reparameterization techniques for two parameters of a beta distribution for the percentage data type, and the posterior distributions of parameters and change points were obtained using Markov Chain Monte Carlo (MCMC) sampling. With comprehensive simulation studies, the performance of the Bayesian change point detection model is found to be consistently powerful and robust across most scenarios with different sample sizes and beta distributions. Second, differential ATSS events in the real data, whose change points were identified using our method, were clustered according to their change points. Last, for each change point, pathway and transcription factor motif analyses were performed on its differential ATSS events. The results of our analyses demonstrated the effectiveness of the Bayesian change point detection model and provided biological insights into cell differentiation.

摘要 替代转录起始位点(ATSS)是增加人体组织中转录本复杂性的主要驱动力。作为一种转录调控机制,ATSS 具有重要的生物学意义。许多研究证实,ATSS 在疾病、细胞发育和分化中发挥着重要作用。然而,对其动态机制的探索仍然不足。确定细胞分化过程中 ATSS 的变化点对于阐明潜在的动态机制至关重要。对于 ATSS 的相对使用百分比数据,现有方法缺乏灵敏度,无法检测 ATSS 纵向数据的变化点。此外,一些方法对数据分布有严格要求,无法应用于解决这一问题。本研究首先针对百分比数据类型的贝塔分布的两个参数,利用重参数化技术构建了贝叶斯变化点检测模型,并利用马尔可夫链蒙特卡罗(MCMC)采样方法得到了参数和变化点的后验分布。通过全面的模拟研究发现,贝叶斯变化点检测模型的性能在不同样本量和贝塔分布的大多数情况下始终保持强大和稳健。其次,根据变化点对真实数据中的差异 ATSS 事件进行聚类,并使用我们的方法确定其变化点。最后,针对每个变化点,对其差异 ATSS 事件进行通路和转录因子主题分析。我们的分析结果证明了贝叶斯变化点检测模型的有效性,并为细胞分化提供了生物学启示。
{"title":"A Bayesian Change Point Model for Dynamic Alternative Transcription Start Site Usage During Cellular Differentiation.","authors":"Juan Xia, Yuxia Li, Haotian Zhu, Feiyang Xue, Feng Shi, Nana Li","doi":"10.1089/cmb.2023.0174","DOIUrl":"10.1089/cmb.2023.0174","url":null,"abstract":"<p><p><b>ABSTRACT</b> <b>An alternative transcription start site (ATSS) is a major driving force for increasing the complexity of transcripts in human tissues. As a transcriptional regulatory mechanism, ATSS has biological significance. Many studies have confirmed that ATSS plays an important role in diseases and cell development and differentiation. However, exploration of its dynamic mechanisms remains insufficient. Identifying ATSS change points during cell differentiation is critical for elucidating potential dynamic mechanisms. For relative ATSS usage as percentage data, the existing methods lack sensitivity to detect the change point for ATSS longitudinal data. In addition, some methods have strict requirements for data distribution and cannot be applied to deal with this problem. In this study, the Bayesian change point detection model was first constructed using reparameterization techniques for two parameters of a beta distribution for the percentage data type, and the posterior distributions of parameters and change points were obtained using Markov Chain Monte Carlo (MCMC) sampling. With comprehensive simulation studies, the performance of the Bayesian change point detection model is found to be consistently powerful and robust across most scenarios with different sample sizes and beta distributions. Second, differential ATSS events in the real data, whose change points were identified using our method, were clustered according to their change points. Last, for each change point, pathway and transcription factor motif analyses were performed on its differential ATSS events. The results of our analyses demonstrated the effectiveness of the Bayesian change point detection model and provided biological insights into cell differentiation</b>.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140943652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enforcing Temporal Consistency in Migration History Inference. 在迁移历史推断中强化时间一致性。
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-05-01 Epub Date: 2024-05-16 DOI: 10.1089/cmb.2023.0352
Mrinmoy Saha Roddur, Sagi Snir, Mohammed El-Kebir

In addition to undergoing evolution, members of biological populations may also migrate between locations. Examples include the spread of tumor cells from the primary tumor to distant metastases or the spread of pathogens from one host to another. One may represent migration histories by assigning a location label to each vertex of a given phylogenetic tree such that an edge connecting vertices with distinct locations represents a migration. Some biological populations undergo comigration, a phenomenon where multiple taxa from distinct lineages simultaneously comigrate from one location to another. In this work, we show that a previous problem statement for inferring migration histories that are parsimonious in terms of migrations and comigrations may lead to temporally inconsistent solutions. To remedy this deficiency, we introduce precise definitions of temporal consistency of comigrations in a phylogenetic tree, leading to three successive problems. First, we formulate the temporally consistent comigration problem to check if a set of comigrations is temporally consistent and provide a linear time algorithm for solving this problem. Second, we formulate the parsimonious consistent comigrations (PCC) problem, which aims to find comigrations given a location labeling of a phylogenetic tree. We show that PCC is NP-hard. Third, we formulate the parsimonious consistent comigration history (PCCH) problem, which infers the migration history given a phylogenetic tree and locations of its extant vertices only. We show that PCCH is NP-hard as well. On the positive side, we propose integer linear programming models to solve the PCC and PCCH problems. We demonstrate our algorithms on simulated and real data.

除了经历进化,生物种群的成员还可能在不同地点之间迁移。例如,肿瘤细胞从原发肿瘤扩散到远处的转移灶,或者病原体从一个宿主扩散到另一个宿主。我们可以通过给定系统发生树的每个顶点分配一个位置标签来表示迁移历史,这样,连接具有不同位置的顶点的边就代表了一次迁移。有些生物种群会发生会聚迁移,即来自不同品系的多个类群同时从一个地点迁移到另一个地点。在这项研究中,我们发现以前的一个问题陈述,即从迁徙和合并的角度推断迁徙历史的合理性,可能会导致时间上不一致的解决方案。为了弥补这一不足,我们引入了系统发育树中汇聚的时间一致性的精确定义,从而引出了三个连续的问题。首先,我们提出了时间上一致的组合问题,以检查一组组合是否在时间上一致,并提供了解决该问题的线性时间算法。其次,我们提出了简约一致的组合(PCC)问题,其目的是在给定系统发育树位置标签的情况下找到组合。我们证明,PCC 是 NP 难问题。第三,我们提出了准一致迁徙历史(PCCH)问题,该问题仅根据系统发育树及其现存顶点的位置推断迁徙历史。我们证明 PCCH 也是 NP-困难的。从积极的方面看,我们提出了解决 PCC 和 PCCH 问题的整数线性规划模型。我们在模拟数据和真实数据上演示了我们的算法。
{"title":"Enforcing Temporal Consistency in Migration History Inference.","authors":"Mrinmoy Saha Roddur, Sagi Snir, Mohammed El-Kebir","doi":"10.1089/cmb.2023.0352","DOIUrl":"10.1089/cmb.2023.0352","url":null,"abstract":"<p><p>\u0000 <b>In addition to undergoing evolution, members of biological populations may also migrate between locations. Examples include the spread of tumor cells from the primary tumor to distant metastases or the spread of pathogens from one host to another. One may represent migration histories by assigning a location label to each vertex of a given phylogenetic tree such that an edge connecting vertices with distinct locations represents a migration. Some biological populations undergo comigration, a phenomenon where multiple taxa from distinct lineages simultaneously comigrate from one location to another. In this work, we show that a previous problem statement for inferring migration histories that are parsimonious in terms of migrations and comigrations may lead to temporally inconsistent solutions. To remedy this deficiency, we introduce precise definitions of temporal consistency of comigrations in a phylogenetic tree, leading to three successive problems. First, we formulate the temporally consistent comigration problem to check if a set of comigrations is temporally consistent and provide a linear time algorithm for solving this problem. Second, we formulate the parsimonious consistent comigrations (PCC) problem, which aims to find comigrations given a location labeling of a phylogenetic tree. We show that PCC is NP-hard. Third, we formulate the parsimonious consistent comigration history (PCCH) problem, which infers the migration history given a phylogenetic tree and locations of its extant vertices only. We show that PCCH is NP-hard as well. On the positive side, we propose integer linear programming models to solve the PCC and PCCH problems. We demonstrate our algorithms on simulated and real data.</b>\u0000 </p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140957697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Integer Linear Programming Model to Optimize Coding DNA Sequences By Joint Control of Transcript Indicators. 通过联合控制转录指标优化 DNA 编码序列的整数线性规划模型。
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-05-01 Epub Date: 2024-04-30 DOI: 10.1089/cmb.2023.0166
Claudio Arbib, Andrea D'ascenzo, Fabrizio Rossi, Daniele Santoni

A Coding DNA Sequence (CDS) is a fraction of DNA whose nucleotides are grouped into consecutive triplets called codons, each one encoding an amino acid. Because most amino acids can be encoded by more than one codon, the same amino acid chain can be obtained by a very large number of different CDSs. These synonymous CDSs show different features that, also depending on the organism the transcript is expressed in, could affect translational efficiency and yield. The identification of optimal CDSs with respect to given transcript indicators is in general a challenging task, but it has been observed in recent literature that integer linear programming (ILP) can be a very flexible and efficient way to achieve it. In this article, we add evidence to this observation by proposing a new ILP model that simultaneously optimizes different well-grounded indicators. With this model, we efficiently find solutions that dominate those returned by six existing codon optimization heuristics.

编码 DNA 序列(CDS)是 DNA 的一部分,其核苷酸被组合成连续的三联体,称为密码子,每个密码子编码一种氨基酸。由于大多数氨基酸可由多个密码子编码,因此同一氨基酸链可由大量不同的 CDS 获得。这些同义 CDS 表现出不同的特征,这些特征也取决于转录本表达的生物体,可能会影响翻译效率和产量。一般来说,根据给定的转录本指标确定最佳 CDS 是一项具有挑战性的任务,但最近的文献表明,整数线性规划(ILP)是一种非常灵活和有效的方法。在本文中,我们提出了一个新的 ILP 模型,该模型可同时优化不同的基础指标,从而为这一观点提供了证据。有了这个模型,我们就能高效地找到解决方案,这些解决方案优于现有的六种密码子优化启发式方法。
{"title":"An Integer Linear Programming Model to Optimize Coding DNA Sequences By Joint Control of Transcript Indicators.","authors":"Claudio Arbib, Andrea D'ascenzo, Fabrizio Rossi, Daniele Santoni","doi":"10.1089/cmb.2023.0166","DOIUrl":"10.1089/cmb.2023.0166","url":null,"abstract":"<p><p>\u0000 <b>A <i>Coding DNA Sequence</i> (CDS) is a fraction of DNA whose nucleotides are grouped into consecutive triplets called codons, each one encoding an amino acid. Because most amino acids can be encoded by more than one codon, the same amino acid chain can be obtained by a very large number of different CDSs. These synonymous CDSs show different features that, also depending on the organism the transcript is expressed in, could affect translational efficiency and yield. The identification of optimal CDSs with respect to given transcript indicators is in general a challenging task, but it has been observed in recent literature that integer linear programming (ILP) can be a very flexible and efficient way to achieve it. In this article, we add evidence to this observation by proposing a new ILP model that simultaneously optimizes different well-grounded indicators. With this model, we efficiently find solutions that dominate those returned by six existing codon optimization heuristics.</b>\u0000 </p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140851945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Minimizers and Convolutional Filters: Theoretical Connections and Applications to Genome Analysis. 论最小化和卷积滤波器:基因组分析的理论联系与应用》。
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-05-01 Epub Date: 2024-04-30 DOI: 10.1089/cmb.2024.0483
Yun William Yu

Minimizers and convolutional neural networks (CNNs) are two quite distinct popular techniques that have both been employed to analyze categorical biological sequences. At face value, the methods seem entirely dissimilar. Minimizers use min-wise hashing on a rolling window to extract a single important k-mer feature per window. CNNs start with a wide array of randomly initialized convolutional filters, paired with a pooling operation, and then multiple additional neural layers to learn both the filters themselves and how they can be used to classify the sequence. In this study, our main result is a careful mathematical analysis of hash function properties showing that for sequences over a categorical alphabet, random Gaussian initialization of convolutional filters with max-pooling is equivalent to choosing a minimizer ordering such that selected k-mers are (in Hamming distance) far from the k-mers within the sequence but close to other minimizers. In empirical experiments, we find that this property manifests as decreased density in repetitive regions, both in simulation and on real human telomeres. We additionally train from scratch a CNN embedding of synthetic short-reads from the SARS-CoV-2 genome into 3D Euclidean space that locally recapitulates the linear sequence distance of the read origins, a modest step toward building a deep learning assembler, although it is at present too slow to be practical. In total, this article provides a partial explanation for the effectiveness of CNNs in categorical sequence analysis.

最小化器和卷积神经网络(CNN)是两种截然不同的流行技术,都被用于分析分类生物序列。从表面上看,这两种方法似乎完全不同。最小化器在滚动窗口上使用最小哈希算法,每个窗口提取一个重要的 k-mer 特征。而 CNN 从一系列随机初始化的卷积滤波器开始,配以池化操作,然后通过多个附加神经层来学习滤波器本身以及如何使用它们对序列进行分类。在这项研究中,我们的主要成果是对哈希函数特性进行了细致的数学分析,结果表明,对于分类字母表上的序列,卷积滤波器的随机高斯初始化与最大池化等同于选择最小化排序,从而使所选的 k-mers 与序列中的 k-mers 相距较远(以汉明距离计算),但与其他最小化排序相近。在经验实验中,我们发现无论是在模拟还是在真实的人类端粒上,这一特性都表现为重复区域的密度降低。此外,我们还从头开始训练将来自 SARS-CoV-2 基因组的合成短读数嵌入到三维欧几里得空间的 CNN 嵌入,这种嵌入能在局部再现读数起源的线性序列距离,这是向构建深度学习装配器迈出的微不足道的一步,尽管它目前的速度太慢而不实用。总之,本文为 CNN 在分类序列分析中的有效性提供了部分解释。
{"title":"On Minimizers and Convolutional Filters: Theoretical Connections and Applications to Genome Analysis.","authors":"Yun William Yu","doi":"10.1089/cmb.2024.0483","DOIUrl":"10.1089/cmb.2024.0483","url":null,"abstract":"<p><p>\u0000 <b>Minimizers and convolutional neural networks (CNNs) are two quite distinct popular techniques that have both been employed to analyze categorical biological sequences. At face value, the methods seem entirely dissimilar. Minimizers use min-wise hashing on a rolling window to extract a single important k-mer feature per window. CNNs start with a wide array of randomly initialized convolutional filters, paired with a pooling operation, and then multiple additional neural layers to learn both the filters themselves and how they can be used to classify the sequence. In this study, our main result is a careful mathematical analysis of hash function properties showing that for sequences over a categorical alphabet, random Gaussian initialization of convolutional filters with max-pooling is equivalent to choosing a minimizer ordering such that selected k-mers are (in Hamming distance) far from the k-mers within the sequence but close to other minimizers. In empirical experiments, we find that this property manifests as decreased density in repetitive regions, both in simulation and on real human telomeres. We additionally train from scratch a CNN embedding of synthetic short-reads from the SARS-CoV-2 genome into 3D Euclidean space that locally recapitulates the linear sequence distance of the read origins, a modest step toward building a deep learning assembler, although it is at present too slow to be practical. In total, this article provides a partial explanation for the effectiveness of CNNs in categorical sequence analysis.<sup></sup></b>\u0000 </p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140870311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Singular Value Decomposition-Based Penalized Multinomial Regression for Classifying Imbalanced Medulloblastoma Subgroups Using Methylation Data. 基于奇异值分解的惩罚性多项式回归利用甲基化数据对不平衡髓母细胞瘤亚组进行分类
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-05-01 Epub Date: 2024-05-14 DOI: 10.1089/cmb.2023.0198
Isra Mohammed, Murtada K Elbashir, Areeg S Faggad

Medulloblastoma (MB) is a molecularly heterogeneous brain malignancy with large differences in clinical presentation. According to genomic studies, there are at least four distinct molecular subgroups of MB: sonic hedgehog (SHH), wingless/INT (WNT), Group 3, and Group 4. The treatment and outcomes depend on appropriate classification. It is difficult for the classification algorithms to identify these subgroups from an imbalanced MB genomic data set, where the distribution of samples among the MB subgroups may not be equal. To overcome this problem, we used singular value decomposition (SVD) and group lasso techniques to find DNA methylation probe features that maximize the separation between the different imbalanced MB subgroups. We used multinomial regression as a classification method to classify the four different molecular subgroups of MB using the reduced DNA methylation data. Coordinate descent is used to solve our loss function associated with the group lasso, which promotes sparsity. By using SVD, we were able to reduce the 321,174 probe features to just 200 features. Less than 40 features were successfully selected after applying the group lasso, which we then used as predictors for our classification models. Our proposed method achieved an average overall accuracy of 99% based on fivefold cross-validation technique. Our approach produces improved classification performance compared with the state-of-the-art methods for classifying MB molecular subgroups.

髓母细胞瘤(MB)是一种分子异质性脑恶性肿瘤,临床表现差异很大。根据基因组研究,髓母细胞瘤至少有四个不同的分子亚组:声刺猬(SHH)、无翅/INT(WNT)、第 3 组和第 4 组。 治疗和预后取决于适当的分类。在不平衡的 MB 基因组数据集中,MB 亚组之间的样本分布可能不均等,因此分类算法很难识别这些亚组。为了解决这个问题,我们使用奇异值分解(SVD)和组套索技术来寻找 DNA 甲基化探针特征,以最大限度地分离不同的不平衡 MB 亚组。我们使用多项式回归作为分类方法,利用还原的 DNA 甲基化数据对 MB 的四个不同分子亚组进行分类。我们使用坐标下降法来解决与组套索相关的损失函数,从而提高了稀疏性。通过使用 SVD,我们将 321,174 个探针特征减少到了 200 个。应用分组套索后,我们成功选出了不到 40 个特征,并将其用作分类模型的预测因子。基于五重交叉验证技术,我们提出的方法达到了 99% 的平均整体准确率。与最先进的 MB 分子亚群分类方法相比,我们的方法提高了分类性能。
{"title":"Singular Value Decomposition-Based Penalized Multinomial Regression for Classifying Imbalanced Medulloblastoma Subgroups Using Methylation Data.","authors":"Isra Mohammed, Murtada K Elbashir, Areeg S Faggad","doi":"10.1089/cmb.2023.0198","DOIUrl":"10.1089/cmb.2023.0198","url":null,"abstract":"<p><p><b>Medulloblastoma (MB) is a molecularly heterogeneous brain malignancy with large differences in clinical presentation. According to genomic studies, there are at least four distinct molecular subgroups of MB: sonic hedgehog (SHH), wingless/INT (WNT), Group 3, and Group 4. The treatment and outcomes depend on appropriate classification. It is difficult for the classification algorithms to identify these subgroups from an imbalanced MB genomic data set, where the distribution of samples among the MB subgroups may not be equal. To overcome this problem, we used singular value decomposition (SVD) and group lasso techniques to find DNA methylation probe features that maximize the separation between the different imbalanced MB subgroups. We used multinomial regression as a classification method to classify the four different molecular subgroups of MB using the reduced DNA methylation data. Coordinate descent is used to solve our loss function associated with the group lasso, which promotes sparsity. By using SVD, we were able to reduce the 321,174 probe features to just 200 features. Less than 40 features were successfully selected after applying the group lasso, which we then used as predictors for our classification models. Our proposed method achieved an average overall accuracy of 99% based on fivefold cross-validation technique. Our approach produces improved classification performance compared with the state-of-the-art methods for classifying MB molecular subgroups</b>.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140943774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
More Is Faster: Why Population Size Matters in Biological Search. 越多越快:为什么生物搜索中种群数量很重要》(Why Population Size Matters in Biological Search.
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-05-01 Epub Date: 2024-05-16 DOI: 10.1089/cmb.2023.0296
Jannatul Ferdous, George Matthew Fricke, Melanie E Moses

Many biological scenarios have multiple cooperating searchers, and the timing of the initial first contact between any one of those searchers and its target is critically important. However, we are unaware of biological models that predict how long it takes for the first of many searchers to discover a target. We present a novel mathematical model that predicts initial first contact times between searchers and targets distributed at random in a volume. We compare this model with the extreme first passage time approach in physics that assumes an infinite number of searchers all initially positioned at the same location. We explore how the number of searchers, the distribution of searchers and targets, and the initial distances between searchers and targets affect initial first contact times. Given a constant density of uniformly distributed searchers and targets, the initial first contact time decreases linearly with both search volume and the number of searchers. However, given only a single target and searchers placed at the same starting location, the relationship between the initial first contact time and the number of searchers shifts from a linear decrease to a logarithmic decrease as the number of searchers grows very large. More generally, we show that initial first contact times can be dramatically faster than the average first contact times and that the initial first contact times decrease with the number of searchers, while the average search times are independent of the number of searchers. We suggest that this is an underappreciated phenomenon in biology and other collective search problems.

许多生物场景都有多个合作搜索者,其中任何一个搜索者与其目标之间最初的首次接触时间都至关重要。然而,我们还不知道有什么生物模型可以预测众多搜索者中的第一个发现目标需要多长时间。我们提出了一个新颖的数学模型,它可以预测搜索者与随机分布在一个体积中的目标之间的初始首次接触时间。我们将该模型与物理学中的极端首次通过时间方法进行了比较,后者假定有无限多的搜索者最初都位于同一位置。我们探讨了搜索者的数量、搜索者和目标的分布以及搜索者和目标之间的初始距离如何影响初始首次接触时间。在搜索者和目标均匀分布的密度不变的情况下,初始首次接触时间随搜索量和搜索者数量的增加而线性减少。然而,如果只有一个目标,且搜索者位于同一起始位置,那么随着搜索者数量的增加,初始首次接触时间与搜索者数量之间的关系就会从线性下降转变为对数下降。更广泛地说,我们发现初始首次接触时间可能比平均首次接触时间快得多,而且初始首次接触时间随搜索者数量的增加而减少,而平均搜索时间则与搜索者数量无关。我们认为,这是生物学和其他集体搜索问题中一个未被充分重视的现象。
{"title":"More Is Faster: Why Population Size Matters in Biological Search.","authors":"Jannatul Ferdous, George Matthew Fricke, Melanie E Moses","doi":"10.1089/cmb.2023.0296","DOIUrl":"10.1089/cmb.2023.0296","url":null,"abstract":"<p><p>\u0000 <b>Many biological scenarios have multiple cooperating searchers, and the timing of the initial first contact between any one of those searchers and its target is critically important. However, we are unaware of biological models that predict how long it takes for the first of many searchers to discover a target. We present a novel mathematical model that predicts initial first contact times between searchers and targets distributed at random in a volume. We compare this model with the extreme first passage time approach in physics that assumes an infinite number of searchers all initially positioned at the same location. We explore how the number of searchers, the distribution of searchers and targets, and the initial distances between searchers and targets affect initial first contact times. Given a constant density of uniformly distributed searchers and targets, the initial first contact time decreases linearly with both search volume and the number of searchers. However, given only a single target and searchers placed at the same starting location, the relationship between the initial first contact time and the number of searchers shifts from a linear decrease to a logarithmic decrease as the number of searchers grows very large. More generally, we show that initial first contact times can be dramatically faster than the average first contact times and that the initial first contact times decrease with the number of searchers, while the average search times are independent of the number of searchers. We suggest that this is an underappreciated phenomenon in biology and other collective search problems.</b>\u0000 </p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140957699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DNASCANNER v2: A Web-Based Tool to Analyze the Characteristic Properties of Nucleotide Sequences. DNASCANNER v2:基于网络的核苷酸序列特性分析工具。
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-04-25 DOI: 10.1089/cmb.2023.0227
Preeti P, Azeen Riyaz, Alakto Choudhury, Priyanka Ray Choudhury, Nischal Pradhan, Abhishek Singh, Mihir Nakul, Chhavi Dudeja, Abhijeet Yadav, Swarsat Kaushik Nath, Vrinda Khanna, Trapti Sharma, Gayatri Pradhan, Simran Takkar, Kamal Rawal

Throughout the process of evolution, DNA undergoes the accumulation of distinct mutations, which can often result in highly organized patterns that serve various essential biological functions. These patterns encompass various genomic elements and provide valuable insights into the regulatory and functional aspects of DNA. The physicochemical, mechanical, thermodynamic, and structural properties of DNA sequences play a crucial role in the formation of specific patterns. These properties contribute to the three-dimensional structure of DNA and influence their interactions with proteins, regulatory elements, and other molecules. In this study, we introduce DNASCANNER v2, an advanced version of our previously published algorithm DNASCANNER for analyzing DNA properties. The current tool is built using the FLASK framework in Python language. Featuring a user-friendly interface tailored for nonspecialized researchers, it offers an extensive analysis of 158 DNA properties, including mono/di/trinucleotide frequencies, structural, physicochemical, thermodynamics, and mechanical properties of DNA sequences. The tool provides downloadable results and offers interactive plots for easy interpretation and comparison between different features. We also demonstrate the utility of DNASCANNER v2 in analyzing splice-site junctions, casposon insertion sequences, and transposon insertion sites (TIS) within the bacterial and human genomes, respectively. We also developed a deep learning module for the prediction of potential TIS in a given nucleotide sequence. In the future, we aim to optimize the performance of this prediction model through extensive training on larger data sets.

在整个进化过程中,DNA 会经历不同突变的积累,这些突变通常会形成高度组织化的模式,发挥各种基本的生物功能。这些模式包括各种基因组元素,为 DNA 的调控和功能方面提供了宝贵的见解。DNA 序列的物理化学、机械、热力学和结构特性对特定模式的形成起着至关重要的作用。这些特性有助于形成 DNA 的三维结构,并影响它们与蛋白质、调控元件和其他分子的相互作用。在本研究中,我们介绍了 DNASCANNER v2,它是我们之前发布的用于分析 DNA 特性的算法 DNASCANNER 的高级版本。目前的工具是使用 Python 语言的 FLASK 框架构建的。该工具具有专为非专业研究人员定制的友好用户界面,可广泛分析 158 种 DNA 特性,包括 DNA 序列的单/双/三核苷酸频率、结构、物理化学、热力学和机械特性。该工具提供可下载的结果,并提供交互式图表,便于解释和比较不同的特征。我们还展示了 DNASCANNER v2 在分析细菌和人类基因组中的剪接位点连接、casposon 插入序列和转座子插入位点(TIS)方面的实用性。我们还开发了一个深度学习模块,用于预测给定核苷酸序列中潜在的 TIS。未来,我们的目标是通过在更大的数据集上进行广泛的训练来优化这一预测模型的性能。
{"title":"DNASCANNER v2: A Web-Based Tool to Analyze the Characteristic Properties of Nucleotide Sequences.","authors":"Preeti P, Azeen Riyaz, Alakto Choudhury, Priyanka Ray Choudhury, Nischal Pradhan, Abhishek Singh, Mihir Nakul, Chhavi Dudeja, Abhijeet Yadav, Swarsat Kaushik Nath, Vrinda Khanna, Trapti Sharma, Gayatri Pradhan, Simran Takkar, Kamal Rawal","doi":"10.1089/cmb.2023.0227","DOIUrl":"https://doi.org/10.1089/cmb.2023.0227","url":null,"abstract":"<p><p>\u0000 <b>Throughout the process of evolution, DNA undergoes the accumulation of distinct mutations, which can often result in highly organized patterns that serve various essential biological functions. These patterns encompass various genomic elements and provide valuable insights into the regulatory and functional aspects of DNA. The physicochemical, mechanical, thermodynamic, and structural properties of DNA sequences play a crucial role in the formation of specific patterns. These properties contribute to the three-dimensional structure of DNA and influence their interactions with proteins, regulatory elements, and other molecules. In this study, we introduce DNASCANNER v2, an advanced version of our previously published algorithm DNASCANNER for analyzing DNA properties. The current tool is built using the FLASK framework in Python language. Featuring a user-friendly interface tailored for nonspecialized researchers, it offers an extensive analysis of 158 DNA properties, including mono/di/trinucleotide frequencies, structural, physicochemical, thermodynamics, and mechanical properties of DNA sequences. The tool provides downloadable results and offers interactive plots for easy interpretation and comparison between different features. We also demonstrate the utility of DNASCANNER v2 in analyzing splice-site junctions, casposon insertion sequences, and transposon insertion sites (TIS) within the bacterial and human genomes, respectively. We also developed a deep learning module for the prediction of potential TIS in a given nucleotide sequence. In the future, we aim to optimize the performance of this prediction model through extensive training on larger data sets.</b>\u0000 </p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140849825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Floor Is Lava: Halving Natural Genomes with Viaducts, Piers, and Pontoons. 地板是熔岩:用高架桥、桥墩和浮桥将自然基因组减半。
IF 1.7 4区 生物学 Q2 Mathematics Pub Date : 2024-04-01 Epub Date: 2024-04-15 DOI: 10.1089/cmb.2023.0330
Leonard Bohnenkämper

Whole Genome Duplications (WGDs) are events that double the content and structure of a genome. In some organisms, multiple WGD events have been observed while loss of genetic material is a typical occurrence following a WGD event. The requirement of classic rearrangement models that every genetic marker has to occur exactly two times in a given problem instance, therefore, poses a serious restriction in this context. The Double-Cut and Join (DCJ) model is a simple and powerful model for the analysis of large structural rearrangements. After being extended to the DCJ-Indel model, capable of handling gains and losses of genetic material, research has shifted in recent years toward enabling it to handle natural genomes, for which no assumption about the distribution of markers has to be made. The traditional theoretical framework for studying WGD events is the Genome Halving Problem (GHP). While the GHP is solved for the DCJ model for genomes without losses, there are currently no exact algorithms utilizing the DCJ-Indel model that are able to handle natural genomes. In this work, we present a general view on the DCJ-Indel model that we apply to derive an exact polynomial time and space solution for the GHP on genomes with at most two genes per family before generalizing the problem to an integer linear program solution for natural genomes.

全基因组重复(WGD)是基因组内容和结构加倍的事件。在某些生物中,已经观察到多次 WGD 事件,而遗传物质的丢失是 WGD 事件后的典型现象。因此,经典重排模型要求每个遗传标记必须在给定的问题实例中准确出现两次,这在此情况下构成了严重的限制。双切和连接(DCJ)模型是一种简单而强大的模型,可用于分析大型结构重排。DCJ-Indel 模型能够处理遗传物质的增减,在扩展到 DCJ-Indel 模型后,近年来的研究转向使其能够处理天然基因组,因为天然基因组无需假设标记的分布。研究 WGD 事件的传统理论框架是基因组减半问题(GHP)。虽然 GHP 在无损失基因组的 DCJ 模型中可以求解,但目前还没有利用 DCJ-Indel 模型处理自然基因组的精确算法。在这项研究中,我们提出了 DCJ-Indel 模型的一般观点,并将其应用于推导 GHP 的精确多项式时间和空间解决方案,该方案适用于每个族最多有两个基因的基因组,然后再将该问题推广为自然基因组的整数线性规划解决方案。
{"title":"The Floor Is Lava: Halving Natural Genomes with Viaducts, Piers, and Pontoons.","authors":"Leonard Bohnenkämper","doi":"10.1089/cmb.2023.0330","DOIUrl":"10.1089/cmb.2023.0330","url":null,"abstract":"<p><p><b>Whole Genome Duplications (WGDs) are events that double the content and structure of a genome. In some organisms, multiple WGD events have been observed while loss of genetic material is a typical occurrence following a WGD event. The requirement of classic rearrangement models that every genetic marker has to occur exactly two times in a given problem instance, therefore, poses a serious restriction in this context. The Double</b>-<b>Cut and Join (DCJ) model is a simple and powerful model for the analysis of large structural rearrangements. After being extended to the DCJ-Indel model, capable of handling gains and losses of genetic material, research has shifted in recent years toward enabling it to handle natural genomes, for which no assumption about the distribution of markers has to be made. The traditional theoretical framework for studying WGD events is the Genome Halving Problem (GHP). While the GHP is solved for the DCJ model for genomes without losses, there are currently no exact algorithms utilizing the DCJ-Indel model that are able to handle natural genomes. In this work, we present a general view on the DCJ-Indel model that we apply to derive an exact polynomial time and space solution for the GHP on genomes with at most two genes per family before generalizing the problem to an integer linear program solution for natural genomes.</b></p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11057688/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140848856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1