首页 > 最新文献

Journal of Computational Biology最新文献

英文 中文
The Statistics of Parametrized Syncmers in a Simple Mutation Process Without Spurious Matches. 无假匹配的简单突变过程中参数化同步器的统计。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-12-01 Epub Date: 2024-11-12 DOI: 10.1089/cmb.2024.0508
John L Spouge, Pijush Das, Ye Chen, Martin Frith

Introduction: Often, bioinformatics uses summary sketches to analyze next-generation sequencing data, but most sketches are not well understood statistically. Under a simple mutation model, Blanca et al. analyzed complete sketches, that is, the complete set of unassembled k-mers, from two closely related sequences. The analysis extracted a point mutation parameter θ quantifying the evolutionary distance between the two sequences. Methods: We extend the results of Blanca et al. for complete sketches to parametrized syncmer sketches with downsampling. A syncmer sketch can sample k-mers much more sparsely than a complete sketch. Consider the following simple mutation model disallowing insertions or deletions. Consider a reference sequence A (e.g., a subsequence from a reference genome), and mutate each nucleotide in it independently with probability θ to produce a mutated sequence B (corresponding to, e.g., a set of reads or draft assembly of a related genome). Then, syncmer counts alone yield an approximate Gaussian distribution for estimating θ. The assumption disallowing insertions and deletions motivates a check on the lengths of A and B. The syncmer count from B yields an approximate Gaussian distribution for its length, and a p-value can test the length of B against the length of A using syncmer counts alone. Results: The Gaussian distributions permit syncmer counts alone to estimate θ and mutated sequence length with a known sampling error. Under some circumstances, the results provide the sampling error for the Mash containment index when applied to syncmer counts. Conclusions: The approximate Gaussian distributions provide hypothesis tests and confidence intervals for phylogenetic distance and sequence length. Our methods are likely to generalize to sketches other than syncmers and may be useful in assembling reads and related applications.

简介生物信息学通常使用摘要草图来分析新一代测序数据,但大多数草图在统计学上并不十分清楚。在一个简单的突变模型下,Blanca 等人分析了两个密切相关序列的完整草图,即未组装 k-mers 的完整集合。该分析提取了一个点突变参数θ,量化了两个序列之间的进化距离。方法我们将 Blanca 等人对完整草图的研究结果扩展到了参数化同步草图与下采样。与完整草图相比,同步草图对 k-mers 的采样要稀疏得多。考虑以下不允许插入或删除的简单突变模型。考虑一个参考序列 A(例如参考基因组的一个子序列),并以概率 θ 对其中的每个核苷酸进行独立突变,以产生一个突变序列 B(例如对应于一组读数或相关基因组的组装草案)。由于假设不允许插入和删除,因此需要对 A 和 B 的长度进行检验。B 的突变计数可得出其长度的近似高斯分布,通过 p 值可以检验 B 的长度与仅使用突变计数的 A 的长度是否一致。结果:高斯分布允许在已知抽样误差的情况下,仅用同步器计数来估计θ和变异序列长度。在某些情况下,结果提供了应用于同步器计数的马什包含指数的抽样误差。结论近似高斯分布为系统发育距离和序列长度提供了假设检验和置信区间。我们的方法很可能适用于同步器以外的草图,并可能在组装读数和相关应用中有用。
{"title":"The Statistics of Parametrized Syncmers in a Simple Mutation Process Without Spurious Matches.","authors":"John L Spouge, Pijush Das, Ye Chen, Martin Frith","doi":"10.1089/cmb.2024.0508","DOIUrl":"10.1089/cmb.2024.0508","url":null,"abstract":"<p><p><b><i>Introduction:</i></b> Often, bioinformatics uses summary sketches to analyze next-generation sequencing data, but most sketches are not well understood statistically. Under a simple mutation model, Blanca et al. analyzed complete sketches, that is, the complete set of unassembled <i>k</i>-mers, from two closely related sequences. The analysis extracted a point mutation parameter θ quantifying the evolutionary distance between the two sequences. <b><i>Methods:</i></b> We extend the results of Blanca et al. for complete sketches to parametrized syncmer sketches with downsampling. A syncmer sketch can sample <i>k</i>-mers much more sparsely than a complete sketch. Consider the following simple mutation model disallowing insertions or deletions. Consider a reference sequence <i>A</i> (e.g., a subsequence from a reference genome), and mutate each nucleotide in it independently with probability θ to produce a mutated sequence <i>B</i> (corresponding to, e.g., a set of reads or draft assembly of a related genome). Then, syncmer counts alone yield an approximate Gaussian distribution for estimating θ. The assumption disallowing insertions and deletions motivates a check on the lengths of <i>A</i> and <i>B</i>. The syncmer count from <i>B</i> yields an approximate Gaussian distribution for its length, and a <i>p</i>-value can test the length of <i>B</i> against the length of <i>A</i> using syncmer counts alone. <b><i>Results:</i></b> The Gaussian distributions permit syncmer counts alone to estimate θ and mutated sequence length with a known sampling error. Under some circumstances, the results provide the sampling error for the Mash containment index when applied to syncmer counts. <b><i>Conclusions:</i></b> The approximate Gaussian distributions provide hypothesis tests and confidence intervals for phylogenetic distance and sequence length. Our methods are likely to generalize to sketches other than syncmers and may be useful in assembling reads and related applications.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1195-1210"},"PeriodicalIF":1.4,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698668/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142621433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Analysis for the Dual Virus Parallel Transmission Model with Immunity Delay. 带免疫延迟的双病毒平行传播模型的随机分析
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-12-01 Epub Date: 2024-10-18 DOI: 10.1089/cmb.2024.0662
Jing Yang, Shaojuan Ma, Juan Ma, Jinhua Ran, Xinyu Bai

In this article, the qualitative properties of a stochastic dual virus parallel transmission model with immunity delay are analyzed. First, we use Lyapunov theory to study the existence and uniqueness of the global positive solution of the proposed model. Second, the threshold values of the persistence and extinction of two viruses were obtained. Finally, the numerical simulation verifies the theoretical results. The results show that the immunity delay and the intensity of noise have important effects on the two diseases spreading in parallel.

本文分析了具有免疫延迟的随机双病毒并行传播模型的定性特性。首先,我们利用李雅普诺夫理论研究了所提模型全局正解的存在性和唯一性。其次,得到了两种病毒持续和消亡的临界值。最后,数值模拟验证了理论结果。结果表明,免疫延迟和噪声强度对两种疾病的并行传播有重要影响。
{"title":"Stochastic Analysis for the Dual Virus Parallel Transmission Model with Immunity Delay.","authors":"Jing Yang, Shaojuan Ma, Juan Ma, Jinhua Ran, Xinyu Bai","doi":"10.1089/cmb.2024.0662","DOIUrl":"10.1089/cmb.2024.0662","url":null,"abstract":"<p><p>In this article, the qualitative properties of a stochastic dual virus parallel transmission model with immunity delay are analyzed. First, we use Lyapunov theory to study the existence and uniqueness of the global positive solution of the proposed model. Second, the threshold values of the persistence and extinction of two viruses were obtained. Finally, the numerical simulation verifies the theoretical results. The results show that the immunity delay and the intensity of noise have important effects on the two diseases spreading in parallel.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1291-1304"},"PeriodicalIF":1.4,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142466642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An R Package for Nonparametric Inference on Dynamic Populations with Infinitely Many Types. 无限多类型动态种群的非参数推断 R 软件包
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-12-01 Epub Date: 2024-10-22 DOI: 10.1089/cmb.2024.0600
Filippo Ascolani, Stefano Damato, Matteo Ruggiero

Fleming-Viot diffusions are widely used stochastic models for population dynamics that extend the celebrated Wright-Fisher diffusions. They describe the temporal evolution of the relative frequencies of the allelic types in an ideally infinite panmictic population, whose individuals undergo random genetic drift and at birth can mutate to a new allelic type drawn from a possibly infinite potential pool, independently of their parent. Recently, Bayesian nonparametric inference has been considered for this model when a finite sample of individuals is drawn from the population at several discrete time points. Previous works have fully described the relevant estimators for this problem, but current software is available only for the Wright-Fisher finite-dimensional case. Here, we provide software for the general case, overcoming some nontrivial computational challenges posed by this setting. The R package FVDDPpkg efficiently approximates the filtering and smoothing distribution for Fleming-Viot diffusions, given finite samples of individuals collected at different times. A suitable Monte Carlo approximation is also introduced in order to reduce the computational cost.

弗莱明-维奥特扩散是广泛应用的种群动态随机模型,它是著名的赖特-费舍扩散的延伸。该模型描述了一个理想的无限泛型种群中等位基因类型相对频率的时间演化,该种群中的个体会发生随机遗传漂移,并在出生时突变为一种新的等位基因类型,这种新的等位基因类型可能来自一个无限的潜在资源库,与它们的亲本无关。最近,贝叶斯非参数推断法被考虑用于在多个离散时间点从种群中抽取有限个体样本的模型。以前的研究已经全面描述了这个问题的相关估计方法,但目前的软件只适用于 Wright-Fisher 有限维情况。在此,我们提供了一般情况下的软件,克服了这一设置带来的一些非难计算的挑战。R 软件包 FVDDPpkg 可以高效地近似弗莱明-维奥特扩散的滤波和平滑分布,并给出在不同时间采集的有限个体样本。为了降低计算成本,我们还引入了一种合适的蒙特卡罗近似方法。
{"title":"An R Package for Nonparametric Inference on Dynamic Populations with Infinitely Many Types.","authors":"Filippo Ascolani, Stefano Damato, Matteo Ruggiero","doi":"10.1089/cmb.2024.0600","DOIUrl":"10.1089/cmb.2024.0600","url":null,"abstract":"<p><p>Fleming-Viot diffusions are widely used stochastic models for population dynamics that extend the celebrated Wright-Fisher diffusions. They describe the temporal evolution of the relative frequencies of the allelic types in an ideally infinite panmictic population, whose individuals undergo random genetic drift and at birth can mutate to a new allelic type drawn from a possibly infinite potential pool, independently of their parent. Recently, Bayesian nonparametric inference has been considered for this model when a finite sample of individuals is drawn from the population at several discrete time points. Previous works have fully described the relevant estimators for this problem, but current software is available only for the Wright-Fisher finite-dimensional case. Here, we provide software for the general case, overcoming some nontrivial computational challenges posed by this setting. The R package FVDDPpkg efficiently approximates the filtering and smoothing distribution for Fleming-Viot diffusions, given finite samples of individuals collected at different times. A suitable Monte Carlo approximation is also introduced in order to reduce the computational cost.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1305-1311"},"PeriodicalIF":1.4,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142466624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Positivity and Boundedness Preserving Numerical Scheme for a Stochastic Multigroup Susceptible-Infected-Recovering Epidemic Model with Age Structure. 具有年龄结构的随机多群体易感-感染-恢复流行病模型的正向性和有界性保留数值方案。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-12-01 Epub Date: 2024-09-27 DOI: 10.1089/cmb.2023.0443
Han Ma, Yanyan Du, Zong Wang, Qimin Zhang

Since the stochastic age-structured multigroup susceptible-infected-recovering (SIR) epidemic model is nonlinear, the solution of this model is hard to be explicitly represented. It is necessary to construct effective numerical methods so as to predict the number of infections. In addition, the stochastic age-structured multigroup SIR model has features of positivity and boundedness of the solution. Therefore, in this article, in order to ensure that the numerical and analytical solutions must have the same properties, by modifying the classical Euler-Maruyama (EM) scheme, we generate a positivity and boundedness preserving EM (PBPEM) method on temporal space for stochastic age-structured multigroup SIR model, which is proved to have a strong convergence to the true solution over finite time intervals. Moreover, by combining the standard finite element method and the PBPEM method, we propose a full-discrete scheme to show the numerical solutions, as well as analyze the error estimations. Finally, the full-discrete scheme is applied to a general stochastic two-group SIR model and the Chlamydia epidemic model, which shows the superiority of the numerical method.

由于随机年龄结构的多群体易感-感染-恢复(SIR)流行病模型是非线性的,因此该模型的解很难明确表示。因此有必要构建有效的数值方法来预测感染数量。此外,随机年龄结构多群体 SIR 模型的解具有正向性和有界性的特点。因此,在本文中,为了确保数值解和分析解必须具有相同的性质,我们通过修改经典的 Euler-Maruyama (EM) 方案,产生了一种时间空间上的正性和有界性保留 EM (PBPEM) 方法,用于随机年龄结构多组 SIR 模型,并证明了该方法在有限时间间隔内对真解具有很强的收敛性。此外,通过结合标准有限元方法和 PBPEM 方法,我们提出了一种全离散方案来显示数值解,并分析了误差估计。最后,将全离散方案应用于一般随机两组 SIR 模型和衣原体流行模型,显示了数值方法的优越性。
{"title":"Positivity and Boundedness Preserving Numerical Scheme for a Stochastic Multigroup Susceptible-Infected-Recovering Epidemic Model with Age Structure.","authors":"Han Ma, Yanyan Du, Zong Wang, Qimin Zhang","doi":"10.1089/cmb.2023.0443","DOIUrl":"10.1089/cmb.2023.0443","url":null,"abstract":"<p><p>Since the stochastic age-structured multigroup susceptible-infected-recovering (SIR) epidemic model is nonlinear, the solution of this model is hard to be explicitly represented. It is necessary to construct effective numerical methods so as to predict the number of infections. In addition, the stochastic age-structured multigroup SIR model has features of positivity and boundedness of the solution. Therefore, in this article, in order to ensure that the numerical and analytical solutions must have the same properties, by modifying the classical Euler-Maruyama (EM) scheme, we generate a positivity and boundedness preserving EM (PBPEM) method on temporal space for stochastic age-structured multigroup SIR model, which is proved to have a strong convergence to the true solution over finite time intervals. Moreover, by combining the standard finite element method and the PBPEM method, we propose a full-discrete scheme to show the numerical solutions, as well as analyze the error estimations. Finally, the full-discrete scheme is applied to a general stochastic two-group SIR model and the Chlamydia epidemic model, which shows the superiority of the numerical method.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1259-1290"},"PeriodicalIF":1.4,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142347649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Arithmetic Coding-Based Encoding Method Toward High-Density DNA Storage. 基于自适应算术编码的编码方法,迈向高密度 DNA 存储。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-15 DOI: 10.1089/cmb.2024.0697
Yingxin Hu, Yanjun Liu, Yuefei Yang

With the rapid advancement of big data and artificial intelligence technologies, the limitations inherent in traditional storage media for accommodating vast amounts of data have become increasingly evident. DNA storage is an innovative approach harnessing DNA and other biomolecules as storage mediums, endowed with superior characteristics including expansive capacity, remarkable density, minimal energy requirements, and unparalleled longevity. Central to the efficient DNA storage is the process of DNA coding, whereby digital information is converted into sequences of DNA bases. A novel encoding method based on adaptive arithmetic coding (AAC) has been introduced, delineating the encoding process into three distinct phases: compression, error correction, and mapping. Prediction by Partial Matching (PPM)-based AAC in the compression phase serves to compress data and enhance storage density. Subsequently, the error correction phase relies on octal Hamming code to rectify errors and safeguard data integrity. The mapping phase employs a "3-2 code" mapping relationship to ensure adherence to biochemical constraints. The proposed method was verified by encoding different formats of files such as text, pictures, and audio. The results indicated that the average coding density of bases can be up to 3.25 per nucleotide, the GC content (which includes guanine [G] and cytosine [C]) can be stabilized at 50% and the homopolymer length is restricted to no more than 2. Simulation experimental results corroborate the method's efficacy in preserving data integrity during both reading and writing operations, augmenting storage density, and exhibiting robust error correction capabilities.

随着大数据和人工智能技术的快速发展,传统存储介质在容纳海量数据方面的局限性日益明显。DNA 存储是一种利用 DNA 和其他生物大分子作为存储介质的创新方法,具有容量大、密度高、能耗低和寿命长等优越特性。高效 DNA 存储的核心是 DNA 编码过程,即把数字信息转换成 DNA 碱基序列的过程。一种基于自适应算术编码(AAC)的新型编码方法已经问世,它将编码过程划分为三个不同的阶段:压缩、纠错和映射。在压缩阶段,基于部分匹配预测(PPM)的自适应算术编码可压缩数据并提高存储密度。随后,纠错阶段依靠八进制汉明码来纠正错误并保护数据完整性。映射阶段采用 "3-2 码 "映射关系,以确保遵守生化约束。通过对文本、图片和音频等不同格式的文件进行编码,对所提出的方法进行了验证。模拟实验结果表明,该方法能在读写操作中保持数据的完整性,提高存储密度,并具有强大的纠错能力。
{"title":"Adaptive Arithmetic Coding-Based Encoding Method Toward High-Density DNA Storage.","authors":"Yingxin Hu, Yanjun Liu, Yuefei Yang","doi":"10.1089/cmb.2024.0697","DOIUrl":"10.1089/cmb.2024.0697","url":null,"abstract":"<p><p>With the rapid advancement of big data and artificial intelligence technologies, the limitations inherent in traditional storage media for accommodating vast amounts of data have become increasingly evident. DNA storage is an innovative approach harnessing DNA and other biomolecules as storage mediums, endowed with superior characteristics including expansive capacity, remarkable density, minimal energy requirements, and unparalleled longevity. Central to the efficient DNA storage is the process of DNA coding, whereby digital information is converted into sequences of DNA bases. A novel encoding method based on adaptive arithmetic coding (AAC) has been introduced, delineating the encoding process into three distinct phases: compression, error correction, and mapping. Prediction by Partial Matching (PPM)-based AAC in the compression phase serves to compress data and enhance storage density. Subsequently, the error correction phase relies on octal Hamming code to rectify errors and safeguard data integrity. The mapping phase employs a \"3-2 code\" mapping relationship to ensure adherence to biochemical constraints. The proposed method was verified by encoding different formats of files such as text, pictures, and audio. The results indicated that the average coding density of bases can be up to 3.25 per nucleotide, the GC content (which includes guanine [G] and cytosine [C]) can be stabilized at 50% and the homopolymer length is restricted to no more than 2. Simulation experimental results corroborate the method's efficacy in preserving data integrity during both reading and writing operations, augmenting storage density, and exhibiting robust error correction capabilities.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142621431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Policy to Prediction: Assessing Forecasting Accuracy in an Integrated Framework with Machine Learning and Disease Models. 从政策到预测:利用机器学习和疾病模型评估综合框架中的预测准确性。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-01 Epub Date: 2024-08-02 DOI: 10.1089/cmb.2023.0377
Amit K Chakraborty, Hao Wang, Pouria Ramazi

To improve the forecasting accuracy of the spread of infectious diseases, a hybrid model was recently introduced where the commonly assumed constant disease transmission rate was actively estimated from enforced mitigating policy data by a machine learning (ML) model and then fed to an extended susceptible-infected-recovered model to forecast the number of infected cases. Testing only one ML model, that is, gradient boosting model (GBM), the work left open whether other ML models would perform better. Here, we compared GBMs, linear regressions, k-nearest neighbors, and Bayesian networks (BNs) in forecasting the number of COVID-19-infected cases in the United States and Canadian provinces based on policy indices of future 35 days. There was no significant difference in the mean absolute percentage errors of these ML models over the combined dataset [H(3)=3.10,p=0.38]. In two provinces, a significant difference was observed [H(3)=8.77,H(3)=8.07,p<0.05], yet posthoc tests revealed no significant difference in pairwise comparisons. Nevertheless, BNs significantly outperformed the other models in most of the training datasets. The results put forward that the ML models have equal forecasting power overall, and BNs are best for data-fitting applications.

为了提高传染病传播预测的准确性,最近推出了一种混合模型,即通过机器学习(ML)模型从强制减灾政策数据中主动估计通常假定的恒定疾病传播率,然后将其输入扩展的易感-感染-恢复模型,以预测感染病例的数量。这项工作只测试了一种 ML 模型,即梯度提升模型(GBM),其他 ML 模型是否会有更好的表现尚无定论。在此,我们根据未来 35 天的政策指数,比较了 GBM、线性回归、k 最近邻和贝叶斯网络 (BN) 在预测美国和加拿大各省 COVID-19 感染病例数方面的表现。在综合数据集上,这些 ML 模型的平均绝对百分比误差没有明显差异[H(3)=3.10,p=0.38]。在两个省份,观察到了显著差异[H(3)=8.77,H(3)=8.07,p0.05],但事后检验显示配对比较无显著差异。不过,在大多数训练数据集中,BNs 的表现明显优于其他模型。结果表明,ML 模型总体上具有相同的预测能力,而 BN 最适合数据拟合应用。
{"title":"From Policy to Prediction: Assessing Forecasting Accuracy in an Integrated Framework with Machine Learning and Disease Models.","authors":"Amit K Chakraborty, Hao Wang, Pouria Ramazi","doi":"10.1089/cmb.2023.0377","DOIUrl":"10.1089/cmb.2023.0377","url":null,"abstract":"<p><p>To improve the forecasting accuracy of the spread of infectious diseases, a hybrid model was recently introduced where the commonly assumed constant disease transmission rate was actively estimated from enforced mitigating policy data by a machine learning (ML) model and then fed to an extended susceptible-infected-recovered model to forecast the number of infected cases. Testing only one ML model, that is, gradient boosting model (GBM), the work left open whether other ML models would perform better. Here, we compared GBMs, linear regressions, k-nearest neighbors, and Bayesian networks (BNs) in forecasting the number of COVID-19-infected cases in the United States and Canadian provinces based on policy indices of future 35 days. There was no significant difference in the mean absolute percentage errors of these ML models over the combined dataset [<math><mrow><mi>H</mi><mo>(</mo><mn>3</mn><mo>)</mo><mo>=</mo><mn>3.10</mn><mo>,</mo><mi>p</mi><mo>=</mo><mn>0.38</mn></mrow></math>]. In two provinces, a significant difference was observed [<math><mrow><mi>H</mi><mo>(</mo><mn>3</mn><mo>)</mo><mo>=</mo><mn>8.77</mn><mo>,</mo><mi>H</mi><mo>(</mo><mn>3</mn><mo>)</mo><mo>=</mo><mn>8.07</mn><mo>,</mo><mi>p</mi><mo><</mo><mn>0.05</mn></mrow></math>], yet posthoc tests revealed no significant difference in pairwise comparisons. Nevertheless, BNs significantly outperformed the other models in most of the training datasets. The results put forward that the ML models have equal forecasting power overall, and BNs are best for data-fitting applications.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1104-1117"},"PeriodicalIF":1.4,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141874974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Noise to Knowledge: Diffusion Probabilistic Model-Based Neural Inference of Gene Regulatory Networks. 从噪声到知识:基于扩散概率模型的基因调控网络神经推断。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-01 Epub Date: 2024-10-10 DOI: 10.1089/cmb.2024.0607
Hao Zhu, Donna Slonim

Understanding gene regulatory networks (GRNs) is crucial for elucidating cellular mechanisms and advancing therapeutic interventions. Original methods for GRN inference from bulk expression data often struggled with the high dimensionality and inherent noise in the data. Here we introduce RegDiffusion, a new class of Denoising Diffusion Probabilistic Models focusing on the regulatory effects among feature variables. RegDiffusion introduces Gaussian noise to the input data following a diffusion schedule and uses a neural network with a parameterized adjacency matrix to predict the added noise. We show that using this process, GRNs can be learned effectively with a surprisingly simple model architecture. In our benchmark experiments, RegDiffusion shows superior performance compared to several baseline methods in multiple datasets. We also demonstrate that RegDiffusion can infer biologically meaningful regulatory networks from real-world single-cell data sets with over 15,000 genes in under 5 minutes. This work not only introduces a fresh perspective on GRN inference but also highlights the promising capacity of diffusion-based models in the area of single-cell analysis. The RegDiffusion software package and experiment data are available at https://github.com/TuftsBCB/RegDiffusion.

了解基因调控网络(GRN)对于阐明细胞机制和推进治疗干预至关重要。从大量表达数据中推断基因调控网络的原始方法往往难以应对数据的高维度和固有噪声。在这里,我们引入了 RegDiffusion,这是一类新的去噪扩散概率模型,侧重于特征变量之间的调控效应。RegDiffusion 按照扩散时间表向输入数据引入高斯噪声,并使用带有参数化邻接矩阵的神经网络来预测添加的噪声。我们的研究表明,利用这一过程,GRN 可以通过令人惊讶的简单模型架构进行有效学习。在我们的基准实验中,RegDiffusion 在多个数据集上的表现优于几种基准方法。我们还证明,RegDiffusion 能在 5 分钟内从包含 15000 多个基因的实际单细胞数据集中推断出具有生物学意义的调控网络。这项工作不仅为GRN推断引入了一个全新的视角,而且凸显了基于扩散的模型在单细胞分析领域大有可为的能力。RegDiffusion 软件包和实验数据见 https://github.com/TuftsBCB/RegDiffusion。
{"title":"From Noise to Knowledge: Diffusion Probabilistic Model-Based Neural Inference of Gene Regulatory Networks.","authors":"Hao Zhu, Donna Slonim","doi":"10.1089/cmb.2024.0607","DOIUrl":"10.1089/cmb.2024.0607","url":null,"abstract":"<p><p>Understanding gene regulatory networks (GRNs) is crucial for elucidating cellular mechanisms and advancing therapeutic interventions. Original methods for GRN inference from bulk expression data often struggled with the high dimensionality and inherent noise in the data. Here we introduce RegDiffusion, a new class of Denoising Diffusion Probabilistic Models focusing on the regulatory effects among feature variables. RegDiffusion introduces Gaussian noise to the input data following a diffusion schedule and uses a neural network with a parameterized adjacency matrix to predict the added noise. We show that using this process, GRNs can be learned effectively with a surprisingly simple model architecture. In our benchmark experiments, RegDiffusion shows superior performance compared to several baseline methods in multiple datasets. We also demonstrate that RegDiffusion can infer biologically meaningful regulatory networks from real-world single-cell data sets with over 15,000 genes in under 5 minutes. This work not only introduces a fresh perspective on GRN inference but also highlights the promising capacity of diffusion-based models in the area of single-cell analysis. The RegDiffusion software package and experiment data are available at https://github.com/TuftsBCB/RegDiffusion.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1087-1103"},"PeriodicalIF":1.4,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698671/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142466640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network-Constrained Eigen-Single-Cell Profile Estimation for Uncovering Crucial Immunogene Regulatory Systems in Human Bone Marrow. 网络约束特征单细胞轮廓估计法揭示人类骨髓中关键的免疫基因调控系统
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-01 Epub Date: 2024-09-06 DOI: 10.1089/cmb.2024.0539
Heewon Park, Satoru Miyano

We focus on characterizing cell lines from young and aged-healthy and -AML (acute myeloid leukemia) cell lines, and our goal is to identify the key markers associated with the progression of AML. To characterize the age-related phenotypes in AML cell lines, we consider eigenCell analysis that effectively encapsulates the primary expression level patterns across the cell lines. However, earlier investigations utilizing eigenGenes and eigenCells analysis were based on linear combination of all features, leading to the disturbance from noise features. Moreover, the analysis based on a fully dense loading matrix makes it challenging to interpret the results of eigenCells analysis. In order to address these challenges, we develop a novel computational approach termed network-constrained eigenCells profile estimation, which employs a sparse learning strategy. The proposed method estimates eigenCell based on not only the lasso but also network constrained penalization. The use of the network-constrained penalization enables us to simultaneously select neighborhood genes. Furthermore, the hub genes and their regulator/target genes are easily selected as crucial markers for eigenCells estimation. That is, our method can incorporate insights from network biology into the process of sparse loading estimation. Through our methodology, we estimate sparse eigenCells profiles, where only critical markers exhibit expression levels. This allows us to identify the key markers associated with a specific phenotype. Monte Carlo simulations demonstrate the efficacy of our method in reconstructing the sparse structure of eigenCells profiles. We employed our approach to unveil the regulatory system of immunogenes in both young/aged-healthy and -AML cell lines. The markers we have identified for the age-related phenotype in both healthy and AML cell lines have garnered strong support from previous studies. Specifically, our findings, in conjunction with the existing literature, indicate that the activities within this subnetwork of CD79A could be pivotal in elucidating the mechanism driving AML progression, particularly noting the significant role played by the diminished activities in the CD79A subnetwork. We expect that the proposed method will be a useful tool for characterizing disease-related subsets of cell lines, encompassing phenotypes and clones.

我们的研究重点是表征年轻细胞系和老年健康细胞系以及急性髓性白血病(AML)细胞系,我们的目标是找出与急性髓性白血病进展相关的关键标记物。为了描述急性髓细胞白血病细胞系中与年龄相关的表型,我们考虑采用 eigenCell 分析方法,它能有效概括各细胞系的主要表达水平模式。然而,早期利用特征基因和特征细胞分析的研究是基于所有特征的线性组合,从而导致噪声特征的干扰。此外,基于全密集载荷矩阵的分析也给解释特征细胞分析结果带来了挑战。为了应对这些挑战,我们开发了一种新颖的计算方法,称为网络约束特征细胞轮廓估计,它采用了稀疏学习策略。所提出的方法不仅基于套索,还基于网络约束惩罚来估计特征细胞。网络约束惩罚的使用使我们能够同时选择邻近基因。此外,中枢基因及其调控/目标基因也很容易被选中,作为估计特征细胞的关键标记。也就是说,我们的方法可以将网络生物学的见解融入稀疏载荷估计的过程中。通过我们的方法,我们可以估算出稀疏的特征细胞图谱,其中只有关键标记表现出表达水平。这样,我们就能确定与特定表型相关的关键标记物。蒙特卡罗模拟证明了我们的方法在重建稀疏的特征细胞图谱结构方面的功效。我们采用我们的方法揭示了年轻/高龄-健康细胞系和-AML 细胞系中免疫原的调控系统。我们在健康和急性髓细胞白血病细胞系中发现的年龄相关表型标记物得到了以往研究的有力支持。具体来说,我们的研究结果与现有文献相结合,表明 CD79A 子网络内的活动可能在阐明急性髓细胞性白血病进展的驱动机制方面起着关键作用,尤其是 CD79A 子网络内活动减少所起的重要作用。我们希望所提出的方法将成为表征与疾病相关的细胞系亚群(包括表型和克隆)的有用工具。
{"title":"Network-Constrained Eigen-Single-Cell Profile Estimation for Uncovering Crucial Immunogene Regulatory Systems in Human Bone Marrow.","authors":"Heewon Park, Satoru Miyano","doi":"10.1089/cmb.2024.0539","DOIUrl":"10.1089/cmb.2024.0539","url":null,"abstract":"<p><p>We focus on characterizing cell lines from young and aged-healthy and -AML (acute myeloid leukemia) cell lines, and our goal is to identify the key markers associated with the progression of AML. To characterize the age-related phenotypes in AML cell lines, we consider eigenCell analysis that effectively encapsulates the primary expression level patterns across the cell lines. However, earlier investigations utilizing eigenGenes and eigenCells analysis were based on linear combination of all features, leading to the disturbance from noise features. Moreover, the analysis based on a fully dense loading matrix makes it challenging to interpret the results of eigenCells analysis. In order to address these challenges, we develop a novel computational approach termed network-constrained eigenCells profile estimation, which employs a sparse learning strategy. The proposed method estimates eigenCell based on not only the lasso but also network constrained penalization. The use of the network-constrained penalization enables us to simultaneously select neighborhood genes. Furthermore, the hub genes and their regulator/target genes are easily selected as crucial markers for eigenCells estimation. That is, our method can incorporate insights from network biology into the process of sparse loading estimation. Through our methodology, we estimate sparse eigenCells profiles, where only critical markers exhibit expression levels. This allows us to identify the key markers associated with a specific phenotype. Monte Carlo simulations demonstrate the efficacy of our method in reconstructing the sparse structure of eigenCells profiles. We employed our approach to unveil the regulatory system of immunogenes in both young/aged-healthy and -AML cell lines. The markers we have identified for the age-related phenotype in both healthy and AML cell lines have garnered strong support from previous studies. Specifically, our findings, in conjunction with the existing literature, indicate that the activities within this subnetwork of CD79A could be pivotal in elucidating the mechanism driving AML progression, particularly noting the significant role played by the diminished activities in the CD79A subnetwork. We expect that the proposed method will be a useful tool for characterizing disease-related subsets of cell lines, encompassing phenotypes and clones.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1158-1178"},"PeriodicalIF":1.4,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142140187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid GNN Approach for Improved Molecular Property Prediction. 改进分子特性预测的混合 GNN 方法。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-01 Epub Date: 2024-07-31 DOI: 10.1089/cmb.2023.0452
Pedro Quesado, Luis H M Torres, Bernardete Ribeiro, Joel P Arrais

The development of new drugs is a vital effort that has the potential to improve human health, well-being and life expectancy. Molecular property prediction is a crucial step in drug discovery, as it helps to identify potential therapeutic compounds. However, experimental methods for drug development can often be time-consuming and resource-intensive, with a low probability of success. To address such limitations, deep learning (DL) methods have emerged as a viable alternative due to their ability to identify high-discriminating patterns in molecular data. In particular, graph neural networks (GNNs) operate on graph-structured data to identify promising drug candidates with desirable molecular properties. These methods represent molecules as a set of node (atoms) and edge (chemical bonds) features to aggregate local information for molecular graph representation learning. Despite the availability of several GNN frameworks, each approach has its own shortcomings. Although, some GNNs may excel in certain tasks, they may not perform as well in others. In this work, we propose a hybrid approach that incorporates different graph-based methods to combine their strengths and mitigate their limitations to accurately predict molecular properties. The proposed approach consists in a multi-layered hybrid GNN architecture that integrates multiple GNN frameworks to compute graph embeddings for molecular property prediction. Furthermore, we conduct extensive experiments on multiple benchmark datasets to demonstrate that our hybrid approach significantly outperforms the state-of-the-art graph-based models. The data and code scripts to reproduce the results are available in the repository, https://github.com/pedro-quesado/HybridGNN.

新药研发是一项至关重要的工作,有可能改善人类的健康、福祉和预期寿命。分子特性预测是药物发现的关键步骤,因为它有助于确定潜在的治疗化合物。然而,药物开发的实验方法往往耗费时间和资源,而且成功概率较低。为了解决这些局限性,深度学习(DL)方法因其能够识别分子数据中的高区分度模式而成为一种可行的替代方法。特别是,图神经网络(GNN)可在图结构数据上运行,以识别具有理想分子特性的候选药物。这些方法将分子表示为一组节点(原子)和边缘(化学键)特征,以聚合用于分子图表示学习的局部信息。尽管有多种 GNN 框架,但每种方法都有其自身的缺点。虽然某些 GNN 在某些任务中表现出色,但在其他任务中可能表现不佳。在这项工作中,我们提出了一种混合方法,该方法结合了不同的基于图的方法,综合了它们的优势,并减少了它们在准确预测分子特性方面的局限性。所提出的方法包括一个多层混合 GNN 架构,该架构集成了多个 GNN 框架,用于计算分子性质预测的图嵌入。此外,我们还在多个基准数据集上进行了大量实验,证明我们的混合方法明显优于最先进的基于图的模型。用于重现结果的数据和代码脚本可在 https://github.com/pedro-quesado/HybridGNN 存储库中获取。
{"title":"A Hybrid GNN Approach for Improved Molecular Property Prediction.","authors":"Pedro Quesado, Luis H M Torres, Bernardete Ribeiro, Joel P Arrais","doi":"10.1089/cmb.2023.0452","DOIUrl":"10.1089/cmb.2023.0452","url":null,"abstract":"<p><p>The development of new drugs is a vital effort that has the potential to improve human health, well-being and life expectancy. Molecular property prediction is a crucial step in drug discovery, as it helps to identify potential therapeutic compounds. However, experimental methods for drug development can often be time-consuming and resource-intensive, with a low probability of success. To address such limitations, deep learning (DL) methods have emerged as a viable alternative due to their ability to identify high-discriminating patterns in molecular data. In particular, graph neural networks (GNNs) operate on graph-structured data to identify promising drug candidates with desirable molecular properties. These methods represent molecules as a set of node (atoms) and edge (chemical bonds) features to aggregate local information for molecular graph representation learning. Despite the availability of several GNN frameworks, each approach has its own shortcomings. Although, some GNNs may excel in certain tasks, they may not perform as well in others. In this work, we propose a hybrid approach that incorporates different graph-based methods to combine their strengths and mitigate their limitations to accurately predict molecular properties. The proposed approach consists in a multi-layered hybrid GNN architecture that integrates multiple GNN frameworks to compute graph embeddings for molecular property prediction. Furthermore, we conduct extensive experiments on multiple benchmark datasets to demonstrate that our hybrid approach significantly outperforms the state-of-the-art graph-based models. The data and code scripts to reproduce the results are available in the repository, https://github.com/pedro-quesado/HybridGNN.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"1146-1157"},"PeriodicalIF":1.4,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141855704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI Models for the Protein Scaffold Filling Problem. 蛋白质支架填充问题的人工智能生成模型。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-10-23 DOI: 10.1089/cmb.2024.0510
Letu Qingge, Kushal Badal, Richard Annan, Jordan Sturtz, Xiaowen Liu, Binhai Zhu

De novo protein sequencing is an important problem in proteomics, playing a crucial role in understanding protein functions, drug discovery, design and evolutionary studies, etc. Top-down and bottom-up tandem mass spectrometry are popular approaches used in the field of mass spectrometry to analyze and sequence proteins. However, these approaches often produce incomplete protein sequences with gaps, namely scaffolds. The protein scaffold filling problem refers to filling the missing amino acids in the gaps of a scaffold to infer the complete protein sequence. In this article, we tackle the protein scaffold filling problem based on generative AI techniques, such as convolutional denoising autoencoder, transformer, and generative pretrained transformer (GPT) models, to complete the protein sequences and compare our results with recently developed convolutional long short-term memory-based sequence model. We evaluate the model performance both on a real dataset and generated datasets. All proposed models show outstanding prediction accuracy. Notably, the GPT-2 model achieves 100% gap-filling accuracy and 100% full sequence accuracy on the MabCampth protein scaffold, which outperforms the other models.

全新蛋白质测序是蛋白质组学中的一个重要问题,在了解蛋白质功能、药物发现、设计和进化研究等方面发挥着至关重要的作用。自上而下和自下而上的串联质谱法是质谱分析和蛋白质测序领域常用的方法。然而,这些方法往往会产生不完整的蛋白质序列,其中存在缺口,即 "支架"。蛋白质支架填充问题是指填补支架间隙中缺失的氨基酸,从而推断出完整的蛋白质序列。本文基于生成式人工智能技术,如卷积去噪自动编码器、变换器和生成式预训练变换器(GPT)模型,来解决蛋白质支架填充问题,以完成蛋白质序列,并将我们的结果与最近开发的基于卷积长短期记忆的序列模型进行比较。我们在真实数据集和生成数据集上对模型性能进行了评估。所有提出的模型都显示出了出色的预测准确性。值得注意的是,GPT-2 模型在 MabCampth 蛋白支架上实现了 100% 的缺口填补准确率和 100% 的全序列准确率,优于其他模型。
{"title":"Generative AI Models for the Protein Scaffold Filling Problem.","authors":"Letu Qingge, Kushal Badal, Richard Annan, Jordan Sturtz, Xiaowen Liu, Binhai Zhu","doi":"10.1089/cmb.2024.0510","DOIUrl":"https://doi.org/10.1089/cmb.2024.0510","url":null,"abstract":"<p><p>De novo protein sequencing is an important problem in proteomics, playing a crucial role in understanding protein functions, drug discovery, design and evolutionary studies, etc. Top-down and bottom-up tandem mass spectrometry are popular approaches used in the field of mass spectrometry to analyze and sequence proteins. However, these approaches often produce incomplete protein sequences with gaps, namely scaffolds. The protein scaffold filling problem refers to filling the missing amino acids in the gaps of a scaffold to infer the complete protein sequence. In this article, we tackle the protein scaffold filling problem based on generative AI techniques, such as convolutional denoising autoencoder, transformer, and generative pretrained transformer (GPT) models, to complete the protein sequences and compare our results with recently developed convolutional long short-term memory-based sequence model. We evaluate the model performance both on a real dataset and generated datasets. All proposed models show outstanding prediction accuracy. Notably, the GPT-2 model achieves 100% gap-filling accuracy and 100% full sequence accuracy on the MabCampth protein scaffold, which outperforms the other models.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142501311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1