首页 > 最新文献

Bioinformatics advances最新文献

英文 中文
Changes in total charge on spike protein of SARS-CoV-2 in emerging lineages. SARS-CoV-2 新血统尖峰蛋白总电荷的变化。
Pub Date : 2024-04-08 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae053
Anže Božič, Rudolf Podgornik

Motivation: Charged amino acid residues on the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have been shown to influence its binding to different cell surface receptors, its non-specific electrostatic interactions with the environment, and its structural stability and conformation. It is therefore important to obtain a good understanding of amino acid mutations that affect the total charge on the spike protein which have arisen across different SARS-CoV-2 lineages during the course of the virus' evolution.

Results: We analyse the change in the number of ionizable amino acids and the corresponding total charge on the spike proteins of almost 2200 SARS-CoV-2 lineages that have emerged over the span of the pandemic. Our results show that the previously observed trend toward an increase in the positive charge on the spike protein of SARS-CoV-2 variants of concern has essentially stopped with the emergence of the early omicron variants. Furthermore, recently emerged lineages show a greater diversity in terms of their composition of ionizable amino acids. We also demonstrate that the patterns of change in the number of ionizable amino acids on the spike protein are characteristic of related lineages within the broader clade division of the SARS-CoV-2 phylogenetic tree. Due to the ubiquity of electrostatic interactions in the biological environment, our findings are relevant for a broad range of studies dealing with the structural stability of SARS-CoV-2 and its interactions with the environment.

Availability and implementation: The data underlying the article are available in the Supplementary material.

研究动机严重急性呼吸系统综合征冠状病毒2(SARS-CoV-2)尖峰蛋白上的带电荷氨基酸残基已被证明会影响其与不同细胞表面受体的结合、与环境的非特异性静电相互作用以及其结构稳定性和构象。因此,充分了解在病毒进化过程中不同 SARS-CoV-2 世系中出现的影响尖峰蛋白总电荷的氨基酸突变非常重要:我们分析了病毒大流行期间出现的近 2200 个 SARS-CoV-2 株系中可电离氨基酸数量的变化以及尖峰蛋白上相应的总电荷。我们的研究结果表明,以前观察到的 SARS-CoV-2 变异株尖峰蛋白正电荷增加的趋势随着早期奥米克龙变异株的出现而基本停止。此外,最近出现的变种在可离子化氨基酸的组成方面表现出更大的多样性。我们还证明,尖峰蛋白上可电离氨基酸数量的变化规律是 SARS-CoV-2 系统发生树更广泛支系划分中相关支系的特征。由于静电相互作用在生物环境中无处不在,我们的发现对涉及 SARS-CoV-2 结构稳定性及其与环境相互作用的广泛研究具有重要意义:文章的基础数据见补充材料。
{"title":"Changes in total charge on spike protein of SARS-CoV-2 in emerging lineages.","authors":"Anže Božič, Rudolf Podgornik","doi":"10.1093/bioadv/vbae053","DOIUrl":"https://doi.org/10.1093/bioadv/vbae053","url":null,"abstract":"<p><strong>Motivation: </strong>Charged amino acid residues on the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have been shown to influence its binding to different cell surface receptors, its non-specific electrostatic interactions with the environment, and its structural stability and conformation. It is therefore important to obtain a good understanding of amino acid mutations that affect the total charge on the spike protein which have arisen across different SARS-CoV-2 lineages during the course of the virus' evolution.</p><p><strong>Results: </strong>We analyse the change in the number of ionizable amino acids and the corresponding total charge on the spike proteins of almost 2200 SARS-CoV-2 lineages that have emerged over the span of the pandemic. Our results show that the previously observed trend toward an increase in the positive charge on the spike protein of SARS-CoV-2 variants of concern has essentially stopped with the emergence of the early omicron variants. Furthermore, recently emerged lineages show a greater diversity in terms of their composition of ionizable amino acids. We also demonstrate that the patterns of change in the number of ionizable amino acids on the spike protein are characteristic of related lineages within the broader clade division of the SARS-CoV-2 phylogenetic tree. Due to the ubiquity of electrostatic interactions in the biological environment, our findings are relevant for a broad range of studies dealing with the structural stability of SARS-CoV-2 and its interactions with the environment.</p><p><strong>Availability and implementation: </strong>The data underlying the article are available in the Supplementary material.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11031363/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140874999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WAS IT A MATch I SAW? Approximate palindromes lead to overstated false match rates in benchmarks using reversed sequences. 我看到的是匹配吗?在使用反向序列的基准测试中,近似回文导致高估的错误匹配率。
Pub Date : 2024-04-08 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae052
George Glidden-Handgis, Travis J Wheeler

Background: Software for labeling biological sequences typically produces a theory-based statistic for each match (the E-value) that indicates the likelihood of seeing that match's score by chance. E-values accurately predict false match rate for comparisons of random (shuffled) sequences, and thus provide a reasoned mechanism for setting score thresholds that enable high sensitivity with low expected false match rate. This threshold-setting strategy is challenged by real biological sequences, which contain regions of local repetition and low sequence complexity that cause excess matches between non-homologous sequences. Knowing this, tool developers often develop benchmarks that use realistic-seeming decoy sequences to explore empirical tradeoffs between sensitivity and false match rate. A recent trend has been to employ reversed biological sequences as realistic decoys, because these preserve the distribution of letters and the existence of local repeats, while disrupting the original sequence's functional properties. However, we and others have observed that sequences appear to produce high scoring alignments to their reversals with surprising frequency, leading to overstatement of false match risk that may negatively affect downstream analysis.

Results: We demonstrate that an alignment between a sequence S and its (possibly mutated) reversal tends to produce higher scores than alignment between truly unrelated sequences, even when S is a shuffled string with no notable repetitive or low-complexity regions. This phenomenon is due to the unintuitive fact that (even randomly shuffled) sequences contain palindromes that are on average longer than the longest common substrings (LCS) shared between permuted variants of the same sequence. Though the expected palindrome length is only slightly larger than the expected LCS, the distribution of alignment scores involving reversed sequences is strongly right-shifted, leading to greatly increased frequency of high-scoring alignments to reversed sequences.

Impact: Overestimates of false match risk can motivate unnecessarily high score thresholds, leading to potentially reduced true match sensitivity. Also, when tool sensitivity is only reported up to the score of the first matched decoy sequence, a large decoy set consisting of reversed sequences can obscure sensitivity differences between tools. As a result of these observations, we advise that reversed biological sequences be used as decoys only when care is taken to remove positive matches in the original (un-reversed) sequences, or when overstatement of false labeling is not a concern. Though the primary focus of the analysis is on sequence annotation, we also demonstrate that the prevalence of internal palindromes may lead to an overstatement of the rate of false labels in protein identification with mass spectrometry.

背景:用于标记生物序列的软件通常会为每个匹配序列生成一个基于理论的统计量(E 值),该统计量表示偶然看到该匹配序列得分的可能性。E 值可以准确预测随机(洗牌)序列比较的错误匹配率,从而为设置得分阈值提供了合理的机制,使其能够以较低的预期错误匹配率获得较高的灵敏度。这种阈值设置策略受到了真实生物序列的挑战,因为真实生物序列包含局部重复和低序列复杂性区域,这些区域会导致非同源序列之间的过度匹配。了解到这一点后,工具开发人员通常会开发一些基准,使用看似真实的诱饵序列来探索灵敏度和错误匹配率之间的经验权衡。最近的一个趋势是使用反向生物序列作为现实诱饵,因为这些序列保留了字母的分布和局部重复的存在,同时破坏了原始序列的功能特性。然而,我们和其他人观察到,序列似乎以惊人的频率与其反向序列产生高分比对,导致虚假匹配风险被夸大,可能对下游分析产生负面影响:我们证明,序列 S 与其(可能变异的)反向序列之间的比对往往比真正不相关的序列之间的比对产生更高的得分,即使 S 是一个没有明显重复或低复杂性区域的洗牌字符串。这种现象是由于一个不直观的事实,即(即使是随机洗牌的)序列包含的回文平均长度比同一序列的排列变体之间共享的最长公共子串(LCS)要长。虽然预期的回文长度只比预期的最长公共子串稍大,但涉及反转序列的配准得分分布却强烈右移,导致反转序列的高分配准频率大大增加:高估错误匹配风险会导致不必要的高分阈值,从而可能降低真正的匹配灵敏度。此外,当工具灵敏度只报告到第一个匹配诱饵序列的得分时,由反向序列组成的大型诱饵集可能会掩盖工具之间的灵敏度差异。根据上述观察结果,我们建议只有在注意去除原始(未反转)序列中的阳性匹配,或不担心虚假标记的夸大时,才使用反转生物序列作为诱饵。虽然分析的主要重点是序列注释,但我们也证明了内部回文的普遍存在可能会导致质谱法蛋白质鉴定中错误标记率的夸大。
{"title":"WAS IT A MATch I SAW? Approximate palindromes lead to overstated false match rates in benchmarks using reversed sequences.","authors":"George Glidden-Handgis, Travis J Wheeler","doi":"10.1093/bioadv/vbae052","DOIUrl":"10.1093/bioadv/vbae052","url":null,"abstract":"<p><strong>Background: </strong>Software for labeling biological sequences typically produces a theory-based statistic for each match (the E-value) that indicates the likelihood of seeing that match's score by chance. E-values accurately predict false match rate for comparisons of random (shuffled) sequences, and thus provide a reasoned mechanism for setting score thresholds that enable high sensitivity with low expected false match rate. This threshold-setting strategy is challenged by real biological sequences, which contain regions of local repetition and low sequence complexity that cause excess matches between non-homologous sequences. Knowing this, tool developers often develop benchmarks that use realistic-seeming decoy sequences to explore empirical tradeoffs between sensitivity and false match rate. A recent trend has been to employ reversed biological sequences as realistic decoys, because these preserve the distribution of letters and the existence of local repeats, while disrupting the original sequence's functional properties. However, we and others have observed that sequences appear to produce high scoring alignments to their reversals with surprising frequency, leading to overstatement of false match risk that may negatively affect downstream analysis.</p><p><strong>Results: </strong>We demonstrate that an alignment between a sequence S and its (possibly mutated) reversal tends to produce higher scores than alignment between truly unrelated sequences, even when S is a shuffled string with no notable repetitive or low-complexity regions. This phenomenon is due to the unintuitive fact that (even randomly shuffled) sequences contain palindromes that are on average longer than the longest common substrings (LCS) shared between permuted variants of the same sequence. Though the expected palindrome length is only slightly larger than the expected LCS, the distribution of alignment scores involving reversed sequences is strongly right-shifted, leading to greatly increased frequency of high-scoring alignments to reversed sequences.</p><p><strong>Impact: </strong>Overestimates of false match risk can motivate unnecessarily high score thresholds, leading to potentially reduced true match sensitivity. Also, when tool sensitivity is only reported up to the score of the first matched decoy sequence, a large decoy set consisting of reversed sequences can obscure sensitivity differences between tools. As a result of these observations, we advise that reversed biological sequences be used as decoys only when care is taken to remove positive matches in the original (un-reversed) sequences, or when overstatement of false labeling is not a concern. Though the primary focus of the analysis is on sequence annotation, we also demonstrate that the prevalence of internal palindromes may lead to an overstatement of the rate of false labels in protein identification with mass spectrometry.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11099658/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141066149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single-cell type annotation with deep learning in 265 cell types for humans. 利用深度学习对人类 265 种细胞类型进行单细胞类型注释。
Pub Date : 2024-04-08 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae054
Sherry Dong, Kaiwen Deng, Xiuzhen Huang

Motivation: Annotating cell types is a challenging yet essential task in analyzing single-cell RNA sequencing data. However, due to the lack of a gold standard, it is difficult to evaluate the algorithms fairly and an overfitting algorithm may be favored in benchmarks. To address this challenge, we developed a deep learning-based single-cell type prediction tool that assigns the cell type to 265 different cell types for humans, based on data from approximately five million cells.

Results: We achieved a median area under the ROC curve (AUC) of 0.93 when evaluated across datasets. We found that inconsistent labeling in the existing database generated by different labs contributed to the mistakes of the model. Therefore, we used cell ontology to correct the annotations and retrained the model, which resulted in 0.971 median AUC. Our study reveals a limiting factor of the accuracy one may achieve with the current database annotation and points to the solutions towards an algorithm-based correction of the gold standard for future automated cell annotation approaches.

Availability and implementation: The code is available at: https://github.com/SherrySDong/Hierarchical-Correction-Improves-Automated-Single-cell-Type-Annotation. Data used in this study are listed in Supplementary Table S1 and are retrievable at the CZI database.

动机在分析单细胞 RNA 测序数据时,标注细胞类型是一项具有挑战性但又必不可少的任务。然而,由于缺乏黄金标准,很难对算法进行公平的评估,而且在基准测试中,过拟合算法可能会受到青睐。为了应对这一挑战,我们开发了一种基于深度学习的单细胞类型预测工具,根据来自约 500 万个细胞的数据,为人类的 265 种不同细胞类型分配细胞类型:在跨数据集评估时,我们的 ROC 曲线下面积(AUC)中位数达到了 0.93。我们发现,不同实验室生成的现有数据库中不一致的标记导致了模型的错误。因此,我们使用细胞本体来校正注释并重新训练模型,结果 AUC 中位数达到了 0.971。我们的研究揭示了当前数据库注释所能达到的精确度的限制因素,并为未来的自动细胞注释方法指出了基于算法的金标准校正的解决方案:代码见:https://github.com/SherrySDong/Hierarchical-Correction-Improves-Automated-Single-cell-Type-Annotation。本研究使用的数据列于补充表 S1,可在 CZI 数据库中检索。
{"title":"Single-cell type annotation with deep learning in 265 cell types for humans.","authors":"Sherry Dong, Kaiwen Deng, Xiuzhen Huang","doi":"10.1093/bioadv/vbae054","DOIUrl":"https://doi.org/10.1093/bioadv/vbae054","url":null,"abstract":"<p><strong>Motivation: </strong>Annotating cell types is a challenging yet essential task in analyzing single-cell RNA sequencing data. However, due to the lack of a gold standard, it is difficult to evaluate the algorithms fairly and an overfitting algorithm may be favored in benchmarks. To address this challenge, we developed a deep learning-based single-cell type prediction tool that assigns the cell type to 265 different cell types for humans, based on data from approximately five million cells.</p><p><strong>Results: </strong>We achieved a median area under the ROC curve (AUC) of 0.93 when evaluated across datasets. We found that inconsistent labeling in the existing database generated by different labs contributed to the mistakes of the model. Therefore, we used cell ontology to correct the annotations and retrained the model, which resulted in 0.971 median AUC. Our study reveals a limiting factor of the accuracy one may achieve with the current database annotation and points to the solutions towards an algorithm-based correction of the gold standard for future automated cell annotation approaches.</p><p><strong>Availability and implementation: </strong>The code is available at: https://github.com/SherrySDong/Hierarchical-Correction-Improves-Automated-Single-cell-Type-Annotation. Data used in this study are listed in Supplementary Table S1 and are retrievable at the CZI database.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11031354/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extending BioMASS to construct mathematical models from external knowledge 扩展 BioMASS,利用外部知识构建数学模型
Pub Date : 2024-04-04 DOI: 10.1093/bioadv/vbae042
Kiwamu Arakane, Hiroaki Imoto, Fabian Ormersbach, Mariko Okada
Abstract Motivation Mechanistic modeling based on ordinary differential equations has led to numerous findings in systems biology by integrating prior knowledge and experimental data. However, the manual curation of knowledge necessary when constructing models poses a bottleneck. As the speed of knowledge accumulation continues to grow, there is a demand for a scalable means of constructing executable models. Results We previously introduced BioMASS—an open-source, Python-based framework–to construct, simulate, and analyze mechanistic models of signaling networks. With one of its features, Text2Model, BioMASS allows users to define models in a natural language-like format, thereby facilitating the construction of large-scale models. We demonstrate that Text2Model can serve as a tool for integrating external knowledge for mathematical modeling by generating Text2Model files from a pathway database or through the use of a large language model, and simulating its dynamics through BioMASS. Our findings reveal the tool's capabilities to encourage exploration from prior knowledge and pave the way for a fully data-driven approach to constructing mathematical models. Availability and implementation The code and documentation for BioMASS are available at https://github.com/biomass-dev/biomass and https://biomass-core.readthedocs.io, respectively. The code used in this article are available at https://github.com/okadalabipr/text2model-from-knowledge.
摘要 基于常微分方程的机理建模通过整合先验知识和实验数据,在系统生物学领域取得了众多发现。然而,在构建模型时必须对知识进行人工整理,这是一个瓶颈。随着知识积累速度的不断加快,人们需要一种可扩展的方法来构建可执行模型。结果 我们之前介绍了 BioMASS--一个基于 Python 的开源框架,用于构建、模拟和分析信号网络的机理模型。BioMASS 的功能之一是 Text2Model,它允许用户以类似自然语言的格式定义模型,从而促进了大规模模型的构建。通过从通路数据库或使用大型语言模型生成 Text2Model 文件,并通过 BioMASS 模拟其动态,我们证明了 Text2Model 可以作为整合外部知识以建立数学模型的工具。我们的研究结果揭示了该工具鼓励从已有知识中进行探索的能力,并为采用完全数据驱动的方法构建数学模型铺平了道路。可用性和实施 BioMASS 的代码和文档可分别在 https://github.com/biomass-dev/biomass 和 https://biomass-core.readthedocs.io 上获取。本文使用的代码可在 https://github.com/okadalabipr/text2model-from-knowledge 上获取。
{"title":"Extending BioMASS to construct mathematical models from external knowledge","authors":"Kiwamu Arakane, Hiroaki Imoto, Fabian Ormersbach, Mariko Okada","doi":"10.1093/bioadv/vbae042","DOIUrl":"https://doi.org/10.1093/bioadv/vbae042","url":null,"abstract":"Abstract Motivation Mechanistic modeling based on ordinary differential equations has led to numerous findings in systems biology by integrating prior knowledge and experimental data. However, the manual curation of knowledge necessary when constructing models poses a bottleneck. As the speed of knowledge accumulation continues to grow, there is a demand for a scalable means of constructing executable models. Results We previously introduced BioMASS—an open-source, Python-based framework–to construct, simulate, and analyze mechanistic models of signaling networks. With one of its features, Text2Model, BioMASS allows users to define models in a natural language-like format, thereby facilitating the construction of large-scale models. We demonstrate that Text2Model can serve as a tool for integrating external knowledge for mathematical modeling by generating Text2Model files from a pathway database or through the use of a large language model, and simulating its dynamics through BioMASS. Our findings reveal the tool's capabilities to encourage exploration from prior knowledge and pave the way for a fully data-driven approach to constructing mathematical models. Availability and implementation The code and documentation for BioMASS are available at https://github.com/biomass-dev/biomass and https://biomass-core.readthedocs.io, respectively. The code used in this article are available at https://github.com/okadalabipr/text2model-from-knowledge.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140744108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adjusting for covariates and assessing modeling fitness in machine learning using MUVR2 使用 MUVR2 在机器学习中调整协变量和评估建模适配性
Pub Date : 2024-04-04 DOI: 10.1093/bioadv/vbae051
Yingxiao Yan, T. Schillemans, Viktor Skantze, Carl Brunius
Abstract Motivation Machine learning (ML) methods are frequently used in Omics research to examine associations between molecular data and for example exposures and health conditions. ML is also used for feature selection to facilitate biological interpretation. Our previous MUVR algorithm was shown to generate predictions and variable selections at state-of-the-art performance. However, a general framework for assessing modeling fitness is still lacking. In addition, enabling to adjust for covariates is a highly desired, but largely lacking trait in ML. We aimed to address these issues in the new MUVR2 framework. Results The MUVR2 algorithm was developed to include the regularized regression framework elastic net in addition to partial least squares and random forest modeling. Compared with other cross-validation strategies, MUVR2 consistently showed state-of-the-art performance, including variable selection, while minimizing overfitting. Testing on simulated and real-world data, we also showed that MUVR2 allows for the adjustment for covariates using elastic net modeling, but not using partial least squares or random forest. Availability and implementation Algorithms, data, scripts, and a tutorial are open source under GPL-3 license and available in the MUVR2 R package at https://github.com/MetaboComp/MUVR2.
摘要 动机 机器学习(ML)方法经常被用于 Omics 研究,以检查分子数据与暴露和健康状况等之间的关联。机器学习还用于特征选择,以促进生物学解释。我们之前的 MUVR 算法已证明能以最先进的性能生成预测和变量选择。然而,目前仍缺乏评估建模适配性的通用框架。此外,能够根据协变量进行调整也是 ML 非常需要但又非常缺乏的特性。我们的目标是在新的 MUVR2 框架中解决这些问题。结果 除了偏最小二乘法和随机森林建模外,MUVR2 算法还包括正则回归框架 elastic net。与其他交叉验证策略相比,MUVR2 始终显示出最先进的性能,包括变量选择,同时最大限度地减少了过拟合。在模拟数据和实际数据的测试中,我们还发现 MUVR2 可以使用弹性网建模调整协变量,而不能使用偏最小二乘法或随机森林建模。可用性与实现 算法、数据、脚本和教程在 GPL-3 许可下开源,可在 https://github.com/MetaboComp/MUVR2 的 MUVR2 R 软件包中获取。
{"title":"Adjusting for covariates and assessing modeling fitness in machine learning using MUVR2","authors":"Yingxiao Yan, T. Schillemans, Viktor Skantze, Carl Brunius","doi":"10.1093/bioadv/vbae051","DOIUrl":"https://doi.org/10.1093/bioadv/vbae051","url":null,"abstract":"Abstract Motivation Machine learning (ML) methods are frequently used in Omics research to examine associations between molecular data and for example exposures and health conditions. ML is also used for feature selection to facilitate biological interpretation. Our previous MUVR algorithm was shown to generate predictions and variable selections at state-of-the-art performance. However, a general framework for assessing modeling fitness is still lacking. In addition, enabling to adjust for covariates is a highly desired, but largely lacking trait in ML. We aimed to address these issues in the new MUVR2 framework. Results The MUVR2 algorithm was developed to include the regularized regression framework elastic net in addition to partial least squares and random forest modeling. Compared with other cross-validation strategies, MUVR2 consistently showed state-of-the-art performance, including variable selection, while minimizing overfitting. Testing on simulated and real-world data, we also showed that MUVR2 allows for the adjustment for covariates using elastic net modeling, but not using partial least squares or random forest. Availability and implementation Algorithms, data, scripts, and a tutorial are open source under GPL-3 license and available in the MUVR2 R package at https://github.com/MetaboComp/MUVR2.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140745516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adapting beyond borders: Insights from the 19th Student Council Symposium (SCS2023), the first hybrid ISCB Student Council global event 超越国界的适应:第 19 届学生会专题讨论会(SCS2023)的启示--国际学生会理事会学生会的首次全球混合活动
Pub Date : 2024-04-03 DOI: 10.1093/bioadv/vbae028
S. M. Al Sium, Estefania Torrejón, S. Chowdhury, Rubaiat Ahmed, Aakriti Jain, Mirko Treccani, Laura Veschetti, Arsalan Riaz, Pradeep Eranti, Gabriel J Olguín-Orellana
Abstract Summary The 19th ISCB Student Council Symposium (SCS2023) organized by ISCB-SC adopted a hybrid format for the first time, allowing participants to engage in-person in Lyon, France, and virtually via an interactive online platform. The symposium prioritized inclusivity, featuring on-site sessions, poster presentations, and social activities for in-person attendees, while virtual participants accessed live sessions, interactive Q&A, and a virtual exhibit hall. Attendee statistics revealed a global reach, with Europe as the major contributor. SCS2023’s success in bridging in-person and virtual experiences sets a precedent for future events in Computational Biology and Bioinformatics. Availability and Implementation The details of the symposium, speaker information, schedules, and accepted abstracts, are available in the program booklet (https://doi.org/10.5281/zenodo.8173977). For organizers interested in adopting a similar hybrid model, it would be beneficial to have access to details regarding the online platform used, the types of sessions offered, and the challenges faced. Future iterations of SCS can address these aspects to further enhance accessibility and inclusivity.
摘要 ISCB-SC组织的第19届ISCB学生会研讨会(SCS2023)首次采用了混合形式,与会者可以在法国里昂亲身参与,也可以通过互动在线平台进行虚拟参与。研讨会将包容性放在首位,为现场与会者提供了现场会议、海报展示和社交活动,而虚拟与会者则可以参加现场会议、互动问答和虚拟展厅。与会者统计数据显示,与会者遍布全球,其中欧洲是主要贡献者。SCS2023 成功地将现场和虚拟体验结合起来,为今后的计算生物学和生物信息学活动开创了先例。可用性和实施 研讨会的详细信息、演讲者信息、日程安排和录用摘要可在程序手册(https://doi.org/10.5281/zenodo.8173977)中查阅。对于有意采用类似混合模式的组织者来说,了解所使用的在线平台、提供的会议类型以及面临的挑战等方面的详细信息将大有裨益。未来的 "科学委员会 "迭代可以解决这些方面的问题,以进一步提高可访问性和包容性。
{"title":"Adapting beyond borders: Insights from the 19th Student Council Symposium (SCS2023), the first hybrid ISCB Student Council global event","authors":"S. M. Al Sium, Estefania Torrejón, S. Chowdhury, Rubaiat Ahmed, Aakriti Jain, Mirko Treccani, Laura Veschetti, Arsalan Riaz, Pradeep Eranti, Gabriel J Olguín-Orellana","doi":"10.1093/bioadv/vbae028","DOIUrl":"https://doi.org/10.1093/bioadv/vbae028","url":null,"abstract":"Abstract Summary The 19th ISCB Student Council Symposium (SCS2023) organized by ISCB-SC adopted a hybrid format for the first time, allowing participants to engage in-person in Lyon, France, and virtually via an interactive online platform. The symposium prioritized inclusivity, featuring on-site sessions, poster presentations, and social activities for in-person attendees, while virtual participants accessed live sessions, interactive Q&A, and a virtual exhibit hall. Attendee statistics revealed a global reach, with Europe as the major contributor. SCS2023’s success in bridging in-person and virtual experiences sets a precedent for future events in Computational Biology and Bioinformatics. Availability and Implementation The details of the symposium, speaker information, schedules, and accepted abstracts, are available in the program booklet (https://doi.org/10.5281/zenodo.8173977). For organizers interested in adopting a similar hybrid model, it would be beneficial to have access to details regarding the online platform used, the types of sessions offered, and the challenges faced. Future iterations of SCS can address these aspects to further enhance accessibility and inclusivity.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140750855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ExplaineR: an R package to explain machine learning models. ExplaineR: 用于解释机器学习模型的 R 软件包。
Pub Date : 2024-03-26 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae049
Ramtin Zargari Marandi

Summary: SHapley Additive exPlanations (SHAP) is a widely used method for model interpretation. However, its full potential often remains untapped due to the absence of dedicated software tools. In response, ExplaineR, an R package to facilitate interpretation of binary classification and regression models based on clustering functionality for SHAP analysis is introduced here. It additionally offers user-interactive elements in visualizations for evaluating model performance, fairness analysis, decision-curve analysis, and a diverse range of SHAP plots. It facilitates in-depth post-prediction analysis of models, enabling users to pinpoint potentially significant patterns in SHAP plots and subsequently trace them back to instances through SHAP clustering. This functionality is particularly valuable for identifying patient subgroups in clinical cohorts, thus enhancing its role as a robust profiling tool. ExplaineR empowers users to generate comprehensive reports on machine learning outcomes, ensuring consistent and thorough documentation of model performance and interpretations.

Availability and implementation: ExplaineR 1.0.0 is available on GitHub (https://persimune.github.io/explainer/) and CRAN (https://cran.r-project.org/web/packages/explainer/index.html).

摘要:SHapley Additive exPlanations(SHAP)是一种广泛使用的模型解释方法。然而,由于缺乏专用软件工具,它的全部潜力往往仍未得到开发。为此,本文介绍了一个基于聚类功能的 R 软件包 ExplaineR,以方便解释二元分类和回归模型的 SHAP 分析。此外,它还在可视化方面提供了用户互动元素,用于评估模型性能、公平性分析、决策曲线分析和各种 SHAP 图。它有助于对模型进行深入的预测后分析,使用户能够在 SHAP 图中找出潜在的重要模式,并随后通过 SHAP 聚类追溯到实例。这一功能对于识别临床队列中的患者亚群尤为重要,从而增强了其作为强大的剖析工具的作用。ExplaineR 使用户能够生成有关机器学习结果的综合报告,确保对模型性能和解释进行一致而全面的记录:ExplaineR 1.0.0 可在 GitHub (https://persimune.github.io/explainer/) 和 CRAN (https://cran.r-project.org/web/packages/explainer/index.html) 上获取。
{"title":"ExplaineR: an R package to explain machine learning models.","authors":"Ramtin Zargari Marandi","doi":"10.1093/bioadv/vbae049","DOIUrl":"https://doi.org/10.1093/bioadv/vbae049","url":null,"abstract":"<p><strong>Summary: </strong>SHapley Additive exPlanations (SHAP) is a widely used method for model interpretation. However, its full potential often remains untapped due to the absence of dedicated software tools. In response, <i>ExplaineR</i>, an R package to facilitate interpretation of binary classification and regression models based on clustering functionality for SHAP analysis is introduced here. It additionally offers user-interactive elements in visualizations for evaluating model performance, fairness analysis, decision-curve analysis, and a diverse range of SHAP plots. It facilitates in-depth post-prediction analysis of models, enabling users to pinpoint potentially significant patterns in SHAP plots and subsequently trace them back to instances through SHAP clustering. This functionality is particularly valuable for identifying patient subgroups in clinical cohorts, thus enhancing its role as a robust profiling tool. <i>ExplaineR</i> empowers users to generate comprehensive reports on machine learning outcomes, ensuring consistent and thorough documentation of model performance and interpretations.</p><p><strong>Availability and implementation: </strong><i>ExplaineR</i> 1.0.0 is available on GitHub (https://persimune.github.io/explainer/) and CRAN (https://cran.r-project.org/web/packages/explainer/index.html).</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10994716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140859282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Text-mining-based feature selection for anticancer drug response prediction. 基于文本挖掘的抗癌药物反应预测特征选择。
Pub Date : 2024-03-26 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae047
Grace Wu, Arvin Zaker, Amirhosein Ebrahimi, Shivanshi Tripathi, Arvind Singh Mer

Motivation: Predicting anticancer treatment response from baseline genomic data is a critical obstacle in personalized medicine. Machine learning methods are commonly used for predicting drug response from gene expression data. In the process of constructing these machine learning models, one of the most significant challenges is identifying appropriate features among a massive number of genes.

Results: In this study, we utilize features (genes) extracted using the text-mining of scientific literatures. Using two independent cancer pharmacogenomic datasets, we demonstrate that text-mining-based features outperform traditional feature selection techniques in machine learning tasks. In addition, our analysis reveals that text-mining feature-based machine learning models trained on in vitro data also perform well when predicting the response of in vivo cancer models. Our results demonstrate that text-mining-based feature selection is an easy to implement approach that is suitable for building machine learning models for anticancer drug response prediction.

Availability and implementation: https://github.com/merlab/text_features.

动机:从基线基因组数据预测抗癌治疗反应是个性化医疗的一个关键障碍。机器学习方法通常用于从基因表达数据预测药物反应。在构建这些机器学习模型的过程中,最重要的挑战之一是从大量基因中识别出合适的特征:在这项研究中,我们利用了从科学文献文本挖掘中提取的特征(基因)。利用两个独立的癌症药物基因组数据集,我们证明了在机器学习任务中,基于文本挖掘的特征优于传统的特征选择技术。此外,我们的分析表明,在体外数据上训练的基于文本挖掘特征的机器学习模型在预测体内癌症模型的反应时也表现良好。我们的研究结果表明,基于文本挖掘的特征选择是一种易于实现的方法,适合用于建立抗癌药物反应预测的机器学习模型。可用性和实现:https://github.com/merlab/text_features。
{"title":"Text-mining-based feature selection for anticancer drug response prediction.","authors":"Grace Wu, Arvin Zaker, Amirhosein Ebrahimi, Shivanshi Tripathi, Arvind Singh Mer","doi":"10.1093/bioadv/vbae047","DOIUrl":"https://doi.org/10.1093/bioadv/vbae047","url":null,"abstract":"<p><strong>Motivation: </strong>Predicting anticancer treatment response from baseline genomic data is a critical obstacle in personalized medicine. Machine learning methods are commonly used for predicting drug response from gene expression data. In the process of constructing these machine learning models, one of the most significant challenges is identifying appropriate features among a massive number of genes.</p><p><strong>Results: </strong>In this study, we utilize features (genes) extracted using the text-mining of scientific literatures. Using two independent cancer pharmacogenomic datasets, we demonstrate that text-mining-based features outperform traditional feature selection techniques in machine learning tasks. In addition, our analysis reveals that text-mining feature-based machine learning models trained on <i>in vitro</i> data also perform well when predicting the response of <i>in vivo</i> cancer models. Our results demonstrate that text-mining-based feature selection is an easy to implement approach that is suitable for building machine learning models for anticancer drug response prediction.</p><p><strong>Availability and implementation: </strong>https://github.com/merlab/text_features.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11009020/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CATD: a reproducible pipeline for selecting cell-type deconvolution methods across tissues. CATD:用于选择跨组织细胞类型解卷积方法的可重复管道。
Pub Date : 2024-03-23 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae048
Anna Vathrakokoili Pournara, Zhichao Miao, Ozgur Yilimaz Beker, Nadja Nolte, Alvis Brazma, Irene Papatheodorou

Motivation: Cell-type deconvolution methods aim to infer cell composition from bulk transcriptomic data. The proliferation of developed methods coupled with inconsistent results obtained in many cases, highlights the pressing need for guidance in the selection of appropriate methods. Additionally, the growing accessibility of single-cell RNA sequencing datasets, often accompanied by bulk expression from related samples enable the benchmark of existing methods.

Results: In this study, we conduct a comprehensive assessment of 31 methods, utilizing single-cell RNA-sequencing data from diverse human and mouse tissues. Employing various simulation scenarios, we reveal the efficacy of regression-based deconvolution methods, highlighting their sensitivity to reference choices. We investigate the impact of bulk-reference differences, incorporating variables such as sample, study and technology. We provide validation using a gold standard dataset from mononuclear cells and suggest a consensus prediction of proportions when ground truth is not available. We validated the consensus method on data from the stomach and studied its spillover effect. Importantly, we propose the use of the critical assessment of transcriptomic deconvolution (CATD) pipeline which encompasses functionalities for generating references and pseudo-bulks and running implemented deconvolution methods. CATD streamlines simultaneous deconvolution of numerous bulk samples, providing a practical solution for speeding up the evaluation of newly developed methods.

Availability and implementation: https://github.com/Papatheodorou-Group/CATD_snakemake.

动机细胞类型解卷积方法旨在从大量转录组数据中推断细胞组成。已开发的方法层出不穷,但在许多情况下得到的结果并不一致,这突出表明在选择适当方法时迫切需要指导。此外,单细胞 RNA 测序数据集的可获取性越来越高,通常还伴随着相关样本的大量表达,这使得现有方法的基准得以确立:在这项研究中,我们利用来自不同人类和小鼠组织的单细胞 RNA 测序数据,对 31 种方法进行了全面评估。通过各种模拟场景,我们揭示了基于回归的去卷积方法的功效,并强调了这些方法对参照物选择的敏感性。我们结合样本、研究和技术等变量,研究了批量参考差异的影响。我们使用来自单核细胞的金标准数据集进行了验证,并提出了在无法获得地面实况时的比例共识预测方法。我们在胃部数据上验证了共识方法,并研究了其溢出效应。重要的是,我们建议使用转录组去卷积关键评估(CATD)管道,该管道包含生成参考和伪大量以及运行已实施的去卷积方法的功能。CATD 简化了大量样本的同步解卷积,为加快评估新开发的方法提供了实用的解决方案。可用性和实施:https://github.com/Papatheodorou-Group/CATD_snakemake。
{"title":"CATD: a reproducible pipeline for selecting cell-type deconvolution methods across tissues.","authors":"Anna Vathrakokoili Pournara, Zhichao Miao, Ozgur Yilimaz Beker, Nadja Nolte, Alvis Brazma, Irene Papatheodorou","doi":"10.1093/bioadv/vbae048","DOIUrl":"https://doi.org/10.1093/bioadv/vbae048","url":null,"abstract":"<p><strong>Motivation: </strong>Cell-type deconvolution methods aim to infer cell composition from bulk transcriptomic data. The proliferation of developed methods coupled with inconsistent results obtained in many cases, highlights the pressing need for guidance in the selection of appropriate methods. Additionally, the growing accessibility of single-cell RNA sequencing datasets, often accompanied by bulk expression from related samples enable the benchmark of existing methods.</p><p><strong>Results: </strong>In this study, we conduct a comprehensive assessment of 31 methods, utilizing single-cell RNA-sequencing data from diverse human and mouse tissues. Employing various simulation scenarios, we reveal the efficacy of regression-based deconvolution methods, highlighting their sensitivity to reference choices. We investigate the impact of bulk-reference differences, incorporating variables such as sample, study and technology. We provide validation using a gold standard dataset from mononuclear cells and suggest a consensus prediction of proportions when ground truth is not available. We validated the consensus method on data from the stomach and studied its spillover effect. Importantly, we propose the use of the critical assessment of transcriptomic deconvolution (CATD) pipeline which encompasses functionalities for generating references and pseudo-bulks and running implemented deconvolution methods. CATD streamlines simultaneous deconvolution of numerous bulk samples, providing a practical solution for speeding up the evaluation of newly developed methods.</p><p><strong>Availability and implementation: </strong>https://github.com/Papatheodorou-Group/CATD_snakemake.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11023940/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140866913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
dingo: a Python package for metabolic flux sampling. dingo:用于代谢通量采样的 Python 软件包。
Pub Date : 2024-03-22 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae037
Apostolos Chalkis, Vissarion Fisikopoulos, Elias Tsigaridas, Haris Zafeiropoulos

We present dingo, a Python package that supports a variety of methods to sample from the flux space of metabolic models, based on state-of-the-art random walks and rounding methods. For uniform sampling, dingo's sampling methods provide significant speed-ups and outperform existing software. Indicatively, dingo can sample from the flux space of the largest metabolic model up to now (Recon3D) in less than a day using a personal computer, under several statistical guarantees; this computation is out of reach for other similar software. In addition, dingo supports common analysis methods, such as flux balance analysis and flux variability analysis, and visualization components. dingo contributes to the arsenal of tools in metabolic modelling by enabling flux sampling in high dimensions (in the order of thousands).

Availability and implementation: The dingo Python library is available in GitHub at https://github.com/GeomScale/dingo and the data underlying this article are available in https://doi.org/10.5281/zenodo.10423335.

我们介绍的 dingo 是一个 Python 软件包,它基于最先进的随机游走和舍入方法,支持从代谢模型的通量空间进行采样的多种方法。对于均匀采样,dingo 的采样方法能显著提高速度并优于现有软件。具体来说,dingo 可以在多种统计保证下,使用个人电脑在不到一天的时间内对迄今为止最大的代谢模型(Recon3D)的通量空间进行采样;这是其他类似软件无法达到的计算速度。此外,dingo 还支持通量平衡分析和通量变异性分析等常用分析方法以及可视化组件。dingo 支持高维度(数千维)通量采样,为代谢模型工具库做出了贡献:dingo Python 库可在 GitHub https://github.com/GeomScale/dingo 上获取,本文的基础数据可在 https://doi.org/10.5281/zenodo.10423335 上获取。
{"title":"dingo: a Python package for metabolic flux sampling.","authors":"Apostolos Chalkis, Vissarion Fisikopoulos, Elias Tsigaridas, Haris Zafeiropoulos","doi":"10.1093/bioadv/vbae037","DOIUrl":"https://doi.org/10.1093/bioadv/vbae037","url":null,"abstract":"<p><p>We present dingo, a Python package that supports a variety of methods to sample from the flux space of metabolic models, based on state-of-the-art random walks and rounding methods. For uniform sampling, dingo's sampling methods provide significant speed-ups and outperform existing software. Indicatively, dingo can sample from the flux space of the largest metabolic model up to now (Recon3D) in less than a day using a personal computer, under several statistical guarantees; this computation is out of reach for other similar software. In addition, dingo supports common analysis methods, such as flux balance analysis and flux variability analysis, and visualization components. dingo contributes to the arsenal of tools in metabolic modelling by enabling flux sampling in high dimensions (in the order of thousands).</p><p><strong>Availability and implementation: </strong>The dingo Python library is available in GitHub at https://github.com/GeomScale/dingo and the data underlying this article are available in https://doi.org/10.5281/zenodo.10423335.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10997433/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140871553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics advances
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1