首页 > 最新文献

Nature Protocols最新文献

英文 中文
Open-vessel polymerization of N-carboxyanhydride (NCA) for polypeptide synthesis. 用于多肽合成的 N-羧基酸酐(NCA)开孔聚合。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-10-08 DOI: 10.1038/s41596-024-01062-3
Yueming Wu, Kang Chen, Jiangzhou Wang, Wenhui Dai, Haowen Yu, Xinyi Xie, Minzhang Chen, Runhui Liu

Synthetic polypeptides, also known as poly(α-amino acids), have the same polyamide backbone structures as natural proteins and peptides. As an important class of biomaterials, polypeptides have been widely used because of their biocompatibility, bioactivity and biodegradability. Ring-opening polymerization of N-carboxyanhydride (NCA) is a classical and widely used method for the synthesis of polypeptides. The dominantly used primary amine-initiated NCA polymerization can yield well-defined polymers and complex macromolecular architectures, but the reaction is slow and sensitive to moisture, making it necessary to use anhydrous solvents and a glovebox. One solution is to use lithium hexamethyldisilazide (LiHMDS) as the initiator, as described in this protocol. LiHMDS-initiated NCA polymerization is less sensitive to moisture and can be carried out in an open vessel outside the glovebox. It is also very fast; the reaction can be complete within 5 min to produce 30-mer polypeptides. In this protocol, poly(γ-benzyl-L-glutamate) is prepared as an example, but the protocol can easily be adapted to the synthesis of other polypeptides by generating NCAs from different amino acids, making it particularly suitable for the efficient parallel synthesis of polypeptide libraries. We provide detailed procedures for NCA synthesis and purification, the method of polymer end-group modification and measurement of polymerization kinetics and reactivity ratio. The procedure for synthesis of monomers and polymerization to form polypeptides requires <1 d. The superfast and open-vessel NCA polymerization method described here will probably enable a wide range of applications in the synthesis and functional study of polypeptide biomaterials.

合成多肽又称聚(α-氨基酸),与天然蛋白质和肽具有相同的聚酰胺骨架结构。作为一类重要的生物材料,多肽因其生物相容性、生物活性和生物可降解性而得到广泛应用。N- 羧基酸酐(NCA)开环聚合法是合成多肽的一种经典且广泛使用的方法。主要使用的伯胺引发的 NCA 聚合反应可以产生定义明确的聚合物和复杂的大分子结构,但反应速度缓慢,对水分敏感,因此必须使用无水溶剂和手套箱。一种解决方案是使用六甲基二硅氮化锂(LiHMDS)作为引发剂,如本方案所述。LiHMDS 引发的 NCA 聚合对湿气的敏感性较低,可以在手套箱外的开放式容器中进行。而且反应速度非常快,5 分钟内即可完成反应,生成 30 个聚合物的多肽。在本方案中,以制备聚(γ-苄基-L-谷氨酸)为例,但该方案可以很容易地通过从不同氨基酸生成 NCA 来合成其他多肽,因此特别适用于多肽库的高效平行合成。我们提供了 NCA 合成和纯化的详细步骤、聚合物端基修饰方法以及聚合动力学和反应率的测量方法。合成单体和聚合形成多肽的程序要求
{"title":"Open-vessel polymerization of N-carboxyanhydride (NCA) for polypeptide synthesis.","authors":"Yueming Wu, Kang Chen, Jiangzhou Wang, Wenhui Dai, Haowen Yu, Xinyi Xie, Minzhang Chen, Runhui Liu","doi":"10.1038/s41596-024-01062-3","DOIUrl":"10.1038/s41596-024-01062-3","url":null,"abstract":"<p><p>Synthetic polypeptides, also known as poly(α-amino acids), have the same polyamide backbone structures as natural proteins and peptides. As an important class of biomaterials, polypeptides have been widely used because of their biocompatibility, bioactivity and biodegradability. Ring-opening polymerization of N-carboxyanhydride (NCA) is a classical and widely used method for the synthesis of polypeptides. The dominantly used primary amine-initiated NCA polymerization can yield well-defined polymers and complex macromolecular architectures, but the reaction is slow and sensitive to moisture, making it necessary to use anhydrous solvents and a glovebox. One solution is to use lithium hexamethyldisilazide (LiHMDS) as the initiator, as described in this protocol. LiHMDS-initiated NCA polymerization is less sensitive to moisture and can be carried out in an open vessel outside the glovebox. It is also very fast; the reaction can be complete within 5 min to produce 30-mer polypeptides. In this protocol, poly(γ-benzyl-L-glutamate) is prepared as an example, but the protocol can easily be adapted to the synthesis of other polypeptides by generating NCAs from different amino acids, making it particularly suitable for the efficient parallel synthesis of polypeptide libraries. We provide detailed procedures for NCA synthesis and purification, the method of polymer end-group modification and measurement of polymerization kinetics and reactivity ratio. The procedure for synthesis of monomers and polymerization to form polypeptides requires <1 d. The superfast and open-vessel NCA polymerization method described here will probably enable a wide range of applications in the synthesis and functional study of polypeptide biomaterials.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142391835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to isolate channel-forming membrane proteins using the E. coli expression system. 如何利用大肠杆菌表达系统分离通道形成膜蛋白。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-10-04 DOI: 10.1038/s41596-024-01055-2
Claudio Piselli

The recombinant expression, isolation and characterization of pore-forming proteins is one of the most commonly used strategies for understanding the permeability properties of the biological membrane into which they are embedded. This protocol describes how to quantify the expression of your protein of interest and use this information to optimize its production using the Escherichia coli strain BL21Gold(de3)ΔABCF. It explains with a step-by-step approach how to separate the bacterial compartments according to their solubility and how to extract your protein of interest in its native conformation using detergent solutions. Finally, it describes how to improve its purity via ion-exchange chromatography and insert the purified porins into outer membrane vesicles, from which they can be copurified. The protocol is simpler and less empirical than those described for most channel-forming membrane proteins and also provides a solid foundation for the isolation of soluble proteins. Several parameters can be optimized on a case-by-case basis: expression time and temperature, concentration of the inducer, nature and concentration of the detergent, incubation time and temperature, pH and ionic strength of the purification buffers. This protocol is effective with prokaryotic channel-forming membrane proteins and can be employed for the production of pore-forming proteins from chloroplasts, mitochondria or eukaryotes in general. With minor optimization, this protocol can be adapted for the isolation of receptors, carrier, pumps or any other membrane-active proteins.

孔形成蛋白的重组表达、分离和表征是了解其嵌入生物膜的渗透特性的最常用策略之一。本方案介绍了如何量化您感兴趣的蛋白质的表达,并利用这些信息来优化大肠杆菌 BL21Gold(de3)ΔABCF 菌株的生产。该书以循序渐进的方式解释了如何根据溶解度分离细菌区系,以及如何使用去污剂溶液提取原生构象的相关蛋白质。最后,它介绍了如何通过离子交换色谱法提高纯度,并将纯化的孔蛋白插入外膜囊泡中,然后再从囊泡中进行共聚。与大多数形成通道的膜蛋白相比,该方案更简单,经验性更少,也为分离可溶性蛋白奠定了坚实的基础。可根据具体情况优化几个参数:表达时间和温度、诱导剂的浓度、去垢剂的性质和浓度、孵育时间和温度、纯化缓冲液的 pH 值和离子强度。该方案对原核通道形成膜蛋白有效,也可用于生产叶绿体、线粒体或一般真核生物的孔道形成蛋白。稍加优化后,该方案还可用于分离受体、载体、泵或任何其他膜活性蛋白。
{"title":"How to isolate channel-forming membrane proteins using the E. coli expression system.","authors":"Claudio Piselli","doi":"10.1038/s41596-024-01055-2","DOIUrl":"https://doi.org/10.1038/s41596-024-01055-2","url":null,"abstract":"<p><p>The recombinant expression, isolation and characterization of pore-forming proteins is one of the most commonly used strategies for understanding the permeability properties of the biological membrane into which they are embedded. This protocol describes how to quantify the expression of your protein of interest and use this information to optimize its production using the Escherichia coli strain BL21Gold(de3)ΔABCF. It explains with a step-by-step approach how to separate the bacterial compartments according to their solubility and how to extract your protein of interest in its native conformation using detergent solutions. Finally, it describes how to improve its purity via ion-exchange chromatography and insert the purified porins into outer membrane vesicles, from which they can be copurified. The protocol is simpler and less empirical than those described for most channel-forming membrane proteins and also provides a solid foundation for the isolation of soluble proteins. Several parameters can be optimized on a case-by-case basis: expression time and temperature, concentration of the inducer, nature and concentration of the detergent, incubation time and temperature, pH and ionic strength of the purification buffers. This protocol is effective with prokaryotic channel-forming membrane proteins and can be employed for the production of pore-forming proteins from chloroplasts, mitochondria or eukaryotes in general. With minor optimization, this protocol can be adapted for the isolation of receptors, carrier, pumps or any other membrane-active proteins.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142375652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-confidence and high-throughput quantification of synapse engulfment by oligodendrocyte precursor cells. 少突胶质前体细胞对突触吞噬的高置信度和高通量定量分析
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-10-03 DOI: 10.1038/s41596-024-01048-1
Jessica A Kahng, Andre M Xavier, Austin Ferro, Samantha X Tang, Yohan S S Auguste, Lucas Cheadle

Oligodendrocyte precursor cells (OPCs) sculpt neural circuits through the phagocytic engulfment of synapses during development and adulthood. However, existing techniques for analyzing synapse engulfment by OPCs have limited accuracy. Here we describe the quantification of synapse engulfment by OPCs via a two-pronged cell biological approach that combines high-confidence and high-throughput methodologies. Firstly, an adeno-associated virus encoding a pH-sensitive, fluorescently tagged synaptic marker is expressed in neurons in vivo to differentially label presynaptic inputs, depending upon whether they are outside of or within acidic phagolysosomal compartments. When paired with immunostaining for OPC markers in lightly fixed tissue, this approach quantifies the engulfment of synapses by around 30-50 OPCs in each experiment. The second method uses OPCs isolated from dissociated brain tissue that are then fixed, incubated with fluorescent antibodies against presynaptic proteins, and analyzed by flow cytometry, enabling the quantification of presynaptic material within tens of thousands of OPCs in <1 week. The integration of both methods extends the current imaging-based assays, originally designed to quantify synaptic phagocytosis by other brain cells such as microglia and astrocytes, by enabling the quantification of synaptic engulfment by OPCs at individual and populational levels. With minor modifications, these approaches can be adapted to study synaptic phagocytosis by numerous glial cell types in the brain. The protocol is suitable for users with expertise in both confocal microscopy and flow cytometry. The imaging-based and flow cytometry-based protocols require 5 weeks and 2 d to complete, respectively.

少突胶质细胞前体细胞(OPC)在发育和成年期通过吞噬突触来构建神经回路。然而,现有的 OPCs 分析突触吞噬的技术准确性有限。在这里,我们介绍了一种双管齐下的细胞生物学方法,该方法结合了高置信度和高通量方法,可定量分析 OPCs 对突触的吞噬作用。首先,在体内神经元中表达一种编码 pH 敏感荧光标记突触标记物的腺相关病毒,根据突触前输入是在酸性吞噬体区块外还是内,对突触前输入进行不同标记。这种方法与轻度固定组织中的 OPC 标记免疫染色法相配合,可在每次实验中量化约 30-50 个 OPC 对突触的吞噬情况。第二种方法是从离体脑组织中分离出 OPC,然后将其固定,与突触前蛋白的荧光抗体一起孵育,并通过流式细胞术进行分析。
{"title":"High-confidence and high-throughput quantification of synapse engulfment by oligodendrocyte precursor cells.","authors":"Jessica A Kahng, Andre M Xavier, Austin Ferro, Samantha X Tang, Yohan S S Auguste, Lucas Cheadle","doi":"10.1038/s41596-024-01048-1","DOIUrl":"10.1038/s41596-024-01048-1","url":null,"abstract":"<p><p>Oligodendrocyte precursor cells (OPCs) sculpt neural circuits through the phagocytic engulfment of synapses during development and adulthood. However, existing techniques for analyzing synapse engulfment by OPCs have limited accuracy. Here we describe the quantification of synapse engulfment by OPCs via a two-pronged cell biological approach that combines high-confidence and high-throughput methodologies. Firstly, an adeno-associated virus encoding a pH-sensitive, fluorescently tagged synaptic marker is expressed in neurons in vivo to differentially label presynaptic inputs, depending upon whether they are outside of or within acidic phagolysosomal compartments. When paired with immunostaining for OPC markers in lightly fixed tissue, this approach quantifies the engulfment of synapses by around 30-50 OPCs in each experiment. The second method uses OPCs isolated from dissociated brain tissue that are then fixed, incubated with fluorescent antibodies against presynaptic proteins, and analyzed by flow cytometry, enabling the quantification of presynaptic material within tens of thousands of OPCs in <1 week. The integration of both methods extends the current imaging-based assays, originally designed to quantify synaptic phagocytosis by other brain cells such as microglia and astrocytes, by enabling the quantification of synaptic engulfment by OPCs at individual and populational levels. With minor modifications, these approaches can be adapted to study synaptic phagocytosis by numerous glial cell types in the brain. The protocol is suitable for users with expertise in both confocal microscopy and flow cytometry. The imaging-based and flow cytometry-based protocols require 5 weeks and 2 d to complete, respectively.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiplexed chromatin immunoprecipitation sequencing for quantitative study of histone modifications and chromatin factors. 用于组蛋白修饰和染色质因子定量研究的多重染色质免疫沉淀测序。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-10-03 DOI: 10.1038/s41596-024-01058-z
Banushree Kumar, Carmen Navarro, Philip Yuk Kwong Yung, Jing Lyu, Angelo Salazar Mantero, Anna-Maria Katsori, Hannah Schwämmle, Marcel Martin, Simon J Elsässer

ChIP-seq is a widely used technique for studying histone post-translational modifications and DNA-binding proteins. DNA fragments associated with a specific protein or histone modification epitope are captured by using antibodies, sequenced and mapped to a reference genome. Albeit versatile and popular, performing many parallel ChIP-seq experiments to compare different conditions, replicates and epitopes is laborious, is prone to experimental variation and does not allow quantitative comparisons unless adequate spike-in chromatin is included. We present a detailed protocol for performing and analyzing a multiplexed quantitative chromatin immunoprecipitation-sequencing experiment (MINUTE-ChIP), in which multiple samples are profiled against multiple epitopes in a single workflow. Multiplexing not only dramatically increases the throughput of ChIP-seq experiments (e.g., profiling 12 samples against multiple histone modifications or DNA-binding proteins in a single experiment), but also enables accurate quantitative comparisons. The protocol consists of four parts: sample preparation (i.e., lysis, chromatin fragmentation and barcoding of native or formaldehyde-fixed material), pooling and splitting of the barcoded chromatin into parallel immunoprecipitation reactions, preparation of next-generation sequencing libraries from input and immunoprecipitated DNA and data analysis using our dedicated analysis pipeline. This pipeline autonomously generates quantitatively scaled ChIP-seq tracks for downstream analysis and visualization, alongside necessary quality control indicators. The entire workflow requires basic knowledge in molecular biology and bioinformatics and can be completed in 1 week. MINUTE-ChIP empowers biologists to perform every ChIP-seq experiment with an appropriate number of replicates and control conditions, delivering more statistically robust, exquisitely quantitative and biologically meaningful results.

ChIP-seq 是一种广泛用于研究组蛋白翻译后修饰和 DNA 结合蛋白的技术。使用抗体捕获与特定蛋白质或组蛋白修饰表位相关的DNA片段,进行测序并映射到参考基因组。尽管ChIP-seq实验用途广泛且广受欢迎,但进行许多平行的ChIP-seq实验以比较不同的条件、复制和表位却很费力,而且容易出现实验变异,除非加入足够的染色质尖峰,否则无法进行定量比较。我们介绍了执行和分析多重染色质免疫沉淀-测序定量实验(MINUTE-ChIP)的详细方案,在该方案中,多个样品在一个工作流程中针对多个表位进行分析。多重化不仅能显著提高 ChIP-seq 实验的通量(例如,在一次实验中针对多种组蛋白修饰或 DNA 结合蛋白对 12 个样本进行分析),还能进行精确的定量比较。该方案由四部分组成:样品制备(即原生或甲醛固定材料的裂解、染色质片段化和条形码)、将条形码染色质汇集并分割成平行的免疫沉淀反应、从输入和免疫沉淀 DNA 中制备下一代测序文库,以及使用我们的专用分析管道进行数据分析。该流水线可自主生成定量的 ChIP-seq 轨道,用于下游分析和可视化,并提供必要的质量控制指标。整个工作流程需要具备分子生物学和生物信息学的基本知识,可在一周内完成。MINUTE-ChIP 使生物学家能够以适当的重复次数和控制条件进行每一次 ChIP-seq 实验,从而获得统计上更可靠、定量上更精确、生物学上更有意义的结果。
{"title":"Multiplexed chromatin immunoprecipitation sequencing for quantitative study of histone modifications and chromatin factors.","authors":"Banushree Kumar, Carmen Navarro, Philip Yuk Kwong Yung, Jing Lyu, Angelo Salazar Mantero, Anna-Maria Katsori, Hannah Schwämmle, Marcel Martin, Simon J Elsässer","doi":"10.1038/s41596-024-01058-z","DOIUrl":"https://doi.org/10.1038/s41596-024-01058-z","url":null,"abstract":"<p><p>ChIP-seq is a widely used technique for studying histone post-translational modifications and DNA-binding proteins. DNA fragments associated with a specific protein or histone modification epitope are captured by using antibodies, sequenced and mapped to a reference genome. Albeit versatile and popular, performing many parallel ChIP-seq experiments to compare different conditions, replicates and epitopes is laborious, is prone to experimental variation and does not allow quantitative comparisons unless adequate spike-in chromatin is included. We present a detailed protocol for performing and analyzing a multiplexed quantitative chromatin immunoprecipitation-sequencing experiment (MINUTE-ChIP), in which multiple samples are profiled against multiple epitopes in a single workflow. Multiplexing not only dramatically increases the throughput of ChIP-seq experiments (e.g., profiling 12 samples against multiple histone modifications or DNA-binding proteins in a single experiment), but also enables accurate quantitative comparisons. The protocol consists of four parts: sample preparation (i.e., lysis, chromatin fragmentation and barcoding of native or formaldehyde-fixed material), pooling and splitting of the barcoded chromatin into parallel immunoprecipitation reactions, preparation of next-generation sequencing libraries from input and immunoprecipitated DNA and data analysis using our dedicated analysis pipeline. This pipeline autonomously generates quantitatively scaled ChIP-seq tracks for downstream analysis and visualization, alongside necessary quality control indicators. The entire workflow requires basic knowledge in molecular biology and bioinformatics and can be completed in 1 week. MINUTE-ChIP empowers biologists to perform every ChIP-seq experiment with an appropriate number of replicates and control conditions, delivering more statistically robust, exquisitely quantitative and biologically meaningful results.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plant sperm cell sequencing for genome phasing and determination of meiotic crossover points. 用于基因组分期和确定减数分裂交叉点的植物精子细胞测序。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-10-02 DOI: 10.1038/s41596-024-01063-2
Weiyi Zhang, Arslan Tariq, Xinxin Jia, Jianbing Yan, Alisdair R Fernie, Björn Usadel, Weiwei Wen

Haplotype phasing represents a pivotal procedure in genome analysis, entailing the identification of specific genetic variant combinations on each chromosome. Achieving chromosome-level genome phasing constitutes a considerable challenge, particularly in organisms with large and complex genomes. To address this challenge, we have developed a robust, gamete cell-based phasing pipeline, including wet-laboratory processes for plant sperm cell isolation, short-read sequencing and a bioinformatics workflow to generate chromosome-level phasing. The bioinformatics workflow is applicable for both plant and other sperm cells, for example, those of mammals. Our pipeline ensures high-quality single-nucleotide polymorphism (SNP) calling for each sperm cell and the subsequent construction of a high-density genetic map. The genetic map facilitates accurate chromosome-level genome phasing, enables crossover event detection and could be used to correct potential assembly errors. Our bioinformatics pipeline runs on a Linux system and most of its steps can be executed in parallel, expediting the analysis process. The entire workflow can be performed over the course of 1 d. We provide a practical example from our previous research using this protocol and provide the whole bioinformatics pipeline as a Docker image to ensure its easy adaptability to other studies.

单体型分期是基因组分析中的一个关键步骤,需要识别每条染色体上的特定遗传变异组合。实现染色体水平的基因组分型是一项相当大的挑战,尤其是在基因组庞大而复杂的生物体中。为了应对这一挑战,我们开发了一套基于配子细胞的强大的相位分析流水线,包括植物精子细胞分离的湿实验室流程、短线程测序以及生成染色体级相位分析的生物信息学工作流程。生物信息学工作流程既适用于植物精子细胞,也适用于其他精子细胞,例如哺乳动物的精子细胞。我们的工作流程可确保对每个精子细胞进行高质量的单核苷酸多态性(SNP)调用,并随后构建高密度遗传图谱。遗传图谱有助于进行准确的染色体级基因组分期,实现交叉事件检测,并可用于纠正潜在的组装错误。我们的生物信息学流水线在 Linux 系统上运行,大部分步骤可以并行执行,从而加快了分析过程。整个工作流程可在 1 d 内完成。我们提供了一个使用该协议的实际例子,并将整个生物信息学管道作为 Docker 镜像提供,以确保其易于适应其他研究。
{"title":"Plant sperm cell sequencing for genome phasing and determination of meiotic crossover points.","authors":"Weiyi Zhang, Arslan Tariq, Xinxin Jia, Jianbing Yan, Alisdair R Fernie, Björn Usadel, Weiwei Wen","doi":"10.1038/s41596-024-01063-2","DOIUrl":"https://doi.org/10.1038/s41596-024-01063-2","url":null,"abstract":"<p><p>Haplotype phasing represents a pivotal procedure in genome analysis, entailing the identification of specific genetic variant combinations on each chromosome. Achieving chromosome-level genome phasing constitutes a considerable challenge, particularly in organisms with large and complex genomes. To address this challenge, we have developed a robust, gamete cell-based phasing pipeline, including wet-laboratory processes for plant sperm cell isolation, short-read sequencing and a bioinformatics workflow to generate chromosome-level phasing. The bioinformatics workflow is applicable for both plant and other sperm cells, for example, those of mammals. Our pipeline ensures high-quality single-nucleotide polymorphism (SNP) calling for each sperm cell and the subsequent construction of a high-density genetic map. The genetic map facilitates accurate chromosome-level genome phasing, enables crossover event detection and could be used to correct potential assembly errors. Our bioinformatics pipeline runs on a Linux system and most of its steps can be executed in parallel, expediting the analysis process. The entire workflow can be performed over the course of 1 d. We provide a practical example from our previous research using this protocol and provide the whole bioinformatics pipeline as a Docker image to ensure its easy adaptability to other studies.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142365856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of synthetic circular RNA spike-ins (SynCRS) for normalization of circular RNA sequencing data. 使用合成环状 RNA 穗状插入物 (SynCRS) 对环状 RNA 测序数据进行归一化处理。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-09-26 DOI: 10.1038/s41596-024-01053-4
Vanessa M Conn, Ryan Liu, Marta Gabryelska, Simon J Conn

High-throughput RNA sequencing enables the quantification of transcript abundance and the identification of novel transcripts in biological samples. These include circular RNAs (circRNAs), a family of alternatively spliced RNA molecules that form a continuous loop. However, quantification and comparison of circRNAs between RNA sequencing libraries remain challenging due to confounding errors introduced during exonuclease digestion, library preparation and RNA sequencing itself. Here we describe a set of synthetic circRNA spike-ins-termed 'SynCRS'-that can be added directly to purified RNA samples before exonuclease digestion and library preparation. SynCRS, introduced either individually or in combinations of varying size and abundance, can be integrated into all next-generation sequencing workflows and, critically, facilitate the quantitative calibration of circRNA transcript abundance between samples, tissue types, species and laboratories. Our step-by-step protocol details the generation of SynCRS and guides users on the stoichiometry of SynCRS spike-in to RNA samples, followed by the bioinformatic steps required to facilitate quantitative comparisons of circRNAs between libraries. The laboratory steps to produce the SynCRS require an additional 3 d on top of the high throughput circRNA sequencing and bioinformatics. The protocol is suitable for users with basic experience in molecular biology and bioinformatics.

高通量 RNA 测序可对生物样本中的转录本丰度进行量化,并识别新型转录本。这些转录本包括环状 RNA(circRNA),环状 RNA 是一种形成连续环路的交替剪接 RNA 分子。然而,由于外切酶消化、文库制备和 RNA 测序过程中引入的混杂误差,在 RNA 测序文库之间进行 circRNAs 定量和比较仍具有挑战性。在这里,我们描述了一组合成 circRNA 穗状插入物--称为 "SynCRS"--可在外切酶消化和文库制备之前直接添加到纯化的 RNA 样品中。SynCRS可单独或以不同大小和丰度的组合形式加入,可集成到所有下一代测序工作流程中,关键是可促进样本、组织类型、物种和实验室之间circRNA转录本丰度的定量校准。我们的分步方案详细介绍了 SynCRS 的生成过程,并指导用户如何将 SynCRS 加入到 RNA 样品中,然后进行必要的生物信息学步骤,以便对不同文库中的 circRNA 进行定量比较。除了高通量 circRNA 测序和生物信息学步骤外,生成 SynCRS 的实验室步骤还需要 3 天时间。该方案适合具有分子生物学和生物信息学基本经验的用户。
{"title":"Use of synthetic circular RNA spike-ins (SynCRS) for normalization of circular RNA sequencing data.","authors":"Vanessa M Conn, Ryan Liu, Marta Gabryelska, Simon J Conn","doi":"10.1038/s41596-024-01053-4","DOIUrl":"https://doi.org/10.1038/s41596-024-01053-4","url":null,"abstract":"<p><p>High-throughput RNA sequencing enables the quantification of transcript abundance and the identification of novel transcripts in biological samples. These include circular RNAs (circRNAs), a family of alternatively spliced RNA molecules that form a continuous loop. However, quantification and comparison of circRNAs between RNA sequencing libraries remain challenging due to confounding errors introduced during exonuclease digestion, library preparation and RNA sequencing itself. Here we describe a set of synthetic circRNA spike-ins-termed 'SynCRS'-that can be added directly to purified RNA samples before exonuclease digestion and library preparation. SynCRS, introduced either individually or in combinations of varying size and abundance, can be integrated into all next-generation sequencing workflows and, critically, facilitate the quantitative calibration of circRNA transcript abundance between samples, tissue types, species and laboratories. Our step-by-step protocol details the generation of SynCRS and guides users on the stoichiometry of SynCRS spike-in to RNA samples, followed by the bioinformatic steps required to facilitate quantitative comparisons of circRNAs between libraries. The laboratory steps to produce the SynCRS require an additional 3 d on top of the high throughput circRNA sequencing and bioinformatics. The protocol is suitable for users with basic experience in molecular biology and bioinformatics.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142350444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Droplet-based functional CRISPR screening of cell-cell interactions by SPEAC-seq. 通过 SPEAC-seq 对细胞-细胞相互作用进行基于液滴的 CRISPR 功能筛选。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-09-26 DOI: 10.1038/s41596-024-01056-1
Camilo Faust Akl, Mathias Linnerbauer, Zhaorong Li, Hong-Gyun Lee, Iain C Clark, Michael A Wheeler, Francisco J Quintana

Cell-cell interactions are essential for the function and contextual regulation of biological tissues. We present a platform for high-throughput microfluidics-supported genetic screening of functional regulators of cell-cell interactions. Systematic perturbation of encapsulated associated cells followed by sequencing (SPEAC-seq) combines genome-wide CRISPR libraries, cell coculture in droplets and microfluidic droplet sorting based on functional read-outs determined by fluorescent reporter circuits to enable the unbiased discovery of interaction regulators. This technique overcomes limitations of traditional methods for characterization of cell-cell communication, which require a priori knowledge of cellular interactions, are highly engineered and lack functional read-outs. As an example of this technique, we describe the investigation of neuroinflammatory intercellular communication between microglia and astrocytes, using genome-wide CRISPR-Cas9 inactivation libraries and fluorescent reporters of NF-κB activation. This approach enabled the discovery of thousands of microglial regulators of astrocyte NF-κB activation important for the control of central nervous system inflammation. Importantly, SPEAC-seq can be adapted to different cell types, screening modalities, cell functions and physiological contexts, only limited by the ability to fluorescently report cell functions and by droplet cultivation conditions. Performing genome-wide screening takes less than 2 weeks and requires microfluidics capabilities. Thus, SPEAC-seq enables the large-scale investigation of cell-cell interactions.

细胞-细胞相互作用对生物组织的功能和环境调控至关重要。我们提出了一个用于高通量微流控芯片支持的细胞-细胞相互作用功能调控因子基因筛选平台。通过测序对封装关联细胞进行系统扰动(SPEAC-seq),将全基因组 CRISPR 文库、液滴中的细胞共培养和基于荧光报告电路确定的功能读出的微流控液滴分选结合起来,实现了对相互作用调节因子的无偏发现。这种技术克服了表征细胞-细胞通讯的传统方法的局限性,因为这些方法需要细胞相互作用的先验知识、高度工程化和缺乏功能读出。作为该技术的一个例子,我们介绍了利用全基因组 CRISPR-Cas9 失活文库和 NF-κB 激活荧光报告物研究小胶质细胞和星形胶质细胞之间的神经炎症细胞间通讯。这种方法发现了数以千计对控制中枢神经系统炎症非常重要的星形胶质细胞 NF-κB 激活的小胶质细胞调节因子。重要的是,SPEAC-seq 可适用于不同的细胞类型、筛选模式、细胞功能和生理环境,仅受荧光报告细胞功能的能力和液滴培养条件的限制。进行全基因组筛选只需不到两周的时间,而且需要微流体技术能力。因此,SPEAC-seq 能够对细胞-细胞相互作用进行大规模研究。
{"title":"Droplet-based functional CRISPR screening of cell-cell interactions by SPEAC-seq.","authors":"Camilo Faust Akl, Mathias Linnerbauer, Zhaorong Li, Hong-Gyun Lee, Iain C Clark, Michael A Wheeler, Francisco J Quintana","doi":"10.1038/s41596-024-01056-1","DOIUrl":"10.1038/s41596-024-01056-1","url":null,"abstract":"<p><p>Cell-cell interactions are essential for the function and contextual regulation of biological tissues. We present a platform for high-throughput microfluidics-supported genetic screening of functional regulators of cell-cell interactions. Systematic perturbation of encapsulated associated cells followed by sequencing (SPEAC-seq) combines genome-wide CRISPR libraries, cell coculture in droplets and microfluidic droplet sorting based on functional read-outs determined by fluorescent reporter circuits to enable the unbiased discovery of interaction regulators. This technique overcomes limitations of traditional methods for characterization of cell-cell communication, which require a priori knowledge of cellular interactions, are highly engineered and lack functional read-outs. As an example of this technique, we describe the investigation of neuroinflammatory intercellular communication between microglia and astrocytes, using genome-wide CRISPR-Cas9 inactivation libraries and fluorescent reporters of NF-κB activation. This approach enabled the discovery of thousands of microglial regulators of astrocyte NF-κB activation important for the control of central nervous system inflammation. Importantly, SPEAC-seq can be adapted to different cell types, screening modalities, cell functions and physiological contexts, only limited by the ability to fluorescently report cell functions and by droplet cultivation conditions. Performing genome-wide screening takes less than 2 weeks and requires microfluidics capabilities. Thus, SPEAC-seq enables the large-scale investigation of cell-cell interactions.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142350443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Divergent synthesis of amino acid-linked O-GalNAc glycan core structures. 氨基酸连接的 O-GalNAc 聚糖核心结构的不同合成。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-09-26 DOI: 10.1038/s41596-024-01051-6
Madhusudhan Reddy Gadi, Jinghua Han, Tangliang Shen, Shuquan Fan, Zhongying Xiao, Lei Li

O-GalNAc glycans, also known as mucin-type O-glycans, are primary constituents of mucins on various mucosal sites of the body and also ubiquitously expressed on cell surface and secreted proteins. They have crucial roles in a wide range of physiological and pathological processes, including tumor growth and progression. In addition, altered expression of O-GalNAc glycans is frequently observed during different disease states. Research dedicated to unraveling the structure-function relationships of O-GalNAc glycans has led to the discovery of disease biomarkers and diagnostic tools and the development of O-glycopeptide-based cancer vaccines. Many of these efforts require amino acid-linked O-GalNAc core structures as building blocks to assemble complex O-glycans and glycopeptides. There are eight core structures (cores one to eight), from which all mucin-type O-glycans are derived. In this protocol, we describe the first divergent synthesis of all eight cores from a versatile precursor in practical scales. The protocol involves (i) chemical synthesis of the orthogonally protected precursor (3 days) from commercially available materials, (ii) chemical synthesis of five unique glycosyl donors (1-2 days for each donor) and (iii) selective deprotection of the precursor and assembly of the eight cores (2-4 days for each core). The procedure can be adopted to prepare O-GalNAc cores linked to serine, threonine and tyrosine, which can then be utilized directly for solid-phase glycopeptide synthesis or chemoenzymatic synthesis of complex O-glycans. The procedure empowers researchers with fundamental organic chemistry skills to prepare gram scales of any desired O-GalNAc core(s) or all eight cores concurrently.

O-GalNAcans 又称粘蛋白型 O-聚糖,是人体各粘膜部位粘蛋白的主要成分,也广泛表达于细胞表面和分泌蛋白中。它们在包括肿瘤生长和进展在内的各种生理和病理过程中发挥着至关重要的作用。此外,在不同的疾病状态下,经常会观察到 O-GalNA 聚糖表达的改变。致力于揭示 O-GalNAc 聚糖的结构-功能关系的研究已经发现了疾病生物标志物和诊断工具,并开发出了基于 O-糖肽的癌症疫苗。其中许多研究工作都需要氨基酸连接的 O-GalNAc 核心结构作为构建模块来组装复杂的 O-聚糖和糖肽。有八种核心结构(核心一至八),所有粘蛋白型 O 型聚糖都是从这些结构中衍生出来的。在本方案中,我们首次描述了在实际应用中从一种多功能前体中分化合成所有八种核心结构的方法。该方案包括:(i) 从市售材料中化学合成正交保护的前体(3 天);(ii) 化学合成五个独特的糖基供体(每个供体 1-2 天);(iii) 选择性地对前体进行脱保护并组装八个核心(每个核心 2-4 天)。该程序可用于制备与丝氨酸、苏氨酸和酪氨酸连接的 O-GalNAc 核心,然后可直接用于固相糖肽合成或化学合成复杂的 O-聚糖。该程序使具有基本有机化学技能的研究人员能够同时制备任何所需的 O-GalNAc 核心或全部八个核心的克级产品。
{"title":"Divergent synthesis of amino acid-linked O-GalNAc glycan core structures.","authors":"Madhusudhan Reddy Gadi, Jinghua Han, Tangliang Shen, Shuquan Fan, Zhongying Xiao, Lei Li","doi":"10.1038/s41596-024-01051-6","DOIUrl":"https://doi.org/10.1038/s41596-024-01051-6","url":null,"abstract":"<p><p>O-GalNAc glycans, also known as mucin-type O-glycans, are primary constituents of mucins on various mucosal sites of the body and also ubiquitously expressed on cell surface and secreted proteins. They have crucial roles in a wide range of physiological and pathological processes, including tumor growth and progression. In addition, altered expression of O-GalNAc glycans is frequently observed during different disease states. Research dedicated to unraveling the structure-function relationships of O-GalNAc glycans has led to the discovery of disease biomarkers and diagnostic tools and the development of O-glycopeptide-based cancer vaccines. Many of these efforts require amino acid-linked O-GalNAc core structures as building blocks to assemble complex O-glycans and glycopeptides. There are eight core structures (cores one to eight), from which all mucin-type O-glycans are derived. In this protocol, we describe the first divergent synthesis of all eight cores from a versatile precursor in practical scales. The protocol involves (i) chemical synthesis of the orthogonally protected precursor (3 days) from commercially available materials, (ii) chemical synthesis of five unique glycosyl donors (1-2 days for each donor) and (iii) selective deprotection of the precursor and assembly of the eight cores (2-4 days for each core). The procedure can be adopted to prepare O-GalNAc cores linked to serine, threonine and tyrosine, which can then be utilized directly for solid-phase glycopeptide synthesis or chemoenzymatic synthesis of complex O-glycans. The procedure empowers researchers with fundamental organic chemistry skills to prepare gram scales of any desired O-GalNAc core(s) or all eight cores concurrently.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142350442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rapid generation of homozygous fluorescent knock-in human cells using CRISPR-Cas9 genome editing and validation by automated imaging and digital PCR screening. 利用 CRISPR-Cas9 基因组编辑技术快速生成同源荧光基因敲入人体细胞,并通过自动成像和数字 PCR 筛查进行验证。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-09-20 DOI: 10.1038/s41596-024-01043-6
Andrea Callegari, Moritz Kueblbeck, Natalia Rosalía Morero, Beatriz Serrano-Solano, Jan Ellenberg

We previously described a protocol for genome engineering of mammalian cultured cells with clustered regularly interspaced short palindromic repeats and associated protein 9 (CRISPR-Cas9) to generate homozygous knock-ins of fluorescent tags into endogenous genes. Here we are updating this former protocol to reflect major improvements in the workflow regarding efficiency and throughput. In brief, we have improved our method by combining high-efficiency electroporation of optimized CRISPR-Cas9 reagents, screening of single cell-derived clones by automated bright-field and fluorescence imaging, rapidly assessing the number of tagged alleles and potential off-targets using digital polymerase chain reaction (PCR) and automated data analysis. Compared with the original protocol, our current procedure (1) substantially increases the efficiency of tag integration, (2) automates the identification of clones derived from single cells with correct subcellular localization of the tagged protein and (3) provides a quantitative and high throughput assay to measure the number of on- and off-target integrations with digital PCR. The increased efficiency of the new procedure reduces the number of clones that need to be analyzed in-depth by more than tenfold and yields to more than 26% of homozygous clones in polyploid cancer cell lines in a single genome engineering round. Overall, we were able to dramatically reduce the hands-on time from 30 d to 10 d during the overall ~10 week procedure, allowing a single person to process up to five genes in parallel, assuming that validated reagents-for example, PCR primers, digital PCR assays and western blot antibodies-are available.

我们之前介绍了一种哺乳动物培养细胞基因组工程的方案,利用聚类规则间隔短回文重复序列和相关蛋白 9(CRISPR-Cas9)将荧光标签同源敲入内源基因。在此,我们更新了这一原方案,以反映工作流程在效率和通量方面的重大改进。简而言之,我们将优化 CRISPR-Cas9 试剂的高效电穿孔、通过自动明视野和荧光成像筛选单细胞克隆、使用数字聚合酶链式反应(PCR)快速评估标记等位基因和潜在非靶点的数量以及自动数据分析结合起来,改进了我们的方法。与原来的方案相比,我们目前的程序(1)大大提高了标签整合的效率;(2)自动识别来自单细胞的克隆,并正确定位标签蛋白的亚细胞位置;(3)提供了一种定量的高通量检测方法,利用数字聚合酶链反应测量靶上和脱靶整合的数量。新程序提高了效率,将需要深入分析的克隆数量减少了十倍以上,在一轮基因组工程中,多倍体癌细胞系中的同源克隆数量可达到 26% 以上。总之,在整个约 10 周的过程中,我们能够将动手时间从 30 天大幅缩短到 10 天,假定有经过验证的试剂(如 PCR 引物、数字 PCR 检测试剂和 Western 印迹抗体),一个人就能同时处理多达五个基因。
{"title":"Rapid generation of homozygous fluorescent knock-in human cells using CRISPR-Cas9 genome editing and validation by automated imaging and digital PCR screening.","authors":"Andrea Callegari, Moritz Kueblbeck, Natalia Rosalía Morero, Beatriz Serrano-Solano, Jan Ellenberg","doi":"10.1038/s41596-024-01043-6","DOIUrl":"https://doi.org/10.1038/s41596-024-01043-6","url":null,"abstract":"<p><p>We previously described a protocol for genome engineering of mammalian cultured cells with clustered regularly interspaced short palindromic repeats and associated protein 9 (CRISPR-Cas9) to generate homozygous knock-ins of fluorescent tags into endogenous genes. Here we are updating this former protocol to reflect major improvements in the workflow regarding efficiency and throughput. In brief, we have improved our method by combining high-efficiency electroporation of optimized CRISPR-Cas9 reagents, screening of single cell-derived clones by automated bright-field and fluorescence imaging, rapidly assessing the number of tagged alleles and potential off-targets using digital polymerase chain reaction (PCR) and automated data analysis. Compared with the original protocol, our current procedure (1) substantially increases the efficiency of tag integration, (2) automates the identification of clones derived from single cells with correct subcellular localization of the tagged protein and (3) provides a quantitative and high throughput assay to measure the number of on- and off-target integrations with digital PCR. The increased efficiency of the new procedure reduces the number of clones that need to be analyzed in-depth by more than tenfold and yields to more than 26% of homozygous clones in polyploid cancer cell lines in a single genome engineering round. Overall, we were able to dramatically reduce the hands-on time from 30 d to 10 d during the overall ~10 week procedure, allowing a single person to process up to five genes in parallel, assuming that validated reagents-for example, PCR primers, digital PCR assays and western blot antibodies-are available.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142291582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data. 对非靶向代谢组学数据中基于特征的分子网络结果进行统计分析。
IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-09-20 DOI: 10.1038/s41596-024-01046-3
Abzer K Pakkir Shah, Axel Walter, Filip Ottosson, Francesco Russo, Marcelo Navarro-Diaz, Judith Boldt, Jarmo-Charles J Kalinski, Eftychia Eva Kontou, James Elofson, Alexandros Polyzois, Carolina González-Marín, Shane Farrell, Marie R Aggerbeck, Thapanee Pruksatrakul, Nathan Chan, Yunshu Wang, Magdalena Pöchhacker, Corinna Brungs, Beatriz Cámara, Andrés Mauricio Caraballo-Rodríguez, Andres Cumsille, Fernanda de Oliveira, Kai Dührkop, Yasin El Abiead, Christian Geibel, Lana G Graves, Martin Hansen, Steffen Heuckeroth, Simon Knoblauch, Anastasiia Kostenko, Mirte C M Kuijpers, Kevin Mildau, Stilianos Papadopoulos Lambidis, Paulo Wender Portal Gomes, Tilman Schramm, Karoline Steuer-Lodd, Paolo Stincone, Sibgha Tayyab, Giovanni Andrea Vitale, Berenike C Wagner, Shipei Xing, Marquis T Yazzie, Simone Zuffa, Martinus de Kruijff, Christine Beemelmanns, Hannes Link, Christoph Mayer, Justin J J van der Hooft, Tito Damiani, Tomáš Pluskal, Pieter Dorrestein, Jan Stanstrup, Robin Schmid, Mingxun Wang, Allegra Aron, Madeleine Ernst, Daniel Petras

Feature-based molecular networking (FBMN) is a popular analysis approach for liquid chromatography-tandem mass spectrometry-based non-targeted metabolomics data. While processing liquid chromatography-tandem mass spectrometry data through FBMN is fairly streamlined, downstream data handling and statistical interrogation are often a key bottleneck. Especially users new to statistical analysis struggle to effectively handle and analyze complex data matrices. Here we provide a comprehensive guide for the statistical analysis of FBMN results, focusing on the downstream analysis of the FBMN output table. We explain the data structure and principles of data cleanup and normalization, as well as uni- and multivariate statistical analysis of FBMN results. We provide explanations and code in two scripting languages (R and Python) as well as the QIIME2 framework for all protocol steps, from data clean-up to statistical analysis. All code is shared in the form of Jupyter Notebooks ( https://github.com/Functional-Metabolomics-Lab/FBMN-STATS ). Additionally, the protocol is accompanied by a web application with a graphical user interface ( https://fbmn-statsguide.gnps2.org/ ) to lower the barrier of entry for new users and for educational purposes. Finally, we also show users how to integrate their statistical results into the molecular network using the Cytoscape visualization tool. Throughout the protocol, we use a previously published environmental metabolomics dataset for demonstration purposes. Together, the protocol, code and web application provide a complete guide and toolbox for FBMN data integration, cleanup and advanced statistical analysis, enabling new users to uncover molecular insights from their non-targeted metabolomics data. Our protocol is tailored for the seamless analysis of FBMN results from Global Natural Products Social Molecular Networking and can be easily adapted to other mass spectrometry feature detection, annotation and networking tools.

基于特征的分子网络(FBMN)是一种流行的分析方法,适用于基于液相色谱-串联质谱的非靶向代谢组学数据。虽然通过 FBMN 处理液相色谱-串联质谱数据相当简便,但下游数据处理和统计查询往往是一个关键瓶颈。尤其是统计分析新手,很难有效处理和分析复杂的数据矩阵。在此,我们提供一份全面的 FBMN 结果统计分析指南,重点介绍 FBMN 输出表的下游分析。我们解释了数据结构、数据清理和归一化原则,以及 FBMN 结果的单变量和多变量统计分析。我们用两种脚本语言(R 和 Python)以及 QIIME2 框架为从数据清理到统计分析的所有协议步骤提供解释和代码。所有代码都以 Jupyter Notebooks ( https://github.com/Functional-Metabolomics-Lab/FBMN-STATS ) 的形式共享。此外,该协议还附有一个图形用户界面的网络应用程序 ( https://fbmn-statsguide.gnps2.org/ ),以降低新用户的入门门槛并用于教育目的。最后,我们还向用户展示了如何使用 Cytoscape 可视化工具将统计结果整合到分子网络中。在整个协议中,我们使用了之前发布的环境代谢组学数据集进行演示。协议、代码和网络应用程序共同为 FBMN 数据整合、清理和高级统计分析提供了完整的指南和工具箱,使新用户能够从他们的非靶向代谢组学数据中发现分子洞察力。我们的协议专为无缝分析来自全球天然产品社会分子网络的 FBMN 结果而定制,可轻松适用于其他质谱特征检测、注释和网络工具。
{"title":"Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data.","authors":"Abzer K Pakkir Shah, Axel Walter, Filip Ottosson, Francesco Russo, Marcelo Navarro-Diaz, Judith Boldt, Jarmo-Charles J Kalinski, Eftychia Eva Kontou, James Elofson, Alexandros Polyzois, Carolina González-Marín, Shane Farrell, Marie R Aggerbeck, Thapanee Pruksatrakul, Nathan Chan, Yunshu Wang, Magdalena Pöchhacker, Corinna Brungs, Beatriz Cámara, Andrés Mauricio Caraballo-Rodríguez, Andres Cumsille, Fernanda de Oliveira, Kai Dührkop, Yasin El Abiead, Christian Geibel, Lana G Graves, Martin Hansen, Steffen Heuckeroth, Simon Knoblauch, Anastasiia Kostenko, Mirte C M Kuijpers, Kevin Mildau, Stilianos Papadopoulos Lambidis, Paulo Wender Portal Gomes, Tilman Schramm, Karoline Steuer-Lodd, Paolo Stincone, Sibgha Tayyab, Giovanni Andrea Vitale, Berenike C Wagner, Shipei Xing, Marquis T Yazzie, Simone Zuffa, Martinus de Kruijff, Christine Beemelmanns, Hannes Link, Christoph Mayer, Justin J J van der Hooft, Tito Damiani, Tomáš Pluskal, Pieter Dorrestein, Jan Stanstrup, Robin Schmid, Mingxun Wang, Allegra Aron, Madeleine Ernst, Daniel Petras","doi":"10.1038/s41596-024-01046-3","DOIUrl":"https://doi.org/10.1038/s41596-024-01046-3","url":null,"abstract":"<p><p>Feature-based molecular networking (FBMN) is a popular analysis approach for liquid chromatography-tandem mass spectrometry-based non-targeted metabolomics data. While processing liquid chromatography-tandem mass spectrometry data through FBMN is fairly streamlined, downstream data handling and statistical interrogation are often a key bottleneck. Especially users new to statistical analysis struggle to effectively handle and analyze complex data matrices. Here we provide a comprehensive guide for the statistical analysis of FBMN results, focusing on the downstream analysis of the FBMN output table. We explain the data structure and principles of data cleanup and normalization, as well as uni- and multivariate statistical analysis of FBMN results. We provide explanations and code in two scripting languages (R and Python) as well as the QIIME2 framework for all protocol steps, from data clean-up to statistical analysis. All code is shared in the form of Jupyter Notebooks ( https://github.com/Functional-Metabolomics-Lab/FBMN-STATS ). Additionally, the protocol is accompanied by a web application with a graphical user interface ( https://fbmn-statsguide.gnps2.org/ ) to lower the barrier of entry for new users and for educational purposes. Finally, we also show users how to integrate their statistical results into the molecular network using the Cytoscape visualization tool. Throughout the protocol, we use a previously published environmental metabolomics dataset for demonstration purposes. Together, the protocol, code and web application provide a complete guide and toolbox for FBMN data integration, cleanup and advanced statistical analysis, enabling new users to uncover molecular insights from their non-targeted metabolomics data. Our protocol is tailored for the seamless analysis of FBMN results from Global Natural Products Social Molecular Networking and can be easily adapted to other mass spectrometry feature detection, annotation and networking tools.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142291583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Nature Protocols
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1