Maria Llambrich, Frans M van der Kloet, Lluc Sementé, Anaïs Rodrigues, Saer Samanipour, Pierre-Hugues Stefanuto, Johan A Westerhuis, Raquel Cumeras, Jesús Brezmes
{"title":"GcDUO: an open-source software for GC × GC-MS data analysis.","authors":"Maria Llambrich, Frans M van der Kloet, Lluc Sementé, Anaïs Rodrigues, Saer Samanipour, Pierre-Hugues Stefanuto, Johan A Westerhuis, Raquel Cumeras, Jesús Brezmes","doi":"10.1093/bib/bbaf080","DOIUrl":null,"url":null,"abstract":"<p><p>Comprehensive 2D gas chromatography coupled with mass spectrometry (GC × GC-MS) is a powerful analytical technique. However, the complexity and volume of data generated pose significant challenges for data processing and interpretation, limiting a broader adoption. Chemometric approaches, particularly multiway models like Parallel Factor Analysis (PARAFAC), have proven effective in addressing these challenges by enabling the extraction of meaningful chemical information from multi-dimensional datasets. However, traditional PARAFAC is constrained by its assumption of data tri-linearity, which may not be valid in all cases, leading to potential inaccuracies. To overcome these limitations, we present GcDUO, an open-source software implemented in R, designed specifically for the processing and analysis of GC × GC-MS data. GcDUO integrates advanced chemometric methods, including both PARAFAC and PARAFAC2, for a more accurate and comprehensive analysis. PARAFAC is particularly useful for deconvoluting overlapping peaks and extracting pure chemical signals, while PARAFAC2 relaxes de tri-linearity constraint, allowing the alignment between samples. The software is structured into six modules-data import, region of interest (ROI) selection, deconvolution, peak annotation, data integration, and visualization-facilitating comprehensive and flexible data processing. GcDUO was validated against the gold-standard software for comprehensive GC, demonstrating a high correlation (R2 = 0.9) in peak area measurements, confirming its effectiveness and reliability. GcDUO provides a valuable, open-source platform for researchers in metabolomics and related fields, enabling more accessible and customizable GC × GC-MS data analysis.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 2","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11879434/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf080","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Comprehensive 2D gas chromatography coupled with mass spectrometry (GC × GC-MS) is a powerful analytical technique. However, the complexity and volume of data generated pose significant challenges for data processing and interpretation, limiting a broader adoption. Chemometric approaches, particularly multiway models like Parallel Factor Analysis (PARAFAC), have proven effective in addressing these challenges by enabling the extraction of meaningful chemical information from multi-dimensional datasets. However, traditional PARAFAC is constrained by its assumption of data tri-linearity, which may not be valid in all cases, leading to potential inaccuracies. To overcome these limitations, we present GcDUO, an open-source software implemented in R, designed specifically for the processing and analysis of GC × GC-MS data. GcDUO integrates advanced chemometric methods, including both PARAFAC and PARAFAC2, for a more accurate and comprehensive analysis. PARAFAC is particularly useful for deconvoluting overlapping peaks and extracting pure chemical signals, while PARAFAC2 relaxes de tri-linearity constraint, allowing the alignment between samples. The software is structured into six modules-data import, region of interest (ROI) selection, deconvolution, peak annotation, data integration, and visualization-facilitating comprehensive and flexible data processing. GcDUO was validated against the gold-standard software for comprehensive GC, demonstrating a high correlation (R2 = 0.9) in peak area measurements, confirming its effectiveness and reliability. GcDUO provides a valuable, open-source platform for researchers in metabolomics and related fields, enabling more accessible and customizable GC × GC-MS data analysis.
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.