{"title":"Chroma: A MATLAB package and open-source platform for biomarker data processing and automatic index calculations","authors":"Julian Traphagan, Guangsheng Zhuang","doi":"10.1016/j.cageo.2024.105675","DOIUrl":null,"url":null,"abstract":"<div><p>The molecular ratio indices of biological markers (biomarkers), such as the Carbon Preference Index (CPI) or <em>P</em><sub>aq</sub>, are frequently used as proxies for paleoclimatic and palaeoecological conditions. These indices are regularly extracted from the relative abundances of target molecules detected by a Gas Chromatography analyzer with a Flame Ionization Detector (GC-FID). Despite their use in biogeochemical studies for over a half-century, it remains common procedure to quantify the abundance of individual compounds by manual integration of chromatogram peaks (i.e., interpret baselines visually and characterize peaks by hand), which is time consuming and can lead to inconsistent results. Here, we introduce a new MATLAB package (Chroma) for the automatic detection and integration of standard-referenced biomarker abundances and the calculation of a variety of established hydrocarbon indices commonly reported in the published literature. The algorithm identifies the detector response timing of specific target peaks in a sample chromatogram by cross-referencing to a standard (e.g., Mix-A6, Schimmelmann, Indiana University Bloomington), then calculates the peak areas for an approximation of molecular abundance. This new toolkit for automatic and rapid integration of GC-acquired data provides a consistent and reproducible approach for the calculation of hydrocarbon indices and offers a standardized inter-laboratory platform for data comparisons and exchange. We validate the utility of the Chroma package with the chromatograms of plant wax <em>n</em>-alkanes, a widely used proxy for ecology and hydrology, from six stratigraphic sections in the Tibetan Plateau. Chroma is an effective tool for efficient data processing and will continuously evolve to accommodate extended uses in related areas of biomarker research beyond <em>n</em>-alkanes.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"191 ","pages":"Article 105675"},"PeriodicalIF":4.2000,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300424001584","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The molecular ratio indices of biological markers (biomarkers), such as the Carbon Preference Index (CPI) or Paq, are frequently used as proxies for paleoclimatic and palaeoecological conditions. These indices are regularly extracted from the relative abundances of target molecules detected by a Gas Chromatography analyzer with a Flame Ionization Detector (GC-FID). Despite their use in biogeochemical studies for over a half-century, it remains common procedure to quantify the abundance of individual compounds by manual integration of chromatogram peaks (i.e., interpret baselines visually and characterize peaks by hand), which is time consuming and can lead to inconsistent results. Here, we introduce a new MATLAB package (Chroma) for the automatic detection and integration of standard-referenced biomarker abundances and the calculation of a variety of established hydrocarbon indices commonly reported in the published literature. The algorithm identifies the detector response timing of specific target peaks in a sample chromatogram by cross-referencing to a standard (e.g., Mix-A6, Schimmelmann, Indiana University Bloomington), then calculates the peak areas for an approximation of molecular abundance. This new toolkit for automatic and rapid integration of GC-acquired data provides a consistent and reproducible approach for the calculation of hydrocarbon indices and offers a standardized inter-laboratory platform for data comparisons and exchange. We validate the utility of the Chroma package with the chromatograms of plant wax n-alkanes, a widely used proxy for ecology and hydrology, from six stratigraphic sections in the Tibetan Plateau. Chroma is an effective tool for efficient data processing and will continuously evolve to accommodate extended uses in related areas of biomarker research beyond n-alkanes.
期刊介绍:
Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.