J. Goldschmidt, Elisabeth Moser, L. Nitzsche, Rudolf Bierl, J. Wöllenstein
{"title":"Improving the performance of artificial neural networks trained on synthetic data in gas spectroscopy – a study on two sensing approaches","authors":"J. Goldschmidt, Elisabeth Moser, L. Nitzsche, Rudolf Bierl, J. Wöllenstein","doi":"10.1515/teme-2023-0051","DOIUrl":null,"url":null,"abstract":"Abstract Artificial neural networks (ANNs) are used in quantitative infrared gas spectroscopy to predict concentrations on multi-component absorption spectra. Training of ANNs requires vast amounts of labelled training data which may be elaborate and time consuming to obtain. Additional data can be gained by the utilization of synthetically generated spectra, but at the cost of systematic deviations to measured data. Here, we present two approaches to train ANNs with a combination of comparatively small, measured data sets and synthetically generated data. For the first approach a neural network is trained hybridly with synthetically generated infrared absorption spectra of mixtures of N2O and CO and measured zero-gas spectra, taken with a mid-infrared dual comb spectrometer. This improves the mean absolute error (MAE) of the network predictions from 0.46 to 0.01 ppmV and 0.24 to 0.01 ppmV for the concentration predictions of N2O and CO respectively for zero-gas measurements which was previously observed for training with purely synthetic data. At the same time a similar performance on spectra from gas mixtures of 0–100 ppmV N2O and 0 to 60 ppmV CO was achieved. For the second approach an ANN pre-trained on synthetic infrared spectra of mixtures of acetone and ethanol is retrained on a small dataset consisting of 26 spectra taken with a mid-infrared photoacoustic spectrometer. In this case the MAE for the concentration predictions of ethanol and acetone are improved by 45 % and 20 % in comparison to purely synthetic training. This shows the capability of using synthetically generated data to train ANNs in combination with small amounts of measured data to further improve neural networks for gas sensing and the transferability between different sensing approaches.","PeriodicalId":56086,"journal":{"name":"Tm-Technisches Messen","volume":"78 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tm-Technisches Messen","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1515/teme-2023-0051","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Artificial neural networks (ANNs) are used in quantitative infrared gas spectroscopy to predict concentrations on multi-component absorption spectra. Training of ANNs requires vast amounts of labelled training data which may be elaborate and time consuming to obtain. Additional data can be gained by the utilization of synthetically generated spectra, but at the cost of systematic deviations to measured data. Here, we present two approaches to train ANNs with a combination of comparatively small, measured data sets and synthetically generated data. For the first approach a neural network is trained hybridly with synthetically generated infrared absorption spectra of mixtures of N2O and CO and measured zero-gas spectra, taken with a mid-infrared dual comb spectrometer. This improves the mean absolute error (MAE) of the network predictions from 0.46 to 0.01 ppmV and 0.24 to 0.01 ppmV for the concentration predictions of N2O and CO respectively for zero-gas measurements which was previously observed for training with purely synthetic data. At the same time a similar performance on spectra from gas mixtures of 0–100 ppmV N2O and 0 to 60 ppmV CO was achieved. For the second approach an ANN pre-trained on synthetic infrared spectra of mixtures of acetone and ethanol is retrained on a small dataset consisting of 26 spectra taken with a mid-infrared photoacoustic spectrometer. In this case the MAE for the concentration predictions of ethanol and acetone are improved by 45 % and 20 % in comparison to purely synthetic training. This shows the capability of using synthetically generated data to train ANNs in combination with small amounts of measured data to further improve neural networks for gas sensing and the transferability between different sensing approaches.
期刊介绍:
The journal promotes dialogue between the developers of application-oriented sensors, measurement systems, and measurement methods and the manufacturers and measurement technologists who use them.
Topics
The manufacture and characteristics of new sensors for measurement technology in the industrial sector
New measurement methods
Hardware and software based processing and analysis of measurement signals to obtain measurement values
The outcomes of employing new measurement systems and methods.