Pub Date : 2024-01-17DOI: 10.3103/S0027134923070159
M. A. Krinitskiy, V. A. Golikov, N. N. Anikin, A. I. Suslov, A. V. Gavrikov, N. D. Tilinina
Marine radars are vital for safe navigation at sea, detecting vessels and obstacles. Sea clutter, caused by Bragg scattering, is usually filtered out as noise. It becomes detectable in unfiltered radar images, acquired using SeaVision hardware package, when wind speed and wave height exceed certain thresholds. The parameters of wind-induced ocean waves can be determined using these images; however, traditional spectral methods for obtaining wave characteristics face limitations in improving accuracy. Deep learning techniques offer advantages in image processing tasks, being more robust and able to handle noisier data, yet delivering the results without Fourier transformations and not necessarily requiring long series of radar imagery. In our study, we present the method exploiting convolutional neural networks (CNNs) for estimating wave characteristics from shipborne radar data captured using SeaVision package. In particular, we train our CNN to infer significant wave height using estimates provided by the Spotter buoy as ground truth. Our CNN-based method has an advantage over the classical methods due to the low requirements for radar image data since we process just one SeaVision snapshot, whereas classical method requires more than 20 min of radar images.
{"title":"Estimating Significant Wave Height from X-Band Navigation Radar Using Convolutional Neural Networks","authors":"M. A. Krinitskiy, V. A. Golikov, N. N. Anikin, A. I. Suslov, A. V. Gavrikov, N. D. Tilinina","doi":"10.3103/S0027134923070159","DOIUrl":"10.3103/S0027134923070159","url":null,"abstract":"<p>Marine radars are vital for safe navigation at sea, detecting vessels and obstacles. Sea clutter, caused by Bragg scattering, is usually filtered out as noise. It becomes detectable in unfiltered radar images, acquired using SeaVision hardware package, when wind speed and wave height exceed certain thresholds. The parameters of wind-induced ocean waves can be determined using these images; however, traditional spectral methods for obtaining wave characteristics face limitations in improving accuracy. Deep learning techniques offer advantages in image processing tasks, being more robust and able to handle noisier data, yet delivering the results without Fourier transformations and not necessarily requiring long series of radar imagery. In our study, we present the method exploiting convolutional neural networks (CNNs) for estimating wave characteristics from shipborne radar data captured using SeaVision package. In particular, we train our CNN to infer significant wave height using estimates provided by the Spotter buoy as ground truth. Our CNN-based method has an advantage over the classical methods due to the low requirements for radar image data since we process just one SeaVision snapshot, whereas classical method requires more than 20 min of radar images.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S128 - S137"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140889238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070275
V. Yu. Rezvov, M. A. Krinitskiy, V. A. Golikov, N. D. Tilinina
Marine X-band radar is an important navigational tool that records signals reflected from the sea surface. Theoretical studies show that the initial unfiltered signal contains information about the sea surface state, including wind wave parameters. Physical laws describing the intensity of the signal reflected from the rough surface are the basis of the classical approaches for significant wave height (SWH) estimation. Nevertheless, the latest research claims the possibility of SWH approximation using machine learning models. Both classical and AI-based approaches require in situ data collected during expensive sea expeditions or with wave monitoring systems. An alternative to real data is generation of synthetic radar images with certain wind wave parameters. This Fourier-based approach is capable of modelling the sea clutter images for wind waves of any given height. Assuming a fully-developed sea, we generate synthetic images from the Pierson–Moskowitz wave spectrum. After that, we apply an unsupervised learning using synthetic radar images to train the convolutional part of the neural network as the encoding part of the autoencoder. In this study, we demonstrate how the accuracy of SWH estimation based on radar images changes when the neural network is pretrained on synthetic data.
摘要 海洋 X 波段雷达是记录海面反射信号的重要导航工具。理论研究表明,未滤波的初始信号包含海面状态信息,包括风浪参数。描述粗糙表面反射信号强度的物理定律是估算显著波高(SWH)的经典方法的基础。不过,最新的研究表明,可以使用机器学习模型来逼近 SWH。无论是传统方法还是基于人工智能的方法,都需要通过昂贵的海上考察或海浪监测系统收集现场数据。替代真实数据的方法是生成具有特定风浪参数的合成雷达图像。这种基于傅立叶的方法能够为任何给定高度的风浪的海杂波图像建模。假设海面完全展开,我们根据皮尔森-莫斯考维兹波谱生成合成图像。然后,我们使用合成雷达图像进行无监督学习,训练神经网络的卷积部分,作为自动编码器的编码部分。在这项研究中,我们展示了当神经网络在合成数据上进行预训练时,基于雷达图像的 SWH 估计精度是如何变化的。
{"title":"Improvement of the AI-Based Estimation of Significant Wave Height Based on Preliminary Training on Synthetic X-Band Radar Sea Clutter Images","authors":"V. Yu. Rezvov, M. A. Krinitskiy, V. A. Golikov, N. D. Tilinina","doi":"10.3103/S0027134923070275","DOIUrl":"10.3103/S0027134923070275","url":null,"abstract":"<p>Marine X-band radar is an important navigational tool that records signals reflected from the sea surface. Theoretical studies show that the initial unfiltered signal contains information about the sea surface state, including wind wave parameters. Physical laws describing the intensity of the signal reflected from the rough surface are the basis of the classical approaches for significant wave height (SWH) estimation. Nevertheless, the latest research claims the possibility of SWH approximation using machine learning models. Both classical and AI-based approaches require in situ data collected during expensive sea expeditions or with wave monitoring systems. An alternative to real data is generation of synthetic radar images with certain wind wave parameters. This Fourier-based approach is capable of modelling the sea clutter images for wind waves of any given height. Assuming a fully-developed sea, we generate synthetic images from the Pierson–Moskowitz wave spectrum. After that, we apply an unsupervised learning using synthetic radar images to train the convolutional part of the neural network as the encoding part of the autoencoder. In this study, we demonstrate how the accuracy of SWH estimation based on radar images changes when the neural network is pretrained on synthetic data.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S188 - S201"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140889302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070184
A. P. Kryukov, S. P. Polyakov, A. A. Vlaskina, E. O. Gres, A. P. Demichev, Yu. Yu. Dubenskaya, D. P. Zhurov
Imaging atmospheric Cherenkov telescopes are used to record images of extensive area showers caused by high-energy particles colliding with the upper atmosphere. The images are analyzed to determine events’ physical parameters, such as the type and the energy of the primary particles. The distributions of some of the physical parameters can be used as well, for example, to determine the properties of a gamma ray source. The key problem of any experiment is the calibration of experimental data. For this purpose, Monte Carlo simulated data with known values of the physical parameters are used. The main disadvantage of this method is its extremely high requirements for computing resources and the large amount of time spent on modelling. In this paper, we use an alternative approach: Cherenkov telescope images are simulated with conditional variational autoencoders. We compare the characteristics of both the individual images and their Hillas parameter distributions with those of the images generated by the Monte Carlo method.
{"title":"The Use of Conditional Variational Autoencoders for Simulation of EAS Images from IACTs","authors":"A. P. Kryukov, S. P. Polyakov, A. A. Vlaskina, E. O. Gres, A. P. Demichev, Yu. Yu. Dubenskaya, D. P. Zhurov","doi":"10.3103/S0027134923070184","DOIUrl":"10.3103/S0027134923070184","url":null,"abstract":"<p>Imaging atmospheric Cherenkov telescopes are used to record images of extensive area showers caused by high-energy particles colliding with the upper atmosphere. The images are analyzed to determine events’ physical parameters, such as the type and the energy of the primary particles. The distributions of some of the physical parameters can be used as well, for example, to determine the properties of a gamma ray source. The key problem of any experiment is the calibration of experimental data. For this purpose, Monte Carlo simulated data with known values of the physical parameters are used. The main disadvantage of this method is its extremely high requirements for computing resources and the large amount of time spent on modelling. In this paper, we use an alternative approach: Cherenkov telescope images are simulated with conditional variational autoencoders. We compare the characteristics of both the individual images and their Hillas parameter distributions with those of the images generated by the Monte Carlo method.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S37 - S44"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070123
I. V. Isaev, I. E. Obornev, E. A. Obornev, E. A. Rodionov, M. I. Shimelevich, S. A. Dolenko
Exploration geophysics requires solving specific inverse problems — reconstructing the spatial distribution of the medium properties in the thickness of the earth from the geophysical fields measured on its surface. We consider inverse problems of gravimetry, magnetometry, magnetotelluric sounding, and their integration, which means simultaneous use of various geophysical fields to reconstruct the desired distribution. Integration requires the determined parameters for all the methods to be the same. This may be achieved by the spatial statement of the problem, in which the task is to determine the boundaries of geophysical objects. In our previous studies, we considered the parameterization scheme where the inverse problem was to determine the lower boundary of several geological layers. Each layer was characterized by variable values of the depth of the lower boundary along the section, and by fixed values of density, magnetization, and resistivity, both for the layer and over the entire dataset. It was demonstrated that the integration of geophysical methods provides significantly better results than the use of each of the methods separately. The present study considers an extended and more realistic model of data—a parameterization scheme with variable properties of the medium, both along each layer and over the dataset.
{"title":"Study of the Integration of Physical Methods in Neural Network Solution of the Inverse Problem of Exploration Geophysics with Variable Physical Properties of the Medium","authors":"I. V. Isaev, I. E. Obornev, E. A. Obornev, E. A. Rodionov, M. I. Shimelevich, S. A. Dolenko","doi":"10.3103/S0027134923070123","DOIUrl":"10.3103/S0027134923070123","url":null,"abstract":"<p>Exploration geophysics requires solving specific inverse problems — reconstructing the spatial distribution of the medium properties in the thickness of the earth from the geophysical fields measured on its surface. We consider inverse problems of gravimetry, magnetometry, magnetotelluric sounding, and their integration, which means simultaneous use of various geophysical fields to reconstruct the desired distribution. Integration requires the determined parameters for all the methods to be the same. This may be achieved by the spatial statement of the problem, in which the task is to determine the boundaries of geophysical objects. In our previous studies, we considered the parameterization scheme where the inverse problem was to determine the lower boundary of several geological layers. Each layer was characterized by variable values of the depth of the lower boundary along the section, and by fixed values of density, magnetization, and resistivity, both for the layer and over the entire dataset. It was demonstrated that the integration of geophysical methods provides significantly better results than the use of each of the methods separately. The present study considers an extended and more realistic model of data—a parameterization scheme with variable properties of the medium, both along each layer and over the dataset.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S122 - S127"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070135
A. J. Kasatkin, M. A. Krinitskiy
Fluxes of greenhouse gases (GHG) may be assessed in situ using the eddy covariance method through processing high-frequency measurements of gas concentration and wind speed acquired at certain sites, e.g., carbon measurement test areas of the pilot project of the Ministry of Education and Science of Russia. The measurements commonly come with noise, anomalies, and gaps of various natures. These anomalies result in biased GHG flux estimates. There are a number of empirical and heuristic approaches for filtering noise and anomalies, as well as for gap-filling. These approaches are characterized by many tuning parameters that are commonly adjusted by an expert, which is a limiting factor for large-scale deployment of GHG monitoring stations. In this study, we propose an alternative approach for anomaly detection in high-frequency measurements of GHG concentration and wind speed. Our approach is based on machine learning techniques. This approach is characterized by a lower number of tuning parameters. The goal of our study is to develop a fully automated data preprocessing routine based on machine learning algorithms. We collected the dataset of high-frequency GHG concentration and wind speed measurements from one of the carbon measurement test areas. In order to compare anomaly detection algorithms, we labeled anomalies in a subset of this dataset. We present two approaches for anomaly detection, namely: (a) identification of outliers based on the error magnitude in time series statistical forecasts performed by a machine learning (ML) algorithm; and (b) classification of anomalies using an ML model trained on the labeled dataset of outliers we mentioned above. We compared the approaches and algorithms based on the F1-score metric assessed with respect to an expert-labeled subset of anomalies in GHG concentration and wind speed time series. Within the forecast-error based approach, we trained several ML models: the ARIMA autoregression method, the CatBoost model for autoregression, the CatBoost model for forecasting employing additional features, and the LSTM artificial neural network. Within the supervised classification approach, we tested the CatBoost classification model. We demonstrate that ML models for forecasting deliver a high quality of time series prediction within the autoregression approach. We also show that the anomaly identification method based on the autoregression approach delivers the best quality with the F1-score reaching (0.812).
摘要利用涡度协方差法,通过处理在某些地点(如俄罗斯教育和科学部试点项目的碳测量试验区)获得的气体浓度和风速的高频测量数据,可以对温室气体(GHG)流量进行现场评估。这些测量结果通常带有噪音、异常和各种性质的间隙。这些异常现象会导致温室气体通量估计值出现偏差。有许多经验性和启发式方法可用于过滤噪声和异常,以及填补空白。这些方法的特点是有许多调整参数,通常由专家进行调整,这是大规模部署温室气体监测站的一个限制因素。在本研究中,我们提出了一种在温室气体浓度和风速的高频测量中进行异常检测的替代方法。我们的方法基于机器学习技术。这种方法的特点是调整参数数量较少。我们的研究目标是开发一种基于机器学习算法的全自动数据预处理程序。我们从一个碳测量测试区收集了高频温室气体浓度和风速测量数据集。为了比较异常检测算法,我们对该数据集中的一个子集进行了异常标注。我们提出了两种异常检测方法,即:(a) 根据机器学习(ML)算法在时间序列统计预测中的误差大小识别异常值;(b) 使用在上述异常值标注数据集上训练的 ML 模型对异常值进行分类。我们根据对专家标注的温室气体浓度和风速时间序列异常子集评估的 F1 分数指标,对各种方法和算法进行了比较。在基于预测误差的方法中,我们训练了多个 ML 模型:ARIMA 自回归方法、用于自回归的 CatBoost 模型、用于预测附加特征的 CatBoost 模型以及 LSTM 人工神经网络。在监督分类方法中,我们测试了 CatBoost 分类模型。我们证明,在自回归方法中,用于预测的 ML 模型可提供高质量的时间序列预测。我们还表明,基于自回归方法的异常识别方法质量最好,F1-分数达到了(0.812)。
{"title":"Machine Learning Techniques for Anomaly Detection in High-Frequency Time Series of Wind Speed and Greenhouse Gas Concentration Measurements","authors":"A. J. Kasatkin, M. A. Krinitskiy","doi":"10.3103/S0027134923070135","DOIUrl":"10.3103/S0027134923070135","url":null,"abstract":"<p>Fluxes of greenhouse gases (GHG) may be assessed in situ using the eddy covariance method through processing high-frequency measurements of gas concentration and wind speed acquired at certain sites, e.g., carbon measurement test areas of the pilot project of the Ministry of Education and Science of Russia. The measurements commonly come with noise, anomalies, and gaps of various natures. These anomalies result in biased GHG flux estimates. There are a number of empirical and heuristic approaches for filtering noise and anomalies, as well as for gap-filling. These approaches are characterized by many tuning parameters that are commonly adjusted by an expert, which is a limiting factor for large-scale deployment of GHG monitoring stations. In this study, we propose an alternative approach for anomaly detection in high-frequency measurements of GHG concentration and wind speed. Our approach is based on machine learning techniques. This approach is characterized by a lower number of tuning parameters. The goal of our study is to develop a fully automated data preprocessing routine based on machine learning algorithms. We collected the dataset of high-frequency GHG concentration and wind speed measurements from one of the carbon measurement test areas. In order to compare anomaly detection algorithms, we labeled anomalies in a subset of this dataset. We present two approaches for anomaly detection, namely: (a) identification of outliers based on the error magnitude in time series statistical forecasts performed by a machine learning (ML) algorithm; and (b) classification of anomalies using an ML model trained on the labeled dataset of outliers we mentioned above. We compared the approaches and algorithms based on the F1-score metric assessed with respect to an expert-labeled subset of anomalies in GHG concentration and wind speed time series. Within the forecast-error based approach, we trained several ML models: the ARIMA autoregression method, the CatBoost model for autoregression, the CatBoost model for forecasting employing additional features, and the LSTM artificial neural network. Within the supervised classification approach, we tested the CatBoost classification model. We demonstrate that ML models for forecasting deliver a high quality of time series prediction within the autoregression approach. We also show that the anomaly identification method based on the autoregression approach delivers the best quality with the F1-score reaching <span>(0.812)</span>.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S138 - S148"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070068
R. R. Fitagdinov, I. V. Kharuk
We report on the development of neural networks for generating readings from Telescope Array’s surface detectors with the largest registered integral signal. To achieve this goal, we implemented generative Wasserstein adversarial networks with the gradient penalty. The data used to train the model was generated using the Monte Carlo method. We obtained visually similar data which are consistent with the physics of the underlying processes. The anomaly search method can be employed to identify discrepancies between real and simulated data, as well as to introduce a quantitative measure of similarity between the real detector readings and those generated by the neural network’s readings.
{"title":"Generation of the Ground Detector Readings of the Telescope Array Experiment and the Search for Anomalies Using Neural Networks","authors":"R. R. Fitagdinov, I. V. Kharuk","doi":"10.3103/S0027134923070068","DOIUrl":"10.3103/S0027134923070068","url":null,"abstract":"<p>We report on the development of neural networks for generating readings from Telescope Array’s surface detectors with the largest registered integral signal. To achieve this goal, we implemented generative Wasserstein adversarial networks with the gradient penalty. The data used to train the model was generated using the Monte Carlo method. We obtained visually similar data which are consistent with the physics of the underlying processes. The anomaly search method can be employed to identify discrepancies between real and simulated data, as well as to introduce a quantitative measure of similarity between the real detector readings and those generated by the neural network’s readings.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S59 - S63"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139501511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070226
A. V. Matseiko, I. V. Kharuk
Baikal-GVD is a large ((sim)1 km({}^{3})) underwater neutrino telescope located in Lake Baikal, Russia. In this report, we present two machine learning techniques developed for its data analysis. First, we introduce a neural network for an efficient rejection of noise hits, emerging due to natural water luminescence. Second, we develop a neural network for distinguishing muon- and neutrino-induced events. By choosing an appropriate classification threshold, we preserve (90%) of neutrino-induced events, while muon-induced events are suppressed by a factor of (10^{-6}). Both of the developed neural networks employ the causal structure of events and surpass the precision of standard algorithmic approaches.
{"title":"Application of Machine Learning Methods in Baikal-GVD: Background Noise Rejection and Selection of Neutrino-Induced Events","authors":"A. V. Matseiko, I. V. Kharuk","doi":"10.3103/S0027134923070226","DOIUrl":"10.3103/S0027134923070226","url":null,"abstract":"<p>Baikal-GVD is a large (<span>(sim)</span>1 km<span>({}^{3})</span>) underwater neutrino telescope located in Lake Baikal, Russia. In this report, we present two machine learning techniques developed for its data analysis. First, we introduce a neural network for an efficient rejection of noise hits, emerging due to natural water luminescence. Second, we develop a neural network for distinguishing muon- and neutrino-induced events. By choosing an appropriate classification threshold, we preserve <span>(90%)</span> of neutrino-induced events, while muon-induced events are suppressed by a factor of <span>(10^{-6})</span>. Both of the developed neural networks employ the causal structure of events and surpass the precision of standard algorithmic approaches.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S71 - S79"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070251
I. S. Lazukhin, M. I. Petrovskiy, I. V. Mashechkin
The digital twin of a technological process is a complex of mathematical models that allow the determination of qualitative and quantitative dependencies between process parameters. It can predict the values of controlled and observed variables dynamically, depending on the process state and control actions. Additionally, it can identify hidden dependencies, states, and factors affecting the technological process and implement the selection of optimal control actions based on goals, technological limitations, or financial constraints. When building such models using data-driven methods, the input data consist of multidimensional time series from the production system’s sensor readings. The aim of this work is to develop, implement, and evaluate a subset of the digital twin functionality as applied to the oil cracking process. The key components of the proposed methods include data preprocessing, which encompasses the challenge of selecting stable operational periods for the plant, and feature selection, represented by a gradient boosting approach. We also focus on the construction of differentiable predictive models, which use modern deep learning methods to predict controlled parameter values based on dynamic system states and control. Moreover, we apply differentiable neural network models as constraints, objective functions, and state equations to solve the optimal control problem using a classical optimal control approach.
{"title":"Deep Learning Methods for Tasks of Creating Digital Twins for Technological Processes","authors":"I. S. Lazukhin, M. I. Petrovskiy, I. V. Mashechkin","doi":"10.3103/S0027134923070251","DOIUrl":"10.3103/S0027134923070251","url":null,"abstract":"<p>The digital twin of a technological process is a complex of mathematical models that allow the determination of qualitative and quantitative dependencies between process parameters. It can predict the values of controlled and observed variables dynamically, depending on the process state and control actions. Additionally, it can identify hidden dependencies, states, and factors affecting the technological process and implement the selection of optimal control actions based on goals, technological limitations, or financial constraints. When building such models using data-driven methods, the input data consist of multidimensional time series from the production system’s sensor readings. The aim of this work is to develop, implement, and evaluate a subset of the digital twin functionality as applied to the oil cracking process. The key components of the proposed methods include data preprocessing, which encompasses the challenge of selecting stable operational periods for the plant, and feature selection, represented by a gradient boosting approach. We also focus on the construction of differentiable predictive models, which use modern deep learning methods to predict controlled parameter values based on dynamic system states and control. Moreover, we apply differentiable neural network models as constraints, objective functions, and state equations to solve the optimal control problem using a classical optimal control approach.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S3 - S15"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070044
G. A. Kupriyanov, I. V. Isaev, I. V. Plastinin, T. A. Dolenko, S. A. Dolenko
One of the methods for the analysis of complex spectral bands (especially for spectra of liquid objects) is their decomposition into a limited number of spectral curves with physically reasonable shapes (Gaussian, Lorentzian, Voigt, etc.). Subsequent analysis of the dependences of the parameters of these contours on some external conditions in which the spectra are obtained may reveal some regularities that bear information about the physical processes taking place in the object. The problem with the required decomposition is that such a decomposition in the presence of noise in spectra is an incorrect inverse problem. Therefore, this problem is often solved by advanced optimization methods that are less likely to become stuck in local minima, such as genetic algorithms (GA). In the conventional version of GA, all individuals are similar regarding the probabilities and implementation of the main genetic operators (crossover and mutation) and the procedure of selection. In their preceding studies, the authors tested the gender GA (GGA), where the individuals of the two genders differ in terms of the mutation probability (higher for males) and the selection procedures for crossover (with the number of crossovers limited for females). In this study, we introduce additional differences between the genders in the procedures of selection and mutation. The improved modification of GGA is tested by comparing the efficiency of the conventional GA, GGA, and three versions of GGA with and without subsequent gradient descent in solving the problems of decomposition of the Raman valence band of liquid water into Gaussian contours.
{"title":"Decomposition of Spectral Band into Gaussian Contours Using an Improved Modification of the Gender Genetic Algorithm","authors":"G. A. Kupriyanov, I. V. Isaev, I. V. Plastinin, T. A. Dolenko, S. A. Dolenko","doi":"10.3103/S0027134923070044","DOIUrl":"10.3103/S0027134923070044","url":null,"abstract":"<p>One of the methods for the analysis of complex spectral bands (especially for spectra of liquid objects) is their decomposition into a limited number of spectral curves with physically reasonable shapes (Gaussian, Lorentzian, Voigt, etc.). Subsequent analysis of the dependences of the parameters of these contours on some external conditions in which the spectra are obtained may reveal some regularities that bear information about the physical processes taking place in the object. The problem with the required decomposition is that such a decomposition in the presence of noise in spectra is an incorrect inverse problem. Therefore, this problem is often solved by advanced optimization methods that are less likely to become stuck in local minima, such as genetic algorithms (GA). In the conventional version of GA, all individuals are similar regarding the probabilities and implementation of the main genetic operators (crossover and mutation) and the procedure of selection. In their preceding studies, the authors tested the gender GA (GGA), where the individuals of the two genders differ in terms of the mutation probability (higher for males) and the selection procedures for crossover (with the number of crossovers limited for females). In this study, we introduce additional differences between the genders in the procedures of selection and mutation. The improved modification of GGA is tested by comparing the efficiency of the conventional GA, GGA, and three versions of GGA with and without subsequent gradient descent in solving the problems of decomposition of the Raman valence band of liquid water into Gaussian contours.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S236 - S242"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-17DOI: 10.3103/S0027134923070299
A. S. Savin, M. A. Krinitskiy, A. A. Osadchiev
Sea surface salinity (SSS) is a key physicochemical characteristic of the ocean that plays a significant role in describing the climate. Routine SSS retrieval algorithms exploiting remote sensing data have been developed and validated with high precision for typical regions of the World Ocean. Their effectiveness is worse in the Arctic though. To address this limitation, in this study, we employ machine learning (ML) techniques to enhance the quality of standard algorithms. We evaluate a few ML models, ranging from classical methods that process vector features, provided by standard Soil Moisture Active Passive (SMAP) satellite salinity algorithms, to deep artificial neural networks that combine vector features with two-dimensional fields extracted from the ERA5 reanalysis. We validate these models using in situ the data collected by the Shirshov Institute of Oceanology RAS during the expeditions to the Barents, Kara, Laptev, and East Siberian seas from 2015 to 2021. The results of the study indicate that the SMAP sea surface salinity standard product is improved in these regions. The ML models developed in this study make it possible to further study the Arctic region using enhanced sea surface salinity maps.
摘要海洋表面盐度(SSS)是海洋的一个关键物理化学特征,在描述气候方面发挥着重要作用。利用遥感数据开发的常规 SSS 检索算法已在世界海洋典型区域得到高精度验证。但它们在北极地区的效果较差。为了解决这一局限性,我们在本研究中采用了机器学习(ML)技术来提高标准算法的质量。我们对一些 ML 模型进行了评估,包括处理由标准土壤水分主动被动(SMAP)卫星盐度算法提供的矢量特征的经典方法,以及将矢量特征与从ERA5 再分析中提取的二维场相结合的深度人工神经网络。我们利用俄罗斯科学院希尔绍夫海洋研究所在 2015 至 2021 年期间对巴伦支海、喀拉海、拉普捷夫海和东西伯利亚海进行考察时收集的现场数据对这些模型进行了验证。研究结果表明,SMAP 海洋表面盐度标准产品在这些地区得到了改进。本研究开发的 ML 模型使得利用增强型海面盐度图进一步研究北极地区成为可能。
{"title":"SMAP Sea Surface Salinity Improvement in the Arctic Region Using Machine Learning Approaches","authors":"A. S. Savin, M. A. Krinitskiy, A. A. Osadchiev","doi":"10.3103/S0027134923070299","DOIUrl":"10.3103/S0027134923070299","url":null,"abstract":"<p>Sea surface salinity (SSS) is a key physicochemical characteristic of the ocean that plays a significant role in describing the climate. Routine SSS retrieval algorithms exploiting remote sensing data have been developed and validated with high precision for typical regions of the World Ocean. Their effectiveness is worse in the Arctic though. To address this limitation, in this study, we employ machine learning (ML) techniques to enhance the quality of standard algorithms. We evaluate a few ML models, ranging from classical methods that process vector features, provided by standard Soil Moisture Active Passive (SMAP) satellite salinity algorithms, to deep artificial neural networks that combine vector features with two-dimensional fields extracted from the ERA5 reanalysis. We validate these models using in situ the data collected by the Shirshov Institute of Oceanology RAS during the expeditions to the Barents, Kara, Laptev, and East Siberian seas from 2015 to 2021. The results of the study indicate that the SMAP sea surface salinity standard product is improved in these regions. The ML models developed in this study make it possible to further study the Arctic region using enhanced sea surface salinity maps.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"78 1 supplement","pages":"S210 - S216"},"PeriodicalIF":0.4,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}