Abhirup Chaudhuri, S Venkateshwara Rao, Ankit Singh, M Pradeep Kumar, Debasis Atta
{"title":"A systematic approach to tuning a neural network model and its application in estimating layer parameters from VES Schlumberger data","authors":"Abhirup Chaudhuri, S Venkateshwara Rao, Ankit Singh, M Pradeep Kumar, Debasis Atta","doi":"10.1007/s12040-024-02349-5","DOIUrl":null,"url":null,"abstract":"<p>Interpretation of vertical electrical sounding (VES) data is inherently difficult due to its ambiguity and non-linear characteristics. Conventional least square-based methods rely on a well-defined <i>apriori</i> model, constrained by ground information (available borehole logs and field observations of exposed lithological sections) for meaningful interpretations. In this work, a back-propagation neural network-based model was developed to estimate layer resistivities and thickness from given apparent resistivities. The model was trained and validated on noise-infused synthetic datasets. Since the effectiveness and generalisation of any model depend on its hyperparameter settings, we investigated effective methods for estimating hyperparameters such as learning rate, momentum, model architecture and learning rate scheduling parameters. It is well known that the optimal values of hyperparameters are not entirely independent of each other. Thus, any change in one hyperparameter changes the optimal range of all other hyperparameters, and thus, tuning any hyperparameter individually is futile. This warrants a joint hyperparameter tuning along with network architecture, which was carried out using a modified version of meta-heuristic black hole algorithm. The modifications include randomly flipping one or more coordinates of the population stars (solutions) whose cost function was above a threshold value decided by a mutation rate parameter. This helped in boosting the exploration capability of the algorithm and prune trajectories with higher cost functions. It is demonstrated that with a finely tuned neural network model, reasonable resistivity model parameters which interpret the ground conditions fairly well could be obtained. The model was tested on resistivity-sounding data with associated borehole lithologs and was found to be giving reasonable results. The same model was used to estimate the overburden thickness consisting of topsoil and deposited silts in Bhadradri–Kothagudem district, India. The layer thickness was consistent with those seen in cutbanks in the area. The methods of optimal hyperparameter set estimation are not exclusive to this model and can be used to train models accomplishing other geophysical tasks.</p>","PeriodicalId":15609,"journal":{"name":"Journal of Earth System Science","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Earth System Science","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12040-024-02349-5","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Interpretation of vertical electrical sounding (VES) data is inherently difficult due to its ambiguity and non-linear characteristics. Conventional least square-based methods rely on a well-defined apriori model, constrained by ground information (available borehole logs and field observations of exposed lithological sections) for meaningful interpretations. In this work, a back-propagation neural network-based model was developed to estimate layer resistivities and thickness from given apparent resistivities. The model was trained and validated on noise-infused synthetic datasets. Since the effectiveness and generalisation of any model depend on its hyperparameter settings, we investigated effective methods for estimating hyperparameters such as learning rate, momentum, model architecture and learning rate scheduling parameters. It is well known that the optimal values of hyperparameters are not entirely independent of each other. Thus, any change in one hyperparameter changes the optimal range of all other hyperparameters, and thus, tuning any hyperparameter individually is futile. This warrants a joint hyperparameter tuning along with network architecture, which was carried out using a modified version of meta-heuristic black hole algorithm. The modifications include randomly flipping one or more coordinates of the population stars (solutions) whose cost function was above a threshold value decided by a mutation rate parameter. This helped in boosting the exploration capability of the algorithm and prune trajectories with higher cost functions. It is demonstrated that with a finely tuned neural network model, reasonable resistivity model parameters which interpret the ground conditions fairly well could be obtained. The model was tested on resistivity-sounding data with associated borehole lithologs and was found to be giving reasonable results. The same model was used to estimate the overburden thickness consisting of topsoil and deposited silts in Bhadradri–Kothagudem district, India. The layer thickness was consistent with those seen in cutbanks in the area. The methods of optimal hyperparameter set estimation are not exclusive to this model and can be used to train models accomplishing other geophysical tasks.
期刊介绍:
The Journal of Earth System Science, an International Journal, was earlier a part of the Proceedings of the Indian Academy of Sciences – Section A begun in 1934, and later split in 1978 into theme journals. This journal was published as Proceedings – Earth and Planetary Sciences since 1978, and in 2005 was renamed ‘Journal of Earth System Science’.
The journal is highly inter-disciplinary and publishes scholarly research – new data, ideas, and conceptual advances – in Earth System Science. The focus is on the evolution of the Earth as a system: manuscripts describing changes of anthropogenic origin in a limited region are not considered unless they go beyond describing the changes to include an analysis of earth-system processes. The journal''s scope includes the solid earth (geosphere), the atmosphere, the hydrosphere (including cryosphere), and the biosphere; it also addresses related aspects of planetary and space sciences. Contributions pertaining to the Indian sub- continent and the surrounding Indian-Ocean region are particularly welcome. Given that a large number of manuscripts report either observations or model results for a limited domain, manuscripts intended for publication in JESS are expected to fulfill at least one of the following three criteria.
The data should be of relevance and should be of statistically significant size and from a region from where such data are sparse. If the data are from a well-sampled region, the data size should be considerable and advance our knowledge of the region.
A model study is carried out to explain observations reported either in the same manuscript or in the literature.
The analysis, whether of data or with models, is novel and the inferences advance the current knowledge.