Behzad Dastjerdy, A. Saeidi, Shahriyar Heidarzadeh
{"title":"地质力学数据异常值检测方法综述","authors":"Behzad Dastjerdy, A. Saeidi, Shahriyar Heidarzadeh","doi":"10.3390/geotechnics3020022","DOIUrl":null,"url":null,"abstract":"The reliability of geomechanical models and engineering designs depend heavily on high-quality data. In geomechanical projects, collecting and analyzing laboratory data is crucial in characterizing the mechanical properties of soils and rocks. However, insufficient lab data or underestimating data treatment can lead to unreliable data being used in the design stage, causing safety hazards, delays, or failures. Hence, detecting outliers or extreme values is significant for ensuring accurate geomechanical analysis. This study reviews and categorizes applicable outlier detection methods for geomechanical data into fence labeling methods and statistical tests. Using real geomechanical data, the applicability of these methods was examined based on four elements: data distribution, sensitivity to extreme values, sample size, and data skewness. The results indicated that statistical tests were less effective than fence labeling methods in detecting outliers in geomechanical data due to limitations in handling skewed data and small sample sizes. Thus, the best outlier detection method should consider this matter. Fence labeling methods, specifically, the medcouple boxplot and semi-interquartile range rule, were identified as the most accurate outlier detection methods for geomechanical data but may necessitate more advanced statistical techniques. Moreover, Tukey’s boxplot was found unsuitable for geomechanical data due to negative confidence intervals that conflicted with geomechanical principles.","PeriodicalId":11823,"journal":{"name":"Environmental geotechnics","volume":"29 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2023-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Review of Applicable Outlier Detection Methods to Treat Geomechanical Data\",\"authors\":\"Behzad Dastjerdy, A. Saeidi, Shahriyar Heidarzadeh\",\"doi\":\"10.3390/geotechnics3020022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The reliability of geomechanical models and engineering designs depend heavily on high-quality data. In geomechanical projects, collecting and analyzing laboratory data is crucial in characterizing the mechanical properties of soils and rocks. However, insufficient lab data or underestimating data treatment can lead to unreliable data being used in the design stage, causing safety hazards, delays, or failures. Hence, detecting outliers or extreme values is significant for ensuring accurate geomechanical analysis. This study reviews and categorizes applicable outlier detection methods for geomechanical data into fence labeling methods and statistical tests. Using real geomechanical data, the applicability of these methods was examined based on four elements: data distribution, sensitivity to extreme values, sample size, and data skewness. The results indicated that statistical tests were less effective than fence labeling methods in detecting outliers in geomechanical data due to limitations in handling skewed data and small sample sizes. Thus, the best outlier detection method should consider this matter. Fence labeling methods, specifically, the medcouple boxplot and semi-interquartile range rule, were identified as the most accurate outlier detection methods for geomechanical data but may necessitate more advanced statistical techniques. Moreover, Tukey’s boxplot was found unsuitable for geomechanical data due to negative confidence intervals that conflicted with geomechanical principles.\",\"PeriodicalId\":11823,\"journal\":{\"name\":\"Environmental geotechnics\",\"volume\":\"29 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2023-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental geotechnics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.3390/geotechnics3020022\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, GEOLOGICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental geotechnics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/geotechnics3020022","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, GEOLOGICAL","Score":null,"Total":0}
Review of Applicable Outlier Detection Methods to Treat Geomechanical Data
The reliability of geomechanical models and engineering designs depend heavily on high-quality data. In geomechanical projects, collecting and analyzing laboratory data is crucial in characterizing the mechanical properties of soils and rocks. However, insufficient lab data or underestimating data treatment can lead to unreliable data being used in the design stage, causing safety hazards, delays, or failures. Hence, detecting outliers or extreme values is significant for ensuring accurate geomechanical analysis. This study reviews and categorizes applicable outlier detection methods for geomechanical data into fence labeling methods and statistical tests. Using real geomechanical data, the applicability of these methods was examined based on four elements: data distribution, sensitivity to extreme values, sample size, and data skewness. The results indicated that statistical tests were less effective than fence labeling methods in detecting outliers in geomechanical data due to limitations in handling skewed data and small sample sizes. Thus, the best outlier detection method should consider this matter. Fence labeling methods, specifically, the medcouple boxplot and semi-interquartile range rule, were identified as the most accurate outlier detection methods for geomechanical data but may necessitate more advanced statistical techniques. Moreover, Tukey’s boxplot was found unsuitable for geomechanical data due to negative confidence intervals that conflicted with geomechanical principles.
期刊介绍:
In 21st century living, engineers and researchers need to deal with growing problems related to climate change, oil and water storage, handling, storage and disposal of toxic and hazardous wastes, remediation of contaminated sites, sustainable development and energy derived from the ground.
Environmental Geotechnics aims to disseminate knowledge and provides a fresh perspective regarding the basic concepts, theory, techniques and field applicability of innovative testing and analysis methodologies and engineering practices in geoenvironmental engineering.
The journal''s Editor in Chief is a Member of the Committee on Publication Ethics.
All relevant papers are carefully considered, vetted by a distinguished team of international experts and rapidly published. Full research papers, short communications and comprehensive review articles are published under the following broad subject categories:
geochemistry and geohydrology,
soil and rock physics, biological processes in soil, soil-atmosphere interaction,
electrical, electromagnetic and thermal characteristics of porous media,
waste management, utilization of wastes, multiphase science, landslide wasting,
soil and water conservation,
sensor development and applications,
the impact of climatic changes on geoenvironmental, geothermal/ground-source energy, carbon sequestration, oil and gas extraction techniques,
uncertainty, reliability and risk, monitoring and forensic geotechnics.