{"title":"Solutions to small datasets in defects detection based on Markov chain Monte Carlo simulations","authors":"Xin Wang, Hua Fan","doi":"10.1117/12.2679161","DOIUrl":null,"url":null,"abstract":"In the processes of industrial defects detections, deep neural networks have received much favor for its high efficiency and low labor cost. As is known, a big amount of dataset is the precondition to train any network. However, the datasets of industrial defects tended to be small and hard to obtain. To solve the problem above, we raise a thought of simulation based on MCMC aiming to sample from the posterior distribution of the defect’s images, which is viewed as the limit distribution of certain Markov Chain. Once we capture the posterior distribution, the law of the defects is revealed. 1Therefore, generating pseudo samples by the law is quite reasonable. It has been proved that the algorithm is practicable by advanced devices and economical in the long term. The significance of this work is to raise a practicable thought towards the problem of mini-batch industrial datasets. Code is available at https://github.com/MrSmallWang/solutions-tosmall- datasets.","PeriodicalId":301595,"journal":{"name":"Conference on Pure, Applied, and Computational Mathematics","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Pure, Applied, and Computational Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2679161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the processes of industrial defects detections, deep neural networks have received much favor for its high efficiency and low labor cost. As is known, a big amount of dataset is the precondition to train any network. However, the datasets of industrial defects tended to be small and hard to obtain. To solve the problem above, we raise a thought of simulation based on MCMC aiming to sample from the posterior distribution of the defect’s images, which is viewed as the limit distribution of certain Markov Chain. Once we capture the posterior distribution, the law of the defects is revealed. 1Therefore, generating pseudo samples by the law is quite reasonable. It has been proved that the algorithm is practicable by advanced devices and economical in the long term. The significance of this work is to raise a practicable thought towards the problem of mini-batch industrial datasets. Code is available at https://github.com/MrSmallWang/solutions-tosmall- datasets.