Haonan Wang , Quanzhi Sun , Jun Wu , Xuxia Zhang , Weipeng Liu , Tao Peng , Renzhong Tang
{"title":"基于自训练和改进 XGBoost 的铝合金铸件质量预测方法","authors":"Haonan Wang , Quanzhi Sun , Jun Wu , Xuxia Zhang , Weipeng Liu , Tao Peng , Renzhong Tang","doi":"10.1016/j.rcim.2024.102890","DOIUrl":null,"url":null,"abstract":"<div><div>The revolutionary advances in integrated components in current automotive industry have led to a sharply rising demand for aluminum alloy castings. Targeted quality inspection is thus proposed for components manufacturers to achieve high responsiveness and low operational cost. This suggests casting machine manufacturers to integrate advanced quality prediction functions into the next generation of intelligent casting machines. However, acquiring ample quality inspection data is essential for implementing such functions, which is often challenging, if not infeasible, due to practical issues such as data proprietorship or privacy. Self-training is a good candidate for dealing with scarce labeled data, and XGBoost is commonly used as the base classifier. However, misclassification of unlabeled data happens using XGBoost, which could lead to incorrect pseudo-label assignments, eventually resulting in higher misclassification rate. To address this challenge, a self-training and improved XGBoost-based aluminum alloy casting quality prediction approach is proposed. This approach integrates the classification loss of unlabeled data in the objective function as a new regularization term and considers first and second partial derivatives of the classification loss function for unlabeled data in the leaf node's weight score. The proposed approach penalizes those classification models that misclassify unlabeled data, thereby improves quality prediction performance. To evaluate the effectiveness of our approach, a casting machine manufacturer was collaborated to conduct a case study. The results on three-type casting quality prediction demonstrate that our approach could achieve an accuracy, precision, recall and F1 score of 93.2 %, 90 %, 64.2 %, and 0.75, respectively, outperforming all compared approaches. The approach supports casting machine manufacturers to pre-train a casting quality prediction models with scarce labeled data, enabling swift deployment and customization for targeted quality inspection.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"92 ","pages":"Article 102890"},"PeriodicalIF":9.1000,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Self-training-based approach with improved XGBoost for aluminum alloy casting quality prediction\",\"authors\":\"Haonan Wang , Quanzhi Sun , Jun Wu , Xuxia Zhang , Weipeng Liu , Tao Peng , Renzhong Tang\",\"doi\":\"10.1016/j.rcim.2024.102890\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The revolutionary advances in integrated components in current automotive industry have led to a sharply rising demand for aluminum alloy castings. Targeted quality inspection is thus proposed for components manufacturers to achieve high responsiveness and low operational cost. This suggests casting machine manufacturers to integrate advanced quality prediction functions into the next generation of intelligent casting machines. However, acquiring ample quality inspection data is essential for implementing such functions, which is often challenging, if not infeasible, due to practical issues such as data proprietorship or privacy. Self-training is a good candidate for dealing with scarce labeled data, and XGBoost is commonly used as the base classifier. However, misclassification of unlabeled data happens using XGBoost, which could lead to incorrect pseudo-label assignments, eventually resulting in higher misclassification rate. To address this challenge, a self-training and improved XGBoost-based aluminum alloy casting quality prediction approach is proposed. This approach integrates the classification loss of unlabeled data in the objective function as a new regularization term and considers first and second partial derivatives of the classification loss function for unlabeled data in the leaf node's weight score. The proposed approach penalizes those classification models that misclassify unlabeled data, thereby improves quality prediction performance. To evaluate the effectiveness of our approach, a casting machine manufacturer was collaborated to conduct a case study. The results on three-type casting quality prediction demonstrate that our approach could achieve an accuracy, precision, recall and F1 score of 93.2 %, 90 %, 64.2 %, and 0.75, respectively, outperforming all compared approaches. The approach supports casting machine manufacturers to pre-train a casting quality prediction models with scarce labeled data, enabling swift deployment and customization for targeted quality inspection.</div></div>\",\"PeriodicalId\":21452,\"journal\":{\"name\":\"Robotics and Computer-integrated Manufacturing\",\"volume\":\"92 \",\"pages\":\"Article 102890\"},\"PeriodicalIF\":9.1000,\"publicationDate\":\"2024-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Computer-integrated Manufacturing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0736584524001777\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584524001777","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Self-training-based approach with improved XGBoost for aluminum alloy casting quality prediction
The revolutionary advances in integrated components in current automotive industry have led to a sharply rising demand for aluminum alloy castings. Targeted quality inspection is thus proposed for components manufacturers to achieve high responsiveness and low operational cost. This suggests casting machine manufacturers to integrate advanced quality prediction functions into the next generation of intelligent casting machines. However, acquiring ample quality inspection data is essential for implementing such functions, which is often challenging, if not infeasible, due to practical issues such as data proprietorship or privacy. Self-training is a good candidate for dealing with scarce labeled data, and XGBoost is commonly used as the base classifier. However, misclassification of unlabeled data happens using XGBoost, which could lead to incorrect pseudo-label assignments, eventually resulting in higher misclassification rate. To address this challenge, a self-training and improved XGBoost-based aluminum alloy casting quality prediction approach is proposed. This approach integrates the classification loss of unlabeled data in the objective function as a new regularization term and considers first and second partial derivatives of the classification loss function for unlabeled data in the leaf node's weight score. The proposed approach penalizes those classification models that misclassify unlabeled data, thereby improves quality prediction performance. To evaluate the effectiveness of our approach, a casting machine manufacturer was collaborated to conduct a case study. The results on three-type casting quality prediction demonstrate that our approach could achieve an accuracy, precision, recall and F1 score of 93.2 %, 90 %, 64.2 %, and 0.75, respectively, outperforming all compared approaches. The approach supports casting machine manufacturers to pre-train a casting quality prediction models with scarce labeled data, enabling swift deployment and customization for targeted quality inspection.
期刊介绍:
The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.