Enhancing manufacturing operations with synthetic data: a systematic framework for data generation, accuracy, and utility

Vishnupriya Buggineni, Cheng Chen, Jaime Camelio
{"title":"Enhancing manufacturing operations with synthetic data: a systematic framework for data generation, accuracy, and utility","authors":"Vishnupriya Buggineni, Cheng Chen, Jaime Camelio","doi":"10.3389/fmtec.2024.1320166","DOIUrl":null,"url":null,"abstract":"Addressing the challenges of data scarcity and privacy, synthetic data generation offers an innovative solution that advances manufacturing assembly operations and data analytics. Serving as a viable alternative, it enables manufacturers to leverage a broader and more diverse range of machine learning models by incorporating the creation of artificial data points for training and evaluation. Current methods lack generalizable framework for researchers to follow and solve these issues. The development of synthetic data sets, however, can make up for missing samples and enable researchers to understand existing issues within the manufacturing process and create data-driven tools for reducing manufacturing costs. This paper systematically reviews both discrete and continuous manufacturing process data types with their applicable synthetic generation techniques. The proposed framework entails four main stages: Data collection, pre-processing, synthetic data generation, and evaluation. To validate the framework’s efficacy, a case study leveraging synthetic data enabled an exploration of complex defect classification challenges in the packaging process. The results show enhanced prediction accuracy and provide a detailed comparative analysis of various synthetic data strategies. This paper concludes by highlighting our framework’s transformative potential for researchers, educators, and practitioners and provides scalable guidance to solve the data challenges in the current manufacturing sector.","PeriodicalId":330401,"journal":{"name":"Frontiers in Manufacturing Technology","volume":"59 10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Manufacturing Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fmtec.2024.1320166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Addressing the challenges of data scarcity and privacy, synthetic data generation offers an innovative solution that advances manufacturing assembly operations and data analytics. Serving as a viable alternative, it enables manufacturers to leverage a broader and more diverse range of machine learning models by incorporating the creation of artificial data points for training and evaluation. Current methods lack generalizable framework for researchers to follow and solve these issues. The development of synthetic data sets, however, can make up for missing samples and enable researchers to understand existing issues within the manufacturing process and create data-driven tools for reducing manufacturing costs. This paper systematically reviews both discrete and continuous manufacturing process data types with their applicable synthetic generation techniques. The proposed framework entails four main stages: Data collection, pre-processing, synthetic data generation, and evaluation. To validate the framework’s efficacy, a case study leveraging synthetic data enabled an exploration of complex defect classification challenges in the packaging process. The results show enhanced prediction accuracy and provide a detailed comparative analysis of various synthetic data strategies. This paper concludes by highlighting our framework’s transformative potential for researchers, educators, and practitioners and provides scalable guidance to solve the data challenges in the current manufacturing sector.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用合成数据改进制造操作:数据生成、准确性和实用性的系统框架
为应对数据稀缺和隐私方面的挑战,合成数据生成提供了一种创新的解决方案,可促进制造装配操作和数据分析。作为一种可行的替代方法,它通过创建人工数据点进行训练和评估,使制造商能够利用更广泛、更多样的机器学习模型。目前的方法缺乏通用框架,研究人员无法遵循并解决这些问题。然而,合成数据集的开发可以弥补样本的缺失,使研究人员能够了解制造过程中的现有问题,并创建数据驱动的工具来降低制造成本。本文系统回顾了离散和连续制造过程数据类型及其适用的合成生成技术。建议的框架包括四个主要阶段:数据收集、预处理、合成数据生成和评估。为了验证该框架的有效性,利用合成数据进行了一项案例研究,探索了包装过程中复杂的缺陷分类难题。结果表明,预测准确性得到了提高,并对各种合成数据策略进行了详细的比较分析。本文最后强调了我们的框架对研究人员、教育工作者和从业人员的变革潜力,并为解决当前制造业的数据挑战提供了可扩展的指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enhancing manufacturing operations with synthetic data: a systematic framework for data generation, accuracy, and utility Enhancing manufacturing operations with synthetic data: a systematic framework for data generation, accuracy, and utility Imaging systems and techniques for fusion-based metal additive manufacturing: a review Leveraging I4.0 smart methodologies for developing solutions for harvesting produce Editorial: Horizons in manufacturing technology
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1