首页 > 最新文献

Journal of Quality Technology最新文献

英文 中文
Deep multistage multi-task learning for quality prediction of multistage manufacturing systems 面向多阶段制造系统质量预测的深度多阶段多任务学习
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-04-20 DOI: 10.1080/00224065.2021.1903822
Hao Yan, Nurrettin Dorukhan Sergin, William A. Brenneman, Steve J. Lange, Shan Ba
Abstract In multistage manufacturing systems, modeling multiple quality indices based on the process sensing variables is important. However, the classic modeling technique predicts each quality variable one at a time, which fails to consider the correlation within or between stages. We propose a deep multistage multi-task learning framework to jointly predict all output sensing variables in a unified end-to-end learning framework according to the sequential system architecture in the MMS. Our numerical studies and real case study have shown that the new model has a superior performance compared to many benchmark methods as well as great interpretability through developed variable selection techniques.
在多阶段制造系统中,基于过程感知变量的多质量指标建模非常重要。然而,经典的建模技术每次只预测一个质量变量,而没有考虑阶段内或阶段之间的相关性。我们提出了一个深度多阶段多任务学习框架,根据MMS中的顺序系统架构,在统一的端到端学习框架中联合预测所有输出感知变量。我们的数值研究和实际案例研究表明,与许多基准方法相比,新模型具有优越的性能,并且通过开发的变量选择技术具有很强的可解释性。
{"title":"Deep multistage multi-task learning for quality prediction of multistage manufacturing systems","authors":"Hao Yan, Nurrettin Dorukhan Sergin, William A. Brenneman, Steve J. Lange, Shan Ba","doi":"10.1080/00224065.2021.1903822","DOIUrl":"https://doi.org/10.1080/00224065.2021.1903822","url":null,"abstract":"Abstract In multistage manufacturing systems, modeling multiple quality indices based on the process sensing variables is important. However, the classic modeling technique predicts each quality variable one at a time, which fails to consider the correlation within or between stages. We propose a deep multistage multi-task learning framework to jointly predict all output sensing variables in a unified end-to-end learning framework according to the sequential system architecture in the MMS. Our numerical studies and real case study have shown that the new model has a superior performance compared to many benchmark methods as well as great interpretability through developed variable selection techniques.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"1 1","pages":"526 - 544"},"PeriodicalIF":2.5,"publicationDate":"2021-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79996947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Monitoring proportions with two components of common cause variation 监测比例与两个组成部分的共同原因变化
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-04-14 DOI: 10.1080/00224065.2021.1903823
R. Goedhart, W. Woodall
Abstract We propose a method for monitoring proportions when the in-control proportion and the sample sizes vary over time. Our approach is able to overcome some of the performance issues of other commonly used methods, as we demonstrate in this paper using analytical and numerical methods. The derivations and results are shown mainly for monitoring proportions, but we show how the method can be extended to the monitoring of count data.
摘要本文提出了一种在控制比例和样本量随时间变化时监测比例的方法。我们的方法能够克服其他常用方法的一些性能问题,正如我们在本文中使用解析和数值方法所证明的那样。推导和结果主要用于监测比例,但我们展示了如何将该方法扩展到计数数据的监测。
{"title":"Monitoring proportions with two components of common cause variation","authors":"R. Goedhart, W. Woodall","doi":"10.1080/00224065.2021.1903823","DOIUrl":"https://doi.org/10.1080/00224065.2021.1903823","url":null,"abstract":"Abstract We propose a method for monitoring proportions when the in-control proportion and the sample sizes vary over time. Our approach is able to overcome some of the performance issues of other commonly used methods, as we demonstrate in this paper using analytical and numerical methods. The derivations and results are shown mainly for monitoring proportions, but we show how the method can be extended to the monitoring of count data.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"23 1","pages":"324 - 337"},"PeriodicalIF":2.5,"publicationDate":"2021-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82059945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Change detection in parametric multivariate dynamic data streams using the ARMAX-GARCH model 使用ARMAX-GARCH模型的参数化多元动态数据流的变化检测
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-04-06 DOI: 10.1080/00224065.2021.1903820
Miaomiao Yu, Chunjie Wu, F. Tsung
Abstract Dynamic data detection is one of the main concerns in the statistical process control (SPC) field. Here we focus on monitoring parametric multivariate dynamic data streams using the ARMAX-GARCH model, which reflects both the influence of exogenous variables on the mean vector and the heterogeneity of the covariance matrix. A quasi maximum likelihood estimator is used to estimate the parameter vector of a dynamic process, and a top-r control scheme is proposed to monitor the parameters of multi-dimensional data streams. Finally, a real-data example of monitoring landslide illustrates the superiorities of the proposed scheme.
动态数据检测是统计过程控制(SPC)领域的主要问题之一。本文重点研究了使用ARMAX-GARCH模型监测参数化多元动态数据流,该模型既反映了外生变量对均值向量的影响,也反映了协方差矩阵的异质性。利用拟极大似然估计估计动态过程的参数向量,提出了一种top-r控制方案来监测多维数据流的参数。最后,以滑坡监测的实际数据为例,说明了该方案的优越性。
{"title":"Change detection in parametric multivariate dynamic data streams using the ARMAX-GARCH model","authors":"Miaomiao Yu, Chunjie Wu, F. Tsung","doi":"10.1080/00224065.2021.1903820","DOIUrl":"https://doi.org/10.1080/00224065.2021.1903820","url":null,"abstract":"Abstract Dynamic data detection is one of the main concerns in the statistical process control (SPC) field. Here we focus on monitoring parametric multivariate dynamic data streams using the ARMAX-GARCH model, which reflects both the influence of exogenous variables on the mean vector and the heterogeneity of the covariance matrix. A quasi maximum likelihood estimator is used to estimate the parameter vector of a dynamic process, and a top-r control scheme is proposed to monitor the parameters of multi-dimensional data streams. Finally, a real-data example of monitoring landslide illustrates the superiorities of the proposed scheme.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"148 1","pages":"303 - 323"},"PeriodicalIF":2.5,"publicationDate":"2021-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77777799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
The Statistical Analysis of Multivariate Failure Time Data: A Marginal Modeling Approach 多变量失效时间数据的统计分析:一种边际建模方法
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-04-01 DOI: 10.1080/00224065.2021.1903824
Guanqi Fang
This book written by Prentice and Zhao brings advances in the specialized field of failure time data analysis. In the existing literature, an extensive study of statistical methods for univariate failure time analysis has been performed. These methods include Kaplan-Meier (KM) estimator, Cox regression, and censored data rank test, etc. However, to my best knowledge, the effort on multivariate failure time data analysis is insufficient. Multivariate failure time data arise when failure times for individuals in a study cohort have a dependent feature, which exists in a number of situations, including epidemiologic studies and clinical trials, etc. The development of statistical methods for multivariate data deserves more research attention. Even though there are several books tackling the problem, they devote the analysis either for select types of multivariate data or have an emphasis on a specific method. Compared with these works, this book makes a summary of the latest innovative research results both deeply and extensively. Overall, the logic of this book is very clear. Chapter 1 gives an overview of the subsequent chapters. It covers a brief introduction to the models and tools and also provides some good application settings. Readers who are not familiar with the topic may read this chapter to grasp the motivation of the study quickly. Chapter 2 describes some core methods that are used to model univariate failure time data. It serves as a solid foundation for the extension to multivariate data analysis; therefore, readers need to pay much attention to this chapter. Chapters 3 and 4 provide tools for analyzing bivariate failure time data from the nonparametric and regression perspectives, respectively. In Chapters 5 and 6, the aforementioned models and tools are extended to cover the scenario of three or more failure time variates. Chapter 7 further considers the case of recurrent event data. Finally, the book concludes with Chapter 8, which discusses approaches to handling more general assumptions, such as dependent censorship and mismeasured covariate data. As implied by the title, the marginal modeling approach is the most important and unique feature of this book. This approach has been described in detail by Sections 4.6, 5.4, and 6.5. Under this approach, a Cox-type model for the marginal double or triple or multiple failure hazard rates is utilized to explain the effects of time-dependent covariates. Some strengths provided by this approach make it distinct from the three conventional approaches: 1) the frailty approach, 2) the copula approach, and 3) the counting process intensity modeling. For example, the copula approach imposes a strong assumption on the dependencies among failure times and doesn’t allow such dependencies to depend on covariates. In contrast, the marginal approach provides robustness by conducting semiparametric estimates of the dependency. In short, the contributions of this book consist of
这本由Prentice和Zhao撰写的书在故障时间数据分析的专业领域取得了进展。在现有文献中,对单变量失效时间分析的统计方法进行了广泛的研究。这些方法包括Kaplan-Meier (KM)估计、Cox回归和删减数据秩检验等。然而,据我所知,对多元失效时间数据分析的努力是不够的。当研究队列中个体的失败时间具有依赖特征时,就会出现多变量失败时间数据,这种情况存在于许多情况下,包括流行病学研究和临床试验等。多元数据统计方法的发展值得更多的研究。尽管有几本书解决了这个问题,但它们的分析要么是针对选择的多变量数据类型,要么是强调一种特定的方法。与这些著作相比,本书对最新的创新研究成果进行了深入而广泛的总结。总的来说,这本书的逻辑非常清晰。第一章概述了后面各章的内容。它简要介绍了模型和工具,并提供了一些很好的应用程序设置。不熟悉主题的读者可以阅读本章,以便快速掌握学习的动机。第2章描述了一些用于单变量失效时间数据建模的核心方法。它为扩展到多变量数据分析奠定了坚实的基础;因此,读者需要对这一章多加关注。第3章和第4章分别从非参数和回归的角度提供了分析二元失效时间数据的工具。在第5章和第6章中,上述模型和工具被扩展到涵盖三个或更多故障时间变量的场景。第7章进一步考虑了重复事件数据的情况。最后,本书以第8章结束,其中讨论了处理更一般假设的方法,例如依赖审查和误测协变量数据。正如书名所暗示的那样,边际建模方法是本书最重要也是最独特的特点。第4.6、5.4和6.5节详细描述了这种方法。在这种方法下,利用边际双重、三重或多重失效危险率的cox型模型来解释时变协变量的影响。该方法提供的一些优势使其与三种传统方法不同:1)脆弱性方法,2)copula方法和3)计数过程强度建模。例如,copula方法对故障时间之间的依赖关系施加了很强的假设,并且不允许这种依赖关系依赖于协变量。相反,边际方法通过对相关性进行半参数估计来提供鲁棒性。简而言之,本书的贡献包括
{"title":"The Statistical Analysis of Multivariate Failure Time Data: A Marginal Modeling Approach","authors":"Guanqi Fang","doi":"10.1080/00224065.2021.1903824","DOIUrl":"https://doi.org/10.1080/00224065.2021.1903824","url":null,"abstract":"This book written by Prentice and Zhao brings advances in the specialized field of failure time data analysis. In the existing literature, an extensive study of statistical methods for univariate failure time analysis has been performed. These methods include Kaplan-Meier (KM) estimator, Cox regression, and censored data rank test, etc. However, to my best knowledge, the effort on multivariate failure time data analysis is insufficient. Multivariate failure time data arise when failure times for individuals in a study cohort have a dependent feature, which exists in a number of situations, including epidemiologic studies and clinical trials, etc. The development of statistical methods for multivariate data deserves more research attention. Even though there are several books tackling the problem, they devote the analysis either for select types of multivariate data or have an emphasis on a specific method. Compared with these works, this book makes a summary of the latest innovative research results both deeply and extensively. Overall, the logic of this book is very clear. Chapter 1 gives an overview of the subsequent chapters. It covers a brief introduction to the models and tools and also provides some good application settings. Readers who are not familiar with the topic may read this chapter to grasp the motivation of the study quickly. Chapter 2 describes some core methods that are used to model univariate failure time data. It serves as a solid foundation for the extension to multivariate data analysis; therefore, readers need to pay much attention to this chapter. Chapters 3 and 4 provide tools for analyzing bivariate failure time data from the nonparametric and regression perspectives, respectively. In Chapters 5 and 6, the aforementioned models and tools are extended to cover the scenario of three or more failure time variates. Chapter 7 further considers the case of recurrent event data. Finally, the book concludes with Chapter 8, which discusses approaches to handling more general assumptions, such as dependent censorship and mismeasured covariate data. As implied by the title, the marginal modeling approach is the most important and unique feature of this book. This approach has been described in detail by Sections 4.6, 5.4, and 6.5. Under this approach, a Cox-type model for the marginal double or triple or multiple failure hazard rates is utilized to explain the effects of time-dependent covariates. Some strengths provided by this approach make it distinct from the three conventional approaches: 1) the frailty approach, 2) the copula approach, and 3) the counting process intensity modeling. For example, the copula approach imposes a strong assumption on the dependencies among failure times and doesn’t allow such dependencies to depend on covariates. In contrast, the marginal approach provides robustness by conducting semiparametric estimates of the dependency. In short, the contributions of this book consist of","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"8 1","pages":"359 - 360"},"PeriodicalIF":2.5,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73872207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Next Editor of the Journal of Quality Technology: Dr. L. Allison Jones-Farmer 《质量技术杂志》下一任编辑:L. Allison Jones-Farmer博士
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-03-15 DOI: 10.1080/00224065.2021.1902187
M. Testik, B. Colosimo
{"title":"Next Editor of the Journal of Quality Technology: Dr. L. Allison Jones-Farmer","authors":"M. Testik, B. Colosimo","doi":"10.1080/00224065.2021.1902187","DOIUrl":"https://doi.org/10.1080/00224065.2021.1902187","url":null,"abstract":"","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"182 1","pages":"217 - 217"},"PeriodicalIF":2.5,"publicationDate":"2021-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79772312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The fish patty experiment: a strip-plot look 鱼肉饼实验:一个条形图
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-03-11 DOI: 10.1080/00224065.2021.1889417
P. Goos
Abstract In this article, I provide a detailed discussion of the well-known fish patty experiment introduced in the literature by the late John A. Cornell in the first edition of his famous textbook on the design and analysis of mixture experiments. Cornell used the fish patty experiment as the motivating example for an article discussing that, for logistical reasons, many mixture-process variable experiments are run using a split-plot experimental design. More specifically, he described two possible ways in which the fish patty experiment might have been performed, both of which require a split-plot analysis of the data. These descriptions were not followed by the corresponding analyses of the fish patty data. Moreover, Cornell did not discuss the most convenient way in which the fish patty experiment could have been run, namely using a strip-plot design. In this article, I discuss the logistics leading to a strip-plot design, conduct the corresponding strip-plot analysis and contrast it with the two split-plot analyses.
在本文中,我对已故约翰·a·康奈尔(John a . Cornell)在其著名的混合实验设计与分析教科书第一版中介绍的著名的鱼饼实验进行了详细的讨论。康奈尔在一篇文章中以鱼肉饼实验为例,讨论了出于逻辑原因,许多混合过程变量实验都是使用分块实验设计进行的。更具体地说,他描述了两种可能进行鱼饼实验的方法,这两种方法都需要对数据进行分割图分析。这些描述并没有对鱼肉饼数据进行相应的分析。此外,康奈尔并没有讨论进行鱼肉饼实验最方便的方法,即使用条形图设计。在本文中,我讨论了导致带状地块设计的物流,进行了相应的带状地块分析,并与两种分割地块分析进行了对比。
{"title":"The fish patty experiment: a strip-plot look","authors":"P. Goos","doi":"10.1080/00224065.2021.1889417","DOIUrl":"https://doi.org/10.1080/00224065.2021.1889417","url":null,"abstract":"Abstract In this article, I provide a detailed discussion of the well-known fish patty experiment introduced in the literature by the late John A. Cornell in the first edition of his famous textbook on the design and analysis of mixture experiments. Cornell used the fish patty experiment as the motivating example for an article discussing that, for logistical reasons, many mixture-process variable experiments are run using a split-plot experimental design. More specifically, he described two possible ways in which the fish patty experiment might have been performed, both of which require a split-plot analysis of the data. These descriptions were not followed by the corresponding analyses of the fish patty data. Moreover, Cornell did not discuss the most convenient way in which the fish patty experiment could have been run, namely using a strip-plot design. In this article, I discuss the logistics leading to a strip-plot design, conduct the corresponding strip-plot analysis and contrast it with the two split-plot analyses.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"10 1","pages":"236 - 248"},"PeriodicalIF":2.5,"publicationDate":"2021-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79210090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Cluster-based data filtering for manufacturing big data systems 基于聚类的制造业大数据系统数据过滤
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-03-05 DOI: 10.1080/00224065.2021.1889420
Yifu Li, Xinwei Deng, Shan Ba, W. Myers, William A. Brenneman, Steve J. Lange, Ronald Zink, R. Jin
Abstract A manufacturing system collects big and heterogeneous data for tasks such as product quality modeling and data-driven decision-making. However, as the size of data grows, timely and effective data utilization becomes challenging. We propose an unsupervised data filtering method to reduce manufacturing big data sets with multi-variate continuous variables into informative small data sets. Furthermore, to determine the appropriate proportion of data to be filtered, we propose a filtering information criterion (FIC) to balance the tradeoff between the filtered data size and the information preserved. The case study of a babycare manufacturing and a simulation study have shown the effectiveness of the proposed method.
制造系统收集大量异构数据,用于产品质量建模和数据驱动决策等任务。然而,随着数据规模的增长,及时有效地利用数据变得具有挑战性。提出了一种无监督数据过滤方法,将具有多变量连续变量的制造业大数据集简化为信息量大的小数据集。此外,为了确定要过滤的数据的适当比例,我们提出了一个过滤信息标准(FIC)来平衡过滤后的数据大小和保留的信息之间的权衡。通过对某婴儿护理用品生产企业的实例研究和仿真研究,验证了该方法的有效性。
{"title":"Cluster-based data filtering for manufacturing big data systems","authors":"Yifu Li, Xinwei Deng, Shan Ba, W. Myers, William A. Brenneman, Steve J. Lange, Ronald Zink, R. Jin","doi":"10.1080/00224065.2021.1889420","DOIUrl":"https://doi.org/10.1080/00224065.2021.1889420","url":null,"abstract":"Abstract A manufacturing system collects big and heterogeneous data for tasks such as product quality modeling and data-driven decision-making. However, as the size of data grows, timely and effective data utilization becomes challenging. We propose an unsupervised data filtering method to reduce manufacturing big data sets with multi-variate continuous variables into informative small data sets. Furthermore, to determine the appropriate proportion of data to be filtered, we propose a filtering information criterion (FIC) to balance the tradeoff between the filtered data size and the information preserved. The case study of a babycare manufacturing and a simulation study have shown the effectiveness of the proposed method.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"152 1","pages":"290 - 302"},"PeriodicalIF":2.5,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79585482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Optimal design subsampling from Big Datasets 基于大数据集的最优设计子抽样
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-03-04 DOI: 10.1080/00224065.2021.1889418
L. Deldossi, C. Tommasi
Abstract Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.
大数据是大量的数字信息,很少来自适当计划的调查;因此,它们常常包含多余的观察结果。当目标是回答感兴趣的特定问题时,我们建议选择包含大部分信息的单元子样本来实现这一目标。由最优设计理论驱动的选择方法结合了推理目的,因此比标准抽样方案执行得更好。
{"title":"Optimal design subsampling from Big Datasets","authors":"L. Deldossi, C. Tommasi","doi":"10.1080/00224065.2021.1889418","DOIUrl":"https://doi.org/10.1080/00224065.2021.1889418","url":null,"abstract":"Abstract Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"45 1","pages":"93 - 101"},"PeriodicalIF":2.5,"publicationDate":"2021-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77437589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Forward stepwise random forest analysis for experimental designs 实验设计的前向逐步随机森林分析
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-01-28 DOI: 10.1080/00224065.2020.1865853
Chang-Yun Lin
Abstract In experimental designs, it is usually assumed that the data follow normal distributions and the models have linear structures. In practice, experimenters may encounter different types of responses and be uncertain about model structures. If this is the case, traditional methods, such as the ANOVA and regression, are not suitable for data analysis and model selection. We introduce the random forest analysis, which is a powerful machine learning method capable of analyzing numerical and categorical data with complicated model structures. To perform model selection and factor identification with the random forest method, we propose a forward stepwise algorithm and develop Python and R codes based on minimizing the OOB error. Six examples including simulation and case studies are provided. We compare the performance of the proposed method and some frequently used analysis methods. Results show that the forward stepwise random forest analysis, in general, has a high power for identifying active factors and selects models that have high prediction accuracy.
在实验设计中,通常假设数据服从正态分布,模型具有线性结构。在实践中,实验者可能会遇到不同类型的反应,并且对模型结构不确定。如果是这种情况,传统的方法,如方差分析和回归,不适合进行数据分析和模型选择。随机森林分析是一种强大的机器学习方法,能够分析具有复杂模型结构的数值和分类数据。为了使用随机森林方法进行模型选择和因子识别,我们提出了一种前向逐步算法,并基于最小化OOB误差开发了Python和R代码。给出了包括仿真和案例研究在内的六个实例。我们比较了所提出的方法和一些常用的分析方法的性能。结果表明,一般情况下,正演逐步随机森林分析在识别主动因素和选择预测精度较高的模型方面具有较高的能力。
{"title":"Forward stepwise random forest analysis for experimental designs","authors":"Chang-Yun Lin","doi":"10.1080/00224065.2020.1865853","DOIUrl":"https://doi.org/10.1080/00224065.2020.1865853","url":null,"abstract":"Abstract In experimental designs, it is usually assumed that the data follow normal distributions and the models have linear structures. In practice, experimenters may encounter different types of responses and be uncertain about model structures. If this is the case, traditional methods, such as the ANOVA and regression, are not suitable for data analysis and model selection. We introduce the random forest analysis, which is a powerful machine learning method capable of analyzing numerical and categorical data with complicated model structures. To perform model selection and factor identification with the random forest method, we propose a forward stepwise algorithm and develop Python and R codes based on minimizing the OOB error. Six examples including simulation and case studies are provided. We compare the performance of the proposed method and some frequently used analysis methods. Results show that the forward stepwise random forest analysis, in general, has a high power for identifying active factors and selects models that have high prediction accuracy.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"22 1","pages":"488 - 504"},"PeriodicalIF":2.5,"publicationDate":"2021-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84776529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A spatiotemporal prediction approach for a 3D thermal field from sensor networks 基于传感器网络的三维热场时空预测方法
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-01-22 DOI: 10.1080/00224065.2020.1851618
Di Wang, Kaibo Liu, Xi Zhang
Abstract Thermal fields exist widely in engineering systems and are critical for engineering operation, product quality and system safety in many industries. An accurate prediction of thermal field distribution, that is, acquiring any location of interest in a thermal field at the present and future time, is essential to provide useful information for the surveillance, maintenance, and improvement of a system. However, thermal field prediction using data acquired from sensor networks is challenging due to data sparsity and missing data problems. To address this issue, we propose a field spatiotemporal prediction approach based on transfer learning techniques by studying the dynamics of a 3 D thermal field from multiple homogeneous fields. Our model characterizes the spatiotemporal dynamics of the local thermal field variations by considering the spatiotemporal correlation of the fields and harnessing the information from homogeneous fields to acquire an accurate thermal field distribution in the future. A real case study of thermal fields during grain storage is conducted to validate our proposed approach. Grain thermal field prediction results provide a deep insight of grain quality during storage, which is helpful for the manager of grain storage to make further decisions about grain quality control and maintenance.
热场广泛存在于工程系统中,对工程运行、产品质量和系统安全具有重要意义。准确预测热场分布,即获取当前和未来热场中任何感兴趣的位置,对于系统的监视、维护和改进提供有用的信息至关重要。然而,由于数据稀疏和数据缺失问题,利用从传感器网络获取的数据进行热场预测具有挑战性。为了解决这一问题,我们提出了一种基于迁移学习技术的场时空预测方法,通过研究多个均匀场的三维热场动力学。该模型考虑了局部热场的时空相关性,并利用均匀场的信息来获得准确的热场分布,从而刻画了局部热场变化的时空动态特征。最后,以粮食储存过程中的热场为例,对本文方法进行了验证。粮食热场预测结果为深入了解储粮过程中的粮食质量状况提供了依据,为储粮管理者进一步制定粮食质量控制和维护决策提供了依据。
{"title":"A spatiotemporal prediction approach for a 3D thermal field from sensor networks","authors":"Di Wang, Kaibo Liu, Xi Zhang","doi":"10.1080/00224065.2020.1851618","DOIUrl":"https://doi.org/10.1080/00224065.2020.1851618","url":null,"abstract":"Abstract Thermal fields exist widely in engineering systems and are critical for engineering operation, product quality and system safety in many industries. An accurate prediction of thermal field distribution, that is, acquiring any location of interest in a thermal field at the present and future time, is essential to provide useful information for the surveillance, maintenance, and improvement of a system. However, thermal field prediction using data acquired from sensor networks is challenging due to data sparsity and missing data problems. To address this issue, we propose a field spatiotemporal prediction approach based on transfer learning techniques by studying the dynamics of a 3 D thermal field from multiple homogeneous fields. Our model characterizes the spatiotemporal dynamics of the local thermal field variations by considering the spatiotemporal correlation of the fields and harnessing the information from homogeneous fields to acquire an accurate thermal field distribution in the future. A real case study of thermal fields during grain storage is conducted to validate our proposed approach. Grain thermal field prediction results provide a deep insight of grain quality during storage, which is helpful for the manager of grain storage to make further decisions about grain quality control and maintenance.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"1 1","pages":"215 - 235"},"PeriodicalIF":2.5,"publicationDate":"2021-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88334330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Journal of Quality Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1