首页 > 最新文献

Journal of Quality Technology最新文献

英文 中文
Understanding elections through statistics by Ole J. Forsberg, CRC press, Taylor & Francis group, boca Raton, FL, 2020, 225 pp., $69.95, ISBN 978-0367895372 《通过统计理解选举》,奥勒·j·福斯伯格著,CRC出版社,泰勒和弗朗西斯集团,佛罗里达州博卡拉顿,2020年,225页,69.95美元,ISBN 978-0367895372
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-07-12 DOI: 10.1080/00224065.2021.1947163
Shuai Huang
This book offers a good introduction to some statistical methods used in elections. It has two parts. The first part contains four chapters that cover estimation methods for polls. The main technical problem is the estimation of the proportion of the population holding a particular preference in voting. The analytic core of the problem is binomial distribution and the sampling and estimation procedures center around this distribution. As in the real world there are always complications. Various remedies are provided to address these complications. The author has done a great job of introducing the critical concepts and considerations in both the problem formulation and solution. For example, when introducing the importance of weighting in deriving the estimate of the poll, the author pretends to write a press release of his poll result on the issue of gender fairness in the military. It is clear that if a different source of demographics statistics is used, the poll result is quite different. Examples as such are quite useful for readers to understand the subject matter and its complexity. The second part of the book covers a few techniques to detect frauds and anomalies by examining the election results. Some techniques build on an interesting premise that humans are bad at mimicking randomness. This echoes what Fisher (1958) had said, “if one tries to think of numbers at random, one thinks of numbers very far from at random.” The Benford test is introduced in detail, including its history and its interesting applications in analyzing election data to detect anomaly based on the distributions of the leading digits reported by different precincts. The differential invalidation and some regression models are introduced as well. Spatial correlations could be modeled by using the geographical information in the data. The book concludes with a detailed discussion on data from Sri Lanka since 1994. This is a useful book that can help a broad range of readers to appreciate the power of statistics in understanding the election process from an analytic and scientific perspective. On top of the techniques introduced in the book, there are anecdotes and comments and insights that can enrich the reading experience. E.g., as in the preface the statement from a Nicaraguan leader “Indeed, you won the elections, but I won the count.” or the comment in the end of Chapter 4 “as with many things in statistics, increasing quality in one area tends to reduce quality in another.” Statistical techniques in this book are tightly bonded with the contexts and the backgrounds of their application. After reading the book, I appreciate the book has helped me understand a complex problem in a complex world. Not everything is what it appears to be, but we can equip ourselves with sufficient knowledge and useful tools to help us look at the data in every angle and really feel the data as it is.
这本书很好地介绍了选举中使用的一些统计方法。它有两部分。第一部分包括四章,介绍了民意调查的估计方法。主要的技术问题是估计在投票中持有特定偏好的人口比例。问题的分析核心是二项分布,抽样和估计过程围绕着这个分布。就像在现实世界中一样,总会有复杂的事情发生。提供了各种补救措施来解决这些并发症。作者做了一个伟大的工作,介绍了关键的概念和考虑问题的制定和解决。例如,在介绍加权对得出民意调查估计数的重要性时,作者假装写了一篇关于军队中性别公平问题的民意调查结果的新闻稿。很明显,如果使用不同的人口统计来源,民意调查结果就会大不相同。这样的例子对于读者理解主题及其复杂性非常有用。书的第二部分介绍了通过检查选举结果来发现欺诈和异常现象的一些技术。一些技术建立在一个有趣的前提上,即人类不擅长模仿随机性。这与Fisher(1958)所说的相呼应,“如果一个人试图随机地思考数字,那么他想到的数字就远非随机。”详细介绍了Benford测试,包括它的历史和它在分析选举数据中有趣的应用,根据不同选区报告的领先数字的分布来检测异常。并介绍了微分失效和一些回归模型。利用数据中的地理信息可以建立空间相关性模型。本书最后对斯里兰卡自1994年以来的数据进行了详细讨论。这是一本有用的书,可以帮助广泛的读者从分析和科学的角度理解选举过程的统计力量。除了书中介绍的技巧外,书中还有轶事、评论和见解,可以丰富阅读体验。例如,在序言中,一位尼加拉瓜领导人说:“的确,你们赢得了选举,但我赢得了计票。或者第四章末尾的评论,“就像统计学中的许多事情一样,一个领域的质量提高往往会降低另一个领域的质量。”本书中的统计技术与上下文及其应用的背景紧密结合。读完这本书,我很感激这本书帮助我理解了一个复杂世界中的一个复杂问题。并不是所有的事情都像它看起来的那样,但我们可以用足够的知识和有用的工具来装备自己,帮助我们从各个角度看待数据,真正感受到数据的本来面目。
{"title":"Understanding elections through statistics by Ole J. Forsberg, CRC press, Taylor & Francis group, boca Raton, FL, 2020, 225 pp., $69.95, ISBN 978-0367895372","authors":"Shuai Huang","doi":"10.1080/00224065.2021.1947163","DOIUrl":"https://doi.org/10.1080/00224065.2021.1947163","url":null,"abstract":"This book offers a good introduction to some statistical methods used in elections. It has two parts. The first part contains four chapters that cover estimation methods for polls. The main technical problem is the estimation of the proportion of the population holding a particular preference in voting. The analytic core of the problem is binomial distribution and the sampling and estimation procedures center around this distribution. As in the real world there are always complications. Various remedies are provided to address these complications. The author has done a great job of introducing the critical concepts and considerations in both the problem formulation and solution. For example, when introducing the importance of weighting in deriving the estimate of the poll, the author pretends to write a press release of his poll result on the issue of gender fairness in the military. It is clear that if a different source of demographics statistics is used, the poll result is quite different. Examples as such are quite useful for readers to understand the subject matter and its complexity. The second part of the book covers a few techniques to detect frauds and anomalies by examining the election results. Some techniques build on an interesting premise that humans are bad at mimicking randomness. This echoes what Fisher (1958) had said, “if one tries to think of numbers at random, one thinks of numbers very far from at random.” The Benford test is introduced in detail, including its history and its interesting applications in analyzing election data to detect anomaly based on the distributions of the leading digits reported by different precincts. The differential invalidation and some regression models are introduced as well. Spatial correlations could be modeled by using the geographical information in the data. The book concludes with a detailed discussion on data from Sri Lanka since 1994. This is a useful book that can help a broad range of readers to appreciate the power of statistics in understanding the election process from an analytic and scientific perspective. On top of the techniques introduced in the book, there are anecdotes and comments and insights that can enrich the reading experience. E.g., as in the preface the statement from a Nicaraguan leader “Indeed, you won the elections, but I won the count.” or the comment in the end of Chapter 4 “as with many things in statistics, increasing quality in one area tends to reduce quality in another.” Statistical techniques in this book are tightly bonded with the contexts and the backgrounds of their application. After reading the book, I appreciate the book has helped me understand a complex problem in a complex world. Not everything is what it appears to be, but we can equip ourselves with sufficient knowledge and useful tools to help us look at the data in every angle and really feel the data as it is.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"9 1","pages":"122 - 122"},"PeriodicalIF":2.5,"publicationDate":"2021-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80102706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boost-R: Gradient boosted trees for recurrence data Boost-R:用于递归数据的梯度增强树
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-07-03 DOI: 10.1080/00224065.2021.1948373
Xiao Liu, Rong Pan
Abstract Recurrence data arise from multi-disciplinary domains spanning reliability, cyber security, healthcare, online retailing, etc. This paper investigates an additive-tree-based approach, known as Boost-R (Boosting for Recurrence Data), for recurrent event data with both static and dynamic features. Boost-R constructs an ensemble of gradient boosted additive trees to estimate the cumulative intensity function of the recurrent event process, where a new tree is added to the ensemble by minimizing the regularized L 2 distance between the observed and predicted cumulative intensity. Unlike conventional regression trees, a time-dependent function is constructed by Boost-R on each tree leaf. The sum of these functions, from multiple trees, yields the ensemble estimator of the cumulative intensity. The divide-and-conquer nature of tree-based methods is appealing when hidden sub-populations exist within a heterogeneous population. The non-parametric nature of regression trees helps to avoid parametric assumptions on the complex interactions between event processes and features. Critical insights and advantages of Boost-R are investigated through comprehensive numerical examples. Datasets and computer code of Boost-R are made available on GitHub. To our best knowledge, Boost-R is the first gradient boosted additive-tree-based approach for modeling large-scale recurrent event data with both static and dynamic feature information.
重复数据来自多学科领域,包括可靠性、网络安全、医疗保健、在线零售等。本文研究了一种基于加性树的方法,称为Boost-R (Boosting for recurrent Data),用于具有静态和动态特征的循环事件数据。Boost-R构建了一个梯度增强的加性树集合来估计循环事件过程的累积强度函数,其中通过最小化观测到的和预测的累积强度之间的正则化l2距离,将新树添加到集合中。与传统的回归树不同,Boost-R在每个树叶上构建了一个时间相关的函数。这些函数的和,从多个树,产生累积强度的集合估计。当隐藏的子种群存在于异质种群中时,基于树的方法的分而治之的特性很有吸引力。回归树的非参数性质有助于避免对事件过程和特征之间复杂的相互作用进行参数假设。通过全面的数值实例研究了Boost-R的关键见解和优势。Boost-R的数据集和计算机代码可在GitHub上获得。据我们所知,Boost-R是第一个基于梯度增强加性树的方法,用于对具有静态和动态特征信息的大规模循环事件数据建模。
{"title":"Boost-R: Gradient boosted trees for recurrence data","authors":"Xiao Liu, Rong Pan","doi":"10.1080/00224065.2021.1948373","DOIUrl":"https://doi.org/10.1080/00224065.2021.1948373","url":null,"abstract":"Abstract Recurrence data arise from multi-disciplinary domains spanning reliability, cyber security, healthcare, online retailing, etc. This paper investigates an additive-tree-based approach, known as Boost-R (Boosting for Recurrence Data), for recurrent event data with both static and dynamic features. Boost-R constructs an ensemble of gradient boosted additive trees to estimate the cumulative intensity function of the recurrent event process, where a new tree is added to the ensemble by minimizing the regularized L 2 distance between the observed and predicted cumulative intensity. Unlike conventional regression trees, a time-dependent function is constructed by Boost-R on each tree leaf. The sum of these functions, from multiple trees, yields the ensemble estimator of the cumulative intensity. The divide-and-conquer nature of tree-based methods is appealing when hidden sub-populations exist within a heterogeneous population. The non-parametric nature of regression trees helps to avoid parametric assumptions on the complex interactions between event processes and features. Critical insights and advantages of Boost-R are investigated through comprehensive numerical examples. Datasets and computer code of Boost-R are made available on GitHub. To our best knowledge, Boost-R is the first gradient boosted additive-tree-based approach for modeling large-scale recurrent event data with both static and dynamic feature information.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"11 1","pages":"545 - 565"},"PeriodicalIF":2.5,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78715817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Change point detection and issue localization based on fleet-wide fault data 基于全车队故障数据的变更点检测和问题定位
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-06-30 DOI: 10.1080/00224065.2021.1937409
Zhanpan Zhang, N. Doganaksoy
Abstract Modern industrial assets (e.g., generators, turbines, engines) are outfitted with numerous sensors to monitor key operating and environmental variables. Unusual sensor readings, such as high temperature, excessive vibration, or low current, could trigger rule-based actions (also known as faults) that range from warning alarms to immediate shutdown of the asset to prevent potential damage. In the case study of this article, a wind park experienced a sudden surge in vibration-induced shutdowns. We utilize fault data logs from the park with the goal of detecting common change points across turbines. Another important goal is the localization of fault occurrences to an identifiable set of turbines. The literature on change point detection and localization for multiple assets is highly sparse. Our technical development is based on the generalized linear modeling framework. We combine well-known solutions to change point detection for a single asset with a heuristics-based approach to identify a common change point(s) for multiple assets. The performance of the proposed detection and localization algorithms is evaluated through synthetic (Monte Carlo) fault data streams. Several novel performance metrics are defined to characterize different aspects of a change point detection algorithm for multiple assets. For the case study example, the proposed methodology identified the change point and the subset of affected turbines with a high degree of accuracy. The problem described here warrants further study to accommodate general fault distributions, change point detection algorithms, and very large fleet sizes.
现代工业资产(如发电机、涡轮机、发动机)配备了许多传感器来监测关键的操作和环境变量。异常的传感器读数,如高温、过度振动或低电流,可能触发基于规则的动作(也称为故障),从警告警报到立即关闭资产,以防止潜在的损坏。在本文的案例研究中,一个风电场经历了振动引起的停机突然激增。我们利用来自园区的故障数据日志,目标是检测涡轮机之间的共同变化点。另一个重要目标是将故障定位到一组可识别的涡轮机上。关于多资产变化点检测和定位的文献非常稀少。我们的技术开发是基于广义线性建模框架。我们将众所周知的解决方案与基于启发式的方法结合起来,用于单个资产的变更点检测,以识别多个资产的公共变更点。通过综合(蒙特卡罗)故障数据流对所提出的检测和定位算法的性能进行了评估。定义了几个新的性能指标来描述多个资产的变化点检测算法的不同方面。对于案例研究示例,提出的方法以高精度确定了变化点和受影响的涡轮机子集。这里描述的问题值得进一步研究,以适应一般的故障分布、变化点检测算法和非常大的车队规模。
{"title":"Change point detection and issue localization based on fleet-wide fault data","authors":"Zhanpan Zhang, N. Doganaksoy","doi":"10.1080/00224065.2021.1937409","DOIUrl":"https://doi.org/10.1080/00224065.2021.1937409","url":null,"abstract":"Abstract Modern industrial assets (e.g., generators, turbines, engines) are outfitted with numerous sensors to monitor key operating and environmental variables. Unusual sensor readings, such as high temperature, excessive vibration, or low current, could trigger rule-based actions (also known as faults) that range from warning alarms to immediate shutdown of the asset to prevent potential damage. In the case study of this article, a wind park experienced a sudden surge in vibration-induced shutdowns. We utilize fault data logs from the park with the goal of detecting common change points across turbines. Another important goal is the localization of fault occurrences to an identifiable set of turbines. The literature on change point detection and localization for multiple assets is highly sparse. Our technical development is based on the generalized linear modeling framework. We combine well-known solutions to change point detection for a single asset with a heuristics-based approach to identify a common change point(s) for multiple assets. The performance of the proposed detection and localization algorithms is evaluated through synthetic (Monte Carlo) fault data streams. Several novel performance metrics are defined to characterize different aspects of a change point detection algorithm for multiple assets. For the case study example, the proposed methodology identified the change point and the subset of affected turbines with a high degree of accuracy. The problem described here warrants further study to accommodate general fault distributions, change point detection algorithms, and very large fleet sizes.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"10 1","pages":"453 - 465"},"PeriodicalIF":2.5,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90219229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Complex geometries in additive manufacturing: A new solution for lattice structure modeling and monitoring 增材制造中的复杂几何:晶格结构建模和监测的新解决方案
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-06-30 DOI: 10.1080/00224065.2021.1926377
B. Colosimo, M. Grasso, Federica Garghetti, B. Rossi
Abstract The production of novel types of complex shapes is nowadays enabled by new manufacturing paradigms such as additive manufacturing, also known as 3D printing. The continuous increase of shape complexity imposes new challenges in terms of inspection, product qualification and process monitoring methodologies. Previously proposed methods for 2.5D free-form surfaces are no longer applicable in the presence of this kind of new full 3D geometries. This paper aims to tackle this challenge by presenting a statistical quality monitoring approach for structures that cannot be described in terms of parametric models. The goal consists of identifying out-of-control geometrical distortions by analyzing either local variations within the part or changes from part to part. The proposed approach involves an innovative solution for modeling the deviation between the nominal geometry (the originating 3D model) and the real geometry (measured via x-ray computed tomography) by slicing the shapes and estimating the deviation slice by slice. 3D deviation maps are then transformed into 1D deviation profiles enabling the use of a profile monitoring scheme for local defect detection. The feasibility and potential of this method are demonstrated by focusing on a category of complex shapes where an elemental geometry regularly repeats in space. These shapes are known as lattice structures, or metamaterials, and their trabecular shape is thought to provide innovative mechanical and functional performance. The performance of the proposed method is shown in real and simulated case studies.
如今,新型复杂形状的生产是由新的制造范式(如增材制造,也称为3D打印)实现的。形状复杂性的不断增加对检验、产品鉴定和过程监控方法提出了新的挑战。以前提出的2.5D自由曲面的方法不再适用于这种新的全三维几何形状。本文旨在通过提出一种不能用参数模型描述的结构的统计质量监测方法来解决这一挑战。目标包括通过分析零件内部的局部变化或零件之间的变化来识别失控的几何变形。所提出的方法涉及一种创新的解决方案,通过对形状进行切片并逐片估计偏差,来建模标称几何形状(原始3D模型)与实际几何形状(通过x射线计算机断层扫描测量)之间的偏差。然后将3D偏差图转换为1D偏差轮廓,从而可以使用轮廓监测方案进行局部缺陷检测。这种方法的可行性和潜力是通过集中在一个复杂的形状类别,其中一个基本的几何形状在空间中有规律地重复证明。这些形状被称为晶格结构或超材料,它们的小梁形状被认为提供了创新的机械和功能性能。实例分析和仿真结果表明了该方法的有效性。
{"title":"Complex geometries in additive manufacturing: A new solution for lattice structure modeling and monitoring","authors":"B. Colosimo, M. Grasso, Federica Garghetti, B. Rossi","doi":"10.1080/00224065.2021.1926377","DOIUrl":"https://doi.org/10.1080/00224065.2021.1926377","url":null,"abstract":"Abstract The production of novel types of complex shapes is nowadays enabled by new manufacturing paradigms such as additive manufacturing, also known as 3D printing. The continuous increase of shape complexity imposes new challenges in terms of inspection, product qualification and process monitoring methodologies. Previously proposed methods for 2.5D free-form surfaces are no longer applicable in the presence of this kind of new full 3D geometries. This paper aims to tackle this challenge by presenting a statistical quality monitoring approach for structures that cannot be described in terms of parametric models. The goal consists of identifying out-of-control geometrical distortions by analyzing either local variations within the part or changes from part to part. The proposed approach involves an innovative solution for modeling the deviation between the nominal geometry (the originating 3D model) and the real geometry (measured via x-ray computed tomography) by slicing the shapes and estimating the deviation slice by slice. 3D deviation maps are then transformed into 1D deviation profiles enabling the use of a profile monitoring scheme for local defect detection. The feasibility and potential of this method are demonstrated by focusing on a category of complex shapes where an elemental geometry regularly repeats in space. These shapes are known as lattice structures, or metamaterials, and their trabecular shape is thought to provide innovative mechanical and functional performance. The performance of the proposed method is shown in real and simulated case studies.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"2 1","pages":"392 - 414"},"PeriodicalIF":2.5,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78524869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Nonparametric monitoring of sunspot number observations 太阳黑子数观测的非参数监测
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-06-25 DOI: 10.1080/00224065.2022.2041376
S. Mathieu, L. Lefèvre, R. von Sachs, V. Delouille, Christian Ritter, F. Clette
Abstract Solar activity is an important driver of long-term climate trends and must be accounted for in climate models. Unfortunately, direct measurements of this quantity over long periods do not exist. The only observation related to solar activity whose records reach back to the seventeenth century are sunspots. Surprisingly, determining the number of sunspots consistently over time has remained until today a challenging statistical problem. It arises from the need of consolidating data from multiple observing stations around the world in a context of low signal-to-noise ratios, non-stationarity, missing data, nonstandard distributions and errors of different kind. The data from some stations experience therefore severe and various deviations over time. In this paper, we apply a systematic statistical approach for monitoring these complex and important series. It consists of three steps essential for successful treatment of the data: smoothing on multiple time-scales, monitoring using block bootstrap calibrated CUSUM charts and classifying of out-of-control situations by support vector techniques. This approach allows us to detect a wide range of anomalies (such as sudden jumps or more progressive drifts), unseen in previous analyses. It helps us to identify the causes of major deviations, which are often observer or equipment related. Their detection and identification will contribute to improve future observations. Their elimination or correction in past data will lead to a more precise reconstruction of the world reference index for solar activity: the International Sunspot Number.
太阳活动是长期气候趋势的重要驱动因素,必须在气候模式中加以考虑。不幸的是,长期对这一数量的直接测量是不存在的。唯一与太阳活动有关的观测记录可以追溯到17世纪,那就是太阳黑子。令人惊讶的是,直到今天,持续确定太阳黑子的数量仍然是一个具有挑战性的统计问题。它源于在低信噪比、非平稳性、数据缺失、非标准分布和不同类型误差的背景下,需要整合来自世界各地多个观测站的数据。因此,一些台站的数据随着时间的推移出现了严重和各种各样的偏差。在本文中,我们应用系统的统计方法来监测这些复杂而重要的序列。它包括成功处理数据所必需的三个步骤:在多个时间尺度上平滑,使用块引导校准的CUSUM图进行监测,以及通过支持向量技术对失控情况进行分类。这种方法允许我们检测大范围的异常(如突然跳跃或更渐进的漂移),在以前的分析中看不到。它帮助我们确定主要偏差的原因,这些偏差通常与观察者或设备有关。它们的探测和识别将有助于改进今后的观测。它们在过去数据中的消除或修正将导致更精确地重建世界太阳活动参考指数:国际太阳黑子数。
{"title":"Nonparametric monitoring of sunspot number observations","authors":"S. Mathieu, L. Lefèvre, R. von Sachs, V. Delouille, Christian Ritter, F. Clette","doi":"10.1080/00224065.2022.2041376","DOIUrl":"https://doi.org/10.1080/00224065.2022.2041376","url":null,"abstract":"Abstract Solar activity is an important driver of long-term climate trends and must be accounted for in climate models. Unfortunately, direct measurements of this quantity over long periods do not exist. The only observation related to solar activity whose records reach back to the seventeenth century are sunspots. Surprisingly, determining the number of sunspots consistently over time has remained until today a challenging statistical problem. It arises from the need of consolidating data from multiple observing stations around the world in a context of low signal-to-noise ratios, non-stationarity, missing data, nonstandard distributions and errors of different kind. The data from some stations experience therefore severe and various deviations over time. In this paper, we apply a systematic statistical approach for monitoring these complex and important series. It consists of three steps essential for successful treatment of the data: smoothing on multiple time-scales, monitoring using block bootstrap calibrated CUSUM charts and classifying of out-of-control situations by support vector techniques. This approach allows us to detect a wide range of anomalies (such as sudden jumps or more progressive drifts), unseen in previous analyses. It helps us to identify the causes of major deviations, which are often observer or equipment related. Their detection and identification will contribute to improve future observations. Their elimination or correction in past data will lead to a more precise reconstruction of the world reference index for solar activity: the International Sunspot Number.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"21 1","pages":"104 - 118"},"PeriodicalIF":2.5,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81646009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robust multivariate control chart based on shrinkage for individual observations 基于单个观察值收缩的鲁棒多变量控制图
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-06-14 DOI: 10.1080/00224065.2021.1930617
Elisa Cabana, R. Lillo
Abstract A robust multivariate quality control technique for individual observations is proposed, based on the robust reweighted shrinkage estimators. A simulation study is done to check the performance and compare the method with the classical Hotelling approach, and the robust alternative based on the reweighted minimum covariance determinant estimator. The results show the appropriateness of the method even when the dimension or the Phase I contamination are high, with both independent and correlated variables, showing additional advantages about computational efficiency. The approach is illustrated with two real data-set examples from production processes.
摘要提出了一种基于鲁棒重加权收缩估计量的多变量质量控制方法。通过仿真研究验证了该方法的性能,并与经典的Hotelling方法以及基于重加权最小协方差行行式估计的鲁棒替代方法进行了比较。结果表明,即使在维度或第一阶段污染较高的情况下,独立变量和相关变量都具有该方法的适用性,并且在计算效率方面具有额外的优势。用生产过程中的两个真实数据集示例说明了该方法。
{"title":"Robust multivariate control chart based on shrinkage for individual observations","authors":"Elisa Cabana, R. Lillo","doi":"10.1080/00224065.2021.1930617","DOIUrl":"https://doi.org/10.1080/00224065.2021.1930617","url":null,"abstract":"Abstract A robust multivariate quality control technique for individual observations is proposed, based on the robust reweighted shrinkage estimators. A simulation study is done to check the performance and compare the method with the classical Hotelling approach, and the robust alternative based on the reweighted minimum covariance determinant estimator. The results show the appropriateness of the method even when the dimension or the Phase I contamination are high, with both independent and correlated variables, showing additional advantages about computational efficiency. The approach is illustrated with two real data-set examples from production processes.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"1 1","pages":"415 - 440"},"PeriodicalIF":2.5,"publicationDate":"2021-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79893083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Entropy-based adaptive design for contour finding and estimating reliability 基于熵的轮廓查找与可靠性估计自适应设计
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-05-24 DOI: 10.1080/00224065.2022.2053795
D. Cole, R. Gramacy, J. Warner, G. Bomarito, P. Leser, W. Leser
Abstract In reliability analysis, methods used to estimate failure probability are often limited by the costs associated with model evaluations. Many of these methods, such as multifidelity importance sampling (MFIS), rely upon a computationally efficient surrogate model like a Gaussian process (GP) to quickly generate predictions. The quality of the GP fit, particularly in the vicinity of the failure region(s), is instrumental in supplying accurately predicted failures for such strategies. We introduce an entropy-based GP adaptive design that, when paired with MFIS, provides more accurate failure probability estimates and with higher confidence. We show that our greedy data acquisition strategy better identifies multiple failure regions compared to existing contour-finding schemes. We then extend the method to batch selection, without sacrificing accuracy. Illustrative examples are provided on benchmark data as well as an application to an impact damage simulator for National Aeronautics and Space Administration (NASA) spacesuits.
摘要在可靠性分析中,用于估计失效概率的方法往往受到模型评估成本的限制。其中许多方法,如多保真度重要抽样(MFIS),依赖于计算效率高的替代模型,如高斯过程(GP)来快速生成预测。GP拟合的质量,特别是在失效区域附近,对于为此类策略提供准确预测的失效至关重要。我们引入了一种基于熵的GP自适应设计,当与MFIS配对时,可以提供更准确的故障概率估计和更高的置信度。我们证明,与现有的轮廓查找方案相比,我们的贪婪数据采集策略可以更好地识别多个故障区域。然后,我们将该方法扩展到批量选择,而不牺牲准确性。给出了基于基准数据的说明性实例以及在美国国家航空航天局(NASA)宇航服冲击损伤模拟器中的应用。
{"title":"Entropy-based adaptive design for contour finding and estimating reliability","authors":"D. Cole, R. Gramacy, J. Warner, G. Bomarito, P. Leser, W. Leser","doi":"10.1080/00224065.2022.2053795","DOIUrl":"https://doi.org/10.1080/00224065.2022.2053795","url":null,"abstract":"Abstract In reliability analysis, methods used to estimate failure probability are often limited by the costs associated with model evaluations. Many of these methods, such as multifidelity importance sampling (MFIS), rely upon a computationally efficient surrogate model like a Gaussian process (GP) to quickly generate predictions. The quality of the GP fit, particularly in the vicinity of the failure region(s), is instrumental in supplying accurately predicted failures for such strategies. We introduce an entropy-based GP adaptive design that, when paired with MFIS, provides more accurate failure probability estimates and with higher confidence. We show that our greedy data acquisition strategy better identifies multiple failure regions compared to existing contour-finding schemes. We then extend the method to batch selection, without sacrificing accuracy. Illustrative examples are provided on benchmark data as well as an application to an impact damage simulator for National Aeronautics and Space Administration (NASA) spacesuits.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"37 1","pages":"43 - 60"},"PeriodicalIF":2.5,"publicationDate":"2021-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80598161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Estimating pure-error from near replicates in design of experiments 在实验设计中估计近重复的纯误差
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-05-17 DOI: 10.1080/00224065.2021.1920347
Caleb King, T. Bzik, P. Parker
Abstract In design of experiments, setting exact replicates of factor settings enables estimation of pure-error; a model-independent estimate of experimental error useful in communicating inherent system noise and testing model lack-of-fit. Often in practice, the factor levels for replicates are precisely measured rather than precisely set, resulting in near-replicates. This can result in inflated estimates of pure-error due to uncompensated set-point variation. In this article, we review previous strategies for estimating pure-error from near-replicates and propose a simple alternative. We derive key analytical properties and investigate them via simulation. Finally, we illustrate the new approach with an application.
在实验设计中,设置因子设置的精确重复可以估计纯误差;一种与模型无关的实验误差估计,可用于传递系统固有噪声和测试模型拟合不足。通常在实践中,重复的因子水平是精确测量的,而不是精确设置的,导致近似重复。由于未补偿的设定点变化,这可能导致虚高的纯误差估计。在本文中,我们回顾了以前用于估计近重复纯误差的策略,并提出了一种简单的替代方法。我们推导出关键的分析性质,并通过模拟研究它们。最后,我们用一个应用程序来说明这种新方法。
{"title":"Estimating pure-error from near replicates in design of experiments","authors":"Caleb King, T. Bzik, P. Parker","doi":"10.1080/00224065.2021.1920347","DOIUrl":"https://doi.org/10.1080/00224065.2021.1920347","url":null,"abstract":"Abstract In design of experiments, setting exact replicates of factor settings enables estimation of pure-error; a model-independent estimate of experimental error useful in communicating inherent system noise and testing model lack-of-fit. Often in practice, the factor levels for replicates are precisely measured rather than precisely set, resulting in near-replicates. This can result in inflated estimates of pure-error due to uncompensated set-point variation. In this article, we review previous strategies for estimating pure-error from near-replicates and propose a simple alternative. We derive key analytical properties and investigate them via simulation. Finally, we illustrate the new approach with an application.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"24 1","pages":"102 - 117"},"PeriodicalIF":2.5,"publicationDate":"2021-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81562697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Predictive Control Charts (PCC): A Bayesian approach in online monitoring of short runs 预测控制图(PCC):短期运行在线监测中的贝叶斯方法
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-05-13 DOI: 10.1080/00224065.2021.1916413
Konstantinos Bourazas, Dimitrios Kiagias, P. Tsiamyrtzis
Abstract Performing online monitoring for short horizon data is a challenging, though cost effective benefit. Self-starting methods attempt to address this issue adopting a hybrid scheme that executes calibration and monitoring simultaneously. In this work, we propose a Bayesian alternative that will utilize prior information and possible historical data (via power priors), offering a head-start in online monitoring, putting emphasis on outlier detection. For cases of complete prior ignorance, the objective Bayesian version will be provided. Charting will be based on the predictive distribution and the methodological framework will be derived in a general way, to facilitate discrete and continuous data from any distribution that belongs to the regular exponential family (with Normal, Poisson and Binomial being the most representative). Being in the Bayesian arena, we will be able to not only perform process monitoring, but also draw online inference regarding the unknown process parameter(s). An extended simulation study will evaluate the proposed methodology against frequentist based competitors and it will cover topics regarding prior sensitivity and model misspecification robustness. A continuous and a discrete real data set will illustrate its use in practice. Technical details, algorithms, guidelines on prior elicitation and R-codes are provided in appendices and supplementary material. Short production runs and online phase I monitoring are among the best candidates to benefit from the developed methodology.
对短视距数据进行在线监测是一项具有挑战性的工作,但具有成本效益。自启动方法试图通过同时执行校准和监测的混合方案来解决这个问题。在这项工作中,我们提出了一种贝叶斯替代方案,该方案将利用先验信息和可能的历史数据(通过功率先验),在在线监测中提供领先优势,重点放在异常值检测上。对于完全先验无知的情况,将提供客观的贝叶斯版本。图表将基于预测分布,方法框架将以一般方式导出,以方便来自任何属于规则指数族的分布的离散和连续数据(以正态分布,泊松分布和二项式分布为最具代表性)。在贝叶斯领域,我们不仅可以进行过程监控,还可以对未知的过程参数进行在线推断。一项扩展的仿真研究将对基于频率主义者的竞争对手评估所提出的方法,并将涵盖有关先验灵敏度和模型错配鲁棒性的主题。一个连续和离散的实际数据集将说明它在实践中的应用。技术细节、算法、预先引出和r码的指南在附录和补充材料中提供。短期生产运行和在线第一阶段监测是从开发的方法中受益的最佳候选。
{"title":"Predictive Control Charts (PCC): A Bayesian approach in online monitoring of short runs","authors":"Konstantinos Bourazas, Dimitrios Kiagias, P. Tsiamyrtzis","doi":"10.1080/00224065.2021.1916413","DOIUrl":"https://doi.org/10.1080/00224065.2021.1916413","url":null,"abstract":"Abstract Performing online monitoring for short horizon data is a challenging, though cost effective benefit. Self-starting methods attempt to address this issue adopting a hybrid scheme that executes calibration and monitoring simultaneously. In this work, we propose a Bayesian alternative that will utilize prior information and possible historical data (via power priors), offering a head-start in online monitoring, putting emphasis on outlier detection. For cases of complete prior ignorance, the objective Bayesian version will be provided. Charting will be based on the predictive distribution and the methodological framework will be derived in a general way, to facilitate discrete and continuous data from any distribution that belongs to the regular exponential family (with Normal, Poisson and Binomial being the most representative). Being in the Bayesian arena, we will be able to not only perform process monitoring, but also draw online inference regarding the unknown process parameter(s). An extended simulation study will evaluate the proposed methodology against frequentist based competitors and it will cover topics regarding prior sensitivity and model misspecification robustness. A continuous and a discrete real data set will illustrate its use in practice. Technical details, algorithms, guidelines on prior elicitation and R-codes are provided in appendices and supplementary material. Short production runs and online phase I monitoring are among the best candidates to benefit from the developed methodology.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"31 1","pages":"367 - 391"},"PeriodicalIF":2.5,"publicationDate":"2021-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89514912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Two-level orthogonal screening designs with 80, 96, and 112 runs, and up to 29 factors 两水平正交筛选设计,包括80、96和112组试验,多达29个因素
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-05-11 DOI: 10.1080/00224065.2021.1916412
Alan R. Vazquez, E. Schoen, P. Goos
Abstract Due to recent advances in the development of laboratory equipment, large screening experiments can now be conducted to study the joint impact of up to a few dozen factors. While much is known about orthogonal designs involving 64 and 128 runs, there is a lack of literature on screening designs with intermediate run sizes. In this article, we therefore construct screening designs with 80, 96 and 112 runs which allow the main effects to be estimated independently from the two-factor interactions and limit the aliasing among the interactions. We motivate our work using a 14-factor tuberculosis inhibition experiment and compare our new designs with alternatives from the literature using simulations.
由于最近实验室设备的发展进步,现在可以进行大型筛选实验来研究多达几十个因素的共同影响。虽然我们对涉及64和128个试验的正交设计了解很多,但缺乏关于中等试验规模筛选设计的文献。因此,在本文中,我们构建了80次、96次和112次的筛选设计,允许主效应独立于两因素相互作用的估计,并限制相互作用之间的混叠。我们使用14因素结核病抑制实验来激励我们的工作,并使用模拟将我们的新设计与文献中的替代方案进行比较。
{"title":"Two-level orthogonal screening designs with 80, 96, and 112 runs, and up to 29 factors","authors":"Alan R. Vazquez, E. Schoen, P. Goos","doi":"10.1080/00224065.2021.1916412","DOIUrl":"https://doi.org/10.1080/00224065.2021.1916412","url":null,"abstract":"Abstract Due to recent advances in the development of laboratory equipment, large screening experiments can now be conducted to study the joint impact of up to a few dozen factors. While much is known about orthogonal designs involving 64 and 128 runs, there is a lack of literature on screening designs with intermediate run sizes. In this article, we therefore construct screening designs with 80, 96 and 112 runs which allow the main effects to be estimated independently from the two-factor interactions and limit the aliasing among the interactions. We motivate our work using a 14-factor tuberculosis inhibition experiment and compare our new designs with alternatives from the literature using simulations.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"78 1","pages":"338 - 358"},"PeriodicalIF":2.5,"publicationDate":"2021-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90084561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Journal of Quality Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1