首页 > 最新文献

Journal of Quality Technology最新文献

英文 中文
Multi-sensor based landslide monitoring via transfer learning 基于迁移学习的多传感器滑坡监测
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-08-17 DOI: 10.1080/00224065.2021.1960936
Wendong Li, F. Tsung, Zhenli Song, Ke Zhang, D. Xiang
Abstract Landslides are severe geographical activities that result in large quantities of rock and debris flowing down hill-slopes, leading to thousands of casualties and billions of dollars in infrastructure damage every year worldwide. For detecting landslides, on-site sensor systems are widely applied for data collection and many existing statistical process control methods can be adopted for modeling and monitoring. However, the conventional methods may perform poorly or even inapplicable when the sensors have different set-up times and end times, especially when the system includes newly deployed sensors with limited data collected. To make effective use of such new sensors immediately after deployment, we propose a novel multi-sensor based charting scheme for dynamic landslide modeling and monitoring by using transfer learning. A regularized parameter-based transfer learning approach integrated with the ordered LASSO is first proposed to effectively transfer information from old sensors with sufficient historical data to new ones with limited data. The approach considers the similarities not only between the autoregressive coefficients of different sensors, but also between the temporal correlation patterns. A control chart is then proposed for monitoring the newly deployed sensors sequentially based on the generalized likelihood ratio. Extensive simulation results and a real data example of landslide monitoring demonstrate the effectiveness of our proposed method.
滑坡是一种严重的地理活动,导致大量的岩石和碎屑从山坡上流下,每年在世界范围内造成成千上万的人员伤亡和数十亿美元的基础设施损失。在滑坡检测中,现场传感器系统被广泛应用于数据采集,现有的许多统计过程控制方法可用于建模和监测。然而,当传感器具有不同的设置时间和结束时间时,传统方法可能表现不佳甚至不适用,特别是当系统包含新部署的传感器且收集的数据有限时。为了在部署后立即有效地利用这些新传感器,我们提出了一种基于迁移学习的动态滑坡建模和监测的基于多传感器的新制图方案。首先提出了一种基于正则化参数的迁移学习方法,该方法与有序LASSO相结合,可以有效地将历史数据充足的旧传感器信息传递给数据有限的新传感器。该方法不仅考虑了不同传感器自回归系数之间的相似性,而且考虑了时间相关模式之间的相似性。然后提出了一种基于广义似然比的控制图,用于对新部署的传感器进行顺序监控。大量的仿真结果和一个实际的滑坡监测数据实例证明了本文方法的有效性。
{"title":"Multi-sensor based landslide monitoring via transfer learning","authors":"Wendong Li, F. Tsung, Zhenli Song, Ke Zhang, D. Xiang","doi":"10.1080/00224065.2021.1960936","DOIUrl":"https://doi.org/10.1080/00224065.2021.1960936","url":null,"abstract":"Abstract Landslides are severe geographical activities that result in large quantities of rock and debris flowing down hill-slopes, leading to thousands of casualties and billions of dollars in infrastructure damage every year worldwide. For detecting landslides, on-site sensor systems are widely applied for data collection and many existing statistical process control methods can be adopted for modeling and monitoring. However, the conventional methods may perform poorly or even inapplicable when the sensors have different set-up times and end times, especially when the system includes newly deployed sensors with limited data collected. To make effective use of such new sensors immediately after deployment, we propose a novel multi-sensor based charting scheme for dynamic landslide modeling and monitoring by using transfer learning. A regularized parameter-based transfer learning approach integrated with the ordered LASSO is first proposed to effectively transfer information from old sensors with sufficient historical data to new ones with limited data. The approach considers the similarities not only between the autoregressive coefficients of different sensors, but also between the temporal correlation patterns. A control chart is then proposed for monitoring the newly deployed sensors sequentially based on the generalized likelihood ratio. Extensive simulation results and a real data example of landslide monitoring demonstrate the effectiveness of our proposed method.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82028594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
An adaptive sensor selection framework for multisensor prognostics 多传感器预测的自适应传感器选择框架
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-08-17 DOI: 10.1080/00224065.2021.1960934
Minhee Kim, Jing-Ru C. Cheng, Kaibo Liu
Abstract Recent advances in sensor technology have made it possible to monitor the degradation of a system using multiple sensors simultaneously. Accordingly, many neural network-based prognostic models have been proposed to use observed multiple sensor signals as inputs and estimate the degradation status or failure time of the system. Although these models have achieved promising prognostic performance, it is still difficult to interpret the extracted features, and the models are often used in a black-box manner providing only the final results. In this study, a novel sensor selection framework is proposed to address this challenge by adaptively deciding which sensors to use at the moment to enhance remaining useful life prediction. The contributions of this work are summarized as follows: (1) being generic and can be attached to a variety of existing neural network-based prognostic models; (2) being trained in a unified manner to optimize both the sensor selection and prognostic accuracies simultaneously; (3) improving the interpretability of the model by explaining how different sensors contribute to the final remaining useful life prediction of individual systems over time; and (4) introducing several regularization techniques to ensure the stability of the training process. We validate the proposed framework using a series of numerical studies on the degradation of aircraft gas turbine engines.
传感器技术的最新进展使得同时使用多个传感器监测系统的退化成为可能。因此,人们提出了许多基于神经网络的预测模型,将观察到的多个传感器信号作为输入,并估计系统的退化状态或故障时间。虽然这些模型已经取得了很好的预测效果,但仍然难以解释提取的特征,并且模型通常以黑盒方式使用,仅提供最终结果。在本研究中,提出了一种新的传感器选择框架,通过自适应决定当前使用哪些传感器来增强剩余使用寿命预测,来解决这一挑战。本工作的贡献总结如下:(1)具有通用性,可以附加到各种现有的基于神经网络的预测模型;(2)以统一的方式进行训练,以同时优化传感器选择和预测精度;(3)通过解释不同传感器对单个系统最终剩余使用寿命预测的贡献,提高模型的可解释性;(4)引入了几种正则化技术来保证训练过程的稳定性。我们通过一系列关于飞机燃气涡轮发动机退化的数值研究来验证所提出的框架。
{"title":"An adaptive sensor selection framework for multisensor prognostics","authors":"Minhee Kim, Jing-Ru C. Cheng, Kaibo Liu","doi":"10.1080/00224065.2021.1960934","DOIUrl":"https://doi.org/10.1080/00224065.2021.1960934","url":null,"abstract":"Abstract Recent advances in sensor technology have made it possible to monitor the degradation of a system using multiple sensors simultaneously. Accordingly, many neural network-based prognostic models have been proposed to use observed multiple sensor signals as inputs and estimate the degradation status or failure time of the system. Although these models have achieved promising prognostic performance, it is still difficult to interpret the extracted features, and the models are often used in a black-box manner providing only the final results. In this study, a novel sensor selection framework is proposed to address this challenge by adaptively deciding which sensors to use at the moment to enhance remaining useful life prediction. The contributions of this work are summarized as follows: (1) being generic and can be attached to a variety of existing neural network-based prognostic models; (2) being trained in a unified manner to optimize both the sensor selection and prognostic accuracies simultaneously; (3) improving the interpretability of the model by explaining how different sensors contribute to the final remaining useful life prediction of individual systems over time; and (4) introducing several regularization techniques to ensure the stability of the training process. We validate the proposed framework using a series of numerical studies on the degradation of aircraft gas turbine engines.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89722065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Order-of-addition mixture experiments 加阶混合实验
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-08-12 DOI: 10.1080/00224065.2021.1960935
Nicholas Rios, Dennis K. J. Lin
Abstract In a mixture experiment, m components are mixed to produce a response. The total amount of the mixture is a constant. Existing literature on mixture designs ignores the order of addition of the mixture components. This paper considers the Order-of-Addition (OofA) mixture experiment, where the response depends on both the mixture proportions of components and their order of addition. Empirical study demonstrates that if mixture-order interactions exist, then the optimal mixture proportions identified by traditional models may be misleading. Full Mixture OofA designs are created which ensure orthogonality between mixture model terms and addition order effects. These designs allow for the estimation of (1) typical mixture model parameters and (2) order-of-addition effects. Moreover, models which include both main effects and key mixture-order interactions are introduced.
在混合实验中,将m个组分混合以产生响应。混合物的总量是一个常数。现有的混合料设计文献忽略了混合料组分的加入顺序。本文考虑了一种混合试验,其响应既取决于各组分的混合比例,也取决于各组分的加入顺序。实证研究表明,如果存在混合级相互作用,那么传统模型确定的最优混合比例可能具有误导性。创建了完全混合OofA设计,以确保混合模型项和加阶效应之间的正交性。这些设计允许估计(1)典型的混合模型参数和(2)加阶效应。此外,还介绍了包括主效应和关键混合阶相互作用的模型。
{"title":"Order-of-addition mixture experiments","authors":"Nicholas Rios, Dennis K. J. Lin","doi":"10.1080/00224065.2021.1960935","DOIUrl":"https://doi.org/10.1080/00224065.2021.1960935","url":null,"abstract":"Abstract In a mixture experiment, m components are mixed to produce a response. The total amount of the mixture is a constant. Existing literature on mixture designs ignores the order of addition of the mixture components. This paper considers the Order-of-Addition (OofA) mixture experiment, where the response depends on both the mixture proportions of components and their order of addition. Empirical study demonstrates that if mixture-order interactions exist, then the optimal mixture proportions identified by traditional models may be misleading. Full Mixture OofA designs are created which ensure orthogonality between mixture model terms and addition order effects. These designs allow for the estimation of (1) typical mixture model parameters and (2) order-of-addition effects. Moreover, models which include both main effects and key mixture-order interactions are introduced.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88300537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Online automatic anomaly detection for photovoltaic systems using thermography imaging and low rank matrix decomposition 基于热成像和低秩矩阵分解的光伏系统在线自动异常检测
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-08-05 DOI: 10.1080/00224065.2021.1948372
Qian Wang, K. Paynabar, M. Pacella
Abstract Faults occurred during the operational lifetime of photovoltaic (PV) systems can cause energy loss, system shutdown, as well as possible fire risks. Therefore, it is crucial to detect anomalies and faults to control the system’s performance and ensure its reliability. Comparing to traditional monitoring techniques based on an on-site visual inspection and/ or electrical measuring devices, the combination of drones and infrared thermography imaging evidently provides the means for faster and less expensive PV monitoring. However, the literature in this area lacks automatic and implementable algorithms for PV fault detection, particularly, using raw aerial thermography, with precise performance evaluation. The objective of this paper is, thus, to build a fully automatic online monitoring framework. We propose an analytical framework for online analysis of the raw video streams of aerial thermography. This framework integrates image processing and statistical machine learning techniques. We validate the effectiveness of the proposed framework and provide sufficient details to facilitate its implementation by practitioners. Two challenges hinder direct fault detection on raw PV images. One is that raw PV images often have non-smooth backgrounds that can impact the detection performance. This background needs to be removed before fault detection. However, this is a daunting task given the perspective of images. To deal with this challenge, we utilize the Transform Invariant Low-rank Textures (TILT) method to orthogonalize the perspective before applying edge detection to crop out the background and aligning the cropped images. The other issue is that the regular hot spots at the bottom edges of the solar panels are normal and should not be detected as anomalies. This makes the intensity-based detection method in the literature fail. These hot spots are part of the low-rank pattern of the image sequence. On the other hand, the hot spots caused by anomalies deviate from the normal low-rank pattern of the PV cells. Therefore, we propose a methodology that relies on Robust Principal Component Analysis (RPCA), which can separate sparse corrupted anomalous components from a low-rank background. The RPCA is applied to the PV images for simultaneous detection and isolation of anomalies. In addition to RPCA, we suggest a set of post-processing procedures for image denoising, and segmentation. The proposed algorithm is developed using 20 normal (with no anomalies) training samples and 100 test samples. The results showed that the algorithm successfully detects the anomalies with a recall of 0.80 and detects the significant anomalies with the maximum recall of 1. Our method outperforms two benchmark methods in terms of F1 score by 44.5% and 114.3%. The small number of false alarms is mostly due to irregular image patterns at the end of a PV array or an extreme non-orthogonal perspective. Since the number of false alarms is not large, it does not disrupt
光伏发电系统在使用寿命期间发生的故障会造成能量损失、系统停机,并可能存在火灾风险。因此,检测异常和故障是控制系统性能和保证系统可靠性的关键。与基于现场目视检查和/或电气测量设备的传统监测技术相比,无人机和红外热成像的结合显然提供了更快、更便宜的光伏监测手段。然而,该领域的文献缺乏用于光伏故障检测的自动和可实现的算法,特别是使用原始航空热像仪进行精确的性能评估。因此,本文的目标是构建一个全自动在线监测框架。我们提出了一个分析框架,用于在线分析航空热成像的原始视频流。该框架集成了图像处理和统计机器学习技术。我们验证建议框架的有效性,并提供足够的细节,以方便从业员实施。两个挑战阻碍了对原始PV图像的直接故障检测。首先,原始PV图像通常具有不光滑的背景,这会影响检测性能。在进行故障检测前,需要清除该背景。然而,考虑到图像的视角,这是一项艰巨的任务。为了应对这一挑战,我们利用变换不变低秩纹理(TILT)方法对透视进行正交化,然后应用边缘检测裁剪背景并对齐裁剪后的图像。另一个问题是,太阳能电池板底部边缘的常规热点是正常的,不应该被检测为异常。这使得文献中基于强度的检测方法失效。这些热点是图像序列低秩模式的一部分。另一方面,异常引起的热点偏离了PV电池正常的低秩模式。因此,我们提出了一种基于鲁棒主成分分析(RPCA)的方法,该方法可以从低秩背景中分离稀疏的损坏异常成分。将RPCA应用于PV图像,同时检测和隔离异常。除了RPCA之外,我们还提出了一套用于图像去噪和分割的后处理程序。该算法使用20个正常(无异常)训练样本和100个测试样本开发。结果表明,该算法以0.80的召回率成功检测异常,以1的最大召回率成功检测显著异常。在F1得分方面,我们的方法比两种基准方法分别高出44.5%和114.3%。少量误报主要是由于PV阵列末端的不规则图像模式或极端非正交视角造成的。由于假警报的数量并不多,它不会破坏检查过程,并且它们可以很容易地被鉴定人员离线识别。平均计算时间为6.32秒/张,实现对光伏板的在线自动检测。
{"title":"Online automatic anomaly detection for photovoltaic systems using thermography imaging and low rank matrix decomposition","authors":"Qian Wang, K. Paynabar, M. Pacella","doi":"10.1080/00224065.2021.1948372","DOIUrl":"https://doi.org/10.1080/00224065.2021.1948372","url":null,"abstract":"Abstract Faults occurred during the operational lifetime of photovoltaic (PV) systems can cause energy loss, system shutdown, as well as possible fire risks. Therefore, it is crucial to detect anomalies and faults to control the system’s performance and ensure its reliability. Comparing to traditional monitoring techniques based on an on-site visual inspection and/ or electrical measuring devices, the combination of drones and infrared thermography imaging evidently provides the means for faster and less expensive PV monitoring. However, the literature in this area lacks automatic and implementable algorithms for PV fault detection, particularly, using raw aerial thermography, with precise performance evaluation. The objective of this paper is, thus, to build a fully automatic online monitoring framework. We propose an analytical framework for online analysis of the raw video streams of aerial thermography. This framework integrates image processing and statistical machine learning techniques. We validate the effectiveness of the proposed framework and provide sufficient details to facilitate its implementation by practitioners. Two challenges hinder direct fault detection on raw PV images. One is that raw PV images often have non-smooth backgrounds that can impact the detection performance. This background needs to be removed before fault detection. However, this is a daunting task given the perspective of images. To deal with this challenge, we utilize the Transform Invariant Low-rank Textures (TILT) method to orthogonalize the perspective before applying edge detection to crop out the background and aligning the cropped images. The other issue is that the regular hot spots at the bottom edges of the solar panels are normal and should not be detected as anomalies. This makes the intensity-based detection method in the literature fail. These hot spots are part of the low-rank pattern of the image sequence. On the other hand, the hot spots caused by anomalies deviate from the normal low-rank pattern of the PV cells. Therefore, we propose a methodology that relies on Robust Principal Component Analysis (RPCA), which can separate sparse corrupted anomalous components from a low-rank background. The RPCA is applied to the PV images for simultaneous detection and isolation of anomalies. In addition to RPCA, we suggest a set of post-processing procedures for image denoising, and segmentation. The proposed algorithm is developed using 20 normal (with no anomalies) training samples and 100 test samples. The results showed that the algorithm successfully detects the anomalies with a recall of 0.80 and detects the significant anomalies with the maximum recall of 1. Our method outperforms two benchmark methods in terms of F1 score by 44.5% and 114.3%. The small number of false alarms is mostly due to irregular image patterns at the end of a PV array or an extreme non-orthogonal perspective. Since the number of false alarms is not large, it does not disrupt","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78111308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Introduction to time series modeling with applications in R 介绍时间序列建模与R中的应用程序
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-07-30 DOI: 10.1080/00224065.2021.1951147
A. Iquebal
{"title":"Introduction to time series modeling with applications in R","authors":"A. Iquebal","doi":"10.1080/00224065.2021.1951147","DOIUrl":"https://doi.org/10.1080/00224065.2021.1951147","url":null,"abstract":"","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85825395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Advanced Survival Models 高级生存模型
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-07-27 DOI: 10.1080/00224065.2021.1958720
Caleb King
Presenting a thorough review of selected topics in survival analysis, Dr. Legrand’s Advanced Survival Models is an excellent reference for students and/or practitioners. The book covers four advanced topics: frailty models, cure models, competing risks, and joint modeling of time-dependent covariates. Each topic is addressed with great care, balancing coverage with intimate detail so that readers come away with a comfortable level of knowledge about each topic. The book is divided into six chapters. The first chapter covers basic survival analysis concepts and presents six medical datasets (many of which are publicly available) for illustrating the models going forward. Each dataset is explained in great detail, including the context of data collection and the meaning of each variable. The data are analyzed using primarily R code throughout with a sprinkling of SAS code as well. The second chapter then presents a brief review of classical survival analysis techniques. It is in this chapter that readers get a taste of the level of detail with which Dr. Legrand discusses each of the advanced models: parametric models, semi-parametric models, non-parametric models, Cox proportional hazards, accelerated failure time; all are given their due diligence and illustrated with the data provided, so that the reader is presented with the breadth of methodology available at even the basic level. For the remaining four chapters, the format is similar. The chapter opens with an overall introduction of the topic, effectively summarizing the contents to come. Next, the primary model varieties are presented with sufficient context to understand their origins as well as their areas of appropriate use. Next, the primary methods of estimating the models are discussed. Finally, the chapter ends with illustration of the models using one or more of the datasets. In each case, Dr. Legrand presents enough detail so that the reader becomes intimately familiar with the basic concepts and estimation procedures. As an illustration of her effectiveness, I was not aware of the existence of cure models prior to reading this book. Now, I feel confident enough on the subject that I would be comfortable explaining it to another person. Where there is the opportunity for more specialized models and/or estimation procedures, multiple references are provided that discuss such models and/or procedures in greater detail. I found the references satisfactory for further study on a particular topic. While the book overall is a fairly easy read, there are several editorial “glitches” that, though not sufficient to cause confusion and misunderstanding, were still noticeable and tended to happen more frequently than one would expect. Most of these “glitches” consist of typographical errors and awkward sentence structures. In addition, the material was a bit repetitive when moving from the introduction to the main material in each chapter. However, I consider both of these to be very minor inc
Legrand博士的《高级生存模型》对生存分析中选定的主题进行了全面的回顾,是学生和/或从业者的极好参考。该书涵盖了四个高级主题:脆弱性模型,治疗模型,竞争风险,以及时间相关协变量的联合建模。每一个主题都是非常小心的,平衡覆盖与亲密的细节,让读者带着一个舒适的知识水平离开每个主题。这本书分为六章。第一章涵盖了基本的生存分析概念,并提供了六个医疗数据集(其中许多是公开的)来说明未来的模型。每个数据集都有非常详细的解释,包括数据收集的上下文和每个变量的含义。数据的分析主要使用R代码,同时也使用少量的SAS代码。第二章简要回顾了经典的生存分析技术。正是在这一章中,读者可以体会到Legrand博士讨论每个高级模型的详细程度:参数模型、半参数模型、非参数模型、Cox比例风险、加速故障时间;所有这些都给予了他们的尽职调查,并与所提供的数据进行了说明,以便读者能够在基本水平上获得广泛的方法。其余四章的格式类似。本章以对主题的总体介绍开始,有效地总结了接下来的内容。接下来,介绍了主要模型品种的充分背景,以了解它们的起源以及它们的适当使用领域。其次,讨论了模型估计的主要方法。最后,本章以使用一个或多个数据集的模型说明结束。在每种情况下,Legrand博士都提供了足够的细节,以便读者能够非常熟悉基本概念和估计过程。为了说明她的有效性,在阅读这本书之前,我并不知道治疗模型的存在。现在,我对这个话题有足够的信心,我可以放心地向另一个人解释。在有机会使用更专门化的模型和/或评估过程的地方,提供了更多的参考资料来更详细地讨论这些模型和/或过程。我发现这些参考文献对某一特定主题的进一步研究是满意的。虽然这本书总体上读起来相当容易,但有几个编辑上的“小故障”,虽然不足以引起混淆和误解,但仍然很明显,而且发生的频率往往比人们预期的要高。这些“小故障”大多由排版错误和笨拙的句子结构组成。此外,从每章的引言到主要内容的转换过程中,材料有一点重复。然而,我认为这两个都是非常小的不便,因为它们都不会影响我对材料的理解。总之,我认为这本书对任何生存分析的实践者或研究者来说都是无价的参考。我也认为这是一本很好的研究生读物。我非常感谢勒格朗博士将这些先进模型的知识带入统计界。
{"title":"Advanced Survival Models","authors":"Caleb King","doi":"10.1080/00224065.2021.1958720","DOIUrl":"https://doi.org/10.1080/00224065.2021.1958720","url":null,"abstract":"Presenting a thorough review of selected topics in survival analysis, Dr. Legrand’s Advanced Survival Models is an excellent reference for students and/or practitioners. The book covers four advanced topics: frailty models, cure models, competing risks, and joint modeling of time-dependent covariates. Each topic is addressed with great care, balancing coverage with intimate detail so that readers come away with a comfortable level of knowledge about each topic. The book is divided into six chapters. The first chapter covers basic survival analysis concepts and presents six medical datasets (many of which are publicly available) for illustrating the models going forward. Each dataset is explained in great detail, including the context of data collection and the meaning of each variable. The data are analyzed using primarily R code throughout with a sprinkling of SAS code as well. The second chapter then presents a brief review of classical survival analysis techniques. It is in this chapter that readers get a taste of the level of detail with which Dr. Legrand discusses each of the advanced models: parametric models, semi-parametric models, non-parametric models, Cox proportional hazards, accelerated failure time; all are given their due diligence and illustrated with the data provided, so that the reader is presented with the breadth of methodology available at even the basic level. For the remaining four chapters, the format is similar. The chapter opens with an overall introduction of the topic, effectively summarizing the contents to come. Next, the primary model varieties are presented with sufficient context to understand their origins as well as their areas of appropriate use. Next, the primary methods of estimating the models are discussed. Finally, the chapter ends with illustration of the models using one or more of the datasets. In each case, Dr. Legrand presents enough detail so that the reader becomes intimately familiar with the basic concepts and estimation procedures. As an illustration of her effectiveness, I was not aware of the existence of cure models prior to reading this book. Now, I feel confident enough on the subject that I would be comfortable explaining it to another person. Where there is the opportunity for more specialized models and/or estimation procedures, multiple references are provided that discuss such models and/or procedures in greater detail. I found the references satisfactory for further study on a particular topic. While the book overall is a fairly easy read, there are several editorial “glitches” that, though not sufficient to cause confusion and misunderstanding, were still noticeable and tended to happen more frequently than one would expect. Most of these “glitches” consist of typographical errors and awkward sentence structures. In addition, the material was a bit repetitive when moving from the introduction to the main material in each chapter. However, I consider both of these to be very minor inc","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78408105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controlling the conditional false alarm rate for the MEWMA control chart 控制MEWMA控制图的条件虚警率
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-07-21 DOI: 10.1080/00224065.2021.1947162
Burcu Aytaçoğlu, Anne R. Driscoll, W. Woodall
Abstract An integral part of the design of control charts, including the multivariate exponentially weighted moving average (MEWMA) control chart, is the determination of the appropriate control limits for prospective monitoring. Methods using Markov chain analyses, integral equations, and simulation have been proposed to determine the MEWMA chart limits when the limits are based on a specified in-control average run length (ARL) value. A drawback of the usual approach is that the conditional false alarm rate (CFAR) for these charts varies over time in what might be in an unexpected and undesirable way. We define the CFAR as the probability of a false alarm given no previous false alarm. We do not condition on the results of a Phase I sample, as done by others, in studies of the effect of estimation error on control chart performance. We propose the use of dynamic probability control limits (DPCLs) to keep the CFAR constant over time at a specified value. The CFAR at any time, however, could be controlled to be any specified value using our approach. Using simulation, we determine the DPCLs for the MEWMA control chart being used to monitor the mean vector with an assumed known variance-covariance matrix. We consider cases where the sample size is both fixed and time-varying. For varying sample sizes, the DPCLs adapt automatically to any change in the sample size distribution. In all cases, the CFAR is held closely to a fixed value and the resulting in-control run length performance follows closely to that of the geometric distribution.
控制图的设计,包括多变量指数加权移动平均(MEWMA)控制图的设计,一个不可缺少的部分是确定适当的控制范围,以进行前瞻性监测。提出了利用马尔可夫链分析、积分方程和仿真的方法来确定MEWMA图的极限,当极限基于指定的控制平均运行长度(ARL)值时。通常方法的一个缺点是,这些图表的条件虚警率(CFAR)可能会以一种意想不到的、不希望看到的方式随时间变化。我们将CFAR定义为在没有先前虚警的情况下出现虚警的概率。在研究估计误差对控制图性能的影响时,我们不像其他人那样以第一阶段样本的结果为条件。我们建议使用动态概率控制限制(DPCLs)来保持CFAR随时间恒定在一个指定的值。但是,可以使用我们的方法将任何时候的CFAR控制为任何指定的值。通过仿真,我们确定了MEWMA控制图的DPCLs,该控制图用于用假设已知的方差-协方差矩阵监测均值向量。我们考虑样本量既固定又随时间变化的情况。对于不同的样本量,dpcl可以自动适应样本量分布的任何变化。在所有情况下,CFAR都保持在一个固定值附近,由此产生的受控运行长度性能与几何分布密切相关。
{"title":"Controlling the conditional false alarm rate for the MEWMA control chart","authors":"Burcu Aytaçoğlu, Anne R. Driscoll, W. Woodall","doi":"10.1080/00224065.2021.1947162","DOIUrl":"https://doi.org/10.1080/00224065.2021.1947162","url":null,"abstract":"Abstract An integral part of the design of control charts, including the multivariate exponentially weighted moving average (MEWMA) control chart, is the determination of the appropriate control limits for prospective monitoring. Methods using Markov chain analyses, integral equations, and simulation have been proposed to determine the MEWMA chart limits when the limits are based on a specified in-control average run length (ARL) value. A drawback of the usual approach is that the conditional false alarm rate (CFAR) for these charts varies over time in what might be in an unexpected and undesirable way. We define the CFAR as the probability of a false alarm given no previous false alarm. We do not condition on the results of a Phase I sample, as done by others, in studies of the effect of estimation error on control chart performance. We propose the use of dynamic probability control limits (DPCLs) to keep the CFAR constant over time at a specified value. The CFAR at any time, however, could be controlled to be any specified value using our approach. Using simulation, we determine the DPCLs for the MEWMA control chart being used to monitor the mean vector with an assumed known variance-covariance matrix. We consider cases where the sample size is both fixed and time-varying. For varying sample sizes, the DPCLs adapt automatically to any change in the sample size distribution. In all cases, the CFAR is held closely to a fixed value and the resulting in-control run length performance follows closely to that of the geometric distribution.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79977580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Hierarchical point process models for recurring safety critical events involving commercial truck drivers: A reliability framework for human performance modeling 涉及商用卡车驾驶员的重复安全关键事件的分层点过程模型:人类行为建模的可靠性框架
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-07-15 DOI: 10.1080/00224065.2021.1939815
Miao Cai, Amir Mehdizadeh, Qiong Hu, Mohammad Ali Alamdar Yazdi, A. Vinel, Karen C. Davis, Hong Xian, F. Megahed, S. Rigdon
Abstract Quality in the trucking industry involves several facets, including on-time performance and safety. In the largest naturalistic driving study to-date, with 496 drivers and 13 M miles driven, we address two safety questions: (a) does the occurrence of safety critical events increase during a driving shift? and (b) what is the effect of rest breaks on the incidence of those events? To address these two questions, we adopt point process models, commonly used to assess the reliability of repairable systems, to model the occurrence/likelihood of safety critical events. To account for driver differences, driver-level random effects were also assumed. Our results show that: (a) the intensity for hard brakes decreases throughout a shift, (b) rest breaks reduce the likelihood of activation of the automated collision mitigation system, and (c) there is a fair amount of variability among drivers. Given that a hard brake (a less severe safety critical event) is more common in the beginning of the shift, it can potentially be explained by an increased likelihood of being in a local/city road and/or increased likelihood of aggressive driving behavior early in a driver’s shift. Furthermore, we quantified the impact of rest breaks in reducing engagement of the more severe automated collision mitigation system, providing data-driven evidence on the importance of rest-break scheduling for trucking safety. Properties of the approach were also investigated through a simulation study, where we examined the consequences of an incorrect specification of the Bayesian priors.
卡车运输行业的质量涉及几个方面,包括准时性能和安全。在迄今为止最大的自然驾驶研究中,有496名司机和13m英里的驾驶,我们解决了两个安全问题:(a)在驾驶换挡期间,安全关键事件的发生是否增加?(b)休息对这些事件发生的影响是什么?为了解决这两个问题,我们采用通常用于评估可修复系统可靠性的点过程模型来模拟安全关键事件的发生/可能性。为了解释驾驶员差异,还假设了驾驶员级别的随机效应。我们的研究结果表明:(a)硬刹车的强度在整个换挡过程中降低,(b)休息时间降低了激活自动碰撞缓解系统的可能性,(c)驾驶员之间存在相当大的可变性。考虑到硬刹车(一种不太严重的安全关键事件)在换挡开始时更常见,这可能是由于驾驶员在本地/城市道路上行驶的可能性增加和/或在驾驶员换挡早期出现攻击性驾驶行为的可能性增加。此外,我们量化了休息休息对减少更严重的自动碰撞缓解系统的影响,为卡车运输安全的休息休息调度的重要性提供了数据驱动的证据。我们还通过模拟研究调查了该方法的特性,其中我们检查了贝叶斯先验规范不正确的后果。
{"title":"Hierarchical point process models for recurring safety critical events involving commercial truck drivers: A reliability framework for human performance modeling","authors":"Miao Cai, Amir Mehdizadeh, Qiong Hu, Mohammad Ali Alamdar Yazdi, A. Vinel, Karen C. Davis, Hong Xian, F. Megahed, S. Rigdon","doi":"10.1080/00224065.2021.1939815","DOIUrl":"https://doi.org/10.1080/00224065.2021.1939815","url":null,"abstract":"Abstract Quality in the trucking industry involves several facets, including on-time performance and safety. In the largest naturalistic driving study to-date, with 496 drivers and 13 M miles driven, we address two safety questions: (a) does the occurrence of safety critical events increase during a driving shift? and (b) what is the effect of rest breaks on the incidence of those events? To address these two questions, we adopt point process models, commonly used to assess the reliability of repairable systems, to model the occurrence/likelihood of safety critical events. To account for driver differences, driver-level random effects were also assumed. Our results show that: (a) the intensity for hard brakes decreases throughout a shift, (b) rest breaks reduce the likelihood of activation of the automated collision mitigation system, and (c) there is a fair amount of variability among drivers. Given that a hard brake (a less severe safety critical event) is more common in the beginning of the shift, it can potentially be explained by an increased likelihood of being in a local/city road and/or increased likelihood of aggressive driving behavior early in a driver’s shift. Furthermore, we quantified the impact of rest breaks in reducing engagement of the more severe automated collision mitigation system, providing data-driven evidence on the importance of rest-break scheduling for trucking safety. Properties of the approach were also investigated through a simulation study, where we examined the consequences of an incorrect specification of the Bayesian priors.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81797217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Understanding elections through statistics by Ole J. Forsberg, CRC press, Taylor & Francis group, boca Raton, FL, 2020, 225 pp., $69.95, ISBN 978-0367895372 《通过统计理解选举》,奥勒·j·福斯伯格著,CRC出版社,泰勒和弗朗西斯集团,佛罗里达州博卡拉顿,2020年,225页,69.95美元,ISBN 978-0367895372
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-07-12 DOI: 10.1080/00224065.2021.1947163
Shuai Huang
This book offers a good introduction to some statistical methods used in elections. It has two parts. The first part contains four chapters that cover estimation methods for polls. The main technical problem is the estimation of the proportion of the population holding a particular preference in voting. The analytic core of the problem is binomial distribution and the sampling and estimation procedures center around this distribution. As in the real world there are always complications. Various remedies are provided to address these complications. The author has done a great job of introducing the critical concepts and considerations in both the problem formulation and solution. For example, when introducing the importance of weighting in deriving the estimate of the poll, the author pretends to write a press release of his poll result on the issue of gender fairness in the military. It is clear that if a different source of demographics statistics is used, the poll result is quite different. Examples as such are quite useful for readers to understand the subject matter and its complexity. The second part of the book covers a few techniques to detect frauds and anomalies by examining the election results. Some techniques build on an interesting premise that humans are bad at mimicking randomness. This echoes what Fisher (1958) had said, “if one tries to think of numbers at random, one thinks of numbers very far from at random.” The Benford test is introduced in detail, including its history and its interesting applications in analyzing election data to detect anomaly based on the distributions of the leading digits reported by different precincts. The differential invalidation and some regression models are introduced as well. Spatial correlations could be modeled by using the geographical information in the data. The book concludes with a detailed discussion on data from Sri Lanka since 1994. This is a useful book that can help a broad range of readers to appreciate the power of statistics in understanding the election process from an analytic and scientific perspective. On top of the techniques introduced in the book, there are anecdotes and comments and insights that can enrich the reading experience. E.g., as in the preface the statement from a Nicaraguan leader “Indeed, you won the elections, but I won the count.” or the comment in the end of Chapter 4 “as with many things in statistics, increasing quality in one area tends to reduce quality in another.” Statistical techniques in this book are tightly bonded with the contexts and the backgrounds of their application. After reading the book, I appreciate the book has helped me understand a complex problem in a complex world. Not everything is what it appears to be, but we can equip ourselves with sufficient knowledge and useful tools to help us look at the data in every angle and really feel the data as it is.
这本书很好地介绍了选举中使用的一些统计方法。它有两部分。第一部分包括四章,介绍了民意调查的估计方法。主要的技术问题是估计在投票中持有特定偏好的人口比例。问题的分析核心是二项分布,抽样和估计过程围绕着这个分布。就像在现实世界中一样,总会有复杂的事情发生。提供了各种补救措施来解决这些并发症。作者做了一个伟大的工作,介绍了关键的概念和考虑问题的制定和解决。例如,在介绍加权对得出民意调查估计数的重要性时,作者假装写了一篇关于军队中性别公平问题的民意调查结果的新闻稿。很明显,如果使用不同的人口统计来源,民意调查结果就会大不相同。这样的例子对于读者理解主题及其复杂性非常有用。书的第二部分介绍了通过检查选举结果来发现欺诈和异常现象的一些技术。一些技术建立在一个有趣的前提上,即人类不擅长模仿随机性。这与Fisher(1958)所说的相呼应,“如果一个人试图随机地思考数字,那么他想到的数字就远非随机。”详细介绍了Benford测试,包括它的历史和它在分析选举数据中有趣的应用,根据不同选区报告的领先数字的分布来检测异常。并介绍了微分失效和一些回归模型。利用数据中的地理信息可以建立空间相关性模型。本书最后对斯里兰卡自1994年以来的数据进行了详细讨论。这是一本有用的书,可以帮助广泛的读者从分析和科学的角度理解选举过程的统计力量。除了书中介绍的技巧外,书中还有轶事、评论和见解,可以丰富阅读体验。例如,在序言中,一位尼加拉瓜领导人说:“的确,你们赢得了选举,但我赢得了计票。或者第四章末尾的评论,“就像统计学中的许多事情一样,一个领域的质量提高往往会降低另一个领域的质量。”本书中的统计技术与上下文及其应用的背景紧密结合。读完这本书,我很感激这本书帮助我理解了一个复杂世界中的一个复杂问题。并不是所有的事情都像它看起来的那样,但我们可以用足够的知识和有用的工具来装备自己,帮助我们从各个角度看待数据,真正感受到数据的本来面目。
{"title":"Understanding elections through statistics by Ole J. Forsberg, CRC press, Taylor & Francis group, boca Raton, FL, 2020, 225 pp., $69.95, ISBN 978-0367895372","authors":"Shuai Huang","doi":"10.1080/00224065.2021.1947163","DOIUrl":"https://doi.org/10.1080/00224065.2021.1947163","url":null,"abstract":"This book offers a good introduction to some statistical methods used in elections. It has two parts. The first part contains four chapters that cover estimation methods for polls. The main technical problem is the estimation of the proportion of the population holding a particular preference in voting. The analytic core of the problem is binomial distribution and the sampling and estimation procedures center around this distribution. As in the real world there are always complications. Various remedies are provided to address these complications. The author has done a great job of introducing the critical concepts and considerations in both the problem formulation and solution. For example, when introducing the importance of weighting in deriving the estimate of the poll, the author pretends to write a press release of his poll result on the issue of gender fairness in the military. It is clear that if a different source of demographics statistics is used, the poll result is quite different. Examples as such are quite useful for readers to understand the subject matter and its complexity. The second part of the book covers a few techniques to detect frauds and anomalies by examining the election results. Some techniques build on an interesting premise that humans are bad at mimicking randomness. This echoes what Fisher (1958) had said, “if one tries to think of numbers at random, one thinks of numbers very far from at random.” The Benford test is introduced in detail, including its history and its interesting applications in analyzing election data to detect anomaly based on the distributions of the leading digits reported by different precincts. The differential invalidation and some regression models are introduced as well. Spatial correlations could be modeled by using the geographical information in the data. The book concludes with a detailed discussion on data from Sri Lanka since 1994. This is a useful book that can help a broad range of readers to appreciate the power of statistics in understanding the election process from an analytic and scientific perspective. On top of the techniques introduced in the book, there are anecdotes and comments and insights that can enrich the reading experience. E.g., as in the preface the statement from a Nicaraguan leader “Indeed, you won the elections, but I won the count.” or the comment in the end of Chapter 4 “as with many things in statistics, increasing quality in one area tends to reduce quality in another.” Statistical techniques in this book are tightly bonded with the contexts and the backgrounds of their application. After reading the book, I appreciate the book has helped me understand a complex problem in a complex world. Not everything is what it appears to be, but we can equip ourselves with sufficient knowledge and useful tools to help us look at the data in every angle and really feel the data as it is.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80102706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boost-R: Gradient boosted trees for recurrence data Boost-R:用于递归数据的梯度增强树
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2021-07-03 DOI: 10.1080/00224065.2021.1948373
Xiao Liu, Rong Pan
Abstract Recurrence data arise from multi-disciplinary domains spanning reliability, cyber security, healthcare, online retailing, etc. This paper investigates an additive-tree-based approach, known as Boost-R (Boosting for Recurrence Data), for recurrent event data with both static and dynamic features. Boost-R constructs an ensemble of gradient boosted additive trees to estimate the cumulative intensity function of the recurrent event process, where a new tree is added to the ensemble by minimizing the regularized L 2 distance between the observed and predicted cumulative intensity. Unlike conventional regression trees, a time-dependent function is constructed by Boost-R on each tree leaf. The sum of these functions, from multiple trees, yields the ensemble estimator of the cumulative intensity. The divide-and-conquer nature of tree-based methods is appealing when hidden sub-populations exist within a heterogeneous population. The non-parametric nature of regression trees helps to avoid parametric assumptions on the complex interactions between event processes and features. Critical insights and advantages of Boost-R are investigated through comprehensive numerical examples. Datasets and computer code of Boost-R are made available on GitHub. To our best knowledge, Boost-R is the first gradient boosted additive-tree-based approach for modeling large-scale recurrent event data with both static and dynamic feature information.
重复数据来自多学科领域,包括可靠性、网络安全、医疗保健、在线零售等。本文研究了一种基于加性树的方法,称为Boost-R (Boosting for recurrent Data),用于具有静态和动态特征的循环事件数据。Boost-R构建了一个梯度增强的加性树集合来估计循环事件过程的累积强度函数,其中通过最小化观测到的和预测的累积强度之间的正则化l2距离,将新树添加到集合中。与传统的回归树不同,Boost-R在每个树叶上构建了一个时间相关的函数。这些函数的和,从多个树,产生累积强度的集合估计。当隐藏的子种群存在于异质种群中时,基于树的方法的分而治之的特性很有吸引力。回归树的非参数性质有助于避免对事件过程和特征之间复杂的相互作用进行参数假设。通过全面的数值实例研究了Boost-R的关键见解和优势。Boost-R的数据集和计算机代码可在GitHub上获得。据我们所知,Boost-R是第一个基于梯度增强加性树的方法,用于对具有静态和动态特征信息的大规模循环事件数据建模。
{"title":"Boost-R: Gradient boosted trees for recurrence data","authors":"Xiao Liu, Rong Pan","doi":"10.1080/00224065.2021.1948373","DOIUrl":"https://doi.org/10.1080/00224065.2021.1948373","url":null,"abstract":"Abstract Recurrence data arise from multi-disciplinary domains spanning reliability, cyber security, healthcare, online retailing, etc. This paper investigates an additive-tree-based approach, known as Boost-R (Boosting for Recurrence Data), for recurrent event data with both static and dynamic features. Boost-R constructs an ensemble of gradient boosted additive trees to estimate the cumulative intensity function of the recurrent event process, where a new tree is added to the ensemble by minimizing the regularized L 2 distance between the observed and predicted cumulative intensity. Unlike conventional regression trees, a time-dependent function is constructed by Boost-R on each tree leaf. The sum of these functions, from multiple trees, yields the ensemble estimator of the cumulative intensity. The divide-and-conquer nature of tree-based methods is appealing when hidden sub-populations exist within a heterogeneous population. The non-parametric nature of regression trees helps to avoid parametric assumptions on the complex interactions between event processes and features. Critical insights and advantages of Boost-R are investigated through comprehensive numerical examples. Datasets and computer code of Boost-R are made available on GitHub. To our best knowledge, Boost-R is the first gradient boosted additive-tree-based approach for modeling large-scale recurrent event data with both static and dynamic feature information.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78715817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Journal of Quality Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1