首页 > 最新文献

Journal of Quality Technology最新文献

英文 中文
funcharts: control charts for multivariate functional data in R 函数图:R中用于多变量函数数据的控制图
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-07-19 DOI: 10.1080/00224065.2023.2219012
Christian Capezza, Fabio Centofanti, A. Lepore, A. Menafoglio, B. Palumbo, S. Vantini
Modern statistical process monitoring (SPM) applications focus on profile monitoring, i.e., the monitoring of process quality characteristics that can be modeled as profiles, also known as functional data. Despite the large interest in the profile monitoring literature, there is still a lack of software to facilitate its practical application. This article introduces the funcharts R package that implements recent developments on the SPM of multivariate functional quality characteristics, possibly adjusted by the influence of additional variables, referred to as covariates. The package also implements the real-time version of all control charting procedures to monitor profiles partially observed up to an intermediate domain point. The package is illustrated both through its built-in data generator and a real-case study on the SPM of Ro-Pax ship CO2 emissions during navigation, which is based on the ShipNavigation data provided in the Supplementary Material.
现代统计过程监控(SPM)应用侧重于概要监控,即对可以建模为概要的过程质量特征的监控,也称为功能数据。尽管对剖面监测文献有很大的兴趣,但仍然缺乏促进其实际应用的软件。本文介绍了funcharts R包,它实现了多变量函数质量特征的SPM的最新发展,可能会受到附加变量(称为协变量)的影响进行调整。该包还实现了所有控制图表程序的实时版本,以监视部分观察到的配置文件,直至中间域点。该软件包通过其内置的数据生成器和基于补充材料中提供的船舶导航数据的Ro-Pax船舶航行期间二氧化碳排放SPM的实际案例研究进行了说明。
{"title":"funcharts: control charts for multivariate functional data in R","authors":"Christian Capezza, Fabio Centofanti, A. Lepore, A. Menafoglio, B. Palumbo, S. Vantini","doi":"10.1080/00224065.2023.2219012","DOIUrl":"https://doi.org/10.1080/00224065.2023.2219012","url":null,"abstract":"Modern statistical process monitoring (SPM) applications focus on profile monitoring, i.e., the monitoring of process quality characteristics that can be modeled as profiles, also known as functional data. Despite the large interest in the profile monitoring literature, there is still a lack of software to facilitate its practical application. This article introduces the funcharts R package that implements recent developments on the SPM of multivariate functional quality characteristics, possibly adjusted by the influence of additional variables, referred to as covariates. The package also implements the real-time version of all control charting procedures to monitor profiles partially observed up to an intermediate domain point. The package is illustrated both through its built-in data generator and a real-case study on the SPM of Ro-Pax ship CO2 emissions during navigation, which is based on the ShipNavigation data provided in the Supplementary Material.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84184134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Monitoring reliability under competing risks using field data 利用现场数据监测竞争风险下的可靠性
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-07-13 DOI: 10.1080/00224065.2022.2080617
F. Pascual, Joseph P. Navelski
Abstract Many modern products fail due to one of multiple causes called competing risks. In this article, we propose variable features for monitoring product failure by control charts under competing risks. Failure reports arrive one at a time from a sample of population of units. Features are derived from both the reports and the assumed competing-risk statistical model. To assess the efficacy of different feature subsets in detecting shifts in the failure-time process, we consider control charts based on random forests and compare the average run length performances under different shift scenarios. We demonstrate the control charts with both simulated data sets and actual field data set from a consulting problem. We also propose graphical fault-diagnosis methods for identifying assignable causes of alarm signals. Control charts based on the proposed features will provide valuable information to manufacturers in planning for warranty, part-replacement, or repair.
许多现代产品的失败都是由于竞争风险造成的。在本文中,我们提出了在竞争风险下用控制图监测产品失效的可变特征。故障报告每次从单元总体样本中到达一个。特征来源于报告和假设的竞争风险统计模型。为了评估不同特征子集在故障时间过程中检测位移的有效性,我们考虑了基于随机森林的控制图,并比较了不同位移场景下的平均运行长度性能。我们用一个咨询问题的模拟数据集和实际现场数据集来演示控制图。我们还提出了图形故障诊断方法来识别报警信号的可分配原因。基于所提出的特性的控制图将为制造商在计划保修、零件更换或维修时提供有价值的信息。
{"title":"Monitoring reliability under competing risks using field data","authors":"F. Pascual, Joseph P. Navelski","doi":"10.1080/00224065.2022.2080617","DOIUrl":"https://doi.org/10.1080/00224065.2022.2080617","url":null,"abstract":"Abstract Many modern products fail due to one of multiple causes called competing risks. In this article, we propose variable features for monitoring product failure by control charts under competing risks. Failure reports arrive one at a time from a sample of population of units. Features are derived from both the reports and the assumed competing-risk statistical model. To assess the efficacy of different feature subsets in detecting shifts in the failure-time process, we consider control charts based on random forests and compare the average run length performances under different shift scenarios. We demonstrate the control charts with both simulated data sets and actual field data set from a consulting problem. We also propose graphical fault-diagnosis methods for identifying assignable causes of alarm signals. Control charts based on the proposed features will provide valuable information to manufacturers in planning for warranty, part-replacement, or repair.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74895174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Analysis of Experiments and Observational Studies using R 使用R设计和分析实验和观察性研究
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-07-12 DOI: 10.1080/00224065.2022.2096515
Joseph D. Conklin
{"title":"Design and Analysis of Experiments and Observational Studies using R","authors":"Joseph D. Conklin","doi":"10.1080/00224065.2022.2096515","DOIUrl":"https://doi.org/10.1080/00224065.2022.2096515","url":null,"abstract":"","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85600900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Data-level transfer learning for degradation modeling and prognosis 退化建模和预测的数据级迁移学习
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-06-15 DOI: 10.1080/00224065.2022.2081103
Amirhossein Fallahdizcheh, Chao Wang
Abstract The typical way to conduct data-driven prognosis is to train a degradation model with historical data, then apply the model to predict failure for in-service units. Most existing works assume the historical data and in-service data are from the same process. In practice, however, different but related processes can share similar degradation patterns. Thus, the historical data from these processes are expected to provide useful prognosis information for each other. In this article, we propose a data-level transfer learning framework to extract useful and shared information from different processes to benefit the prognosis of in-service units. In this framework, the degradation data in each process is modeled by a mixed effects model. To facilitate the information sharing among different mixed effects models, a hierarchical Bayesian structure is proposed to model and connect the distributions of mixed effects in different mixed models. Because the degradation paths in different processes are rarely the same, the dimension of the mixed effects/regressor in each process can be different. To handle this issue, we propose a tailored linear transformation to marginalize or expand the distributions of mixed effects in different degradation processes to achieve consistent dimensions. The transferred information is finally incorporated with the degradation data from in-service units to conduct prognosis. The proposed method is validated and compared with various benchmarks in extensive numerical studies and two case studies. The results show the proposed method can successfully transfer useful information in different processes to benefit the prognosis.
摘要数据驱动预测的典型方法是利用历史数据训练退化模型,然后应用该模型对在役单元进行故障预测。大多数现有的工作假设历史数据和在役数据来自同一过程。然而,在实践中,不同但相关的过程可以共享类似的退化模式。因此,这些过程的历史数据有望为彼此提供有用的预后信息。在本文中,我们提出了一个数据级迁移学习框架,从不同的过程中提取有用的和共享的信息,以有利于在役单位的预测。在这个框架中,每个过程中的退化数据用混合效应模型来建模。为了方便不同混合效应模型之间的信息共享,提出了一种层次贝叶斯结构来建模和连接不同混合模型中混合效应的分布。由于不同过程的退化路径很少相同,因此每个过程的混合效应/回归量的维数可能不同。为了解决这一问题,我们提出了一种定制的线性变换,以边缘化或扩大不同降解过程中混合效应的分布,以获得一致的维度。最后将传递的信息与在役单元的退化数据相结合,进行预测。所提出的方法在广泛的数值研究和两个案例研究中得到了验证并与各种基准进行了比较。结果表明,该方法能够在不同的过程中成功传递有用信息,有利于预测。
{"title":"Data-level transfer learning for degradation modeling and prognosis","authors":"Amirhossein Fallahdizcheh, Chao Wang","doi":"10.1080/00224065.2022.2081103","DOIUrl":"https://doi.org/10.1080/00224065.2022.2081103","url":null,"abstract":"Abstract The typical way to conduct data-driven prognosis is to train a degradation model with historical data, then apply the model to predict failure for in-service units. Most existing works assume the historical data and in-service data are from the same process. In practice, however, different but related processes can share similar degradation patterns. Thus, the historical data from these processes are expected to provide useful prognosis information for each other. In this article, we propose a data-level transfer learning framework to extract useful and shared information from different processes to benefit the prognosis of in-service units. In this framework, the degradation data in each process is modeled by a mixed effects model. To facilitate the information sharing among different mixed effects models, a hierarchical Bayesian structure is proposed to model and connect the distributions of mixed effects in different mixed models. Because the degradation paths in different processes are rarely the same, the dimension of the mixed effects/regressor in each process can be different. To handle this issue, we propose a tailored linear transformation to marginalize or expand the distributions of mixed effects in different degradation processes to achieve consistent dimensions. The transferred information is finally incorporated with the degradation data from in-service units to conduct prognosis. The proposed method is validated and compared with various benchmarks in extensive numerical studies and two case studies. The results show the proposed method can successfully transfer useful information in different processes to benefit the prognosis.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81621824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Spatio-temporal process monitoring using exponentially weighted spatial LASSO 利用指数加权空间LASSO进行时空过程监测
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-06-02 DOI: 10.1080/00224065.2022.2081104
Peihua Qiu, Kai-zuan Yang
Abstract Spatio-temporal process monitoring (STPM) has received a considerable attention recently due to its broad applications in environment monitoring, disease surveillance, streaming image processing, and more. Because spatio-temporal data often have complicated structure, including latent spatio-temporal data correlation, complex spatio-temporal mean structure, and nonparametric data distribution, STPM is a challenging research problem. In practice, if a spatio-temporal process has a distributional shift (e.g., mean shift) started at a specific time point, then the spatial locations with the shift are usually clustered in small regions. This kind of spatial feature of the shift has not been considered in the existing STPM literature yet. In this paper, we develop a new STPM method that takes into account the spatial feature of the shift in its construction. The new method combines the ideas of exponentially weighted moving average in the temporal domain for online process monitoring and spatial LASSO in the spatial domain for accommodating the spatial feature of a future shift. It can also accommodate the complicated spatio-temporal data structure well. Both simulation studies and a real-data application show that it can provide a reliable and effective tool for different STPM applications.
时空过程监测(spatial -temporal process monitoring, STPM)由于在环境监测、疾病监测、流图像处理等领域的广泛应用,近年来受到了广泛的关注。由于时空数据往往具有复杂的结构,包括潜在的时空相关性、复杂的时空平均结构和非参数的数据分布等,因此时空pm是一个具有挑战性的研究问题。在实践中,如果一个时空过程在特定的时间点发生了分布偏移(例如,均值偏移),那么发生偏移的空间位置通常聚集在小区域中。这种转移的空间特征在现有的STPM文献中尚未被考虑。在本文中,我们开发了一种新的STPM方法,该方法在其构造中考虑了位移的空间特征。该方法结合了指数加权移动平均的思想,在时域用于在线过程监测,空间LASSO在空间域用于适应未来变化的空间特征。它还能很好地适应复杂的时空数据结构。仿真研究和实际应用表明,该方法可以为不同的STPM应用提供可靠有效的工具。
{"title":"Spatio-temporal process monitoring using exponentially weighted spatial LASSO","authors":"Peihua Qiu, Kai-zuan Yang","doi":"10.1080/00224065.2022.2081104","DOIUrl":"https://doi.org/10.1080/00224065.2022.2081104","url":null,"abstract":"Abstract Spatio-temporal process monitoring (STPM) has received a considerable attention recently due to its broad applications in environment monitoring, disease surveillance, streaming image processing, and more. Because spatio-temporal data often have complicated structure, including latent spatio-temporal data correlation, complex spatio-temporal mean structure, and nonparametric data distribution, STPM is a challenging research problem. In practice, if a spatio-temporal process has a distributional shift (e.g., mean shift) started at a specific time point, then the spatial locations with the shift are usually clustered in small regions. This kind of spatial feature of the shift has not been considered in the existing STPM literature yet. In this paper, we develop a new STPM method that takes into account the spatial feature of the shift in its construction. The new method combines the ideas of exponentially weighted moving average in the temporal domain for online process monitoring and spatial LASSO in the spatial domain for accommodating the spatial feature of a future shift. It can also accommodate the complicated spatio-temporal data structure well. Both simulation studies and a real-data application show that it can provide a reliable and effective tool for different STPM applications.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74938549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Book review: Introduction to statistical process control 书评:统计过程控制导论
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-04-12 DOI: 10.1080/00224065.2022.2060150
W. Woodall
{"title":"Book review: Introduction to statistical process control","authors":"W. Woodall","doi":"10.1080/00224065.2022.2060150","DOIUrl":"https://doi.org/10.1080/00224065.2022.2060150","url":null,"abstract":"","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84374739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A comprehensive toolbox for the gamma distribution: The gammadist package gamma分布的综合工具箱:gamma包
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-04-04 DOI: 10.1080/00224065.2022.2053794
Piao Chen, Kilian Buis, Xiujie Zhao
Abstract The gamma distribution is one of the most important parametric models in probability theory and statistics. Although a multitude of studies have theoretically investigated the properties of the gamma distribution in the literature, there is still a serious lack of tailored statistical tools to facilitate its practical applications. To fill the gap, this paper develops a comprehensive R package for the gamma distribution. In specific, the R package focuses on the following three important tasks: generate the gamma random variables, estimate the model parameters, and construct statistical limits, including confidence limits, prediction limits, and tolerance limits based on the gamma random variables. The proposed package encompasses the state-of-the-art methods of the gamma distribution in the literature and its usage is illustrated by a real application.
摘要分布是概率论和统计学中最重要的参数模型之一。虽然大量的研究已经从理论上研究了文献中伽马分布的性质,但仍然严重缺乏定制的统计工具来促进其实际应用。为了填补这一空白,本文为伽马分布开发了一个全面的R包。具体来说,R包侧重于以下三个重要任务:生成gamma随机变量,估计模型参数,以及基于gamma随机变量构建统计限,包括置信限、预测限和容差限。建议的包包括文献中最先进的伽玛分布方法,并通过实际应用说明其用法。
{"title":"A comprehensive toolbox for the gamma distribution: The gammadist package","authors":"Piao Chen, Kilian Buis, Xiujie Zhao","doi":"10.1080/00224065.2022.2053794","DOIUrl":"https://doi.org/10.1080/00224065.2022.2053794","url":null,"abstract":"Abstract The gamma distribution is one of the most important parametric models in probability theory and statistics. Although a multitude of studies have theoretically investigated the properties of the gamma distribution in the literature, there is still a serious lack of tailored statistical tools to facilitate its practical applications. To fill the gap, this paper develops a comprehensive R package for the gamma distribution. In specific, the R package focuses on the following three important tasks: generate the gamma random variables, estimate the model parameters, and construct statistical limits, including confidence limits, prediction limits, and tolerance limits based on the gamma random variables. The proposed package encompasses the state-of-the-art methods of the gamma distribution in the literature and its usage is illustrated by a real application.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80021599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Introduction to High-Dimensional Statistics, Christophe Giraud. Chapman& Hall/CRC Press, 2021, 364 pp., $72.00 hardcover, ISBN 978-0-367-71622-6. 《高维统计导论》,Christophe Giraud著。查普曼和霍尔/CRC出版社,2021,364页,72.00美元精装,ISBN 978-0-367-71622-6。
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-03-07 DOI: 10.1080/00224065.2022.2041378
Caleb King
A deep dive into the theoretical underpinnings of common high-dimensional statistical 20 techniques, Dr. Giraud’s Introduction to High-Dimensional Statistics is a good reference for those who wish to explore the mathematical foundations of state-of-the-art multivariate methods. The book covers a wide array of topics, from estimation bounds to multivariate regression and even clustering. In this 2nd edition, Dr. Giraud expands his work to include more recent advances and statistical methods. The book consists of 12 chapters, starting with a brief introduction to the complexities of conducting statistics in high dimensions. The book then proceeds similar to a standard statistical textbook, moving from properties of statistical estimators to statistical modeling, including regression and then other more advanced topics. Each chapter concludes with a set of exercises, many of which are portions of proofs from the chapter left for the reader. All that being said, do not let the title of the book fool you. By the author’s own admission, this is not an introduction on the same level as Hastie et al.’s Elements of Statistical Learning. Instead, the focus of this book is on the mathematical foundations of high-dimensional techniques, proving theorems regarding properties of estimators. I must confess this is not quite what I expected upon first look; one truly cannot judge a book by its cover. That is not to say that this book is lacking. It is impressive in its efficient, yet thorough, presentation of the theory. I especially appreciated how the author took time at the beginning to illustrate some of the strange behavior one encounters in very high dimensions. However, I did find it jarring that mathematical notation was very often used without much introduction. There is an appendix with notations at the end of the book, but I would’ve rather had a bit more interpretation within the text rather than having to flip back and forth. There were also a few typographical errors and partial omissions of formulas, though I can’t be sure if this was part of the text or bugs in the software I used to read the digital version. In summary, this book would certainly make for a good graduate level textbook in an advanced course on statistical methods. If you are willing to put the necessary time and investment into rigorously exploring the foundations of high-dimensional statistics, than you can hardly do better than this book.
深入研究了常见的高维统计技术的理论基础,Giraud博士的《高维统计导论》对于那些希望探索最先进的多元方法的数学基础的人来说是一个很好的参考。这本书涵盖了广泛的主题,从估计边界到多元回归甚至聚类。在这第二版中,吉罗博士扩展了他的工作,包括更多的最新进展和统计方法。本书由12章组成,首先简要介绍了在高维中进行统计的复杂性。然后,本书与标准统计教科书类似,从统计估计器的属性转移到统计建模,包括回归和其他更高级的主题。每章以一组练习结束,其中许多是本章为读者留下的证明部分。话虽如此,不要被书名骗了。根据作者自己的承认,这不是与Hastie等人的统计学习要素相同水平的介绍。相反,本书的重点是高维技术的数学基础,证明了关于估计器性质的定理。我得承认,这和我第一眼看到的不太一样;不能以貌取人。这并不是说这本书有缺陷。它对这一理论的阐述既有效又透彻,令人印象深刻。我特别欣赏作者如何在开始时花时间来说明人们在非常高的维度上遇到的一些奇怪的行为。然而,我确实发现数学符号在没有太多介绍的情况下经常被使用是不和谐的。在书的末尾有一个附录和注释,但我宁愿在文本中有更多的解释,而不是来回翻阅。还有一些排版错误和公式的部分遗漏,尽管我不能确定这是文本的一部分还是我用来阅读数字版本的软件的错误。总之,这本书肯定会成为一个很好的研究生水平的教科书在统计方法的高级课程。如果你愿意投入必要的时间和投资来严格探索高维统计的基础,那么你很难比这本书做得更好。
{"title":"Introduction to High-Dimensional Statistics, Christophe Giraud. Chapman& Hall/CRC Press, 2021, 364 pp., $72.00 hardcover, ISBN 978-0-367-71622-6.","authors":"Caleb King","doi":"10.1080/00224065.2022.2041378","DOIUrl":"https://doi.org/10.1080/00224065.2022.2041378","url":null,"abstract":"A deep dive into the theoretical underpinnings of common high-dimensional statistical 20 techniques, Dr. Giraud’s Introduction to High-Dimensional Statistics is a good reference for those who wish to explore the mathematical foundations of state-of-the-art multivariate methods. The book covers a wide array of topics, from estimation bounds to multivariate regression and even clustering. In this 2nd edition, Dr. Giraud expands his work to include more recent advances and statistical methods. The book consists of 12 chapters, starting with a brief introduction to the complexities of conducting statistics in high dimensions. The book then proceeds similar to a standard statistical textbook, moving from properties of statistical estimators to statistical modeling, including regression and then other more advanced topics. Each chapter concludes with a set of exercises, many of which are portions of proofs from the chapter left for the reader. All that being said, do not let the title of the book fool you. By the author’s own admission, this is not an introduction on the same level as Hastie et al.’s Elements of Statistical Learning. Instead, the focus of this book is on the mathematical foundations of high-dimensional techniques, proving theorems regarding properties of estimators. I must confess this is not quite what I expected upon first look; one truly cannot judge a book by its cover. That is not to say that this book is lacking. It is impressive in its efficient, yet thorough, presentation of the theory. I especially appreciated how the author took time at the beginning to illustrate some of the strange behavior one encounters in very high dimensions. However, I did find it jarring that mathematical notation was very often used without much introduction. There is an appendix with notations at the end of the book, but I would’ve rather had a bit more interpretation within the text rather than having to flip back and forth. There were also a few typographical errors and partial omissions of formulas, though I can’t be sure if this was part of the text or bugs in the software I used to read the digital version. In summary, this book would certainly make for a good graduate level textbook in an advanced course on statistical methods. If you are willing to put the necessary time and investment into rigorously exploring the foundations of high-dimensional statistics, than you can hardly do better than this book.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90561275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Modeling and Computation in Python Python中的贝叶斯建模与计算
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-02-25 DOI: 10.1080/00224065.2022.2041379
Shuai Huang
This book is useful for readers who want to hone their skills in Bayesian modeling and computation. Written by experts in the area of Bayesian software and major contributors to some existing widely used Bayesian computational tools, this book covers not only basic Bayesian probabilistic inference but also a range of models from linear models (and mixed effect models, hierarchical models, splines, etc) to time series models such as the state space model. It also covers the Bayesian additive regression trees. Almost all the concepts and techniques are implemented using PyMC3, Tensorflow Probability (TFP), ArviZ and other libraries. By doing all the modeling, computation, and data analysis, the authors not only show how these things work, but also show how and why things don’t work by emphasis on exploratory data analysis, model comparison, and diagnostics. To learn from the book, readers may need some statistical background such as basic training in statistics and probability theory. Some understanding of Bayesian modeling and inference is also needed, such as the concepts of prior, likelihood, posterior, the bayes’s law, and Monte Carlo sampling. Some experience with Python would also be very beneficial for readers to get started on this journey of Bayesian modeling. The authors suggested a few books as possible preliminaries for their book. I feel that the readers may also benefit from reading Andrew Gelman’s book, Bayesian Data Analysis, Chapman & Hall/CRC, 3rd Edition, 2013. Of course, as the authors pointed it out, this book is not for a Bayesian Reader but a Bayesian practitioner. The book is more of an interactive experience for Bayesian practitioners by learning all the computational tools to model and to negotiate with data for a good modeling practice. On the other hand, if readers have already had experience with real-world data analysis using Python or R or other similar tools, even if this book is their first experience with Bayesian modeling and computation, readers may still learn a lot from this book. There are an abundance of figures and detailed explanations of how things are done and how the results are interpreted. Picking up these details would need some trained sensibility when dealing with real-world data, but aspiring and experienced practitioners should find all the details useful and impressive. And there are also many big picture schematic drawings to help readers connect all the details with overall concepts such as end-to-end workflows. The Figure 9.1 is a remarkable example. Overall, as Kevin Murphy pointed out in the Forward, “this is a valuable addition to the literature, which should hopefully further the adoption of Bayesian methods”. I highly recommend readers who are interested in learning Bayesian models and their applications in practice to have this book on their bookshelf.
这本书对想要磨练贝叶斯建模和计算技能的读者很有用。由贝叶斯软件领域的专家和一些现有广泛使用的贝叶斯计算工具的主要贡献者撰写,本书不仅涵盖了基本的贝叶斯概率推断,还涵盖了从线性模型(和混合效应模型,层次模型,样条等)到时间序列模型(如状态空间模型)的一系列模型。它还涵盖了贝叶斯加性回归树。几乎所有的概念和技术都是使用PyMC3、Tensorflow Probability (TFP)、ArviZ和其他库实现的。通过进行所有的建模、计算和数据分析,作者不仅展示了这些东西是如何工作的,而且通过强调探索性数据分析、模型比较和诊断,还展示了事情是如何以及为什么不工作的。为了从这本书中学习,读者可能需要一些统计背景,如统计和概率论的基本训练。还需要对贝叶斯建模和推理有一定的了解,例如先验、似然、后验、贝叶斯定律和蒙特卡罗抽样的概念。对于开始贝叶斯建模之旅的读者来说,一些Python的经验也是非常有益的。作者们推荐了几本书作为他们这本书可能的序言。我觉得读者也可以从Andrew Gelman的书中受益,Bayesian Data Analysis, Chapman & Hall/CRC, 3rd Edition, 2013。当然,正如作者所指出的,这本书不是为贝叶斯读者而写,而是为贝叶斯实践者而写。这本书更多的是贝叶斯实践者的互动体验,通过学习所有的计算工具来建模和与数据协商,以获得良好的建模实践。另一方面,如果读者已经有了使用Python或R或其他类似工具进行实际数据分析的经验,即使本书是他们第一次使用贝叶斯建模和计算,读者仍然可以从本书中学到很多东西。书中有大量的数据和详细的解释,说明事情是如何完成的,结果是如何解释的。在处理真实世界的数据时,获取这些细节需要一些训练有素的敏感性,但是有抱负和经验丰富的从业者应该会发现所有的细节都是有用的和令人印象深刻的。此外,还有许多大图原理图,帮助读者将所有细节与整体概念(如端到端工作流)联系起来。图9.1就是一个很好的例子。总的来说,正如Kevin Murphy在前言中指出的,“这是对文献的一个有价值的补充,它应该有望进一步采用贝叶斯方法”。我强烈建议对学习贝叶斯模型及其在实践中的应用感兴趣的读者把这本书放在书架上。
{"title":"Bayesian Modeling and Computation in Python","authors":"Shuai Huang","doi":"10.1080/00224065.2022.2041379","DOIUrl":"https://doi.org/10.1080/00224065.2022.2041379","url":null,"abstract":"This book is useful for readers who want to hone their skills in Bayesian modeling and computation. Written by experts in the area of Bayesian software and major contributors to some existing widely used Bayesian computational tools, this book covers not only basic Bayesian probabilistic inference but also a range of models from linear models (and mixed effect models, hierarchical models, splines, etc) to time series models such as the state space model. It also covers the Bayesian additive regression trees. Almost all the concepts and techniques are implemented using PyMC3, Tensorflow Probability (TFP), ArviZ and other libraries. By doing all the modeling, computation, and data analysis, the authors not only show how these things work, but also show how and why things don’t work by emphasis on exploratory data analysis, model comparison, and diagnostics. To learn from the book, readers may need some statistical background such as basic training in statistics and probability theory. Some understanding of Bayesian modeling and inference is also needed, such as the concepts of prior, likelihood, posterior, the bayes’s law, and Monte Carlo sampling. Some experience with Python would also be very beneficial for readers to get started on this journey of Bayesian modeling. The authors suggested a few books as possible preliminaries for their book. I feel that the readers may also benefit from reading Andrew Gelman’s book, Bayesian Data Analysis, Chapman & Hall/CRC, 3rd Edition, 2013. Of course, as the authors pointed it out, this book is not for a Bayesian Reader but a Bayesian practitioner. The book is more of an interactive experience for Bayesian practitioners by learning all the computational tools to model and to negotiate with data for a good modeling practice. On the other hand, if readers have already had experience with real-world data analysis using Python or R or other similar tools, even if this book is their first experience with Bayesian modeling and computation, readers may still learn a lot from this book. There are an abundance of figures and detailed explanations of how things are done and how the results are interpreted. Picking up these details would need some trained sensibility when dealing with real-world data, but aspiring and experienced practitioners should find all the details useful and impressive. And there are also many big picture schematic drawings to help readers connect all the details with overall concepts such as end-to-end workflows. The Figure 9.1 is a remarkable example. Overall, as Kevin Murphy pointed out in the Forward, “this is a valuable addition to the literature, which should hopefully further the adoption of Bayesian methods”. I highly recommend readers who are interested in learning Bayesian models and their applications in practice to have this book on their bookshelf.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78872910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
cpss: an package for change-point detection by sample-splitting methods Cpss:一个通过样本分割方法进行变更点检测的包
IF 2.5 2区 工程技术 Q2 Engineering Pub Date : 2022-02-23 DOI: 10.1080/00224065.2022.2035284
Guanghui Wang, Changliang Zou
Abstract Change-point detection is a popular statistical method for Phase I analysis in statistical process control. The cpss package has been developed to provide users with multiple choices of change-point searching algorithms for a variety of frequently considered parametric change-point models, including the univariate and multivariate mean and/or (co)variance change models, changes in linear models and generalized linear models, and change models in exponential families. In particular, it integrates the recently proposed COPSS criterion to determine the number of change-points in a data-driven fashion that avoids selecting or specifying additional tuning parameters in existing approaches. Hence it is more convenient to use in practical applications. In addition, the cpss package brings great possibilities to handle user-customized change-point models.
摘要变化点检测是统计过程控制中常用的一种统计方法。cpss软件包的开发为用户提供了多种选择的变化点搜索算法,用于各种经常考虑的参数变化点模型,包括单变量和多变量均值和/或(co)方差变化模型,线性模型和广义线性模型的变化,以及指数族的变化模型。特别是,它集成了最近提出的COPSS标准,以数据驱动的方式确定更改点的数量,从而避免在现有方法中选择或指定额外的调优参数。因此在实际应用中使用更为方便。此外,cpss包为处理用户自定义的变更点模型提供了很大的可能性。
{"title":"cpss: an package for change-point detection by sample-splitting methods","authors":"Guanghui Wang, Changliang Zou","doi":"10.1080/00224065.2022.2035284","DOIUrl":"https://doi.org/10.1080/00224065.2022.2035284","url":null,"abstract":"Abstract Change-point detection is a popular statistical method for Phase I analysis in statistical process control. The cpss package has been developed to provide users with multiple choices of change-point searching algorithms for a variety of frequently considered parametric change-point models, including the univariate and multivariate mean and/or (co)variance change models, changes in linear models and generalized linear models, and change models in exponential families. In particular, it integrates the recently proposed COPSS criterion to determine the number of change-points in a data-driven fashion that avoids selecting or specifying additional tuning parameters in existing approaches. Hence it is more convenient to use in practical applications. In addition, the cpss package brings great possibilities to handle user-customized change-point models.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2022-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85866418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Journal of Quality Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1