首页 > 最新文献

Journal of Quality Technology最新文献

英文 中文
Powerful and robust dispersion contrasts for replicated orthogonal designs 重复正交设计的强大且稳健的色散对比
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-10-21 DOI: 10.1080/00224065.2021.1991250
Richard N. McGrath, Baffour Koduah
Abstract A popular approach for estimating location and dispersion effects in replicated designs under the common assumption of normal and independent errors is to use two linked generalized linear models (glms). This approach uses an asymptotic estimate for the variance of dispersion effect estimates and is very sensitive to the normality assumption. It is also possible to identify dispersion effects (after a logarithmic transformation) by using methods developed for identifying location effects in unreplicated designs. One such method is rather robust to the normality assumption but lacks power relative to the glm approach. We introduce a hybrid approach that strikes a balance between power and robustness when used for dispersion effect identification.
在常见的正态和独立误差假设下,估计重复设计中的位置和色散效应的常用方法是使用两个链接的广义线性模型(glms)。该方法对离散效应估计的方差使用渐近估计,并且对正态性假设非常敏感。通过使用在非重复设计中用于识别位置效应的方法,也可以识别色散效应(经过对数变换)。其中一种方法对正态性假设具有相当强的鲁棒性,但相对于glm方法缺乏能力。我们引入了一种混合方法,在功率和鲁棒性之间取得平衡,用于色散效应识别。
{"title":"Powerful and robust dispersion contrasts for replicated orthogonal designs","authors":"Richard N. McGrath, Baffour Koduah","doi":"10.1080/00224065.2021.1991250","DOIUrl":"https://doi.org/10.1080/00224065.2021.1991250","url":null,"abstract":"Abstract A popular approach for estimating location and dispersion effects in replicated designs under the common assumption of normal and independent errors is to use two linked generalized linear models (glms). This approach uses an asymptotic estimate for the variance of dispersion effect estimates and is very sensitive to the normality assumption. It is also possible to identify dispersion effects (after a logarithmic transformation) by using methods developed for identifying location effects in unreplicated designs. One such method is rather robust to the normality assumption but lacks power relative to the glm approach. We introduce a hybrid approach that strikes a balance between power and robustness when used for dispersion effect identification.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"30 1","pages":"573 - 588"},"PeriodicalIF":2.5,"publicationDate":"2021-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83323856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A critique of a variety of “memory-based” process monitoring methods 对各种“基于内存”的进程监控方法的批判
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-10-20 DOI: 10.1080/00224065.2022.2034487
S. Knoth, Nesma A. Saleh, Mahmoud A. Mahmoud, W. Woodall, V. Tercero-Gómez
Abstract Many extensions and modifications have been made to standard process monitoring methods such as the exponentially weighted moving average (EWMA) chart and the cumulative sum (CUSUM) chart. In addition, new schemes have been proposed based on alternative weighting of past data, usually to put greater emphasis on past data and less weight on current and recent data. In other cases, the output of one process monitoring method, such as the EWMA statistic, is used as the input to another method, such as the CUSUM chart. Often the recursive formula for a control chart statistic is itself used recursively to form a new control chart statistic. We find the use of these ad hoc methods to be unjustified. Statistical performance comparisons justifying the use of these methods have been either flawed by focusing only on zero-state run length metrics or by making comparisons to an unnecessarily weak competitor.
摘要指数加权移动平均(EWMA)图和累积和(CUSUM)图等标准过程监控方法得到了许多扩展和改进。此外,还提出了基于过去数据的替代加权的新方案,通常更多地强调过去数据,而较少地重视当前和最近的数据。在其他情况下,一种流程监控方法(如EWMA统计)的输出被用作另一种方法(如CUSUM图)的输入。通常,控制图统计的递归公式本身被递归地用于形成新的控制图统计。我们认为使用这些特别的方法是不合理的。证明使用这些方法的统计性能比较存在缺陷,要么只关注零状态运行长度指标,要么与不必要的弱竞争对手进行比较。
{"title":"A critique of a variety of “memory-based” process monitoring methods","authors":"S. Knoth, Nesma A. Saleh, Mahmoud A. Mahmoud, W. Woodall, V. Tercero-Gómez","doi":"10.1080/00224065.2022.2034487","DOIUrl":"https://doi.org/10.1080/00224065.2022.2034487","url":null,"abstract":"Abstract Many extensions and modifications have been made to standard process monitoring methods such as the exponentially weighted moving average (EWMA) chart and the cumulative sum (CUSUM) chart. In addition, new schemes have been proposed based on alternative weighting of past data, usually to put greater emphasis on past data and less weight on current and recent data. In other cases, the output of one process monitoring method, such as the EWMA statistic, is used as the input to another method, such as the CUSUM chart. Often the recursive formula for a control chart statistic is itself used recursively to form a new control chart statistic. We find the use of these ad hoc methods to be unjustified. Statistical performance comparisons justifying the use of these methods have been either flawed by focusing only on zero-state run length metrics or by making comparisons to an unnecessarily weak competitor.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"165 1","pages":"18 - 42"},"PeriodicalIF":2.5,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76906948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
ASQ Books ASQ的书
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-10-20 DOI: 10.1080/00224065.2021.1987815
Fin Rooney Full of helpful questions and thoughtprovoking ideas, this book offers practitioners guidance in having meaningful conversations with their senior managers and influence how they view quality. For senior managers, it offers a framework for devising a coordinated quality strategy, involving every department in quality, and showing how an embedded quality strategy can create virtuous circles of improvement.
这本书充满了有用的问题和发人深省的想法,为从业者提供了指导,帮助他们与高级经理进行有意义的对话,并影响他们对质量的看法。对于高级管理人员,它提供了一个框架来设计一个协调的质量战略,涉及到质量的每个部门,并展示了一个嵌入的质量战略如何创造改善的良性循环。
{"title":"ASQ Books","authors":"","doi":"10.1080/00224065.2021.1987815","DOIUrl":"https://doi.org/10.1080/00224065.2021.1987815","url":null,"abstract":"Fin Rooney Full of helpful questions and thoughtprovoking ideas, this book offers practitioners guidance in having meaningful conversations with their senior managers and influence how they view quality. For senior managers, it offers a framework for devising a coordinated quality strategy, involving every department in quality, and showing how an embedded quality strategy can create virtuous circles of improvement.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"64 1","pages":"586 - 586"},"PeriodicalIF":2.5,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85788420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence and statistics for quality technology: an introduction to the special issue 人工智能与质量技术统计:特刊导论
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-10-20 DOI: 10.1080/00224065.2021.1987806
B. Colosimo, Enrique del Castillo, L. A. Jones‐Farmer, K. Paynabar
Abstract In many applied and industrial settings, the use of Artificial Intelligence (AI) for quality technology is gaining growing attention. AI refers to the broad set of techniques which replicate human cognitive and analytical skills for problem solving, including Machine Learning, Neural Networks and Deep Learning. This paper presents a brief introduction to the special issue, where AI-based solutions are presented to solve problems that are typically faced in the area of quality technology. Limits and advantages of AI-based solutions are briefly discussed to stimulate creative attention to novel solutions and new directions for future research.
在许多应用和工业环境中,使用人工智能(AI)进行质量技术越来越受到关注。人工智能指的是一系列广泛的技术,这些技术可以复制人类解决问题的认知和分析技能,包括机器学习、神经网络和深度学习。本文简要介绍了这一特殊问题,其中提出了基于人工智能的解决方案,以解决质量技术领域通常面临的问题。简要讨论了基于人工智能的解决方案的局限性和优势,以激发对新解决方案和未来研究新方向的创造性关注。
{"title":"Artificial intelligence and statistics for quality technology: an introduction to the special issue","authors":"B. Colosimo, Enrique del Castillo, L. A. Jones‐Farmer, K. Paynabar","doi":"10.1080/00224065.2021.1987806","DOIUrl":"https://doi.org/10.1080/00224065.2021.1987806","url":null,"abstract":"Abstract In many applied and industrial settings, the use of Artificial Intelligence (AI) for quality technology is gaining growing attention. AI refers to the broad set of techniques which replicate human cognitive and analytical skills for problem solving, including Machine Learning, Neural Networks and Deep Learning. This paper presents a brief introduction to the special issue, where AI-based solutions are presented to solve problems that are typically faced in the area of quality technology. Limits and advantages of AI-based solutions are briefly discussed to stimulate creative attention to novel solutions and new directions for future research.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"68 1","pages":"443 - 453"},"PeriodicalIF":2.5,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74036574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
ASQ Membership ASQ会员
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-10-20 DOI: 10.1080/00224065.2021.1987816
{"title":"ASQ Membership","authors":"","doi":"10.1080/00224065.2021.1987816","DOIUrl":"https://doi.org/10.1080/00224065.2021.1987816","url":null,"abstract":"","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"95 1","pages":"587 - 587"},"PeriodicalIF":2.5,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85689095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-infused process monitoring for quality improvement in solar cell manufacturing processes 为太阳能电池制造过程的质量改进提供知识注入的过程监控
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-10-20 DOI: 10.1080/00224065.2021.1983491
Juan Du, Xi Zhang, Wei Ou
Abstract Solar conversion efficiency (SCE), an important quality metric in solar cell manufacturing processes, is related to chemical vapor deposition in the epitaxy stage based on the photoelectric effect. A large number of solar cell fabrication plants still lack online process monitoring strategies at the epitaxy stage and instead use offline inspections after fabrication is completed. Consequently, production efficiency is reduced due to offline inspections and the quality of wafers in downstream manufacturing stages is uncertain because only a small portion of wafers can be inspected due to random sampling within a single batch. A knowledge-infused monitoring strategy in the epitaxy stage of solar cell manufacturing processes that enables the direct link of online process monitoring to quality SCE is proposed in this study. A customized nonlinear model based on light interference with parameters that can be physically interpreted and largely accepted by practitioners is proposed to capture key information of reflectance signals. Multiple process features are extracted and a general correlation-based variable ranking procedure is adopted in this nonlinear model to rank SCE-correlated key process features. This model enables online process monitoring of key features at the epitaxy stage and allows practitioners to apply timely remedies in case of unexpected conditions. The proposed knowledge-infused process monitoring approach fully considers the physical knowledge from light interference and interpretability of parameters in the established nonlinear model correlated with the quality metric SCE to facilitate the online process monitoring at the epitaxy stage. A real solar cell manufacturing case shows the effectiveness of the proposed monitoring strategy.
摘要太阳能转换效率(SCE)是太阳能电池制造过程中重要的质量指标,它与外延阶段基于光电效应的化学气相沉积有关。大量的太阳能电池制造厂在外延阶段仍然缺乏在线过程监控策略,而是在制造完成后使用离线检测。因此,由于离线检查,生产效率降低,下游制造阶段的晶圆质量不确定,因为在单个批次中随机抽样,只能检查一小部分晶圆。本研究提出了一种在太阳能电池制造过程外延阶段的知识注入监测策略,使在线过程监测与质量SCE直接联系起来。提出了一种基于光干涉的定制非线性模型,该模型具有可物理解释且被从业者广泛接受的参数,用于捕获反射信号的关键信息。该非线性模型提取了多个过程特征,并采用一种通用的基于相关性的变量排序方法对sce相关的关键过程特征进行排序。该模型支持在外延阶段对关键特性进行在线过程监控,并允许从业者在出现意外情况时及时采取补救措施。所提出的知识注入过程监测方法充分考虑了光干涉的物理知识和所建立的非线性模型中与质量度量SCE相关的参数的可解释性,便于在外延阶段进行在线过程监测。一个实际的太阳能电池制造案例表明了所提出的监测策略的有效性。
{"title":"Knowledge-infused process monitoring for quality improvement in solar cell manufacturing processes","authors":"Juan Du, Xi Zhang, Wei Ou","doi":"10.1080/00224065.2021.1983491","DOIUrl":"https://doi.org/10.1080/00224065.2021.1983491","url":null,"abstract":"Abstract Solar conversion efficiency (SCE), an important quality metric in solar cell manufacturing processes, is related to chemical vapor deposition in the epitaxy stage based on the photoelectric effect. A large number of solar cell fabrication plants still lack online process monitoring strategies at the epitaxy stage and instead use offline inspections after fabrication is completed. Consequently, production efficiency is reduced due to offline inspections and the quality of wafers in downstream manufacturing stages is uncertain because only a small portion of wafers can be inspected due to random sampling within a single batch. A knowledge-infused monitoring strategy in the epitaxy stage of solar cell manufacturing processes that enables the direct link of online process monitoring to quality SCE is proposed in this study. A customized nonlinear model based on light interference with parameters that can be physically interpreted and largely accepted by practitioners is proposed to capture key information of reflectance signals. Multiple process features are extracted and a general correlation-based variable ranking procedure is adopted in this nonlinear model to rank SCE-correlated key process features. This model enables online process monitoring of key features at the epitaxy stage and allows practitioners to apply timely remedies in case of unexpected conditions. The proposed knowledge-infused process monitoring approach fully considers the physical knowledge from light interference and interpretability of parameters in the established nonlinear model correlated with the quality metric SCE to facilitate the online process monitoring at the epitaxy stage. A real solar cell manufacturing case shows the effectiveness of the proposed monitoring strategy.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"55 1","pages":"561 - 572"},"PeriodicalIF":2.5,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74381137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Structural tensor-on-tensor regression with interaction effects and its application to a hot rolling process 具有相互作用的结构张量对张量回归及其在热轧过程中的应用
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-09-30 DOI: 10.1080/00224065.2021.1973931
Huihui Miao, Andi Wang, Bing Li, Jianjun Shi
Abstract This paper proposes a method of Structural Tensor-On-Tensosr regression considering the Interaction effects (STOTI). To alleviate the curse of dimensionality and resolve computational challenge, the STOTI method describes the specific structure of the main and interaction effect tensors indicated by the prior knowledge of the data using corresponding regularization terms on their appropriate modes. We designed an ADMM consensus algorithm to estimate these coefficient tensors. Extensive simulations and a real case study of the hot rolling process verified the superiority of the proposed method in terms of estimation and prediction accuracy.
提出了一种考虑相互作用效应的结构张量-张量回归方法。为了减轻维数诅咒和解决计算挑战,STOTI方法使用相应的正则化项来描述由数据的先验知识表示的主效应张量和交互效应张量的特定结构。我们设计了一种ADMM一致性算法来估计这些系数张量。大量的仿真和热轧过程的实际案例研究验证了该方法在估计和预测精度方面的优越性。
{"title":"Structural tensor-on-tensor regression with interaction effects and its application to a hot rolling process","authors":"Huihui Miao, Andi Wang, Bing Li, Jianjun Shi","doi":"10.1080/00224065.2021.1973931","DOIUrl":"https://doi.org/10.1080/00224065.2021.1973931","url":null,"abstract":"Abstract This paper proposes a method of Structural Tensor-On-Tensosr regression considering the Interaction effects (STOTI). To alleviate the curse of dimensionality and resolve computational challenge, the STOTI method describes the specific structure of the main and interaction effect tensors indicated by the prior knowledge of the data using corresponding regularization terms on their appropriate modes. We designed an ADMM consensus algorithm to estimate these coefficient tensors. Extensive simulations and a real case study of the hot rolling process verified the superiority of the proposed method in terms of estimation and prediction accuracy.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"39 1","pages":"547 - 560"},"PeriodicalIF":2.5,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87233166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Explanatory Model Analysis: Explore, Explain, and Examine Predictive Models 解释模型分析:探索、解释和检验预测模型
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-09-21 DOI: 10.1080/00224065.2021.1977101
Bing Si
Predictive models aim to guess, a.k.a., predict, values of a variable of interest based on other variables. It has been used throughout the entire human history and many statistical models have been developed for prediction during the last century. This book covers methods for exploration of predictive models from both instance level and dataset level. It would be a valuable addition to the Chapman & Hall/CRC’s Data Science Series. Together with other books that have published in the book series, this book provides a unique perspective into applied data science to guide data science practitioners who are interested in exploring, explaining, and examining data in real-world applications with both R and Python. Predictive models constitute an important component in the big picture of machine learning and data science approaches and require standard analytical steps such as model specification, model estimation, and model fitness diagnosis. Most of published books in this field focus on how to use these statistical methods to make predictions for different types of datasets, while lack of tools for model exploration and, in particular, model explanation (obtaining insights from model-based prediction) and model examination (evaluation of model performance and understanding its weakness). In contrast, this book is a novel effort that provides a deep understanding to all the steps with extensive validation and justification methods, leading to a better and faster interpretable data analysis. The book is well organized with three parts. It starts with an overview of basic concepts in Chapters 1-4 and then presents the instance-level exploration and datasetlevel exploration in Chapters 5-13 and Chapters 14-20, respectively. The overview part introduces basic and essential knowledge on model development process, software installation, and how to perform classic predictive models using software. The instance-level exploration part covers methods to help better understand “how a model yields a prediction for a particular single observation” for predictive models with both a small and a large number of exploratory variables. The last part is about dataset-level exploration that discusses “how do the model predictions perform overall, for an entire set of observations?” Although a basic understanding of programming languages would be beneficial, the coding part in this book is designed to be self-contained and friendly to readers without programming background as well. Additionally, it is worth noting that the readers are expected to have a certain level of knowledge about different types of data science models, such as logistic regression, support vector machine, and gradient boosting, and understand which kind of research questions each model can address. For example, given a research question aiming at predicting patient survival (yes/no) after surgery from other variables, e.g., age, symptoms, and medical history, the reader should be able to identify th
预测模型的目的是猜测,也就是预测基于其他变量的感兴趣变量的值。它在整个人类历史中一直被使用,在上个世纪,人们开发了许多统计模型来进行预测。这本书涵盖了从实例级和数据集级探索预测模型的方法。这将是查普曼和霍尔/CRC的数据科学系列的一个有价值的补充。与该系列中已出版的其他书籍一起,本书提供了应用数据科学的独特视角,以指导对使用R和Python在实际应用中探索、解释和检查数据感兴趣的数据科学从业者。预测模型是机器学习和数据科学方法的重要组成部分,需要标准的分析步骤,如模型规范、模型估计和模型适应度诊断。该领域出版的大多数书籍都侧重于如何使用这些统计方法对不同类型的数据集进行预测,而缺乏模型探索,特别是模型解释(从基于模型的预测中获得见解)和模型检验(评估模型性能并了解其弱点)的工具。相比之下,这本书是一个新颖的努力,提供了一个深刻的理解与广泛的验证和论证方法的所有步骤,导致一个更好和更快的可解释的数据分析。这本书组织得很好,分为三部分。它首先概述了第1-4章的基本概念,然后分别在第5-13章和第14-20章介绍了实例级探索和数据集级探索。概述部分介绍了模型开发过程、软件安装以及如何使用软件执行经典预测模型的基本和必要知识。实例级探索部分涵盖了帮助更好地理解具有少量和大量探索变量的预测模型的“模型如何产生对特定单个观测的预测”的方法。最后一部分是关于数据集级别的探索,讨论了“对于整个观测集,模型预测的总体表现如何?”虽然对编程语言有基本的了解是有益的,但本书中的编码部分被设计为对没有编程背景的读者也是独立和友好的。此外,值得注意的是,读者应该对不同类型的数据科学模型(如逻辑回归、支持向量机和梯度增强)有一定程度的了解,并了解每种模型可以解决哪种研究问题。例如,给定一个研究问题,旨在从其他变量(如年龄、症状和病史)预测手术后患者的生存(是/否),读者应该能够识别出感兴趣的因变量,生存,是一个二元变量,然后考虑逻辑回归模型作为开始预测建模的自然选择。总的来说,这本书是一本适合数据科学从业者使用R或Python软件学习预测模型的探索性数据分析及其应用的参考书。
{"title":"Explanatory Model Analysis: Explore, Explain, and Examine Predictive Models","authors":"Bing Si","doi":"10.1080/00224065.2021.1977101","DOIUrl":"https://doi.org/10.1080/00224065.2021.1977101","url":null,"abstract":"Predictive models aim to guess, a.k.a., predict, values of a variable of interest based on other variables. It has been used throughout the entire human history and many statistical models have been developed for prediction during the last century. This book covers methods for exploration of predictive models from both instance level and dataset level. It would be a valuable addition to the Chapman & Hall/CRC’s Data Science Series. Together with other books that have published in the book series, this book provides a unique perspective into applied data science to guide data science practitioners who are interested in exploring, explaining, and examining data in real-world applications with both R and Python. Predictive models constitute an important component in the big picture of machine learning and data science approaches and require standard analytical steps such as model specification, model estimation, and model fitness diagnosis. Most of published books in this field focus on how to use these statistical methods to make predictions for different types of datasets, while lack of tools for model exploration and, in particular, model explanation (obtaining insights from model-based prediction) and model examination (evaluation of model performance and understanding its weakness). In contrast, this book is a novel effort that provides a deep understanding to all the steps with extensive validation and justification methods, leading to a better and faster interpretable data analysis. The book is well organized with three parts. It starts with an overview of basic concepts in Chapters 1-4 and then presents the instance-level exploration and datasetlevel exploration in Chapters 5-13 and Chapters 14-20, respectively. The overview part introduces basic and essential knowledge on model development process, software installation, and how to perform classic predictive models using software. The instance-level exploration part covers methods to help better understand “how a model yields a prediction for a particular single observation” for predictive models with both a small and a large number of exploratory variables. The last part is about dataset-level exploration that discusses “how do the model predictions perform overall, for an entire set of observations?” Although a basic understanding of programming languages would be beneficial, the coding part in this book is designed to be self-contained and friendly to readers without programming background as well. Additionally, it is worth noting that the readers are expected to have a certain level of knowledge about different types of data science models, such as logistic regression, support vector machine, and gradient boosting, and understand which kind of research questions each model can address. For example, given a research question aiming at predicting patient survival (yes/no) after surgery from other variables, e.g., age, symptoms, and medical history, the reader should be able to identify th","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"26 1","pages":"486 - 486"},"PeriodicalIF":2.5,"publicationDate":"2021-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89635696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 62
Probability and statistical inference: From basic principles to advanced models 概率与统计推断:从基本原理到高级模型
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-09-16 DOI: 10.1080/00224065.2021.1977100
Bi Si
{"title":"Probability and statistical inference: From basic principles to advanced models","authors":"Bi Si","doi":"10.1080/00224065.2021.1977100","DOIUrl":"https://doi.org/10.1080/00224065.2021.1977100","url":null,"abstract":"","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"10 1","pages":"485 - 485"},"PeriodicalIF":2.5,"publicationDate":"2021-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73379453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Robustness with respect to class imbalance in artificial intelligence classification algorithms 人工智能分类算法中类不平衡的鲁棒性
IF 2.5 2区 工程技术 Q2 ENGINEERING, INDUSTRIAL Pub Date : 2021-08-30 DOI: 10.1080/00224065.2021.1963200
J. Lian, Laura J. Freeman, Yili Hong, Xinwei Deng
Abstract Artificial intelligence (AI) algorithms, such as deep learning and XGboost, are used in numerous applications including autonomous driving, manufacturing process optimization and medical diagnostics. The robustness of AI algorithms is of great interest as inaccurate prediction could result in safety concerns and limit the adoption of AI systems. In this paper, we propose a framework based on design of experiments to systematically investigate the robustness of AI classification algorithms. A robust classification algorithm is expected to have high accuracy and low variability under different application scenarios. The robustness can be affected by a wide range of factors such as the imbalance of class labels in the training dataset, the chosen prediction algorithm, the chosen dataset of the application, and a change of distribution in the training and test datasets. To investigate the robustness of AI classification algorithms, we conduct a comprehensive set of mixture experiments to collect prediction performance results. Then statistical analyses are conducted to understand how various factors affect the robustness of AI classification algorithms. We summarize our findings and provide suggestions to practitioners in AI applications.
人工智能(AI)算法,如深度学习和XGboost,被用于自动驾驶、制造流程优化和医疗诊断等众多应用中。人工智能算法的鲁棒性非常有趣,因为不准确的预测可能导致安全问题并限制人工智能系统的采用。在本文中,我们提出了一个基于实验设计的框架来系统地研究AI分类算法的鲁棒性。在不同的应用场景下,鲁棒分类算法应具有较高的准确率和较低的可变性。鲁棒性可能受到多种因素的影响,如训练数据集中类标签的不平衡、所选择的预测算法、应用程序所选择的数据集以及训练和测试数据集中分布的变化。为了研究人工智能分类算法的鲁棒性,我们进行了一组全面的混合实验来收集预测性能结果。然后进行统计分析,了解各种因素如何影响人工智能分类算法的鲁棒性。我们总结了我们的发现,并为人工智能应用的从业者提供了建议。
{"title":"Robustness with respect to class imbalance in artificial intelligence classification algorithms","authors":"J. Lian, Laura J. Freeman, Yili Hong, Xinwei Deng","doi":"10.1080/00224065.2021.1963200","DOIUrl":"https://doi.org/10.1080/00224065.2021.1963200","url":null,"abstract":"Abstract Artificial intelligence (AI) algorithms, such as deep learning and XGboost, are used in numerous applications including autonomous driving, manufacturing process optimization and medical diagnostics. The robustness of AI algorithms is of great interest as inaccurate prediction could result in safety concerns and limit the adoption of AI systems. In this paper, we propose a framework based on design of experiments to systematically investigate the robustness of AI classification algorithms. A robust classification algorithm is expected to have high accuracy and low variability under different application scenarios. The robustness can be affected by a wide range of factors such as the imbalance of class labels in the training dataset, the chosen prediction algorithm, the chosen dataset of the application, and a change of distribution in the training and test datasets. To investigate the robustness of AI classification algorithms, we conduct a comprehensive set of mixture experiments to collect prediction performance results. Then statistical analyses are conducted to understand how various factors affect the robustness of AI classification algorithms. We summarize our findings and provide suggestions to practitioners in AI applications.","PeriodicalId":54769,"journal":{"name":"Journal of Quality Technology","volume":"303 1","pages":"505 - 525"},"PeriodicalIF":2.5,"publicationDate":"2021-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77936639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Journal of Quality Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1