Ricardo Accorsi Casonatto , Tales De Pádua Grillo Souza , Ari Melo Mariano
{"title":"Quality and Risk Management in Data Mining: A CRISP-DM Perspective.","authors":"Ricardo Accorsi Casonatto , Tales De Pádua Grillo Souza , Ari Melo Mariano","doi":"10.1016/j.procs.2024.08.257","DOIUrl":null,"url":null,"abstract":"<div><p>The area of data science knowledge responsible for dealing with this new reality is diffuse, including mathematics, statistics, computing, engineering, psychology, and administration, among many other areas that make up a new scenario that is still changing. Different models have emerged over the years to systematize the procedures to be followed. Among them, CRISP-DM (Cross Industry Standard Process for Data Mining) has become one of the most widespread in the industry. However, the lack of detailed instructions means the framework is often incorrectly used. Therefore, this research aims to present a utilitarian and didactic model based on the latest advances in the literature and through the lens of production engineering. In order to achieve this objective, exploratory research was carried out based on a systematic review and subsequent categorization of each of the CRISP-DM steps, detailing the authors’ contributions to each stage. In addition, it is proposed that guidelines from the areas of Quality Management and Risk Management be added to the subject, consolidating a useful and didactic model of relevance.</p></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"242 ","pages":"Pages 161-168"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877050924019768/pdf?md5=db8eb2579aadeaa41b23b028ffc4301a&pid=1-s2.0-S1877050924019768-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050924019768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The area of data science knowledge responsible for dealing with this new reality is diffuse, including mathematics, statistics, computing, engineering, psychology, and administration, among many other areas that make up a new scenario that is still changing. Different models have emerged over the years to systematize the procedures to be followed. Among them, CRISP-DM (Cross Industry Standard Process for Data Mining) has become one of the most widespread in the industry. However, the lack of detailed instructions means the framework is often incorrectly used. Therefore, this research aims to present a utilitarian and didactic model based on the latest advances in the literature and through the lens of production engineering. In order to achieve this objective, exploratory research was carried out based on a systematic review and subsequent categorization of each of the CRISP-DM steps, detailing the authors’ contributions to each stage. In addition, it is proposed that guidelines from the areas of Quality Management and Risk Management be added to the subject, consolidating a useful and didactic model of relevance.