Mònica González-Carrasco, Silvana Aciar, Ferran Casas, Xavier Oriol, Ramon Fabregat, Sara Malo
{"title":"A Machine Learning Approach to Well-Being in Late Childhood and Early Adolescence: The Children’s Worlds Data Case","authors":"Mònica González-Carrasco, Silvana Aciar, Ferran Casas, Xavier Oriol, Ramon Fabregat, Sara Malo","doi":"10.1007/s11205-024-03429-1","DOIUrl":null,"url":null,"abstract":"<p>Explaining what leads to higher or lower levels of subjective well-being (SWB) in childhood and adolescence is one of the cornerstones within this field of studies, since it can lead to the development of more focused preventive and promotion actions. Although many indicators of SWB have been identified, selecting one over the other to obtain a reasonably short list poses a challenge, given that models are particularly sensitive to the indicators considered.Two Machine Learning (ML) algorithms, one based on Extreme Gradient Boosting and Random Forest and the other on Lineal Regression, were applied to 77 indicators included in the 3rd wave of the Children’s Worlds project and then compared. ExtremeGradient Boosting outperforms the other two, while Lineal Regression outperforms Random Forest. Moreover, the Extreme Gradient Boosting algorithm was used to compare models for each of the 35 participating countries with that of the pooled sample on the basis of responses from 93,349 children and adolescents collected through a representative sampling and belonging to the 10 and 12-year-olds age groups. Large differences were detected by country with regard to the importance of these 77 indicators in explaining the scores for the five-item-version of the CWSWBS5 (Children’s Worlds Subjective Well-Being Scale). The process followed highlights the greater capacity of some ML techniques in providing models with higher explanatory power and less error, and in more clearly differentiating between the contributions of the different indicators to explain children’s and adolescents’ SWB. This finding is useful when it comes to designing shorter but more reliable questionnaires (a selection of 29 indicators were used in this case).</p>","PeriodicalId":21943,"journal":{"name":"Social Indicators Research","volume":"3 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Indicators Research","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1007/s11205-024-03429-1","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL SCIENCES, INTERDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Explaining what leads to higher or lower levels of subjective well-being (SWB) in childhood and adolescence is one of the cornerstones within this field of studies, since it can lead to the development of more focused preventive and promotion actions. Although many indicators of SWB have been identified, selecting one over the other to obtain a reasonably short list poses a challenge, given that models are particularly sensitive to the indicators considered.Two Machine Learning (ML) algorithms, one based on Extreme Gradient Boosting and Random Forest and the other on Lineal Regression, were applied to 77 indicators included in the 3rd wave of the Children’s Worlds project and then compared. ExtremeGradient Boosting outperforms the other two, while Lineal Regression outperforms Random Forest. Moreover, the Extreme Gradient Boosting algorithm was used to compare models for each of the 35 participating countries with that of the pooled sample on the basis of responses from 93,349 children and adolescents collected through a representative sampling and belonging to the 10 and 12-year-olds age groups. Large differences were detected by country with regard to the importance of these 77 indicators in explaining the scores for the five-item-version of the CWSWBS5 (Children’s Worlds Subjective Well-Being Scale). The process followed highlights the greater capacity of some ML techniques in providing models with higher explanatory power and less error, and in more clearly differentiating between the contributions of the different indicators to explain children’s and adolescents’ SWB. This finding is useful when it comes to designing shorter but more reliable questionnaires (a selection of 29 indicators were used in this case).
期刊介绍:
Since its foundation in 1974, Social Indicators Research has become the leading journal on problems related to the measurement of all aspects of the quality of life. The journal continues to publish results of research on all aspects of the quality of life and includes studies that reflect developments in the field. It devotes special attention to studies on such topics as sustainability of quality of life, sustainable development, and the relationship between quality of life and sustainability. The topics represented in the journal cover and involve a variety of segmentations, such as social groups, spatial and temporal coordinates, population composition, and life domains. The journal presents empirical, philosophical and methodological studies that cover the entire spectrum of society and are devoted to giving evidences through indicators. It considers indicators in their different typologies, and gives special attention to indicators that are able to meet the need of understanding social realities and phenomena that are increasingly more complex, interrelated, interacted and dynamical. In addition, it presents studies aimed at defining new approaches in constructing indicators.