Pub Date : 2023-06-13DOI: 10.1007/s10260-023-00707-z
Fulvia Mecatti, Charalambos Sismanidis, Emanuela Furfaro, Pier Luigi Conti
A new class of sampling strategies is proposed that can be applied to population-based surveys targeting a rare trait that is unevenly spread over an area of interest. Our proposal is characterised by the ability to tailor the data collection to specific features and challenges of the survey at hand. It is based on integrating an adaptive component into a sequential selection, which aims both to intensify the detection of positive cases, upon exploiting the spatial clustering, and to provide a flexible framework to manage logistics and budget constraints. A class of estimators is also proposed to account for the selection bias, that are proved unbiased for the population mean (prevalence) as well as consistent and asymptotically Normal distributed. Unbiased variance estimation is also provided. A ready-to-implement weighting system is developed for estimation purposes. Two special strategies included in the proposed class are presented, that are based on the Poisson sampling and proved more efficient. The selection of primary sampling units is also illustrated for tuberculosis prevalence surveys, which are recommended in many countries and supported by the World Health Organisation as an emblematic example of the need for an improved sampling design. Simulation results are given in the tuberculosis application to illustrate the strengths and weaknesses of the proposed sequential adaptive sampling strategies with respect to traditional cross-sectional non-informative sampling as currently suggested by World Health Organisation guidelines.
{"title":"Sequential adaptive strategies for sampling rare clustered populations.","authors":"Fulvia Mecatti, Charalambos Sismanidis, Emanuela Furfaro, Pier Luigi Conti","doi":"10.1007/s10260-023-00707-z","DOIUrl":"10.1007/s10260-023-00707-z","url":null,"abstract":"<p><p>A new class of sampling strategies is proposed that can be applied to population-based surveys targeting a rare trait that is unevenly spread over an area of interest. Our proposal is characterised by the ability to tailor the data collection to specific features and challenges of the survey at hand. It is based on integrating an adaptive component into a sequential selection, which aims both to intensify the detection of positive cases, upon exploiting the spatial clustering, and to provide a flexible framework to manage logistics and budget constraints. A class of estimators is also proposed to account for the selection bias, that are proved unbiased for the population mean (prevalence) as well as consistent and asymptotically Normal distributed. Unbiased variance estimation is also provided. A ready-to-implement weighting system is developed for estimation purposes. Two special strategies included in the proposed class are presented, that are based on the Poisson sampling and proved more efficient. The selection of primary sampling units is also illustrated for tuberculosis prevalence surveys, which are recommended in many countries and supported by the World Health Organisation as an emblematic example of the need for an improved sampling design. Simulation results are given in the tuberculosis application to illustrate the strengths and weaknesses of the proposed sequential adaptive sampling strategies with respect to traditional cross-sectional non-informative sampling as currently suggested by World Health Organisation guidelines.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10262937/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10195253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-05-16DOI: 10.1007/s10260-023-00701-5
Gianpaolo Zammarchi, Francesco Mola, Claudio Conversano
[This corrects the article DOI: 10.1007/s10260-023-00690-5.].
[这更正了文章DOI:10.1007/s10260-03-00690-5.]。
{"title":"Correction to: Using sentiment analysis to evaluate the impact of the COVID-19 outbreak on Italy's country reputation and stock market performance.","authors":"Gianpaolo Zammarchi, Francesco Mola, Claudio Conversano","doi":"10.1007/s10260-023-00701-5","DOIUrl":"10.1007/s10260-023-00701-5","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1007/s10260-023-00690-5.].</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10186309/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10045358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-05-02DOI: 10.1007/s10260-023-00703-3
Gianni Betti, Vasco Molini, Dan Pavelesku
In this paper we aim to propose a new method for improving the design effect of household surveys based on a two-stage design in which the first stage clusters, or Primary Selection Units (PSUs), are stratified along administrative boundaries. Improvement of the design effect can result in more precise survey estimates (smaller standard errors and confidence intervals) or in the reduction of the necessary sample size, i.e. a reduction in the budget needed for a survey. The proposed method is based on the availability of a previously conducted poverty maps, i.e. spatial descriptions of the distribution of per capita consumption expenditures, that are finely disaggregated in small geographic units, such as cities, municipalities, districts or other administrative partitions of a country that are directly linked to PSUs. Such information is then used to select PSUs with systematic sampling by introducing further implicit stratification in the survey design, so as to maximise the improvement of the design effect. Since per capita consumption expenditures estimated at PSU level from the poverty mapping are affected by (small) standard errors, in the paper we also perform a simulation study in order to take into account this addition variability.
{"title":"Using poverty maps to improve the design of household surveys: the evidence from Tunisia.","authors":"Gianni Betti, Vasco Molini, Dan Pavelesku","doi":"10.1007/s10260-023-00703-3","DOIUrl":"10.1007/s10260-023-00703-3","url":null,"abstract":"<p><p>In this paper we aim to propose a new method for improving the design effect of household surveys based on a two-stage design in which the first stage clusters, or Primary Selection Units (PSUs), are stratified along administrative boundaries. Improvement of the design effect can result in more precise survey estimates (smaller standard errors and confidence intervals) or in the reduction of the necessary sample size, i.e. a reduction in the budget needed for a survey. The proposed method is based on the availability of a previously conducted poverty maps, i.e. spatial descriptions of the distribution of per capita consumption expenditures, that are finely disaggregated in small geographic units, such as cities, municipalities, districts or other administrative partitions of a country that are directly linked to PSUs. Such information is then used to select PSUs with systematic sampling by introducing further <i>implicit stratification</i> in the survey design, so as to maximise the improvement of the design effect. Since per capita consumption expenditures estimated at PSU level from the poverty mapping are affected by (small) standard errors, in the paper we also perform a simulation study in order to take into account this addition variability.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10152429/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10045359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-04-03DOI: 10.1007/s10260-023-00690-5
Gianpaolo Zammarchi, Francesco Mola, Claudio Conversano
During the recent Coronavirus disease 2019 (COVID-19) outbreak, the microblogging service Twitter has been widely used to share opinions and reactions to events. Italy was one of the first European countries to be severely affected by the outbreak and to establish lockdown and stay-at-home orders, potentially leading to country reputation damage. We resort to sentiment analysis to investigate changes in opinions about Italy reported on Twitter before and after the COVID-19 outbreak. Using different lexicons-based methods, we find a breakpoint corresponding to the date of the first established case of COVID-19 in Italy that causes a relevant change in sentiment scores used as a proxy of the country's reputation. Next, we demonstrate that sentiment scores about Italy are associated with the values of the FTSE-MIB index, the Italian Stock Exchange main index, as they serve as early detection signals of changes in the values of FTSE-MIB. Lastly, we evaluate whether different machine learning classifiers were able to determine the polarity of tweets posted before and after the outbreak with a different level of accuracy.
{"title":"Using sentiment analysis to evaluate the impact of the COVID-19 outbreak on Italy's country reputation and stock market performance.","authors":"Gianpaolo Zammarchi, Francesco Mola, Claudio Conversano","doi":"10.1007/s10260-023-00690-5","DOIUrl":"10.1007/s10260-023-00690-5","url":null,"abstract":"<p><p>During the recent Coronavirus disease 2019 (COVID-19) outbreak, the microblogging service Twitter has been widely used to share opinions and reactions to events. Italy was one of the first European countries to be severely affected by the outbreak and to establish lockdown and stay-at-home orders, potentially leading to country reputation damage. We resort to sentiment analysis to investigate changes in opinions about Italy reported on Twitter before and after the COVID-19 outbreak. Using different lexicons-based methods, we find a breakpoint corresponding to the date of the first established case of COVID-19 in Italy that causes a relevant change in sentiment scores used as a proxy of the country's reputation. Next, we demonstrate that sentiment scores about Italy are associated with the values of the FTSE-MIB index, the Italian Stock Exchange main index, as they serve as early detection signals of changes in the values of FTSE-MIB. Lastly, we evaluate whether different machine learning classifiers were able to determine the polarity of tweets posted before and after the outbreak with a different level of accuracy.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068702/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9715215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-03-30DOI: 10.1007/s10260-023-00688-z
G Alleva, G Arbia, P D Falorsi, V Nardelli, A Zuliani
The COVID-19 pandemic presents an unprecedented clinical and healthcare challenge for the many medical researchers who are attempting to prevent its worldwide spread. It also presents a challenge for statisticians involved in designing appropriate sampling plans to estimate the crucial parameters of the pandemic. These plans are necessary for monitoring and surveillance of the phenomenon and evaluating health policies. In this respect, we can use spatial information and aggregate data regarding the number of verified infections (either hospitalized or in compulsory quarantine) to improve the standard two-stage sampling design broadly adopted for studying human populations. We present an optimal spatial sampling design based on spatially balanced sampling techniques. We prove its relative performance analytically in comparison to other competing sampling plans, and we also study its properties through a series of Monte Carlo experiments. Considering the optimal theoretical properties of the proposed sampling plan and its feasibility, we discuss suboptimal designs that approximate well optimality and are more readily applicable.
{"title":"Optimal two-stage spatial sampling design for estimating critical parameters of SARS-CoV-2 epidemic: Efficiency versus feasibility.","authors":"G Alleva, G Arbia, P D Falorsi, V Nardelli, A Zuliani","doi":"10.1007/s10260-023-00688-z","DOIUrl":"10.1007/s10260-023-00688-z","url":null,"abstract":"<p><p>The COVID-19 pandemic presents an unprecedented clinical and healthcare challenge for the many medical researchers who are attempting to prevent its worldwide spread. It also presents a challenge for statisticians involved in designing appropriate sampling plans to estimate the crucial parameters of the pandemic. These plans are necessary for monitoring and surveillance of the phenomenon and evaluating health policies. In this respect, we can use spatial information and aggregate data regarding the number of verified infections (either hospitalized or in compulsory quarantine) to improve the standard two-stage sampling design broadly adopted for studying human populations. We present an optimal spatial sampling design based on spatially balanced sampling techniques. We prove its relative performance analytically in comparison to other competing sampling plans, and we also study its properties through a series of Monte Carlo experiments. Considering the optimal theoretical properties of the proposed sampling plan and its feasibility, we discuss suboptimal designs that approximate well optimality and are more readily applicable.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10062269/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9769381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-30DOI: 10.1007/s10260-023-00681-6
Angela Maria D'Uggento, Alfonso Piscitelli, Nunziata Ribecco, Germana Scepi
In recent years, the increasing number of natural disasters has raised concerns about the sustainability of our planet's future. As young people comprise the generation that will suffer from the negative effects of climate change, they have become involved in a new climate activism that is also gaining interest in the public debate thanks to the Fridays for Future (FFF) movement. This paper analyses the results of a survey of 1,138 young people in a southern Italian region to explore their perceptions of the extent of environmental problems and their participation in protests of green movements such as the FFF. The statistical analyses perform an ordinal classification tree using an original impurity measure considering both the ordinal nature of the response variable and the heterogeneity of its ordered categories. The results show that respondents are concerned about the threat of climate change and participate in the FFF to claim their right to a healthier planet and encourage people to adopt environmentally friendly practices in their lifestyles. Young people feel they are global citizens, connected through the Internet and social media, and show greater sensitivity to the planet's environmental problems, so they are willing to take effective action to demand sustainable policies from decision-makers. When planning public policies that will affect future generations, it is important for policymakers to know the demands and opinions of key stakeholders, especially young people, in order to plan the most appropriate measures, such as climate change mitigation.
{"title":"Perceived climate change risk and global green activism among young people.","authors":"Angela Maria D'Uggento, Alfonso Piscitelli, Nunziata Ribecco, Germana Scepi","doi":"10.1007/s10260-023-00681-6","DOIUrl":"10.1007/s10260-023-00681-6","url":null,"abstract":"<p><p>In recent years, the increasing number of natural disasters has raised concerns about the sustainability of our planet's future. As young people comprise the generation that will suffer from the negative effects of climate change, they have become involved in a new climate activism that is also gaining interest in the public debate thanks to the Fridays for Future (FFF) movement. This paper analyses the results of a survey of 1,138 young people in a southern Italian region to explore their perceptions of the extent of environmental problems and their participation in protests of green movements such as the FFF. The statistical analyses perform an ordinal classification tree using an original impurity measure considering both the ordinal nature of the response variable and the heterogeneity of its ordered categories. The results show that respondents are concerned about the threat of climate change and participate in the FFF to claim their right to a healthier planet and encourage people to adopt environmentally friendly practices in their lifestyles. Young people feel they are global citizens, connected through the Internet and social media, and show greater sensitivity to the planet's environmental problems, so they are willing to take effective action to demand sustainable policies from decision-makers. When planning public policies that will affect future generations, it is important for policymakers to know the demands and opinions of key stakeholders, especially young people, in order to plan the most appropriate measures, such as climate change mitigation.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9885933/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10654157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-08DOI: 10.1007/s10260-022-00677-8
Federico Fioravanti, Fernando Delbianco, Fernando Tohmé
Results in contact sports like Rugby are mainly interpreted in terms of the ability and/or luck of teams. But this neglects the important role of the motivation of players, reflected in the effort exerted in the game. Here we present a Bayesian hierarchical model to infer the main features that explain score differences in rugby matches of the English Premiership Rugby 2020/2021 season. The main result is that, indeed, effort (seen as a ratio between the number of tries and the scoring kick attempts) is highly relevant to explain outcomes in those matches.
{"title":"The relative importance of ability, luck and motivation in team sports: a Bayesian model of performance in the English Rugby Premiership.","authors":"Federico Fioravanti, Fernando Delbianco, Fernando Tohmé","doi":"10.1007/s10260-022-00677-8","DOIUrl":"10.1007/s10260-022-00677-8","url":null,"abstract":"<p><p>Results in contact sports like Rugby are mainly interpreted in terms of the ability and/or luck of teams. But this neglects the important role of the <i>motivation</i> of players, reflected in the effort exerted in the game. Here we present a Bayesian hierarchical model to infer the main features that explain score differences in rugby matches of the English Premiership Rugby 2020/2021 season. The main result is that, indeed, <i>effort</i> (seen as a ratio between the number of tries and the scoring kick attempts) is highly relevant to explain outcomes in those matches.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9734849/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10766447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-10-24DOI: 10.1007/s10260-022-00666-x
Martina Vittorietti, Ornella Giambalvo, Vincenzo Giuseppe Genova, Fabio Aiello
Students' and graduates' mobility is an interesting topic of discussion especially for the Italian education system and universities. The main reasons for migration and for the so called brain drain, can be found in the socio-economic context and in the famous North-South divide. Measuring mobility and understanding its dynamic over time and space are not trivial tasks. Most of the studies in the related literature focus on the determinants of such phenomenon, in this paper, instead, combining tools coming from graph theory and Topological Data Analysis we propose a new measure for the attitude to mobility. Each mobility trajectory is represented by a graph and the importance of the features constituting the graph are evaluated over time using persistence diagrams. The attitude to mobility of the students is then ranked computing the distance between the individual persistence diagram and the theoretical persistence diagram of the stayer student. The new approach is used for evaluating the mobility of the students that in 2008 enrolled in an Italian university. The relation between attitude to mobility and the main socio-demographic variables is investigated.
{"title":"A new measure for the attitude to mobility of Italian students and graduates: a topological data analysis approach.","authors":"Martina Vittorietti, Ornella Giambalvo, Vincenzo Giuseppe Genova, Fabio Aiello","doi":"10.1007/s10260-022-00666-x","DOIUrl":"10.1007/s10260-022-00666-x","url":null,"abstract":"<p><p>Students' and graduates' mobility is an interesting topic of discussion especially for the Italian education system and universities. The main reasons for migration and for the so called brain drain, can be found in the socio-economic context and in the famous North-South divide. Measuring mobility and understanding its dynamic over time and space are not trivial tasks. Most of the studies in the related literature focus on the determinants of such phenomenon, in this paper, instead, combining tools coming from graph theory and Topological Data Analysis we propose a new measure for the attitude to mobility. Each mobility trajectory is represented by a graph and the importance of the features constituting the graph are evaluated over time using persistence diagrams. The attitude to mobility of the students is then ranked computing the distance between the individual persistence diagram and the theoretical persistence diagram of the stayer student. The new approach is used for evaluating the mobility of the students that in 2008 enrolled in an Italian university. The relation between attitude to mobility and the main socio-demographic variables is investigated.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9589705/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40656803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-10-24DOI: 10.1007/s10260-022-00668-9
Andrea Pastore, Stefano F Tonellato, Emanuele Aliverti, Stefano Campostrini
Morbidity is one of the key aspects for assessing populations' well-being. In particular, chronic diseases negatively affect the quality of life in the old age and the risk that more years added to lives are years of disability and illness. Novel analysis, interventions and policies are required to understand and potentially mitigate this issue. In this article, we focus on investigating whether in Italy the compression of morbidity is in act in the recent years, parallely to an increase of life expectancy. Our analysis rely on large repeated cross-sectional data from the national surveillance system passi, providing deep insights on the evolution of morbidity together with other socio-demographical variables. In addition, we investigate differences in morbidity across subgroups, focusing on disparities by gender, level of education and economic difficulties, and assessing the evolution of these differences across the period 2013-2019.
{"title":"When does morbidity start? An analysis of changes in morbidity between 2013 and 2019 in Italy.","authors":"Andrea Pastore, Stefano F Tonellato, Emanuele Aliverti, Stefano Campostrini","doi":"10.1007/s10260-022-00668-9","DOIUrl":"10.1007/s10260-022-00668-9","url":null,"abstract":"<p><p>Morbidity is one of the key aspects for assessing populations' well-being. In particular, chronic diseases negatively affect the quality of life in the old age and the risk that more years added to lives are years of disability and illness. Novel analysis, interventions and policies are required to understand and potentially mitigate this issue. In this article, we focus on investigating whether in Italy the compression of morbidity is in act in the recent years, parallely to an increase of life expectancy. Our analysis rely on large repeated cross-sectional data from the national surveillance system passi, providing deep insights on the evolution of morbidity together with other socio-demographical variables. In addition, we investigate differences in morbidity across subgroups, focusing on disparities by gender, level of education and economic difficulties, and assessing the evolution of these differences across the period 2013-2019.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9589774/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40656802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-10-12DOI: 10.1007/s10260-022-00661-2
Viviana Carcaiso, Leonardo Grilli
The extension of quantile regression to count data raises several issues. We compare the traditional approach, based on transforming the count variable using jittering, with a recently proposed approach in which the coefficients of quantile regression are modelled by parametric functions. We exploit both methods to analyse university students' data to evaluate the effect of emergency remote teaching due to COVID-19 on the number of credits earned by the students. The coefficients modelling approach performs a smoothing that is especially convenient in the tails of the distribution, preventing abrupt changes in the point estimates and increasing precision. Nonetheless, model selection is challenging because of the wide range of options and the limited availability of diagnostic tools. Thus the jittering approach remains fundamental to guide the choice of the parametric functions.
{"title":"Quantile regression for count data: jittering versus regression coefficients modelling in the analysis of credits earned by university students after remote teaching.","authors":"Viviana Carcaiso, Leonardo Grilli","doi":"10.1007/s10260-022-00661-2","DOIUrl":"10.1007/s10260-022-00661-2","url":null,"abstract":"<p><p>The extension of quantile regression to count data raises several issues. We compare the traditional approach, based on transforming the count variable using jittering, with a recently proposed approach in which the coefficients of quantile regression are modelled by parametric functions. We exploit both methods to analyse university students' data to evaluate the effect of emergency remote teaching due to COVID-19 on the number of credits earned by the students. The coefficients modelling approach performs a smoothing that is especially convenient in the tails of the distribution, preventing abrupt changes in the point estimates and increasing precision. Nonetheless, model selection is challenging because of the wide range of options and the limited availability of diagnostic tools. Thus the jittering approach remains fundamental to guide the choice of the parametric functions.</p>","PeriodicalId":53154,"journal":{"name":"Statistical Methods and Applications","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9554398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33515378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}