Pub Date : 2024-11-01Epub Date: 2024-07-02DOI: 10.1007/s13571-024-00336-w
Qing Yin, Jong-Hyeon Jeong, Xu Qin, Shyamal D Peddada, Jennifer J Adibi
Often linear regression is used to estimate mediation effects. In many instances the underlying relationships may not be linear. Although, the exact functional form of the relationship may be unknown, based on the underlying science, one may hypothesize the shape of the relationship. For these reasons, we develop a novel shape-restricted inference-based methodology for conducting mediation analysis. This work is motivated by an application in fetal endocrinology where researchers are interested in understanding the effects of pesticide application on birth weight, with human chorionic gonadotropin (hCG) as the mediator. Using the proposed methodology on a population-level prenatal screening program data, with hCG as the mediator, we discovered that while the natural direct effects suggest a positive association between pesticide application and birth weight, the natural indirect effects were negative.
{"title":"Mediation Analysis using Semi-parametric Shape-Restricted Regression with Applications.","authors":"Qing Yin, Jong-Hyeon Jeong, Xu Qin, Shyamal D Peddada, Jennifer J Adibi","doi":"10.1007/s13571-024-00336-w","DOIUrl":"10.1007/s13571-024-00336-w","url":null,"abstract":"<p><p>Often linear regression is used to estimate mediation effects. In many instances the underlying relationships may not be linear. Although, the exact functional form of the relationship may be unknown, based on the underlying science, one may hypothesize the shape of the relationship. For these reasons, we develop a novel shape-restricted inference-based methodology for conducting mediation analysis. This work is motivated by an application in fetal endocrinology where researchers are interested in understanding the effects of pesticide application on birth weight, with human chorionic gonadotropin (hCG) as the mediator. Using the proposed methodology on a population-level prenatal screening program data, with hCG as the mediator, we discovered that while the natural direct effects suggest a positive association between pesticide application and birth weight, the natural indirect effects were negative.</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"86 2","pages":"669-689"},"PeriodicalIF":0.0,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11615969/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142782112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-12-01Epub Date: 2019-02-07DOI: 10.1007/s13571-018-0183-0
Runmin Shi, Faming Liang, Qifan Song, Ye Luo, Malay Ghosh
The drastic improvement in data collection and acquisition technologies has enabled scientists to collect a great amount of data. With the growing dataset size, typically comes a growing complexity of data structures and of complex models to account for the data structures. How to estimate the parameters of complex models has put a great challenge on current statistical methods. This paper proposes a blockwise consistency approach as a potential solution to the problem, which works by iteratively finding consistent estimates for each block of parameters conditional on the current estimates of the parameters in other blocks. The blockwise consistency approach decomposes the high-dimensional parameter estimation problem into a series of lower-dimensional parameter estimation problems, which often have much simpler structures than the original problem and thus can be easily solved. Moreover, under the framework provided by the blockwise consistency approach, a variety of methods, such as Bayesian and frequentist methods, can be jointly used to achieve a consistent estimator for the original high-dimensional complex model. The blockwise consistency approach is illustrated using two high-dimensional problems, variable selection and multivariate regression. The results of both problems show that the blockwise consistency approach can provide drastic improvements over the existing methods. Extension of the blockwise consistency approach to many other complex models is straightforward.
{"title":"A Blockwise Consistency Method for Parameter Estimation of Complex Models.","authors":"Runmin Shi, Faming Liang, Qifan Song, Ye Luo, Malay Ghosh","doi":"10.1007/s13571-018-0183-0","DOIUrl":"10.1007/s13571-018-0183-0","url":null,"abstract":"<p><p>The drastic improvement in data collection and acquisition technologies has enabled scientists to collect a great amount of data. With the growing dataset size, typically comes a growing complexity of data structures and of complex models to account for the data structures. How to estimate the parameters of complex models has put a great challenge on current statistical methods. This paper proposes a <i>blockwise consistency</i> approach as a potential solution to the problem, which works by iteratively finding consistent estimates for each block of parameters conditional on the current estimates of the parameters in other blocks. The blockwise consistency approach decomposes the high-dimensional parameter estimation problem into a series of lower-dimensional parameter estimation problems, which often have much simpler structures than the original problem and thus can be easily solved. Moreover, under the framework provided by the blockwise consistency approach, a variety of methods, such as Bayesian and frequentist methods, can be jointly used to achieve a consistent estimator for the original high-dimensional complex model. The blockwise consistency approach is illustrated using two high-dimensional problems, variable selection and multivariate regression. The results of both problems show that the blockwise consistency approach can provide drastic improvements over the existing methods. Extension of the blockwise consistency approach to many other complex models is straightforward.</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"80 1 Suppl","pages":"179-223"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8026010/pdf/nihms-996656.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25583525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2011-05-01DOI: 10.1007/s13571-011-0019-7
Rabi Bhattacharya, Lizhen Lin
We consider the finite sample performance of a new nonparametric method for bioassay and benchmark analysis in risk assessment, which averages isotonic MLEs based on disjoint subgroups of dosages, and whose asymptotic behavior is essentially optimal (Bhattacharya and Lin (2010)). It is compared with three other methods, including the leading kernel-based method, called DNP, due to Dette et al. (2005) and Dette and Scheder (2010). In simulation studies, the present method, termed NAM, outperforms the DNP in the majority of cases considered, although both methods generally do well. In small samples, NAM and DNP both outperform the MLE.
我们考虑了一种新的用于风险评估的生物测定和基准分析的非参数方法的有限样本性能,该方法基于不相交的剂量亚组平均等渗MLEs,其渐近行为本质上是最优的(Bhattacharya和Lin(2010))。将其与其他三种方法进行比较,包括由Dette et al.(2005)和Dette and Scheder(2010)提出的基于核的领先方法DNP。在模拟研究中,目前的方法,称为NAM,在考虑的大多数情况下优于DNP,尽管两种方法通常都做得很好。在小样本中,NAM和DNP都优于MLE。
{"title":"NONPARAMETRIC BENCHMARK ANALYSIS IN RISK ASSESSMENT: A COMPARATIVE STUDY BY SIMULATION AND DATA ANALYSIS.","authors":"Rabi Bhattacharya, Lizhen Lin","doi":"10.1007/s13571-011-0019-7","DOIUrl":"https://doi.org/10.1007/s13571-011-0019-7","url":null,"abstract":"<p><p>We consider the finite sample performance of a new nonparametric method for bioassay and benchmark analysis in risk assessment, which averages isotonic MLEs based on disjoint subgroups of dosages, and whose asymptotic behavior is essentially optimal (Bhattacharya and Lin (2010)). It is compared with three other methods, including the leading kernel-based method, called <i>DNP</i>, due to Dette et al. (2005) and Dette and Scheder (2010). In simulation studies, the present method, termed <i>NAM</i>, outperforms the <i>DNP</i> in the majority of cases considered, although both methods generally do well. In small samples, NAM and DNP both outperform the MLE.</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"73 1","pages":"144-163"},"PeriodicalIF":0.0,"publicationDate":"2011-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s13571-011-0019-7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31476113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rejoinder on Discussion of: What's So Special About Semiparametric Methods?","authors":"Michael R Kosorok","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"71-A 2","pages":"369-371"},"PeriodicalIF":0.0,"publicationDate":"2009-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2903066/pdf/nihms195720.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"29131410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The number of scientific publications on semiparametric methods per year has been steadily increasing since the early 1980s. This increased interest has happened in spite of the fact that the novelty of semiparametrics for its own sake has run its course, and semiparametric methods are by now considered classical. The underlying reasons for this continued interest include the genuine scientific utility of semiparametric models combined with the breadth and depth of the many theoretical questions that remain to be answered. Empirical process techniques are an essential research tool for many of these questions. Moreover, both semiparametric methods and empirical processes are playing an increasingly valuable role in high dimensional data analysis and in other emerging areas in statistics. The topics are very fruitful and intriguing for new researchers to engage in. Graduate programs in statistics, biostatistics and econometrics can and should include more empirical processes and semiparametrics in their teaching in order to ensure a sufficient supply of suitably qualified researchers.
{"title":"What's So Special About Semiparametric Methods?","authors":"Michael R Kosorok","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The number of scientific publications on semiparametric methods per year has been steadily increasing since the early 1980s. This increased interest has happened in spite of the fact that the novelty of semiparametrics for its own sake has run its course, and semiparametric methods are by now considered classical. The underlying reasons for this continued interest include the genuine scientific utility of semiparametric models combined with the breadth and depth of the many theoretical questions that remain to be answered. Empirical process techniques are an essential research tool for many of these questions. Moreover, both semiparametric methods and empirical processes are playing an increasingly valuable role in high dimensional data analysis and in other emerging areas in statistics. The topics are very fruitful and intriguing for new researchers to engage in. Graduate programs in statistics, biostatistics and econometrics can and should include more empirical processes and semiparametrics in their teaching in order to ensure a sufficient supply of suitably qualified researchers.</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"71-A 2","pages":"331-353"},"PeriodicalIF":0.0,"publicationDate":"2009-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2903063/pdf/nihms195719.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"29131409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this article we introduce a new procedure for estimating population parameters under inequality constraints (known as order restrictions) when the unrestricted maximum liklelihood estimator (UMLE) is multivariate normally distributed with a known covariance matrix. Furthermore, a Dunnett-type test procedure along with the corresponding simultaneous confidence intervals are proposed for drawing inferences on elementary contrasts of population parameters under order restrictions. The proposed methodology is motivated by estimation and testing problems encountered in the analysis of covariance models. It is well-known that the restricted maximum likelihood estimator (RMLE) may perform poorly under certain conditions in terms of quadratic loss. For example, when the UMLE is distributed according to multivariate normal distribution with means satisfying simple tree order restriction and the dimension of the population mean vector is large. We investigate the performance of the proposed estimator analytically as well as using computer simulations and discover that the proposed method does not fail in the situations where RMLE fails. We illustrate the proposed methodology by re-analyzing a recently published rat uterotrophic bioassay data.
{"title":"Statistical inference under order restrictions in analysis of covariance using a modified restricted maximum likelihood estimator.","authors":"Joshua Betcher, Shyamal D Peddada","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In this article we introduce a new procedure for estimating population parameters under inequality constraints (known as order restrictions) when the unrestricted maximum liklelihood estimator (UMLE) is multivariate normally distributed with a known covariance matrix. Furthermore, a Dunnett-type test procedure along with the corresponding simultaneous confidence intervals are proposed for drawing inferences on elementary contrasts of population parameters under order restrictions. The proposed methodology is motivated by estimation and testing problems encountered in the analysis of covariance models. It is well-known that the restricted maximum likelihood estimator (RMLE) may perform poorly under certain conditions in terms of quadratic loss. For example, when the UMLE is distributed according to multivariate normal distribution with means satisfying simple tree order restriction and the dimension of the population mean vector is large. We investigate the performance of the proposed estimator analytically as well as using computer simulations and discover that the proposed method does not fail in the situations where RMLE fails. We illustrate the proposed methodology by re-analyzing a recently published rat uterotrophic bioassay data.</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"71 1","pages":"79-96"},"PeriodicalIF":0.0,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2955899/pdf/nihms202681.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"29359373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
"A class of analytical models to study the distribution of maternal age at different births from the data on age-specific fertility rates has been presented. Deriving the distributions and means of maternal age at birth of any specific order, final parity and at next-to-last birth, we have extended the approach to estimate parity progression ratios and the ultimate parity distribution of women in the population.... We illustrate computations of various components of the model expressions with the current fertility experiences of the United States for 1970."
{"title":"Some analytical models to estimate maternal age at birth using age-specific fertility rates.","authors":"A Pandey, C M Suchindran","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>\"A class of analytical models to study the distribution of maternal age at different births from the data on age-specific fertility rates has been presented. Deriving the distributions and means of maternal age at birth of any specific order, final parity and at next-to-last birth, we have extended the approach to estimate parity progression ratios and the ultimate parity distribution of women in the population.... We illustrate computations of various components of the model expressions with the current fertility experiences of the United States for 1970.\"</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"57 1","pages":"142-50"},"PeriodicalIF":0.0,"publicationDate":"1995-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"22029290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
"In this paper, a set of two probability models have been derived to describe the variation in the length of open birth interval of women having given birth to a child during the last 'T' years of their current reproductive age. The first model is derived by assuming the reproduction process as steady-state, the second is obtained by varying the fecundability parameter involved in the first model after the last birth. These models are applied to the three sets of data, one collected from [the Indian] Varanasi-survey, 1969-70 and the other two generated from the data on age-specific fertility rates using the life table technique. The biological parameters such as fecundability and secondary sterility have been estimated using some simple procedure of estimation."
{"title":"On some stochastic models of open birth interval.","authors":"V K Tiwari, S N Dwivedi","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>\"In this paper, a set of two probability models have been derived to describe the variation in the length of open birth interval of women having given birth to a child during the last 'T' years of their current reproductive age. The first model is derived by assuming the reproduction process as steady-state, the second is obtained by varying the fecundability parameter involved in the first model after the last birth. These models are applied to the three sets of data, one collected from [the Indian] Varanasi-survey, 1969-70 and the other two generated from the data on age-specific fertility rates using the life table technique. The biological parameters such as fecundability and secondary sterility have been estimated using some simple procedure of estimation.\"</p>","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"56 1","pages":"26-38"},"PeriodicalIF":0.0,"publicationDate":"1994-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"22019020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On some bivariate distributions of number of births.","authors":"B N Bhattacharya, D C Nath","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"47 3","pages":"372-84"},"PeriodicalIF":0.0,"publicationDate":"1985-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"22006550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Marriage trends and their demographic implications.","authors":"M Majumdar, A D Gupta","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":85487,"journal":{"name":"Sankhya. Series B. [Methodological.]","volume":"31 3-4","pages":"491-500"},"PeriodicalIF":0.0,"publicationDate":"1969-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"22009629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}