Estimating the mean of a small sample under the two parameter lognormal distribution

P. Hingley
{"title":"Estimating the mean of a small sample under the two parameter lognormal distribution","authors":"P. Hingley","doi":"10.11145/TEXTS.2018.02.027","DOIUrl":null,"url":null,"abstract":"When making statistical inferences about the means of small samples, the confidence limits for the mean are often calculated assuming a normal distribution. But many biological variables follow the lognormal distribution \\cite{Johnson}, for example the birth weights of babies (EG data at \\cite{Iannelli}). Here, sampling distributions (probability density functions) are found for the maximum likelihood estimates (MLE) of sample mean and variance when data are lognormally distributed. They are derived analytically, making some use of the Technique for Estimator Densities (TED) \\cite{Hingley}, and then checked by using simulations with random numbers. For an I.I.D. sample of size n with lognormal estimation, the sample mean has a lognormal distribution that is conditional on the variance. The distribution of the sample variance of the logarithms follows the usual transform of the central chi-squared distribution. The joint distribution of the sample mean and variance shows the extent to which the mean is affected by the variance. When a normal distribution is wrongly used for estimation on lognormally distributed data, the sample mean still has a lognormal distribution. But the distribution for the MLE of the variance differs. From the distribution for one observation, that for larger sample sizes can be approached by using convolutions. The assumption of a normal estimation model biases the confidence interval for the mean. There is a discussion of the extent to which this is of practical importance when estimating means for small samples of birth weights and other lognormally distributed data sets.","PeriodicalId":370233,"journal":{"name":"Biomath Communications Supplement","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomath Communications Supplement","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11145/TEXTS.2018.02.027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

When making statistical inferences about the means of small samples, the confidence limits for the mean are often calculated assuming a normal distribution. But many biological variables follow the lognormal distribution \cite{Johnson}, for example the birth weights of babies (EG data at \cite{Iannelli}). Here, sampling distributions (probability density functions) are found for the maximum likelihood estimates (MLE) of sample mean and variance when data are lognormally distributed. They are derived analytically, making some use of the Technique for Estimator Densities (TED) \cite{Hingley}, and then checked by using simulations with random numbers. For an I.I.D. sample of size n with lognormal estimation, the sample mean has a lognormal distribution that is conditional on the variance. The distribution of the sample variance of the logarithms follows the usual transform of the central chi-squared distribution. The joint distribution of the sample mean and variance shows the extent to which the mean is affected by the variance. When a normal distribution is wrongly used for estimation on lognormally distributed data, the sample mean still has a lognormal distribution. But the distribution for the MLE of the variance differs. From the distribution for one observation, that for larger sample sizes can be approached by using convolutions. The assumption of a normal estimation model biases the confidence interval for the mean. There is a discussion of the extent to which this is of practical importance when estimating means for small samples of birth weights and other lognormally distributed data sets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在双参数对数正态分布下估计小样本的均值
在对小样本的均值进行统计推断时,通常假设正态分布来计算均值的置信限。但许多生物变量遵循对数正态分布\cite{Johnson},例如婴儿的出生体重(EG数据见\cite{Iannelli})。在这里,当数据是对数正态分布时,找到样本均值和方差的最大似然估计(MLE)的抽样分布(概率密度函数)。它们是解析导出的,使用了一些估计密度技术(TED) \cite{Hingley},然后通过使用随机数模拟进行验证。对于大小为n且具有对数正态估计的i.i.d样本,样本均值具有以方差为条件的对数正态分布。对数的样本方差的分布遵循通常的中心卡方分布的变换。样本均值和方差的联合分布表明了均值受方差影响的程度。当错误地使用正态分布来估计对数正态分布的数据时,样本均值仍然是对数正态分布。但是方差的最大似然值的分布是不同的。从一个观测值的分布来看,对于更大的样本量,可以使用卷积来接近。正态估计模型的假设使平均值的置信区间产生偏差。在估计出生体重和其他对数正态分布数据集的小样本平均值时,这在多大程度上具有实际重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A class of individual-based models Estimating the mean of a small sample under the two parameter lognormal distribution Analysis of model for the transmission dynamics of Zika with sterile insect technique Modelling of activator-inhibitor dynamics via nonlocal integral operators
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1