不偏不倚地根据数据拟合方程

Chris Tofallis
{"title":"不偏不倚地根据数据拟合方程","authors":"Chris Tofallis","doi":"arxiv-2409.02573","DOIUrl":null,"url":null,"abstract":"We consider the problem of fitting a relationship (e.g. a potential\nscientific law) to data involving multiple variables. Ordinary (least squares)\nregression is not suitable for this because the estimated relationship will\ndiffer according to which variable is chosen as being dependent, and the\ndependent variable is unrealistically assumed to be the only variable which has\nany measurement error (noise). We present a very general method for estimating\na linear functional relationship between multiple noisy variables, which are\ntreated impartially, i.e. no distinction between dependent and independent\nvariables. The data are not assumed to follow any distribution, but all\nvariables are treated as being equally reliable. Our approach extends the\ngeometric mean functional relationship to multiple dimensions. This is\nespecially useful with variables measured in different units, as it is\nnaturally scale-invariant, whereas orthogonal regression is not. This is\nbecause our approach is not based on minimizing distances, but on the symmetric\nconcept of correlation. The estimated coefficients are easily obtained from the\ncovariances or correlations, and correspond to geometric means of associated\nleast squares coefficients. The ease of calculation will hopefully allow\nwidespread application of impartial fitting to estimate relationships in a\nneutral way.","PeriodicalId":501293,"journal":{"name":"arXiv - ECON - Econometrics","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fitting an Equation to Data Impartially\",\"authors\":\"Chris Tofallis\",\"doi\":\"arxiv-2409.02573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the problem of fitting a relationship (e.g. a potential\\nscientific law) to data involving multiple variables. Ordinary (least squares)\\nregression is not suitable for this because the estimated relationship will\\ndiffer according to which variable is chosen as being dependent, and the\\ndependent variable is unrealistically assumed to be the only variable which has\\nany measurement error (noise). We present a very general method for estimating\\na linear functional relationship between multiple noisy variables, which are\\ntreated impartially, i.e. no distinction between dependent and independent\\nvariables. The data are not assumed to follow any distribution, but all\\nvariables are treated as being equally reliable. Our approach extends the\\ngeometric mean functional relationship to multiple dimensions. This is\\nespecially useful with variables measured in different units, as it is\\nnaturally scale-invariant, whereas orthogonal regression is not. This is\\nbecause our approach is not based on minimizing distances, but on the symmetric\\nconcept of correlation. The estimated coefficients are easily obtained from the\\ncovariances or correlations, and correspond to geometric means of associated\\nleast squares coefficients. The ease of calculation will hopefully allow\\nwidespread application of impartial fitting to estimate relationships in a\\nneutral way.\",\"PeriodicalId\":501293,\"journal\":{\"name\":\"arXiv - ECON - Econometrics\",\"volume\":\"36 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - ECON - Econometrics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.02573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Econometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们考虑的问题是将一种关系(如潜在的科学定律)拟合到涉及多个变量的数据中。普通(最小二乘)回归并不适合这一问题,因为估计的关系会因选择哪个变量作为因变量而不同,而且因变量被不切实际地假定为唯一存在测量误差(噪声)的变量。我们提出了一种非常通用的方法,用于估计多个噪声变量之间的线性函数关系。数据不假定服从任何分布,但所有变量都被视为同样可靠。我们的方法将几何平均数函数关系扩展到多个维度。这对于以不同单位测量的变量尤其有用,因为它天然是尺度不变的,而正交回归则不然。这是因为我们的方法不是基于距离最小化,而是基于相关性的对称概念。估计系数很容易从协方差或相关性中获得,并对应于相关最小二乘法系数的几何平均数。计算的简便性有望使公正拟合得到广泛应用,从而以中性的方式估算相关关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Fitting an Equation to Data Impartially
We consider the problem of fitting a relationship (e.g. a potential scientific law) to data involving multiple variables. Ordinary (least squares) regression is not suitable for this because the estimated relationship will differ according to which variable is chosen as being dependent, and the dependent variable is unrealistically assumed to be the only variable which has any measurement error (noise). We present a very general method for estimating a linear functional relationship between multiple noisy variables, which are treated impartially, i.e. no distinction between dependent and independent variables. The data are not assumed to follow any distribution, but all variables are treated as being equally reliable. Our approach extends the geometric mean functional relationship to multiple dimensions. This is especially useful with variables measured in different units, as it is naturally scale-invariant, whereas orthogonal regression is not. This is because our approach is not based on minimizing distances, but on the symmetric concept of correlation. The estimated coefficients are easily obtained from the covariances or correlations, and correspond to geometric means of associated least squares coefficients. The ease of calculation will hopefully allow widespread application of impartial fitting to estimate relationships in a neutral way.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Simple robust two-stage estimation and inference for generalized impulse responses and multi-horizon causality GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students A Simple and Adaptive Confidence Interval when Nuisance Parameters Satisfy an Inequality Why you should also use OLS estimation of tail exponents On LASSO Inference for High Dimensional Predictive Regression
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1