首页 > 最新文献

Model Assisted Statistics and Applications最新文献

英文 中文
A validity test for a multivariate linear measurement error model 多元线性测量误差模型的有效性检验
Q4 Mathematics Pub Date : 2024-03-14 DOI: 10.3233/mas-231494
Alexander Kukush, Igor Mandel
A criterion is proposed for testing the hypothesis about the nature of the error variance in the dependent variable in a linear model, which separates correctly and incorrectly specified models. In the former one, only the measurement errors determine the variance (i.e., the dependent variable is correctly explained by the independent ones, up to measurement errors), while the latter model lacks some independent covariates (or has a nonlinear structure). The proposed MEMV (Measurement Error Model Validity) test checks the validity of the model when both dependent and independent covariates are measured with errors. The criterion has an asymptotic character, but numerical simulations outlined approximate boundaries where estimates make sense. A practical example of the test’s implementation is discussed in detail – it shows the test’s ability to detect wrong specifications even in seemingly perfect models. Estimations of the errors due to the omission of the variables in the model are provided in the simulation study. The relation between measurement errors and model specification has not been studied earlier, and the proposed criterion may stimulate future research in this important area.
为检验线性模型中因变量误差方差性质的假设提出了一个标准,该标准将正确和不正确的模型区分开来。在前者中,只有测量误差决定方差(即因变量由独立变量正确解释,直至测量误差),而后者模型缺乏一些独立协变量(或具有非线性结构)。所提出的 MEMV(测量误差模型有效性)检验是在因变量和独立协变量的测量都存在误差的情况下检验模型的有效性。该标准具有渐近特性,但数值模拟勾勒出了估算有意义的近似边界。本文详细讨论了实施该检验的一个实际例子--它显示了该检验即使在看似完美的模型中也能检测出错误的规格。在模拟研究中提供了因模型中遗漏变量而导致的误差估算。测量误差与模型规范之间的关系尚未得到深入研究,所提出的标准可能会促进未来在这一重要领域的研究。
{"title":"A validity test for a multivariate linear measurement error model","authors":"Alexander Kukush, Igor Mandel","doi":"10.3233/mas-231494","DOIUrl":"https://doi.org/10.3233/mas-231494","url":null,"abstract":"A criterion is proposed for testing the hypothesis about the nature of the error variance in the dependent variable in a linear model, which separates correctly and incorrectly specified models. In the former one, only the measurement errors determine the variance (i.e., the dependent variable is correctly explained by the independent ones, up to measurement errors), while the latter model lacks some independent covariates (or has a nonlinear structure). The proposed MEMV (Measurement Error Model Validity) test checks the validity of the model when both dependent and independent covariates are measured with errors. The criterion has an asymptotic character, but numerical simulations outlined approximate boundaries where estimates make sense. A practical example of the test’s implementation is discussed in detail – it shows the test’s ability to detect wrong specifications even in seemingly perfect models. Estimations of the errors due to the omission of the variables in the model are provided in the simulation study. The relation between measurement errors and model specification has not been studied earlier, and the proposed criterion may stimulate future research in this important area.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140242742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Random processes in modeling durability of building and structure 建筑物和结构耐久性建模中的随机过程
Q4 Mathematics Pub Date : 2024-03-14 DOI: 10.3233/mas-231489
Boris I. Vilge
The problem of buildings and structures durability during their long-term operation in aggressive environments is discussed. It is closely connected with the problem of destruction of composite materials as a random process of birth, development, and death of defects at chemical corrosion. The content of the study is to analyze the influence of non-stationary chemical reactions occurring in real conditions on these processes. The analysis is carried out taking into account the incidental physical phenomena affecting corrosion, the most important of which is the diffusion of initial substances and reaction products occurring under different hydrodynamic conditions, i.e., the nature of fluid motion and its pressure which stimulates corrosion. The synthesis of three processes – chemical kinetics, diffusion, and hydrodynamics – allows us to study one of the possible scenarios of corrosion process development as a Markov process based on the statistical theory of the “weakest” link and the joint distribution of the random material and geometric parameters. Numerical analysis of changes in elastic and electrical properties of composite materials under these conditions allows us to substantiate the technology of non-destructive control of changes in the state of the structure. The research results are based on mathematical and statistical modeling.
本文讨论了建筑物和结构在侵蚀性环境中长期运行的耐久性问题。该问题与复合材料的破坏问题密切相关,即在化学腐蚀过程中缺陷的产生、发展和死亡的随机过程。研究的内容是分析在实际条件下发生的非稳态化学反应对这些过程的影响。分析时考虑了影响腐蚀的附带物理现象,其中最重要的是在不同流体力学条件下发生的初始物质和反应产物的扩散,即流体运动的性质及其刺激腐蚀的压力。将化学动力学、扩散和流体力学这三个过程综合起来,我们就可以根据 "最薄弱 "环节的统计理论以及随机材料和几何参数的联合分布,将腐蚀过程发展的一种可能情况作为马尔可夫过程进行研究。通过对这些条件下复合材料的弹性和电特性变化进行数值分析,我们可以证实对结构状态变化进行无损控制的技术。研究成果以数学和统计建模为基础。
{"title":"Random processes in modeling durability of building and structure","authors":"Boris I. Vilge","doi":"10.3233/mas-231489","DOIUrl":"https://doi.org/10.3233/mas-231489","url":null,"abstract":"The problem of buildings and structures durability during their long-term operation in aggressive environments is discussed. It is closely connected with the problem of destruction of composite materials as a random process of birth, development, and death of defects at chemical corrosion. The content of the study is to analyze the influence of non-stationary chemical reactions occurring in real conditions on these processes. The analysis is carried out taking into account the incidental physical phenomena affecting corrosion, the most important of which is the diffusion of initial substances and reaction products occurring under different hydrodynamic conditions, i.e., the nature of fluid motion and its pressure which stimulates corrosion. The synthesis of three processes – chemical kinetics, diffusion, and hydrodynamics – allows us to study one of the possible scenarios of corrosion process development as a Markov process based on the statistical theory of the “weakest” link and the joint distribution of the random material and geometric parameters. Numerical analysis of changes in elastic and electrical properties of composite materials under these conditions allows us to substantiate the technology of non-destructive control of changes in the state of the structure. The research results are based on mathematical and statistical modeling.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140242177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple conformity tests to assess deviations from the Newcomb-Benford Law (NBL): A replication of Koch and Okamura (2020) 评估纽科姆-本福德定律(NBL)偏差的多重一致性检验:对 Koch 和 Okamura(2020 年)的复制
Q4 Mathematics Pub Date : 2024-03-14 DOI: 10.3233/mas-231459
Dalson Figueiredo, Lucas Silva
In this paper, we critically reevaluate Koch and Okamura’s (2020) conclusions on the conformity of Chinese COVID-19 data with Benford’s Law. Building on Figueiredo et al. (2022), we adopt a framework that combines multiple tests, including Chi-square, Kolmogorov-Smirnov, Euclidean Distance, Mean Absolute Deviation, Distortion Factor, and Mantissa Distribution. The primary rationale behind employing multiple tests is to enhance the robustness of our inference. The main finding of the study indicates that COVID-19 infections in China do not adhere to the distribution expected under Benford’s Law, nor does it align with the figures observed in the U.S. and Italy. The usefulness of deviations from Benford’s Law in detecting misreported or fraudulent data remains controversial. However, addressing this question requires a more careful statistical analysis than what is presented in the Koch and Okamura (2020) paper. By employing a combination of several tests using fully transparent procedures, we establish a more reliable approach to evaluating conformity to the Newcomb-Benford Law in applied research.
在本文中,我们对 Koch 和 Okamura(2020 年)关于中国 COVID-19 数据符合本福德定律的结论进行了批判性的重新评估。在 Figueiredo 等人(2022 年)的基础上,我们采用了一个结合多种检验方法的框架,包括 Chi-square、Kolmogorov-Smirnov、Euclidean Distance、Mean Absolute Deviation、Distortion Factor 和 Mantissa Distribution。采用多重检验的主要原因是为了增强推论的稳健性。研究的主要发现表明,中国的 COVID-19 感染并不符合本福德定律的预期分布,也不符合在美国和意大利观察到的数据。偏离本福德定律的数据是否有助于发现误报或欺诈性数据仍存在争议。然而,要解决这一问题,需要进行比 Koch 和 Okamura(2020 年)论文更细致的统计分析。通过结合使用完全透明的程序进行多项测试,我们建立了一种更可靠的方法来评估应用研究是否符合纽科姆-本福德定律。
{"title":"Multiple conformity tests to assess deviations from the Newcomb-Benford Law (NBL): A replication of Koch and Okamura (2020)","authors":"Dalson Figueiredo, Lucas Silva","doi":"10.3233/mas-231459","DOIUrl":"https://doi.org/10.3233/mas-231459","url":null,"abstract":"In this paper, we critically reevaluate Koch and Okamura’s (2020) conclusions on the conformity of Chinese COVID-19 data with Benford’s Law. Building on Figueiredo et al. (2022), we adopt a framework that combines multiple tests, including Chi-square, Kolmogorov-Smirnov, Euclidean Distance, Mean Absolute Deviation, Distortion Factor, and Mantissa Distribution. The primary rationale behind employing multiple tests is to enhance the robustness of our inference. The main finding of the study indicates that COVID-19 infections in China do not adhere to the distribution expected under Benford’s Law, nor does it align with the figures observed in the U.S. and Italy. The usefulness of deviations from Benford’s Law in detecting misreported or fraudulent data remains controversial. However, addressing this question requires a more careful statistical analysis than what is presented in the Koch and Okamura (2020) paper. By employing a combination of several tests using fully transparent procedures, we establish a more reliable approach to evaluating conformity to the Newcomb-Benford Law in applied research.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140242700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Fourier approach and testing for cointegration in LSTAR model 傅立叶方法和 LSTAR 模型的协整检验
Q4 Mathematics Pub Date : 2024-03-14 DOI: 10.3233/mas-231468
Gülşah Sedefoğlu, Burak Güriş
This paper proposes a new logistic smooth transition autoregressive (LSTAR) cointegration test by combining the Fourier function to catch the structural changes that occur over time and the LSTAR model to consider the nonlinearity. Logistic and exponential functions are the main transition functions defining the nonlinearity in STAR models since the exponential smooth transition autoregressive (ESTAR) and LSTAR models can explain the different structures of economic variables. The Fourier approach is a simple and effective way to model the structural changes in time series as an alternative to dummy variables. The most significant advantage of the method is that it does not require prior knowledge about the date, number of breaks, or forms. Monte Carlo simulation results show that the proposed test has good size and power properties for different sample sizes and parameters. The results also revealed that the power performance of the proposed test and the Fourier ESTAR test offered by Güriş and Sedefoğlu (2022) are close to each other. The steps of the Fourier-based tests are illustrated by providing an empirical example of testing the validity of the purchasing power parity (PPP) hypothesis in Türkiye.
本文提出了一种新的逻辑平稳过渡自回归(LSTAR)协整检验方法,将捕捉随时间发生的结构变化的傅立叶函数与考虑非线性的 LSTAR 模型相结合。由于指数平稳过渡自回归模型(ESTAR)和 LSTAR 模型可以解释经济变量的不同结构,因此对数函数和指数函数是 STAR 模型中定义非线性的主要过渡函数。傅立叶方法是建立时间序列结构变化模型的一种简单而有效的方法,可以替代虚拟变量。该方法的最大优点是不需要事先了解日期、中断次数或形式。蒙特卡罗模拟结果表明,对于不同的样本量和参数,所提出的检验具有良好的规模和功率特性。结果还显示,拟议检验的功率性能与 Güriş 和 Sedefoğlu (2022 年)提供的傅立叶ESTAR 检验接近。通过提供一个测试土耳其购买力平价假设有效性的实证例子,说明了基于傅立叶检验的步骤。
{"title":"The Fourier approach and testing for cointegration in LSTAR model","authors":"Gülşah Sedefoğlu, Burak Güriş","doi":"10.3233/mas-231468","DOIUrl":"https://doi.org/10.3233/mas-231468","url":null,"abstract":"This paper proposes a new logistic smooth transition autoregressive (LSTAR) cointegration test by combining the Fourier function to catch the structural changes that occur over time and the LSTAR model to consider the nonlinearity. Logistic and exponential functions are the main transition functions defining the nonlinearity in STAR models since the exponential smooth transition autoregressive (ESTAR) and LSTAR models can explain the different structures of economic variables. The Fourier approach is a simple and effective way to model the structural changes in time series as an alternative to dummy variables. The most significant advantage of the method is that it does not require prior knowledge about the date, number of breaks, or forms. Monte Carlo simulation results show that the proposed test has good size and power properties for different sample sizes and parameters. The results also revealed that the power performance of the proposed test and the Fourier ESTAR test offered by Güriş and Sedefoğlu (2022) are close to each other. The steps of the Fourier-based tests are illustrated by providing an empirical example of testing the validity of the purchasing power parity (PPP) hypothesis in Türkiye.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140244185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parametric test based on the bootstrapping approach for the MANOVA under a Behrens-Fisher problem 贝伦斯-费舍尔问题下基于引导方法的 MANOVA 参数检验
Q4 Mathematics Pub Date : 2024-03-14 DOI: 10.3233/mas-231449
Jatsada Singthongchai, Noppakun Thongmual, Nirun Nitisuk
This article presents a comparison of multivariate normal mean vectors under covariance positive definite matrices. We introduce an improved parametric bootstrap (IPB) approach for addressing the multivariate Behrens-Fisher problem, specifically focusing on cases with unequal covariance matrices. Additionally, we evaluate the performance of the IPB test by comparing it with three existing tests: the parametric bootstrap (PB) test, the generalized variable (GV) test, and the Johansen test. Through Monte Carlo simulation, our results demonstrate that both the IPB test and the PB test exhibit superior control over Type I error rates compared to the GV and Johansen tests. Notably, the IPB test outperforms the PB test in terms of controlling Type I error rates. Consequently, our study concludes that the IPB test represents a preferred statistical method for testing the equality of mean vectors in the multivariate Behrens-Fisher problem.
本文比较了协方差正定矩阵下的多元正态均值向量。我们引入了一种改进的参数自举(IPB)方法来解决多变量贝伦斯-费舍尔问题,特别关注协方差矩阵不相等的情况。此外,我们还将 IPB 检验与参数自举(PB)检验、广义变量(GV)检验和 Johansen 检验这三种现有检验进行了比较,从而评估了 IPB 检验的性能。通过蒙特卡罗模拟,我们的结果表明,与 GV 和 Johansen 检验相比,IPB 检验和 PB 检验都能更好地控制 I 类错误率。值得注意的是,IPB 检验在控制 I 类错误率方面优于 PB 检验。因此,我们的研究得出结论,IPB 检验是检验多元 Behrens-Fisher 问题中均值向量相等性的首选统计方法。
{"title":"Parametric test based on the bootstrapping approach for the MANOVA under a Behrens-Fisher problem","authors":"Jatsada Singthongchai, Noppakun Thongmual, Nirun Nitisuk","doi":"10.3233/mas-231449","DOIUrl":"https://doi.org/10.3233/mas-231449","url":null,"abstract":"This article presents a comparison of multivariate normal mean vectors under covariance positive definite matrices. We introduce an improved parametric bootstrap (IPB) approach for addressing the multivariate Behrens-Fisher problem, specifically focusing on cases with unequal covariance matrices. Additionally, we evaluate the performance of the IPB test by comparing it with three existing tests: the parametric bootstrap (PB) test, the generalized variable (GV) test, and the Johansen test. Through Monte Carlo simulation, our results demonstrate that both the IPB test and the PB test exhibit superior control over Type I error rates compared to the GV and Johansen tests. Notably, the IPB test outperforms the PB test in terms of controlling Type I error rates. Consequently, our study concludes that the IPB test represents a preferred statistical method for testing the equality of mean vectors in the multivariate Behrens-Fisher problem.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140244709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On two new modified tawn copulas 关于两个新的修正陶氏协方
Q4 Mathematics Pub Date : 2024-03-14 DOI: 10.3233/mas-231451
Christophe Chesneau
At its core, copula theory focuses on constructing a copula function, which characterizes the structure of dependence between random variables. In particular, the creation of extreme value copulas is crucial because they allow accurate modeling of extreme dependence that traditional copulas can ignore. In this article, we propose theoretical developments on this subject by proposing two new extreme value copulas. They aim to extend the functionalities of the so-called Tawn copula. This is of interest because the Tawn copula is known to be a powerful tool in modeling joint distributions, particularly in capturing asymmetric and upper tail dependences, making it valuable for analyzing extreme events and tail risk. The proposed copulas are designed to go beyond these attractive features. On the mathematical side, they are derived from new Pickands dependence functions; one modifies the Pickands dependence function of the Tawn copula by using a polynomial-exponential function, and the other does the same but by introducing a power function. The proofs are based on diverse differentiation, arrangement, and inequality techniques. Overall, the created copulas are attractive because (i) they possess modulable levels of asymmetry, (ii) they depend on several tuning parameters, making them very flexible in terms of upper tail dependence in particular, and (iii) they benefit from interesting correlation ranges of values. Several figures and value tables support the theoretical findings.
协整理论的核心是构建协整函数,它描述了随机变量之间的依赖结构。尤其是极值协方函数的建立至关重要,因为它们可以准确地模拟传统协方函数无法忽略的极端依赖性。在本文中,我们提出了两个新的极值协方差,从而对这一主题进行了理论上的发展。它们旨在扩展所谓的 Tawn 协方差的功能。众所周知,Tawn copula 是联合分布建模的有力工具,尤其是在捕捉非对称和上尾部依赖性方面,这使其在分析极端事件和尾部风险方面具有重要价值。所提出的 copulas 在设计上超越了这些吸引人的特点。在数学方面,它们是从新的 Pickands 依赖函数中推导出来的;其中一个通过使用多项式-指数函数修改了 Tawn copula 的 Pickands 依赖函数,另一个通过引入幂函数做了同样的修改。证明基于不同的微分、排列和不等式技术。总的来说,所创建的共线方程很有吸引力,因为:(i) 它们具有可调节的不对称程度;(ii) 它们依赖于多个调整参数,特别是在上尾依赖性方面非常灵活;(iii) 它们受益于有趣的相关值范围。一些数字和数值表支持这些理论发现。
{"title":"On two new modified tawn copulas","authors":"Christophe Chesneau","doi":"10.3233/mas-231451","DOIUrl":"https://doi.org/10.3233/mas-231451","url":null,"abstract":"At its core, copula theory focuses on constructing a copula function, which characterizes the structure of dependence between random variables. In particular, the creation of extreme value copulas is crucial because they allow accurate modeling of extreme dependence that traditional copulas can ignore. In this article, we propose theoretical developments on this subject by proposing two new extreme value copulas. They aim to extend the functionalities of the so-called Tawn copula. This is of interest because the Tawn copula is known to be a powerful tool in modeling joint distributions, particularly in capturing asymmetric and upper tail dependences, making it valuable for analyzing extreme events and tail risk. The proposed copulas are designed to go beyond these attractive features. On the mathematical side, they are derived from new Pickands dependence functions; one modifies the Pickands dependence function of the Tawn copula by using a polynomial-exponential function, and the other does the same but by introducing a power function. The proofs are based on diverse differentiation, arrangement, and inequality techniques. Overall, the created copulas are attractive because (i) they possess modulable levels of asymmetry, (ii) they depend on several tuning parameters, making them very flexible in terms of upper tail dependence in particular, and (iii) they benefit from interesting correlation ranges of values. Several figures and value tables support the theoretical findings.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140243916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of solid waste generation areas in the greater accra region using machine learning algorithms 使用机器学习算法对大阿克拉地区的固体废物产生区域进行分类
Q4 Mathematics Pub Date : 2023-12-27 DOI: 10.3233/mas-231440
C. Chapman-Wardy, Eric Ocran, Samuel Iddi, L. Asiedu
Solid waste management has become a challenge for developing countries mainly because of surging economic activities, rapid urbanisation and rise in community living standards. Many researchers have identified its related problems and have recommended solutions while others have established models to forecast the amount of solid waste generated over a period. However, an efficient and effective management of solid waste requires adequate categorisation of solid waste generation areas to aid in the provision of area-specific or targeted solutions for each categorised area. In this study, we used primary data on some important socio-demographic variables (household size, house type, predominant religion of household, age and educational level of household head, residency type household waste disposal method, frequency of waste collection etc) and the amount of solid waste generated from 2102 households in Greater Accra Region, Ghana. We assessed the classification performances of a traditional statistical classifiers and some selected machine learning algorithms in classifying the surveyed areas in Greater Accra into low, medium, and high solid waste generation areas. The Support Vector Machine with the Cubic Kernel was found to be the best performing classifier with a Specificity of 86%, Sensitivity, Precision and Accuracy of 73% and Area under the curve (AUC) of 0.90. The Support Vector Machine with the Cubic Kernel is therefore recommended as a suitable algorithm for the categorisation of solid waste generation areas. Stakeholders responsible for solid waste management could leverage on the evidence from this study to categorise their waste generation areas and to proffer targeted community-based interventions.
固体废物管理已成为发展中国家面临的一项挑战,这主要是因为经济活动激增、城市化进程加快以及社区生活水平提高。许多研究人员发现了相关问题并提出了解决方案,还有一些研究人员建立了模型来预测一段时间内产生的固体废物量。然而,要对固体废物进行高效和有效的管理,就必须对固体废物产生区域进行适当分类,以便为每个分类区域提供特定或有针对性的解决方案。在这项研究中,我们使用了加纳大阿克拉地区 2102 个家庭的一些重要社会人口变量(家庭规模、房屋类型、家庭主要宗教信仰、户主年龄和教育水平、居住类型、家庭废物处理方法、废物收集频率等)和固体废物产生量的原始数据。我们评估了传统统计分类器和一些选定的机器学习算法在将大阿克拉地区的调查区域分为低、中和高固体废物产生区域时的分类性能。结果发现,带有立方核的支持向量机是性能最好的分类器,其特异性为 86%,灵敏度、精确度和准确度均为 73%,曲线下面积 (AUC) 为 0.90。因此,建议将带有立方核的支持向量机作为固体废物产生区域分类的合适算法。负责固体废物管理的利益相关者可以利用本研究的证据对其废物产生区域进行分类,并提出有针对性的社区干预措施。
{"title":"Classification of solid waste generation areas in the greater accra region using machine learning algorithms","authors":"C. Chapman-Wardy, Eric Ocran, Samuel Iddi, L. Asiedu","doi":"10.3233/mas-231440","DOIUrl":"https://doi.org/10.3233/mas-231440","url":null,"abstract":"Solid waste management has become a challenge for developing countries mainly because of surging economic activities, rapid urbanisation and rise in community living standards. Many researchers have identified its related problems and have recommended solutions while others have established models to forecast the amount of solid waste generated over a period. However, an efficient and effective management of solid waste requires adequate categorisation of solid waste generation areas to aid in the provision of area-specific or targeted solutions for each categorised area. In this study, we used primary data on some important socio-demographic variables (household size, house type, predominant religion of household, age and educational level of household head, residency type household waste disposal method, frequency of waste collection etc) and the amount of solid waste generated from 2102 households in Greater Accra Region, Ghana. We assessed the classification performances of a traditional statistical classifiers and some selected machine learning algorithms in classifying the surveyed areas in Greater Accra into low, medium, and high solid waste generation areas. The Support Vector Machine with the Cubic Kernel was found to be the best performing classifier with a Specificity of 86%, Sensitivity, Precision and Accuracy of 73% and Area under the curve (AUC) of 0.90. The Support Vector Machine with the Cubic Kernel is therefore recommended as a suitable algorithm for the categorisation of solid waste generation areas. Stakeholders responsible for solid waste management could leverage on the evidence from this study to categorise their waste generation areas and to proffer targeted community-based interventions.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139153193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian inference of financial networks 金融网络的贝叶斯推断
Q4 Mathematics Pub Date : 2023-12-27 DOI: 10.3233/mas-231456
Juan Sosa, Brenda Betancourt
Network data arises naturally in a wide variety of applications in different fields. In this article we discuss in detail the statistical modeling of financial networks. The structure of such networks red has not been studied thoroughly in the past, mainly due to limited accessible data. We explore the structure of a real trading network corresponding to transactions within the natural gas future market over a four-year period. The detection of meaningful communities of actors within networks is particularly relevant to understand the topology of a complex system like this. We explore the usage of stochastic block models in conjunction with a nonparametric Bayesian approach in order to identify clusters of traders in a flexible modeling framework. Our findings strongly indicate that the proposed models are highly reliable at detecting community structures.
网络数据自然出现在不同领域的各种应用中。本文将详细讨论金融网络的统计建模。主要由于可获得的数据有限,过去对此类网络结构的研究并不深入。我们探讨了一个真实交易网络的结构,该网络与天然气期货市场四年内的交易相对应。在网络中检测有意义的行动者群体对于了解这样一个复杂系统的拓扑结构尤为重要。我们探索了随机块模型与非参数贝叶斯方法的结合使用,以便在一个灵活的建模框架内识别交易者集群。我们的研究结果有力地表明,所提出的模型在检测群体结构方面非常可靠。
{"title":"Bayesian inference of financial networks","authors":"Juan Sosa, Brenda Betancourt","doi":"10.3233/mas-231456","DOIUrl":"https://doi.org/10.3233/mas-231456","url":null,"abstract":"Network data arises naturally in a wide variety of applications in different fields. In this article we discuss in detail the statistical modeling of financial networks. The structure of such networks red has not been studied thoroughly in the past, mainly due to limited accessible data. We explore the structure of a real trading network corresponding to transactions within the natural gas future market over a four-year period. The detection of meaningful communities of actors within networks is particularly relevant to understand the topology of a complex system like this. We explore the usage of stochastic block models in conjunction with a nonparametric Bayesian approach in order to identify clusters of traders in a flexible modeling framework. Our findings strongly indicate that the proposed models are highly reliable at detecting community structures.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139154694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tests for skewness parameter of skew log Laplace distribution 倾斜对数拉普拉斯分布的倾斜参数测试
Q4 Mathematics Pub Date : 2023-12-27 DOI: 10.3233/mas-221423
Pradnya P. Khandeparkar, Vaijayanti Dixit
Laplace probability density function with additional shape parameter that regulates the degree of skewness is a skew Laplace distribution. The various forms of skew Laplace distribution are found in the literature, the distributions defined by Mc Gill (1962), Holla and Bhattacharya (1968), Lingappaiah (1988), Fernandez and Steel (1998). The skew log Laplace distribution is the probability distribution of a random variable whose logarithm follows a skew Laplace distribution. In this paper, the classical optimum tests for skewness parameter of skew log Laplace distribution (SLLD) derived from Lingappaiah (1988) distribution are discussed. Uniformly most powerful test, uniformly most powerful unbiased test and Wald’s sequential probability ratio test for skewness parameter are compared. The exact likelihood ratio test and Neyman structure test for testing skewness parameter when scale parameter is known are derived. Finally, the underreported income of Road Transport Company is analysed on the basis of the tests derived in this paper.
拉普拉斯概率密度函数带有调节偏斜程度的附加形状参数,这就是偏斜拉普拉斯分布。偏斜拉普拉斯分布的形式多种多样,文献中有 Mc Gill (1962)、Holla 和 Bhattacharya (1968)、Lingappaiah (1988)、Fernandez 和 Steel (1998) 所定义的分布。偏斜对数拉普拉斯分布是指对数服从偏斜拉普拉斯分布的随机变量的概率分布。本文讨论了由 Lingappaiah(1988)分布导出的偏斜对拉普拉斯分布(SLLD)偏斜参数的经典最优检验。比较了斜度参数的均匀最强检验、均匀最强无偏检验和 Wald 连续似然比检验。得出了在尺度参数已知的情况下检验偏度参数的精确似然比检验和奈曼结构检验。最后,根据本文得出的检验方法对道路运输公司的少报收入进行了分析。
{"title":"Tests for skewness parameter of skew log Laplace distribution","authors":"Pradnya P. Khandeparkar, Vaijayanti Dixit","doi":"10.3233/mas-221423","DOIUrl":"https://doi.org/10.3233/mas-221423","url":null,"abstract":"Laplace probability density function with additional shape parameter that regulates the degree of skewness is a skew Laplace distribution. The various forms of skew Laplace distribution are found in the literature, the distributions defined by Mc Gill (1962), Holla and Bhattacharya (1968), Lingappaiah (1988), Fernandez and Steel (1998). The skew log Laplace distribution is the probability distribution of a random variable whose logarithm follows a skew Laplace distribution. In this paper, the classical optimum tests for skewness parameter of skew log Laplace distribution (SLLD) derived from Lingappaiah (1988) distribution are discussed. Uniformly most powerful test, uniformly most powerful unbiased test and Wald’s sequential probability ratio test for skewness parameter are compared. The exact likelihood ratio test and Neyman structure test for testing skewness parameter when scale parameter is known are derived. Finally, the underreported income of Road Transport Company is analysed on the basis of the tests derived in this paper.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139154553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maximum likelihood estimation of a change point for Poisson distributed data 泊松分布数据变化点的最大似然估计
Q4 Mathematics Pub Date : 2023-12-27 DOI: 10.3233/mas-231448
Alex Paparas, S. Fotopoulos, V. Jandhyala, Dimitris Paparas
In this study we develop a change point methodology to identify and estimate changes in the parameter of a Poisson distribution. The proposed methodology considers the case when the Poisson parameter changes abruptly at an unknown point of time. For this case, the maximum likelihood estimate of the change point and its asymptotic distribution are pursued. Mainly, we carry out a large scale simulation study for evaluating the appropriateness of the asymptotic distribution of the mle from the view point of finite samples, and also for evaluating the closeness under known and unknown parameters. The simulations study also compares the mle with that of a Bayesian estimate. Then, the methodology is applied to three examples. First, we uncover changes in the number of homicides in California using monthly data from January 2002 until December 2020. Secondly, data about deaths of females caused by stomach cancer is considered to detect possible changes in the numbers recorded from 1930 to 2011. Thirdly, British coal mining disasters from 1851 to 1962 in which more than 10 men were killed are analyzed.
在本研究中,我们开发了一种变化点方法,用于识别和估计泊松分布参数的变化。所提出的方法考虑了泊松参数在未知时间点突然变化的情况。对于这种情况,我们追求变化点的最大似然估计及其渐近分布。我们主要进行了大规模的模拟研究,从有限样本的角度评估了 mle 的渐近分布的适当性,并评估了在已知和未知参数下的接近性。模拟研究还将 mle 与贝叶斯估计进行了比较。然后,将该方法应用于三个实例。首先,我们利用 2002 年 1 月至 2020 年 12 月的月度数据,揭示了加利福尼亚州凶杀案数量的变化。其次,我们考虑了胃癌导致的女性死亡数据,以发现从 1930 年到 2011 年记录的数字可能发生的变化。第三,分析了 1851 年至 1962 年期间英国发生的死亡人数超过 10 人的煤矿灾难。
{"title":"Maximum likelihood estimation of a change point for Poisson distributed data","authors":"Alex Paparas, S. Fotopoulos, V. Jandhyala, Dimitris Paparas","doi":"10.3233/mas-231448","DOIUrl":"https://doi.org/10.3233/mas-231448","url":null,"abstract":"In this study we develop a change point methodology to identify and estimate changes in the parameter of a Poisson distribution. The proposed methodology considers the case when the Poisson parameter changes abruptly at an unknown point of time. For this case, the maximum likelihood estimate of the change point and its asymptotic distribution are pursued. Mainly, we carry out a large scale simulation study for evaluating the appropriateness of the asymptotic distribution of the mle from the view point of finite samples, and also for evaluating the closeness under known and unknown parameters. The simulations study also compares the mle with that of a Bayesian estimate. Then, the methodology is applied to three examples. First, we uncover changes in the number of homicides in California using monthly data from January 2002 until December 2020. Secondly, data about deaths of females caused by stomach cancer is considered to detect possible changes in the numbers recorded from 1930 to 2011. Thirdly, British coal mining disasters from 1851 to 1962 in which more than 10 men were killed are analyzed.","PeriodicalId":35000,"journal":{"name":"Model Assisted Statistics and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139153237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Model Assisted Statistics and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1