首页 > 最新文献

SIAM undergraduate research online最新文献

英文 中文
Opinion Dynamics with Slowly Evolving Zealot Populations 与缓慢进化的狂热人群的意见动态
Pub Date : 2023-01-01 DOI: 10.1137/22s1515306
Ashlyn DeGroot, E. Schmidt
. We introduce and analyze a model for opinion dynamics comprised of nonlinear ODEs. The variables are the proportion of moderates in the population who hold opinion A, the proportion of zealots who hold opinion A, and the proportion of zealots who hold opinion B (not A). The zealots are willing to change their opinion at a much slower rate than the moderates. Our model takes into account such things as the inherent attractiveness of one opinion over the other, the indoctrination of moderates by the zealots, and deradicalization of the zealots by the moderates. A combination of theoretical and numerical analysis shows there are many different types of asymptotic configurations of the population. Many of these correspond to critical points of the system. The most intriguing finding is that if both A and B are roughly equally attractive, and the rate of indoctrination is roughly equal to the rate of deradicalization, then there will be a stable periodic orbit. The dynamics of this orbit show that a precursor to an opinion being dominant is that the proportion of zealots for the opinion must first grow to some critical value. Moreover, when the periodic orbit exists, there are no other solutions which allow for coexistence between the two opinions.
. 介绍并分析了一个由非线性ode组成的意见动态模型。变量是人口中持意见A的温和派的比例,持意见A的狂热者的比例,以及持意见B(不是A)的狂热者的比例。狂热者愿意以比温和派慢得多的速度改变他们的意见。我们的模型考虑了一些因素,比如一种观点相对于另一种观点的内在吸引力,狂热者对温和派的灌输,以及温和派对狂热者的去激进化。理论分析和数值分析相结合表明,总体有许多不同类型的渐近构型。其中许多都对应于系统的临界点。最有趣的发现是,如果A和B的吸引力大致相同,灌输的速度大致等于去极端化的速度,那么就会有一个稳定的周期轨道。这一轨道的动态表明,一种意见占主导地位的前兆是,这种意见的狂热者的比例必须首先增长到某个临界值。此外,当周期轨道存在时,不存在允许两种观点共存的其他解。
{"title":"Opinion Dynamics with Slowly Evolving Zealot Populations","authors":"Ashlyn DeGroot, E. Schmidt","doi":"10.1137/22s1515306","DOIUrl":"https://doi.org/10.1137/22s1515306","url":null,"abstract":". We introduce and analyze a model for opinion dynamics comprised of nonlinear ODEs. The variables are the proportion of moderates in the population who hold opinion A, the proportion of zealots who hold opinion A, and the proportion of zealots who hold opinion B (not A). The zealots are willing to change their opinion at a much slower rate than the moderates. Our model takes into account such things as the inherent attractiveness of one opinion over the other, the indoctrination of moderates by the zealots, and deradicalization of the zealots by the moderates. A combination of theoretical and numerical analysis shows there are many different types of asymptotic configurations of the population. Many of these correspond to critical points of the system. The most intriguing finding is that if both A and B are roughly equally attractive, and the rate of indoctrination is roughly equal to the rate of deradicalization, then there will be a stable periodic orbit. The dynamics of this orbit show that a precursor to an opinion being dominant is that the proportion of zealots for the opinion must first grow to some critical value. Moreover, when the periodic orbit exists, there are no other solutions which allow for coexistence between the two opinions.","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64317464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Malaria Early Warning Application for Individual Risk Assessment 疟疾预警应用于个人风险评估
Pub Date : 2023-01-01 DOI: 10.1137/23s154875x
Janiah Kyle, Sagar Sadak, Cayden Goeringer
As one of the oldest known diseases to inflict humanity (since the Agricultural Revolution about 12,000 years ago), malaria has proven to be a significant global challenge. Many intervention strategies have been undertaken in the last few decades such as widespread insecticide-treated bed nets (ITN), long-lasting insecticidal nets (LLIN) and indoor residual spraying (IRS). Yet even with great success, malaria continues to be a ravaging disease requiring inventive solutions. In this study, we develop a malaria early warning system, which utilizes an adapted Ross-MacDonald model to assess individual risk and disease epidemiology. Strategies for achieving a disease-free equilibrium state are also shown by performing local asymptotic stability analysis. It is important to note that the stages of the mosquito life-cycle are highly influenced by weather conditions, both in the aquatic and adult stages, as well as by the use of insecticides (either through ITN/LLIN use or via IRS). Therefore, we consider regional data parameters, such as weather conditions, parasite rate and resistance, to estimate deviated risk from the baseline, with the final product being a progressive web application (i.e. a web and mobile app). Such a product has widespread application primarily in holoendemic areas in Africa to inform both native and tourist populations of their relative risk of contracting malaria.
{"title":"Malaria Early Warning Application for Individual Risk Assessment","authors":"Janiah Kyle, Sagar Sadak, Cayden Goeringer","doi":"10.1137/23s154875x","DOIUrl":"https://doi.org/10.1137/23s154875x","url":null,"abstract":"As one of the oldest known diseases to inflict humanity (since the Agricultural Revolution about 12,000 years ago), malaria has proven to be a significant global challenge. Many intervention strategies have been undertaken in the last few decades such as widespread insecticide-treated bed nets (ITN), long-lasting insecticidal nets (LLIN) and indoor residual spraying (IRS). Yet even with great success, malaria continues to be a ravaging disease requiring inventive solutions. In this study, we develop a malaria early warning system, which utilizes an adapted Ross-MacDonald model to assess individual risk and disease epidemiology. Strategies for achieving a disease-free equilibrium state are also shown by performing local asymptotic stability analysis. It is important to note that the stages of the mosquito life-cycle are highly influenced by weather conditions, both in the aquatic and adult stages, as well as by the use of insecticides (either through ITN/LLIN use or via IRS). Therefore, we consider regional data parameters, such as weather conditions, parasite rate and resistance, to estimate deviated risk from the baseline, with the final product being a progressive web application (i.e. a web and mobile app). Such a product has widespread application primarily in holoendemic areas in Africa to inform both native and tourist populations of their relative risk of contracting malaria.","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135507033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding a Measure for Synchrony: Spike Time Tiling Coefficient Method 理解同步度量:尖峰时间平铺系数法
Pub Date : 2023-01-01 DOI: 10.1137/23s1576104
Kevin Li, Evan Huang, Bill Sun
Synchrony is an important feature of brain activities for the coordination of neural information and is also related to some neuronal disorders. Around 40 different measures have been proposed in literature for quantifying the synchrony of spike trains and the list is still growing. The main issue is that it is not clear to users which one to use and how measurements correspond to different features of synchrony. In this work, instead of looking at all methods at once, we focus on investigating one of the popular measures in the field of neuroscience: Spike Time Tiling Coefficient (STTC) proposed by Cutts and Eglen in 2014. We simulate three scenarios of neural spike trains and study how STTC values depend on distributions and phase shifts of spike trains. Firstly, we study pairs of simple periodic binary time series. We derive an analytical formula showing that the dependence of the STTC value on the phase shift is symmetric and has a general trend where the maximum value of STTC occurs when the phase shift is zero and the minimum value occurs when the phase shift is half of the period. Secondly, we investigate pairs of “periodic” normally distributed spike trains. While we observe the similar trends shown in the first scenario, we notice an exception. We also observe a general trend where the STTC value decreases as the standard deviation of the normal distribution increases. Thirdly, we study pairs of Poisson distributed spike trains. Using properties of the Poisson distribution, we generate pairs of Poisson distributed spike trains with certain overlap ratios and study the relationship between STTC and the overlap ratio. In general, this relationship is nonlinear. We observe that as the synchronicity window decreases towards zero, this nonlinear relationship tends toward a linear relationship. We derive analytical formulas to describe this nonlinear relationship and quantitatively evaluate its closeness to a linear relationship as the syn-chronicity window decreases towards zero. Through studying STTC, we notice that when the synchronicity window is too large, the problem of dividing by zero occurs in the calculation of STTC. To avoid such a problem, we derive an upper bound for the synchronicity window. We also argue that STTC can only approach − 1, and show a case to demonstrate this argument
{"title":"Understanding a Measure for Synchrony: Spike Time Tiling Coefficient Method","authors":"Kevin Li, Evan Huang, Bill Sun","doi":"10.1137/23s1576104","DOIUrl":"https://doi.org/10.1137/23s1576104","url":null,"abstract":"Synchrony is an important feature of brain activities for the coordination of neural information and is also related to some neuronal disorders. Around 40 different measures have been proposed in literature for quantifying the synchrony of spike trains and the list is still growing. The main issue is that it is not clear to users which one to use and how measurements correspond to different features of synchrony. In this work, instead of looking at all methods at once, we focus on investigating one of the popular measures in the field of neuroscience: Spike Time Tiling Coefficient (STTC) proposed by Cutts and Eglen in 2014. We simulate three scenarios of neural spike trains and study how STTC values depend on distributions and phase shifts of spike trains. Firstly, we study pairs of simple periodic binary time series. We derive an analytical formula showing that the dependence of the STTC value on the phase shift is symmetric and has a general trend where the maximum value of STTC occurs when the phase shift is zero and the minimum value occurs when the phase shift is half of the period. Secondly, we investigate pairs of “periodic” normally distributed spike trains. While we observe the similar trends shown in the first scenario, we notice an exception. We also observe a general trend where the STTC value decreases as the standard deviation of the normal distribution increases. Thirdly, we study pairs of Poisson distributed spike trains. Using properties of the Poisson distribution, we generate pairs of Poisson distributed spike trains with certain overlap ratios and study the relationship between STTC and the overlap ratio. In general, this relationship is nonlinear. We observe that as the synchronicity window decreases towards zero, this nonlinear relationship tends toward a linear relationship. We derive analytical formulas to describe this nonlinear relationship and quantitatively evaluate its closeness to a linear relationship as the syn-chronicity window decreases towards zero. Through studying STTC, we notice that when the synchronicity window is too large, the problem of dividing by zero occurs in the calculation of STTC. To avoid such a problem, we derive an upper bound for the synchronicity window. We also argue that STTC can only approach − 1, and show a case to demonstrate this argument","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135310699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Symmetry and Free Boundary Points in a Class of Linear Ordinary Differential Equations 一类线性常微分方程的对称和自由边界点
Pub Date : 2023-01-01 DOI: 10.1137/21s1454110
F. Jiang, Zhengyang Guo, Dong'ang Liu, Yanghao Wang
Project
项目
{"title":"Symmetry and Free Boundary Points in a Class of Linear Ordinary Differential Equations","authors":"F. Jiang, Zhengyang Guo, Dong'ang Liu, Yanghao Wang","doi":"10.1137/21s1454110","DOIUrl":"https://doi.org/10.1137/21s1454110","url":null,"abstract":"Project","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64315921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Numerical Analysis of Crowding Effects in Competing Species 竞争物种拥挤效应的数值分析
Pub Date : 2023-01-01 DOI: 10.1137/22s151042x
B. Carlson
. In recent decades, scientists have observed that the mortality rate of some competing species increases superlinearly as populations grow to unsustainable levels. This is modeled by terms representing crowding effects in a system of nonlinear differential equations that describes population growth of two species competing for resources under the effects of crowding. After applying nondimensionalization to reduce parameters in the system, the stability of the steady state solutions of the system is examined. A semi-implicit numerical scheme is proposed which guarantees the positivity of the solutions. The long term behavior of the numerical solutions is studied. The error estimate between the numerical solution and the true solution is given.
. 近几十年来,科学家们观察到,随着种群增长到不可持续的水平,一些竞争物种的死亡率呈超线性增长。这是通过在非线性微分方程系统中表示拥挤效应的术语来建模的,该系统描述了在拥挤效应下两个物种竞争资源的种群增长。采用无量纲化方法对系统进行参数化简,验证了系统稳态解的稳定性。提出了一种保证解正的半隐式数值格式。研究了数值解的长期特性。给出了数值解与真解之间的误差估计。
{"title":"Numerical Analysis of Crowding Effects in Competing Species","authors":"B. Carlson","doi":"10.1137/22s151042x","DOIUrl":"https://doi.org/10.1137/22s151042x","url":null,"abstract":". In recent decades, scientists have observed that the mortality rate of some competing species increases superlinearly as populations grow to unsustainable levels. This is modeled by terms representing crowding effects in a system of nonlinear differential equations that describes population growth of two species competing for resources under the effects of crowding. After applying nondimensionalization to reduce parameters in the system, the stability of the steady state solutions of the system is examined. A semi-implicit numerical scheme is proposed which guarantees the positivity of the solutions. The long term behavior of the numerical solutions is studied. The error estimate between the numerical solution and the true solution is given.","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64317385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation of the Boneh-Franklin IBE Scheme 实施Boneh-Franklin IBE计划
Pub Date : 2023-01-01 DOI: 10.1137/22s1532901
Florence Lam
In this paper and accompanying software, we give a fully functional implementation of the Boneh-Franklin Identity-Based Encryption (IBE) scheme using the Weil pairing, which runs efficiently even with primes of cryptographic size. We describe the conceptual framework of the IBE, give background on the Weil pairing. Further, we discuss the challenges in the process of creating a functional implementation, and how we overcame them. The reader is encouraged to experiment with the accompanying software, which is written in SageMath.
在本文和附带的软件中,我们给出了使用Weil配对的Boneh-Franklin基于身份的加密(IBE)方案的全功能实现,该方案即使在加密大小的素数下也能有效运行。我们描述了IBE的概念框架,给出了Weil配对的背景。此外,我们还讨论了创建功能实现过程中的挑战,以及我们如何克服它们。我们鼓励读者试用附带的软件,该软件是用SageMath编写的。
{"title":"Implementation of the Boneh-Franklin IBE Scheme","authors":"Florence Lam","doi":"10.1137/22s1532901","DOIUrl":"https://doi.org/10.1137/22s1532901","url":null,"abstract":"In this paper and accompanying software, we give a fully functional implementation of the Boneh-Franklin Identity-Based Encryption (IBE) scheme using the Weil pairing, which runs efficiently even with primes of cryptographic size. We describe the conceptual framework of the IBE, give background on the Weil pairing. Further, we discuss the challenges in the process of creating a functional implementation, and how we overcame them. The reader is encouraged to experiment with the accompanying software, which is written in SageMath.","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"121 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64317903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling the Evolutionary Dynamics of an Infectious Disease with an Initial Asymptomatic Infection Stage with Recovery 具有初始无症状感染阶段和恢复阶段的传染病的进化动力学建模
Pub Date : 2023-01-01 DOI: 10.1137/22s151755x
S. Manivannan
{"title":"Modelling the Evolutionary Dynamics of an Infectious Disease with an Initial Asymptomatic Infection Stage with Recovery","authors":"S. Manivannan","doi":"10.1137/22s151755x","DOIUrl":"https://doi.org/10.1137/22s151755x","url":null,"abstract":"","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64317922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Long-time L2 Stability for an IMEX Discretization of the 1D Fujita Equation 一维Fujita方程IMEX离散化的长时间L2稳定性
Pub Date : 2023-01-01 DOI: 10.1137/23s1556940
Victoria Luongo
We study an efficient time-stepping scheme for the 1D Fujita equation that is implicit for the linear terms but explicit for the nonlinear terms. We analyze the long-time stability of the scheme for varying parameter values, which reveal parameter value regimes in which the method is stable. We provide numerical results that illustrate the theory and show the analytically derived stability conditions are sufficient to achieve long-time stability results
{"title":"Long-time L2 Stability for an IMEX Discretization of the 1D Fujita Equation","authors":"Victoria Luongo","doi":"10.1137/23s1556940","DOIUrl":"https://doi.org/10.1137/23s1556940","url":null,"abstract":"We study an efficient time-stepping scheme for the 1D Fujita equation that is implicit for the linear terms but explicit for the nonlinear terms. We analyze the long-time stability of the scheme for varying parameter values, which reveal parameter value regimes in which the method is stable. We provide numerical results that illustrate the theory and show the analytically derived stability conditions are sufficient to achieve long-time stability results","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135403480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Study of Penalized Regression and Machine Learning Algorithms in High Dimensional Scenarios 高维场景下惩罚回归与机器学习算法的比较研究
Pub Date : 2023-01-01 DOI: 10.1137/22s1538302
Connor Shrader, Gabriel Ackall
. With the prevalence of big data in recent years, the importance of modeling high dimensional data and selecting important features has increased greatly. High dimensional data is common in many fields such as genome decoding, rare disease identification, and environmental modeling. However, most traditional regression machine learning models are not designed to handle high dimensional data or conduct variable selection. In this paper, we investigate the use of penalized regression meth-ods such as ridge, least absolute shrinkage and selection operation, elastic net, smoothly clipped absolute deviation, and minimax concave penalty compared to traditional machine learning models such as random forest, XGBoost, and support vector machines. We compare these models using factorial design methods for Monte Carlo simulations in 540 environments, with factors being the response variable, number of predictors, number of samples, signal to noise ratio, covariance matrix, and correlation strength. We also compare different models using empirical data to evaluate their viability in real-world scenarios. We evaluate the models using the training and test mean squared error, variable selection accuracy, β -sensitivity, and β -specificity. We found that the performance of penalized regression models is comparable with traditional machine learning algorithms in most high-dimensional situations. The analysis helps to create a greater understanding of the strengths and weaknesses of each model type and provide a reference for other researchers on which machine learning techniques they should use, depending on a range of factors and data environments. Our study shows that penalized regression techniques should be included in predictive modelers’ toolbox.
. 随着近年来大数据的普及,对高维数据进行建模和选取重要特征的重要性大大提高。高维数据在基因组解码、罕见疾病鉴定和环境建模等许多领域都很常见。然而,大多数传统的回归机器学习模型并不是为处理高维数据或进行变量选择而设计的。在本文中,我们与传统的机器学习模型(如随机森林、XGBoost和支持向量机)相比,研究了惩罚回归方法(如脊、最小绝对收缩和选择操作、弹性网、平滑剪裁绝对偏差和最小最大凹惩罚)的使用。我们使用因子设计方法在540个环境中进行蒙特卡罗模拟,比较这些模型,因子为响应变量、预测因子数量、样本数量、信噪比、协方差矩阵和相关强度。我们还使用经验数据比较了不同的模型,以评估它们在现实世界场景中的可行性。我们使用训练和检验均方误差、变量选择准确性、β敏感性和β特异性来评估模型。我们发现,在大多数高维情况下,惩罚回归模型的性能与传统机器学习算法相当。该分析有助于更好地理解每种模型类型的优缺点,并为其他研究人员提供参考,根据一系列因素和数据环境,他们应该使用哪种机器学习技术。我们的研究表明惩罚回归技术应该包含在预测建模者的工具箱中。
{"title":"A Comparative Study of Penalized Regression and Machine Learning Algorithms in High Dimensional Scenarios","authors":"Connor Shrader, Gabriel Ackall","doi":"10.1137/22s1538302","DOIUrl":"https://doi.org/10.1137/22s1538302","url":null,"abstract":". With the prevalence of big data in recent years, the importance of modeling high dimensional data and selecting important features has increased greatly. High dimensional data is common in many fields such as genome decoding, rare disease identification, and environmental modeling. However, most traditional regression machine learning models are not designed to handle high dimensional data or conduct variable selection. In this paper, we investigate the use of penalized regression meth-ods such as ridge, least absolute shrinkage and selection operation, elastic net, smoothly clipped absolute deviation, and minimax concave penalty compared to traditional machine learning models such as random forest, XGBoost, and support vector machines. We compare these models using factorial design methods for Monte Carlo simulations in 540 environments, with factors being the response variable, number of predictors, number of samples, signal to noise ratio, covariance matrix, and correlation strength. We also compare different models using empirical data to evaluate their viability in real-world scenarios. We evaluate the models using the training and test mean squared error, variable selection accuracy, β -sensitivity, and β -specificity. We found that the performance of penalized regression models is comparable with traditional machine learning algorithms in most high-dimensional situations. The analysis helps to create a greater understanding of the strengths and weaknesses of each model type and provide a reference for other researchers on which machine learning techniques they should use, depending on a range of factors and data environments. Our study shows that penalized regression techniques should be included in predictive modelers’ toolbox.","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64317667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In Pursuit of Higher Power Through Integrated Multivariate Regression 基于多元回归的更高权力追求
Pub Date : 2023-01-01 DOI: 10.1137/23s1584344
Ryan Shahbaba
. Univariate regression models are commonly used in statistics and machine learning to examine the relationship between an outcome variable and a set of explanatory variables, and possibly use this relationship to predict the unknown values of the outcome variable. However, when dealing with multiple outcome variables that are interrelated, multivariate regression models are preferred. These models simultaneously capture the dependencies between outcome variables and their collective relationships with explanatory variables. While multivariate regression models provide a rigorous and comprehensive understanding of factors associated with outcomes of interest, they have several limitations including: increased model complexity, larger sample size requirements, and lack of interpretability. To address these issues, we propose an alternative approach, called Integrated Multivariate Regression (IMR) that reduces the dimensionality of the outcome variables by transforming them into one or more derived outcome variables that retain important information. Using simulated and real data, we demonstrate that IMR simplifies the analysis and increases statistical power by reducing the number of parameters, while simultaneously maintaining interpretability and accounting for interdependencies among the outcome variables.
{"title":"In Pursuit of Higher Power Through Integrated Multivariate Regression","authors":"Ryan Shahbaba","doi":"10.1137/23s1584344","DOIUrl":"https://doi.org/10.1137/23s1584344","url":null,"abstract":". Univariate regression models are commonly used in statistics and machine learning to examine the relationship between an outcome variable and a set of explanatory variables, and possibly use this relationship to predict the unknown values of the outcome variable. However, when dealing with multiple outcome variables that are interrelated, multivariate regression models are preferred. These models simultaneously capture the dependencies between outcome variables and their collective relationships with explanatory variables. While multivariate regression models provide a rigorous and comprehensive understanding of factors associated with outcomes of interest, they have several limitations including: increased model complexity, larger sample size requirements, and lack of interpretability. To address these issues, we propose an alternative approach, called Integrated Multivariate Regression (IMR) that reduces the dimensionality of the outcome variables by transforming them into one or more derived outcome variables that retain important information. Using simulated and real data, we demonstrate that IMR simplifies the analysis and increases statistical power by reducing the number of parameters, while simultaneously maintaining interpretability and accounting for interdependencies among the outcome variables.","PeriodicalId":93373,"journal":{"name":"SIAM undergraduate research online","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135056966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SIAM undergraduate research online
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1