Estimating the Minimum Sample Size for Neural Network Model Fitting-A Monte Carlo Simulation Study.

IF 2.5 3区 心理学 Q2 PSYCHOLOGY, MULTIDISCIPLINARY Behavioral Sciences Pub Date : 2025-02-14 DOI:10.3390/bs15020211
Yongtian Cheng, Konstantinos Vassilis Petrides, Johnson Li
{"title":"Estimating the Minimum Sample Size for Neural Network Model Fitting-A Monte Carlo Simulation Study.","authors":"Yongtian Cheng, Konstantinos Vassilis Petrides, Johnson Li","doi":"10.3390/bs15020211","DOIUrl":null,"url":null,"abstract":"<p><p>In the era of machine learning, many psychological studies use machine learning methods. Specifically, neural networks, a set of machine learning methods that exhibit exceptional performance in various tasks, have been used on psychometric datasets for supervised model fitting. From the computer scientist's perspective, psychometric independent variables are typically ordinal and low-dimensional-characteristics that can significantly impact model performance. To our knowledge, there is no guidance about the sample planning suggestion for this task. Therefore, we conducted a simulation study to test the performance of an NN with different sample sizes and the simulation of both linear and nonlinear relationships. We proposed the minimum sample size for the neural network model fitting with two criteria: the performance of 95% of the models is close to the theoretical maximum, and 80% of the models can outperform the linear model. The findings of this simulation study show that the performance of neural networks can be unstable with ordinal variables as independent variables, and we suggested that neural networks should not be used on ordinal independent variables with at least common nonlinear relationships in psychology. Further suggestions and research directions are also provided.</p>","PeriodicalId":8742,"journal":{"name":"Behavioral Sciences","volume":"15 2","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11851979/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavioral Sciences","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3390/bs15020211","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

In the era of machine learning, many psychological studies use machine learning methods. Specifically, neural networks, a set of machine learning methods that exhibit exceptional performance in various tasks, have been used on psychometric datasets for supervised model fitting. From the computer scientist's perspective, psychometric independent variables are typically ordinal and low-dimensional-characteristics that can significantly impact model performance. To our knowledge, there is no guidance about the sample planning suggestion for this task. Therefore, we conducted a simulation study to test the performance of an NN with different sample sizes and the simulation of both linear and nonlinear relationships. We proposed the minimum sample size for the neural network model fitting with two criteria: the performance of 95% of the models is close to the theoretical maximum, and 80% of the models can outperform the linear model. The findings of this simulation study show that the performance of neural networks can be unstable with ordinal variables as independent variables, and we suggested that neural networks should not be used on ordinal independent variables with at least common nonlinear relationships in psychology. Further suggestions and research directions are also provided.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
神经网络模型拟合最小样本量的估计——蒙特卡罗模拟研究。
在机器学习时代,许多心理学研究都使用了机器学习的方法。具体来说,神经网络是一组在各种任务中表现出色的机器学习方法,已被用于心理测量数据集进行监督模型拟合。从计算机科学家的角度来看,心理测量自变量通常是有序和低维的特征,可以显著影响模型的性能。据我们所知,没有关于这个任务的样本规划建议的指导。因此,我们进行了模拟研究,以测试具有不同样本量的神经网络的性能以及线性和非线性关系的模拟。我们提出了神经网络模型拟合的最小样本量标准:95%的模型的性能接近理论最大值,80%的模型可以优于线性模型。仿真研究结果表明,当自变量为有序变量时,神经网络的性能可能会不稳定,因此我们建议神经网络不应该用于至少具有常见非线性关系的有序自变量。并提出了进一步研究的建议和方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Behavioral Sciences
Behavioral Sciences Social Sciences-Development
CiteScore
2.60
自引率
7.70%
发文量
429
审稿时长
11 weeks
期刊最新文献
Correction: Ferris et al. (2025) Chaining Differential Reinforcement of Compliance and Functional Communication Training to Treat Challenging Behavior Maintained by Negative Reinforcement. Behavioral Sciences, 15(7), 891. From Individual Behavior to Systemic Insight: A Bibliometric and Content Analysis of COM-B Applications in Responsible Consumption. Adapting a Behavioral Intervention for Caregivers of Children with Down Syndrome or Fragile X Syndrome: A Pilot Study of RUBI-DD. Reconstructing Multilingual Development Research: Shifting from a Monolingual Bias and Toward a Developmental Systems Framework. Developing Messages to Prevent Smokeless Tobacco and Nicotine Pouch Uptake Among Early Career Rural Firefighters in California: A Qualitative Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1