Prediction of Autism and Dyslexia Using Machine Learning and Clinical Data Balancing

S. Shilaskar, S. Bhatlawande, Shivpriya Deshmukh, Harshal Dhande
{"title":"Prediction of Autism and Dyslexia Using Machine Learning and Clinical Data Balancing","authors":"S. Shilaskar, S. Bhatlawande, Shivpriya Deshmukh, Harshal Dhande","doi":"10.1109/AICAPS57044.2023.10074161","DOIUrl":null,"url":null,"abstract":"Autism spectrum disorder (ASD) and dyslexia are expanding more swiftly than ever nowadays. Finding the characteristics of dyslexia and autism through screening tests is costly and time-consuming. Thanks to breakthroughs in artificial intelligence, computers, and machine learning, autism and dyslexia may be predicted at a very young age (ML). Even though several studies have been carried out using quite a few different approaches, none of them has shown a clear justification for how to predict autism and dyslexia traits across age groups. This study attempts to build a suitable prediction model enabled by ML technology to predict ASD and dyslexia for people of any age. This work seeks to examine the possible use of Random Forest, SVM with linear kernel, SVM with polynomial kernel, SVM with rbf kernel, SVM with sigmoid kernel, XGBoost, Decision Tree, Logistic Regression, Naïve Bayes, and KNN to forecast and assess ASD and dyslexia difficulties in children, adolescents and adults. Using real data set collected from individuals with and without autistic traits, the proposed model and the AQ-10 screening tool were assessed. The data for dyslexia is made up of 3644 cases with 197 properties, 196 of which are independent variables and one is a dependent variable. The data for autism consists of 704 cases with 22 characteristics, 21 independent variables, and 1 dependent variable with binary values (YES or NO). The results of the research showed that, in terms of accuracy, precision, F1 score, and recall, the recommended prediction model gave better results for the data set.","PeriodicalId":146698,"journal":{"name":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","volume":"152 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advances in Intelligent Computing and Applications (AICAPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICAPS57044.2023.10074161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Autism spectrum disorder (ASD) and dyslexia are expanding more swiftly than ever nowadays. Finding the characteristics of dyslexia and autism through screening tests is costly and time-consuming. Thanks to breakthroughs in artificial intelligence, computers, and machine learning, autism and dyslexia may be predicted at a very young age (ML). Even though several studies have been carried out using quite a few different approaches, none of them has shown a clear justification for how to predict autism and dyslexia traits across age groups. This study attempts to build a suitable prediction model enabled by ML technology to predict ASD and dyslexia for people of any age. This work seeks to examine the possible use of Random Forest, SVM with linear kernel, SVM with polynomial kernel, SVM with rbf kernel, SVM with sigmoid kernel, XGBoost, Decision Tree, Logistic Regression, Naïve Bayes, and KNN to forecast and assess ASD and dyslexia difficulties in children, adolescents and adults. Using real data set collected from individuals with and without autistic traits, the proposed model and the AQ-10 screening tool were assessed. The data for dyslexia is made up of 3644 cases with 197 properties, 196 of which are independent variables and one is a dependent variable. The data for autism consists of 704 cases with 22 characteristics, 21 independent variables, and 1 dependent variable with binary values (YES or NO). The results of the research showed that, in terms of accuracy, precision, F1 score, and recall, the recommended prediction model gave better results for the data set.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用机器学习和临床数据平衡预测自闭症和阅读障碍
如今,自闭症谱系障碍(ASD)和阅读障碍的发展速度比以往任何时候都要快。通过筛选测试来发现阅读障碍和自闭症的特征既昂贵又耗时。由于人工智能、计算机和机器学习的突破,自闭症和阅读障碍可能在很小的时候就被预测出来。尽管已经进行了几项研究,使用了几种不同的方法,但没有一项研究能够明确地证明如何预测不同年龄段的自闭症和阅读障碍特征。本研究试图通过ML技术建立一个适合任何年龄段人群的ASD和阅读障碍预测模型。本研究旨在探讨随机森林、线性核支持向量机、多项式核支持向量机、rbf核支持向量机、s型核支持向量机、XGBoost、决策树、Logistic回归、Naïve贝叶斯和KNN在儿童、青少年和成人中预测和评估ASD和阅读障碍的可能性。使用从具有和不具有自闭症特征的个体中收集的真实数据集,对所提出的模型和AQ-10筛选工具进行评估。阅读障碍的数据由3644个案例组成,有197个属性,其中196个是自变量,1个是因变量。自闭症数据包括704例,22个特征,21个自变量和1个二元值(YES或NO)的因变量。研究结果表明,在准确率、精密度、F1分数和召回率方面,推荐的预测模型对数据集给出了更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Smart Irrigation Management System for Precision Agriculture Impact of Stain Normalisation Technique on Deep Learning based Nuclei Segmentation in Histopathological Image An Optimal Differential Evolution Based XGB Classifier for IoMT malware classification Sarcasm Detection followed by Sentiment Analysis for Bengali Language: Neural Network & Supervised Approach Feature Selection using Enhanced Nature Optimization Technique
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1