{"title":"Analysis of Elementary Math Word Problems Based on AI Deep Learning","authors":"Mingzhe Li","doi":"10.25236/ajms.2023.040310","DOIUrl":null,"url":null,"abstract":": Natural language processing (NLP) has greatly advanced in machine learning, but math education software lacks AI integration for solving math word problems in English. We propose using the BertGen pre-trained Transformer model, along with the MAWPS dataset augmented by our dataset augmenter. The Transformer model, with its multi-head attention mechanisms, excels at capturing long-range dependencies and referential relationships, crucial for math word problems at the primary school level. Our accuracy tests and performance on different datasets validate the effectiveness and generalizability of our approach. Moreover, our augmented dataset outperforms smaller unaugmented datasets, while maintaining diversity. The math word problem augmenter can be adapted for other math problem sets, supporting future research in the field.","PeriodicalId":372277,"journal":{"name":"Academic Journal of Mathematical Sciences","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Journal of Mathematical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25236/ajms.2023.040310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
: Natural language processing (NLP) has greatly advanced in machine learning, but math education software lacks AI integration for solving math word problems in English. We propose using the BertGen pre-trained Transformer model, along with the MAWPS dataset augmented by our dataset augmenter. The Transformer model, with its multi-head attention mechanisms, excels at capturing long-range dependencies and referential relationships, crucial for math word problems at the primary school level. Our accuracy tests and performance on different datasets validate the effectiveness and generalizability of our approach. Moreover, our augmented dataset outperforms smaller unaugmented datasets, while maintaining diversity. The math word problem augmenter can be adapted for other math problem sets, supporting future research in the field.