{"title":"Predicting Flake Mass: A View from Machine Learning","authors":"Guillermo Bustos-Pérez, J. Baena","doi":"10.1080/01977261.2021.1881267","DOIUrl":null,"url":null,"abstract":"ABSTRACT Estimating flake mass based on remaining attributes bears an important relationship for the interpretation of lithic assemblages. Previous works have pointed out the relationship between flake attributes and prediction of flake mass. This study builds on previous works by using data from an experimental collection of flakes. Estimated mass was arrived at by generating a multiple linear regression model that combines several predictive variables. Variable selection for model training was carried out by using best subset selection, which evaluates all possible combinations of variables. Evaluation of the model was performed by computing common machine learning statistics along with estimated percentage error. Results make it possible to determine the best variables and estimate their relationships with flake mass. On the other hand, results also show that although the model is slightly biased and performs adequately, it has a limited inferential ability, especially when compared with other methods/indexes employed to estimate reduction.","PeriodicalId":45597,"journal":{"name":"Lithic Technology","volume":"46 1","pages":"130 - 142"},"PeriodicalIF":1.5000,"publicationDate":"2021-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/01977261.2021.1881267","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lithic Technology","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/01977261.2021.1881267","RegionNum":3,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 3
Abstract
ABSTRACT Estimating flake mass based on remaining attributes bears an important relationship for the interpretation of lithic assemblages. Previous works have pointed out the relationship between flake attributes and prediction of flake mass. This study builds on previous works by using data from an experimental collection of flakes. Estimated mass was arrived at by generating a multiple linear regression model that combines several predictive variables. Variable selection for model training was carried out by using best subset selection, which evaluates all possible combinations of variables. Evaluation of the model was performed by computing common machine learning statistics along with estimated percentage error. Results make it possible to determine the best variables and estimate their relationships with flake mass. On the other hand, results also show that although the model is slightly biased and performs adequately, it has a limited inferential ability, especially when compared with other methods/indexes employed to estimate reduction.