{"title":"Evaluate Machine Learning Models Used for Upscaling Surface Ocean CO2 Measurements","authors":"J. Zeng, Zheng-Hong Tan","doi":"10.1109/AMCON.2018.8614876","DOIUrl":null,"url":null,"abstract":"Upscaling measurements from ground-based sites or underway monitoring to a regional or global scale provides important information to policy makers for environmental management and to researchers looking for a better understanding of relevant issues. We used the reconstruction of global surface ocean CO2 as an example to evaluate the performance of four machine learning models: Random Forest (RF), Support Vector Machine (SVM), Feedforward Neural Network (FNN), and Self-Organization Map (SOM). The results show the performance from high to low as RF, SVM, FNN, and SOM. However, the overall differences of modelled CO2 among the four models are insignificant. Considering the discrete characteristics of RF, it is recommended to use SVM or FNN when the number of data point is not large and continuous estimations are expected. RF has an advantage particularly when the number of data points is very large and the data include categorial variables.","PeriodicalId":438307,"journal":{"name":"2018 IEEE International Conference on Advanced Manufacturing (ICAM)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Advanced Manufacturing (ICAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AMCON.2018.8614876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Upscaling measurements from ground-based sites or underway monitoring to a regional or global scale provides important information to policy makers for environmental management and to researchers looking for a better understanding of relevant issues. We used the reconstruction of global surface ocean CO2 as an example to evaluate the performance of four machine learning models: Random Forest (RF), Support Vector Machine (SVM), Feedforward Neural Network (FNN), and Self-Organization Map (SOM). The results show the performance from high to low as RF, SVM, FNN, and SOM. However, the overall differences of modelled CO2 among the four models are insignificant. Considering the discrete characteristics of RF, it is recommended to use SVM or FNN when the number of data point is not large and continuous estimations are expected. RF has an advantage particularly when the number of data points is very large and the data include categorial variables.