R. Brum, Lúcia M. A. Drummond, Maria Clicia Stelling de Castro, George Teodoro
{"title":"优化云环境下联邦学习的计算成本","authors":"R. Brum, Lúcia M. A. Drummond, Maria Clicia Stelling de Castro, George Teodoro","doi":"10.1109/SBAC-PADW53941.2021.00016","DOIUrl":null,"url":null,"abstract":"Federated Learning is a strategy where distributed training datasets are processed by several clients coordinated by a central server that keeps the global learning model. This approach is very attractive in the biomedical domain because it allows the use of private datasets from multiple institutions to train a model without the need of sharing the data. Moreover, Machine Learning approaches often require larger training samples than what can be afforded by a single institution. In this work, we are interested in analyzing the performance of a Tumor-Infiltrating Lymphocytes Classification problem when solved by a federated learning approach deployed in a commercial cloud. In the presented evaluation, we consider executions of a federated learning implementation on accelerated instance types of the AWS EC2, on on-demand and spot markets, while varying the number of clients. The obtained results showed the improvement of the accuracy and execution times when the number of clients increases. They also revealed that, although the spot instances suffered from revocations, their use could significantly reduce the financial costs compared to the on-demand one.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Towards Optimizing Computational Costs of Federated Learning in Clouds\",\"authors\":\"R. Brum, Lúcia M. A. Drummond, Maria Clicia Stelling de Castro, George Teodoro\",\"doi\":\"10.1109/SBAC-PADW53941.2021.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated Learning is a strategy where distributed training datasets are processed by several clients coordinated by a central server that keeps the global learning model. This approach is very attractive in the biomedical domain because it allows the use of private datasets from multiple institutions to train a model without the need of sharing the data. Moreover, Machine Learning approaches often require larger training samples than what can be afforded by a single institution. In this work, we are interested in analyzing the performance of a Tumor-Infiltrating Lymphocytes Classification problem when solved by a federated learning approach deployed in a commercial cloud. In the presented evaluation, we consider executions of a federated learning implementation on accelerated instance types of the AWS EC2, on on-demand and spot markets, while varying the number of clients. The obtained results showed the improvement of the accuracy and execution times when the number of clients increases. They also revealed that, although the spot instances suffered from revocations, their use could significantly reduce the financial costs compared to the on-demand one.\",\"PeriodicalId\":233108,\"journal\":{\"name\":\"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SBAC-PADW53941.2021.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SBAC-PADW53941.2021.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards Optimizing Computational Costs of Federated Learning in Clouds
Federated Learning is a strategy where distributed training datasets are processed by several clients coordinated by a central server that keeps the global learning model. This approach is very attractive in the biomedical domain because it allows the use of private datasets from multiple institutions to train a model without the need of sharing the data. Moreover, Machine Learning approaches often require larger training samples than what can be afforded by a single institution. In this work, we are interested in analyzing the performance of a Tumor-Infiltrating Lymphocytes Classification problem when solved by a federated learning approach deployed in a commercial cloud. In the presented evaluation, we consider executions of a federated learning implementation on accelerated instance types of the AWS EC2, on on-demand and spot markets, while varying the number of clients. The obtained results showed the improvement of the accuracy and execution times when the number of clients increases. They also revealed that, although the spot instances suffered from revocations, their use could significantly reduce the financial costs compared to the on-demand one.