Secure, Decentralized, Privacy Preserving Machine Learning System Implementation over Blockchain

Adnesh Dhamangaonkar, Prajwal Adsul, Rohini Sarode, S. Mane
{"title":"Secure, Decentralized, Privacy Preserving Machine Learning System Implementation over Blockchain","authors":"Adnesh Dhamangaonkar, Prajwal Adsul, Rohini Sarode, S. Mane","doi":"10.1145/3475992.3476003","DOIUrl":null,"url":null,"abstract":"The traditional approach to centralized machine learning has transparency concerns. The future of machine learning is decentralized machine learning. Thus, many technological advance companies including Microsoft are also investing in researching approaches to decentralization in machine learning. With the upliftment of big data technology, designing optimized artificial intelligence algorithms is a must need. At the base of every machine learning algorithm we need data. Data is something that can be any unprocessed fact, value, text, sound or picture that is not being interpreted and analyzed. This data is not generated by just one party, multiple parties generate such data. The data will be geographically distributed amongst organizations. This pushes the need and research of distributed machine learning algorithms. In the current scenario, there is a central server which will run the machine learning algorithm and produce results, in this system obviously we need to collect all the data at that server itself. If the server is attacked then there is a problem of security of data. Also many organizations would not like to just lend their data to some third party. To address all such issues, we study all the possible ways for implementing a distributed machine learning system and propose a blockchain based distributed conservative system. Mainly, we design a Stochastic Gradient Descent (SGD) algorithm to learn a general predictive model over the trending blockchain technology, also taking care of Byzantine attack, using the within-N algorithm. Also analysis will be made on different machine learning algorithms and datasets as a part of testing, demonstrating the effectiveness of the model.","PeriodicalId":401179,"journal":{"name":"Proceedings of the 2021 3rd Blockchain and Internet of Things Conference","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 3rd Blockchain and Internet of Things Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3475992.3476003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The traditional approach to centralized machine learning has transparency concerns. The future of machine learning is decentralized machine learning. Thus, many technological advance companies including Microsoft are also investing in researching approaches to decentralization in machine learning. With the upliftment of big data technology, designing optimized artificial intelligence algorithms is a must need. At the base of every machine learning algorithm we need data. Data is something that can be any unprocessed fact, value, text, sound or picture that is not being interpreted and analyzed. This data is not generated by just one party, multiple parties generate such data. The data will be geographically distributed amongst organizations. This pushes the need and research of distributed machine learning algorithms. In the current scenario, there is a central server which will run the machine learning algorithm and produce results, in this system obviously we need to collect all the data at that server itself. If the server is attacked then there is a problem of security of data. Also many organizations would not like to just lend their data to some third party. To address all such issues, we study all the possible ways for implementing a distributed machine learning system and propose a blockchain based distributed conservative system. Mainly, we design a Stochastic Gradient Descent (SGD) algorithm to learn a general predictive model over the trending blockchain technology, also taking care of Byzantine attack, using the within-N algorithm. Also analysis will be made on different machine learning algorithms and datasets as a part of testing, demonstrating the effectiveness of the model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
区块链上安全、分散、保护隐私的机器学习系统实现
集中式机器学习的传统方法存在透明度问题。机器学习的未来是分散的机器学习。因此,包括微软在内的许多技术领先公司也在投资研究机器学习中的去中心化方法。随着大数据技术的发展,设计优化的人工智能算法势在必行。在每个机器学习算法的基础上,我们都需要数据。数据可以是任何未经处理的事实、值、文本、声音或图像,也可以是未经解释和分析的。这些数据不是由一方产生的,而是由多方产生的。数据将在各个组织之间按地理位置分布。这推动了对分布式机器学习算法的需求和研究。在当前的场景中,有一个中央服务器将运行机器学习算法并产生结果,在这个系统中,显然我们需要在服务器本身收集所有数据。如果服务器受到攻击,那么就会出现数据安全问题。许多组织也不愿意把他们的数据借给第三方。为了解决所有这些问题,我们研究了实现分布式机器学习系统的所有可能方法,并提出了一个基于区块链的分布式保守系统。我们主要设计了一种随机梯度下降(SGD)算法来学习趋势区块链技术的通用预测模型,同时使用in- n算法来处理拜占庭攻击。作为测试的一部分,还将对不同的机器学习算法和数据集进行分析,以证明模型的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Overview of Blockchain Data Privacy Protection Blockchain Extension for PostgreSQL Data Storage Proceedings of the 2021 3rd Blockchain and Internet of Things Conference DeExam: A Decentralized Exam Administration Model using Public Blockchain Efficient Novel Privacy Preserving PoS Protocol Proof-of-concept with Algorand
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1