Detecting DeFi securities violations from token smart contract code

IF 6.9 1区 经济学 Q1 BUSINESS, FINANCE Financial Innovation Pub Date : 2024-02-20 DOI:10.1186/s40854-023-00572-5
Arianna Trozze, Bennett Kleinberg, Toby Davies
{"title":"Detecting DeFi securities violations from token smart contract code","authors":"Arianna Trozze, Bennett Kleinberg, Toby Davies","doi":"10.1186/s40854-023-00572-5","DOIUrl":null,"url":null,"abstract":"Decentralized Finance (DeFi) is a system of financial products and services built and delivered through smart contracts on various blockchains. In recent years, DeFi has gained popularity and market capitalization. However, it has also been connected to crime, particularly various types of securities violations. The lack of Know Your Customer requirements in DeFi poses challenges for governments trying to mitigate potential offenses. This study aims to determine whether this problem is suited to a machine learning approach, namely, whether we can identify DeFi projects potentially engaging in securities violations based on their tokens’ smart contract code. We adapted prior works on detecting specific types of securities violations across Ethereum by building classifiers based on features extracted from DeFi projects’ tokens’ smart contract code (specifically, opcode-based features). Our final model was a random forest model that achieved an 80% F-1 score against a baseline of 50%. Notably, we further explored the code-based features that are the most important to our model’s performance in more detail by analyzing tokens’ Solidity code and conducting cosine similarity analyses. We found that one element of the code that our opcode-based features can capture is the implementation of the SafeMath library, although this does not account for the entirety of our features. Another contribution of our study is a new dataset, comprising (a) a verified ground truth dataset for tokens involved in securities violations and (b) a set of legitimate tokens from a reputable DeFi aggregator. This paper further discusses the potential use of a model like ours by prosecutors in enforcement efforts and connects it to a wider legal context.","PeriodicalId":37175,"journal":{"name":"Financial Innovation","volume":"29 1","pages":""},"PeriodicalIF":6.9000,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Financial Innovation","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1186/s40854-023-00572-5","RegionNum":1,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0

Abstract

Decentralized Finance (DeFi) is a system of financial products and services built and delivered through smart contracts on various blockchains. In recent years, DeFi has gained popularity and market capitalization. However, it has also been connected to crime, particularly various types of securities violations. The lack of Know Your Customer requirements in DeFi poses challenges for governments trying to mitigate potential offenses. This study aims to determine whether this problem is suited to a machine learning approach, namely, whether we can identify DeFi projects potentially engaging in securities violations based on their tokens’ smart contract code. We adapted prior works on detecting specific types of securities violations across Ethereum by building classifiers based on features extracted from DeFi projects’ tokens’ smart contract code (specifically, opcode-based features). Our final model was a random forest model that achieved an 80% F-1 score against a baseline of 50%. Notably, we further explored the code-based features that are the most important to our model’s performance in more detail by analyzing tokens’ Solidity code and conducting cosine similarity analyses. We found that one element of the code that our opcode-based features can capture is the implementation of the SafeMath library, although this does not account for the entirety of our features. Another contribution of our study is a new dataset, comprising (a) a verified ground truth dataset for tokens involved in securities violations and (b) a set of legitimate tokens from a reputable DeFi aggregator. This paper further discusses the potential use of a model like ours by prosecutors in enforcement efforts and connects it to a wider legal context.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从代币智能合约代码中检测 DeFi 证券违规行为
去中心化金融(DeFi)是在各种区块链上通过智能合约构建和交付金融产品和服务的系统。近年来,DeFi 广受欢迎,市值不断攀升。然而,它也与犯罪有关,特别是各类证券违规行为。DeFi 中缺乏 "了解你的客户 "的要求,这给试图减少潜在违法行为的政府带来了挑战。本研究旨在确定这一问题是否适用于机器学习方法,即我们是否能根据代币的智能合约代码识别可能从事证券违规行为的 DeFi 项目。我们根据从 DeFi 项目代币的智能合约代码中提取的特征(特别是基于 opcode 的特征)构建分类器,从而调整了之前在以太坊上检测特定类型证券违规行为的工作。我们的最终模型是一个随机森林模型,它的 F-1 分数达到了 80%,而基线为 50%。值得注意的是,我们通过分析代币的 Solidity 代码并进行余弦相似性分析,进一步探索了对我们的模型性能最重要的基于代码的特征。我们发现,我们基于操作码的特征可以捕捉到的代码元素之一是 SafeMath 库的实现,尽管这并不代表我们的全部特征。我们研究的另一个贡献是建立了一个新的数据集,其中包括:(a) 一个经过验证的涉及证券违规代币的基本真实数据集;(b) 一组来自知名 DeFi 聚合器的合法代币。本文进一步讨论了检察官在执法工作中使用我们这样的模型的可能性,并将其与更广泛的法律背景联系起来。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Financial Innovation
Financial Innovation Economics, Econometrics and Finance-Finance
CiteScore
11.40
自引率
11.90%
发文量
95
审稿时长
5 weeks
期刊介绍: Financial Innovation (FIN), a Springer OA journal sponsored by Southwestern University of Finance and Economics, serves as a global academic platform for sharing research findings in all aspects of financial innovation during the electronic business era. It facilitates interactions among researchers, policymakers, and practitioners, focusing on new financial instruments, technologies, markets, and institutions. Emphasizing emerging financial products enabled by disruptive technologies, FIN publishes high-quality academic and practical papers. The journal is peer-reviewed, indexed in SSCI, Scopus, Google Scholar, CNKI, CQVIP, and more.
期刊最新文献
Machine learning in business and finance: a literature review and research opportunities Predictive crypto-asset automated market maker architecture for decentralized finance using deep reinforcement learning Cryptocurrencies under climate shocks: a dynamic network analysis of extreme risk spillovers Exploring the coherency and predictability between the stocks of artificial intelligence and energy corporations From CFOs to crypto: exploratory study unraveling factors in corporate adoption
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1