首页 > 最新文献

Applied Computing Review最新文献

英文 中文
NFT Trust Survey 信托调查
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577824
Jean-Marc Seigneur, Suzana Moreno
Non-Fungible Tokens (NFT) have gained popularity since 2021, reaching a total market valuation of several billion US dollars, especially in art. This paper highlights the findings of our statistically representative survey of more than 1850 Americans, e.g., 5.7% have already bought an NFT. Unfortunately, that trust has been misplaced on many occasions due to technical and legal issues of most created NFTs. We detail those issues and evaluate them in the case of the most well-known NFT marketplace, i.e., OpenSea.
自2021年以来,不可替代代币(NFT)开始流行,市场总估值达到数十亿美元,尤其是在艺术领域。本文强调了我们对1850多名美国人进行的具有统计代表性的调查结果,例如,5.7%的人已经购买了NFT。不幸的是,由于大多数已创建的nft的技术和法律问题,这种信任在很多情况下都被放错了地方。我们详细介绍了这些问题,并以最知名的NFT市场OpenSea为例进行了评估。
{"title":"NFT Trust Survey","authors":"Jean-Marc Seigneur, Suzana Moreno","doi":"10.1145/3555776.3577824","DOIUrl":"https://doi.org/10.1145/3555776.3577824","url":null,"abstract":"Non-Fungible Tokens (NFT) have gained popularity since 2021, reaching a total market valuation of several billion US dollars, especially in art. This paper highlights the findings of our statistically representative survey of more than 1850 Americans, e.g., 5.7% have already bought an NFT. Unfortunately, that trust has been misplaced on many occasions due to technical and legal issues of most created NFTs. We detail those issues and evaluate them in the case of the most well-known NFT marketplace, i.e., OpenSea.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"113 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79323879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stateful Adaptive Streams with Approximate Computing and Elastic Scaling 具有近似计算和弹性缩放的有状态自适应流
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577858
João Francisco, Miguel E. Coimbra, P. Neto, Felix Freitag, L. Veiga
The model of approximate computing can be used to increase performance or optimize resource usage in stream and graph processing. It can be used to satisfy performance requirements (e.g., throughput, lag) in stream processing by reducing the effort that applications need to process datasets. There are currently multiple stream processing platforms, and most of them do not natively support approximate results. A recent one, Stateful Functions, is an API that uses Flink to enable developers to easily build stream and graph processing applications. It also retains Flink's features like stateful computations, fault-tolerance, scalability, control events and its graph processing library Gelly. Herein we present Approxate, an extension over this platform to support approximate results. It can also support more efficient stream and graph processing by allocating available resources adaptively, driven by user-defined requirements on throughput, lag, and latency. This extension enables flexibility in computational trade-offs such as trading accuracy for performance. The user can choose which metrics should be guaranteed at the cost of others, and/or the accuracy. Approxate incorporates approximate computing (using load shedding) with adaptive accuracy and resource manegement in state-of-the-art stream processing platforms, which are not targeted in other relevant related work. It does not require significant modifications to application code, and minimizes imbalance in data source representation when dropping events.
近似计算模型可用于提高流和图形处理的性能或优化资源使用。它可以通过减少应用程序处理数据集所需的工作量来满足流处理中的性能要求(例如,吞吐量、延迟)。目前有多种流处理平台,其中大多数都不支持近似结果。最近的一个API是Stateful Functions,它使用Flink使开发人员能够轻松地构建流和图形处理应用程序。它还保留了Flink的特性,如有状态计算、容错、可扩展性、控制事件和图形处理库Gelly。在这里,我们提出了近似,在这个平台上的扩展,以支持近似结果。它还可以根据用户定义的吞吐量、延迟和延迟需求,自适应地分配可用资源,从而支持更高效的流和图形处理。这个扩展使计算权衡的灵活性,如交易精度的性能。用户可以选择以牺牲其他指标和/或准确性为代价来保证哪些指标。在最先进的流处理平台中,approximate结合了具有自适应精度和资源管理的近似计算(使用负载减少),这在其他相关工作中不是针对的。它不需要对应用程序代码进行重大修改,并且在删除事件时最大限度地减少数据源表示中的不平衡。
{"title":"Stateful Adaptive Streams with Approximate Computing and Elastic Scaling","authors":"João Francisco, Miguel E. Coimbra, P. Neto, Felix Freitag, L. Veiga","doi":"10.1145/3555776.3577858","DOIUrl":"https://doi.org/10.1145/3555776.3577858","url":null,"abstract":"The model of approximate computing can be used to increase performance or optimize resource usage in stream and graph processing. It can be used to satisfy performance requirements (e.g., throughput, lag) in stream processing by reducing the effort that applications need to process datasets. There are currently multiple stream processing platforms, and most of them do not natively support approximate results. A recent one, Stateful Functions, is an API that uses Flink to enable developers to easily build stream and graph processing applications. It also retains Flink's features like stateful computations, fault-tolerance, scalability, control events and its graph processing library Gelly. Herein we present Approxate, an extension over this platform to support approximate results. It can also support more efficient stream and graph processing by allocating available resources adaptively, driven by user-defined requirements on throughput, lag, and latency. This extension enables flexibility in computational trade-offs such as trading accuracy for performance. The user can choose which metrics should be guaranteed at the cost of others, and/or the accuracy. Approxate incorporates approximate computing (using load shedding) with adaptive accuracy and resource manegement in state-of-the-art stream processing platforms, which are not targeted in other relevant related work. It does not require significant modifications to application code, and minimizes imbalance in data source representation when dropping events.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"6 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78035215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Deployment of Mobile Robot driven Preference Learning for User-State-Specific Thermal Control in A Real-World Smart Space 在现实世界的智能空间中,移动机器人驱动的偏好学习用于用户状态特定的热控制
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577760
Geon Kim, Hyunju Kim, Dongman Lee
Indoor Environment Quality (IEQ) is one of the most important goals for smart spaces. Thermal comfort is typically considered the most emphasized factor in IEQ that depends on personalized thermal preference. In this paper, we explore technical challenges to deploying a robot-driven personalized thermal control system that uses a mobile robot for learning user-state-specific preference efficiently. We conduct a few experiments that give a clue to overcome such challenges (i.e. low image recognition) when the system is deployed in a real world. We present future directions to improve robot-driven preference learning from the exploration.
室内环境质量(IEQ)是智能空间最重要的目标之一。热舒适通常被认为是IEQ中最重要的因素,它取决于个性化的热偏好。在本文中,我们探讨了部署机器人驱动的个性化热控制系统的技术挑战,该系统使用移动机器人有效地学习用户特定状态的偏好。当系统部署在现实世界中时,我们进行了一些实验,为克服这些挑战(即低图像识别)提供了线索。我们从探索中提出了改进机器人驱动偏好学习的未来方向。
{"title":"Towards Deployment of Mobile Robot driven Preference Learning for User-State-Specific Thermal Control in A Real-World Smart Space","authors":"Geon Kim, Hyunju Kim, Dongman Lee","doi":"10.1145/3555776.3577760","DOIUrl":"https://doi.org/10.1145/3555776.3577760","url":null,"abstract":"Indoor Environment Quality (IEQ) is one of the most important goals for smart spaces. Thermal comfort is typically considered the most emphasized factor in IEQ that depends on personalized thermal preference. In this paper, we explore technical challenges to deploying a robot-driven personalized thermal control system that uses a mobile robot for learning user-state-specific preference efficiently. We conduct a few experiments that give a clue to overcome such challenges (i.e. low image recognition) when the system is deployed in a real world. We present future directions to improve robot-driven preference learning from the exploration.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"75 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86059371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Acala: Aggregate Monitoring for Geo-Distributed Cluster Federations Acala:地理分布式集群联合的聚合监控
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577716
Chih-Kai Huang, G. Pierre
Distributed monitoring is an essential functionality to allow large cluster federations to efficiently schedule applications on a set of available geo-distributed resources. However, periodically reporting the precise status of each available server is both unnecessary to allow accurate scheduling and unscalable when the number of servers grows. This paper proposes Acala, a monitoring framework for geo-distributed cluster federations which aims to provide the management cluster with aggregate information about the entire cluster instead of individual servers. Our evaluations, based on actual deployment under controlled environment in the geo-distributed Grid'5000 testbed, show that Acala reduces the cross-cluster network traffic by up to 99% and the scrape duration by up to 55%.
分布式监控是一项基本功能,它允许大型集群联合在一组可用的地理分布式资源上有效地调度应用程序。但是,定期报告每个可用服务器的精确状态对于实现精确的调度是不必要的,而且当服务器数量增加时也无法进行扩展。本文提出了Acala,一个用于地理分布式集群联合的监控框架,旨在为管理集群提供关于整个集群而不是单个服务器的汇总信息。我们的评估基于地理分布式网格5000测试平台在受控环境下的实际部署,表明Acala将跨集群网络流量减少了99%,将刮取时间减少了55%。
{"title":"Acala: Aggregate Monitoring for Geo-Distributed Cluster Federations","authors":"Chih-Kai Huang, G. Pierre","doi":"10.1145/3555776.3577716","DOIUrl":"https://doi.org/10.1145/3555776.3577716","url":null,"abstract":"Distributed monitoring is an essential functionality to allow large cluster federations to efficiently schedule applications on a set of available geo-distributed resources. However, periodically reporting the precise status of each available server is both unnecessary to allow accurate scheduling and unscalable when the number of servers grows. This paper proposes Acala, a monitoring framework for geo-distributed cluster federations which aims to provide the management cluster with aggregate information about the entire cluster instead of individual servers. Our evaluations, based on actual deployment under controlled environment in the geo-distributed Grid'5000 testbed, show that Acala reduces the cross-cluster network traffic by up to 99% and the scrape duration by up to 55%.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"126 12","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72444214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FedFAME: A Data Augmentation Free Framework based on Model Contrastive Learning for Federated Semi-Supervised Learning 联邦半监督学习中基于模型对比学习的数据增强自由框架
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577613
Shubham Malaviya, Manish Shukla, Pratik Korat, S. Lodha
Federated learning has emerged as a privacy-preserving technique to learn a machine learning model without requiring users to share their data. Our paper focuses on Federated Semi-Supervised Learning (FSSL) setting wherein users do not have domain expertise or incentives to label data on their device, and the server has access to some labeled data that is annotated by experts. The existing work in FSSL require data augmentation for model training. However, data augmentation is not well defined for prevalent domains like text and graphs. Moreover, non independent and identically distributed (non-i.i.d.) data across users is a significant challenge in federated learning. We propose a generalized framework based on model contrastive learning called FedFAME which does not require data augmentation, thus making it easy to adapt to different domains. Our experiments on image and text datasets show the robustness of FedFAME towards non-i.i.d. data. We have validated our approach by varying data imbalance across users and the number of labeled instances on the server.
联邦学习已经成为一种隐私保护技术,可以在不需要用户共享数据的情况下学习机器学习模型。我们的论文关注的是联邦半监督学习(FSSL)设置,其中用户没有领域专业知识或动机在他们的设备上标记数据,服务器可以访问一些由专家注释的标记数据。现有的FSSL工作需要数据增强来进行模型训练。然而,对于文本和图形等流行领域,数据增强并没有很好地定义。此外,跨用户的非独立和同分布(non-i.i.d)数据是联邦学习中的一个重大挑战。我们提出了一个基于模型对比学习的广义框架,称为FedFAME,它不需要数据增强,从而使其易于适应不同的领域。我们在图像和文本数据集上的实验表明了FedFAME对非识别的鲁棒性。数据。我们通过改变用户之间的数据不平衡和服务器上标记实例的数量来验证我们的方法。
{"title":"FedFAME: A Data Augmentation Free Framework based on Model Contrastive Learning for Federated Semi-Supervised Learning","authors":"Shubham Malaviya, Manish Shukla, Pratik Korat, S. Lodha","doi":"10.1145/3555776.3577613","DOIUrl":"https://doi.org/10.1145/3555776.3577613","url":null,"abstract":"Federated learning has emerged as a privacy-preserving technique to learn a machine learning model without requiring users to share their data. Our paper focuses on Federated Semi-Supervised Learning (FSSL) setting wherein users do not have domain expertise or incentives to label data on their device, and the server has access to some labeled data that is annotated by experts. The existing work in FSSL require data augmentation for model training. However, data augmentation is not well defined for prevalent domains like text and graphs. Moreover, non independent and identically distributed (non-i.i.d.) data across users is a significant challenge in federated learning. We propose a generalized framework based on model contrastive learning called FedFAME which does not require data augmentation, thus making it easy to adapt to different domains. Our experiments on image and text datasets show the robustness of FedFAME towards non-i.i.d. data. We have validated our approach by varying data imbalance across users and the number of labeled instances on the server.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"18 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87588849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Hyperparameter Optimisation with Flower and Optuna 基于Flower和Optuna的联邦超参数优化
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577847
J. Parra-Ullauri, Xunzheng Zhang, A. Bravalheri, R. Nejabati, D. Simeonidou
Federated learning (FL) is an emerging distributed machine learning technique in which multiple clients collaborate to learn a model under the management of a central server. An FL system depends on a set of initial conditions (i.e., hyperparameters) that affect the system's performance. However, defining a good choice of hyperparameters for the central server and clients is a challenging problem. Hyperparameter tuning in FL often requires manual or automated searches to find optimal values. Nonetheless, a noticeable limitation is the high cost of algorithm evaluation for server and client models, making the tuning process computationally expensive and time-consuming. We propose an implementation based on integrating the FL framework Flower, and the prime optimisation software Optuna for automated and efficient hyperparameter optimisation (HPO) in FL. Through this combination, it is possible to tune hyperparameters in both clients and server online, aiming to find the optimal values at runtime. We introduce the HPO factor to describe the number of rounds that the HPO will take place, and the HPO rate that defines the frequency for updating the hyperparameters and can be used for pruning. The HPO is managed by the FL server which updates clients' hyperparameters, with an HPO rate, using state-of-the-art optimisation algorithms enabled by Optuna. We tested our approach by updating multiple client models simultaneously in popular image recognition datasets which produced promising results compared to baselines.
联邦学习(FL)是一种新兴的分布式机器学习技术,其中多个客户端在中央服务器的管理下协作学习模型。FL系统依赖于一组影响系统性能的初始条件(即超参数)。然而,为中心服务器和客户机定义一个好的超参数选择是一个具有挑战性的问题。FL中的超参数调优通常需要手动或自动搜索以找到最优值。尽管如此,一个明显的限制是服务器和客户机模型的算法评估的高成本,使得优化过程在计算上昂贵且耗时。我们提出了一种基于集成FL框架Flower和主要优化软件Optuna的实现,用于FL中自动化和高效的超参数优化(HPO)。通过这种组合,可以在线调整客户端和服务器中的超参数,旨在在运行时找到最优值。我们引入了HPO因子来描述HPO将发生的轮数,以及HPO率,它定义了更新超参数的频率,并可用于修剪。HPO由FL服务器管理,该服务器使用Optuna启用的最先进的优化算法,以HPO率更新客户端的超参数。我们通过在流行的图像识别数据集中同时更新多个客户端模型来测试我们的方法,与基线相比,产生了有希望的结果。
{"title":"Federated Hyperparameter Optimisation with Flower and Optuna","authors":"J. Parra-Ullauri, Xunzheng Zhang, A. Bravalheri, R. Nejabati, D. Simeonidou","doi":"10.1145/3555776.3577847","DOIUrl":"https://doi.org/10.1145/3555776.3577847","url":null,"abstract":"Federated learning (FL) is an emerging distributed machine learning technique in which multiple clients collaborate to learn a model under the management of a central server. An FL system depends on a set of initial conditions (i.e., hyperparameters) that affect the system's performance. However, defining a good choice of hyperparameters for the central server and clients is a challenging problem. Hyperparameter tuning in FL often requires manual or automated searches to find optimal values. Nonetheless, a noticeable limitation is the high cost of algorithm evaluation for server and client models, making the tuning process computationally expensive and time-consuming. We propose an implementation based on integrating the FL framework Flower, and the prime optimisation software Optuna for automated and efficient hyperparameter optimisation (HPO) in FL. Through this combination, it is possible to tune hyperparameters in both clients and server online, aiming to find the optimal values at runtime. We introduce the HPO factor to describe the number of rounds that the HPO will take place, and the HPO rate that defines the frequency for updating the hyperparameters and can be used for pruning. The HPO is managed by the FL server which updates clients' hyperparameters, with an HPO rate, using state-of-the-art optimisation algorithms enabled by Optuna. We tested our approach by updating multiple client models simultaneously in popular image recognition datasets which produced promising results compared to baselines.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"62 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72646118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A formal analysis of Dutch Generic Integral Tunnel Design models 荷兰通用整体隧道设计模型的形式化分析
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577786
Kevin H. J. Jilissen, P. Dieleman, J. F. Groote
The Generic Integral Tunnel Design (GITO) contains generic models for the tunnel control systems of Rijkswaterstaat, part of the Dutch Ministry of Infrastructure and Water Management. A formal verification of these models advances the safety and reliability of GITO derived tunnel control systems. In this paper, the first known large-scale formalisation of tunnel control systems is presented which transforms GITO models to the formal specification language mCRL2. This transformation is applied to two sub-systems of the GITO to analyse the correctness of the supplied models. In this formal analysis, several deficiencies in the specifications and faults in the existing models are revealed and verified solutions are proposed. Some of the presented faults even find their origin in the legally required standards.
通用整体隧道设计(GITO)包含荷兰基础设施和水资源管理部Rijkswaterstaat隧道控制系统的通用模型。对这些模型的形式化验证提高了GITO导出的隧道控制系统的安全性和可靠性。本文提出了第一个已知的隧道控制系统的大规模形式化,它将GITO模型转换为形式化规范语言mCRL2。将此转换应用于GITO的两个子系统,以分析所提供模型的正确性。在此形式化分析中,揭示了规范中的一些不足和现有模型中的缺陷,并提出了验证的解决方案。一些出现的缺陷甚至可以在法律要求的标准中找到它们的根源。
{"title":"A formal analysis of Dutch Generic Integral Tunnel Design models","authors":"Kevin H. J. Jilissen, P. Dieleman, J. F. Groote","doi":"10.1145/3555776.3577786","DOIUrl":"https://doi.org/10.1145/3555776.3577786","url":null,"abstract":"The Generic Integral Tunnel Design (GITO) contains generic models for the tunnel control systems of Rijkswaterstaat, part of the Dutch Ministry of Infrastructure and Water Management. A formal verification of these models advances the safety and reliability of GITO derived tunnel control systems. In this paper, the first known large-scale formalisation of tunnel control systems is presented which transforms GITO models to the formal specification language mCRL2. This transformation is applied to two sub-systems of the GITO to analyse the correctness of the supplied models. In this formal analysis, several deficiencies in the specifications and faults in the existing models are revealed and verified solutions are proposed. Some of the presented faults even find their origin in the legally required standards.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"56 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74734020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are alternatives to backpropagation useful for training Binary Neural Networks? An experimental study in image classification 反向传播的替代方法对训练二元神经网络有用吗?图像分类的实验研究
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577674
Ben Crulis, Barthélémy Serres, Cyril de Runz, G. Venturini
Current artificial neural networks are trained with parameters encoded as floating point numbers that occupy lots of memory space at inference time. Due to the increase in size of deep learning models, it is becoming very difficult to consider training and using artificial neural networks on edge devices such as smartphones. Binary neural networks promise to reduce the size of deep neural network models as well as increasing inference speed while decreasing energy consumption and so allow the deployment of more powerful models on edge devices. However, binary neural networks are still proven to be difficult to train using the backpropagation based gradient descent scheme. We propose to adapt to binary neural networks two training algorithms considered as promising alternatives to backpropagation but for continuous neural networks. We provide experimental comparative results for image classification including the backpropagation baseline on the MNIST, Fashion MNIST and CIFAR-10 datasets in both continuous and binary settings. The results demonstrate that binary neural networks can not only be trained using alternative algorithms to backpropagation but can also be shown to lead better performance and a higher tolerance to the presence or absence of batch normalization layers.
目前的人工神经网络是用浮点数编码的参数来训练的,在推理时占用了大量的内存空间。由于深度学习模型规模的增加,考虑在智能手机等边缘设备上训练和使用人工神经网络变得非常困难。二元神经网络有望减少深度神经网络模型的大小,提高推理速度,同时降低能耗,从而允许在边缘设备上部署更强大的模型。然而,使用基于反向传播的梯度下降方案训练二元神经网络仍然被证明是困难的。我们提出了适合于二元神经网络的两种训练算法,这两种算法被认为是有前途的反向传播替代方案,但适用于连续神经网络。在连续和二进制设置下,我们提供了包括MNIST、Fashion MNIST和CIFAR-10数据集上的反向传播基线图像分类的实验比较结果。结果表明,二元神经网络不仅可以使用反向传播的替代算法进行训练,而且还可以显示出更好的性能和对批处理归一化层存在或不存在的更高容忍度。
{"title":"Are alternatives to backpropagation useful for training Binary Neural Networks? An experimental study in image classification","authors":"Ben Crulis, Barthélémy Serres, Cyril de Runz, G. Venturini","doi":"10.1145/3555776.3577674","DOIUrl":"https://doi.org/10.1145/3555776.3577674","url":null,"abstract":"Current artificial neural networks are trained with parameters encoded as floating point numbers that occupy lots of memory space at inference time. Due to the increase in size of deep learning models, it is becoming very difficult to consider training and using artificial neural networks on edge devices such as smartphones. Binary neural networks promise to reduce the size of deep neural network models as well as increasing inference speed while decreasing energy consumption and so allow the deployment of more powerful models on edge devices. However, binary neural networks are still proven to be difficult to train using the backpropagation based gradient descent scheme. We propose to adapt to binary neural networks two training algorithms considered as promising alternatives to backpropagation but for continuous neural networks. We provide experimental comparative results for image classification including the backpropagation baseline on the MNIST, Fashion MNIST and CIFAR-10 datasets in both continuous and binary settings. The results demonstrate that binary neural networks can not only be trained using alternative algorithms to backpropagation but can also be shown to lead better performance and a higher tolerance to the presence or absence of batch normalization layers.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"35 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77059636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alleviating High Gas Costs by Secure and Trustless Off-chain Execution of Smart Contracts 通过安全、无信任的智能合约链下执行来降低高昂的天然气成本
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577833
Soroush Farokhnia, Amir Kafshdar Goharshady
Smart contracts are programs that are executed on the blockchain and can hold, manage and transfer assets in the form of cryptocurrencies. The contract's execution is then performed on-chain and is subject to consensus, i.e. every node on the blockchain network has to run the function calls and keep track of their side-effects including updates to the balances and contract's storage. The notion of gas is introduced in most programmable blockchains, which prevents DoS attacks from malicious parties who might try to slow down the network by performing time-consuming and resource-heavy computations. While the gas idea has largely succeeded in its goal of avoiding DoS attacks, the resulting fees are extremely high. For example, in June-September 2022, on Ethereum alone, there has been an average total gas usage of 2,706.8 ETH ≈ 3,938,749 USD per day. We propose a protocol for alleviating these costs by moving most of the computation off-chain while preserving enough data on-chain to guarantee an implicit consensus about the contract state and ownership of funds in case of dishonest parties. We perform extensive experiments over 3,330 real-world Solidity contracts that were involved in 327,132 transactions in June-September 2022 on Ethereum and show that our approach reduces their gas usage by 40.09 percent, which amounts to a whopping 442,651 USD.
智能合约是在区块链上执行的程序,可以以加密货币的形式持有、管理和转移资产。然后,合约的执行在链上执行,并受到共识的约束,即区块链网络上的每个节点都必须运行函数调用,并跟踪其副作用,包括更新余额和合约的存储。大多数可编程区块链中都引入了gas的概念,这可以防止恶意方的DoS攻击,恶意方可能会通过执行耗时且资源繁重的计算来减慢网络速度。虽然gas的想法在很大程度上成功地避免了DoS攻击,但由此产生的费用非常高。例如,在2022年6月至9月期间,仅在以太坊上,平均每天的总天然气使用量为2,706.8 ETH≈3,938,749美元。我们提出了一种协议,通过将大部分计算移到链下,同时在链上保留足够的数据,以保证在不诚实的各方的情况下,对合同状态和资金所有权达成隐含共识,从而降低这些成本。我们对3330份真实世界的Solidity合约进行了广泛的实验,这些合约在2022年6月至9月期间在以太坊上进行了327,132笔交易,并表明我们的方法将他们的天然气使用量减少了40.09%,这相当于高达442,651美元。
{"title":"Alleviating High Gas Costs by Secure and Trustless Off-chain Execution of Smart Contracts","authors":"Soroush Farokhnia, Amir Kafshdar Goharshady","doi":"10.1145/3555776.3577833","DOIUrl":"https://doi.org/10.1145/3555776.3577833","url":null,"abstract":"Smart contracts are programs that are executed on the blockchain and can hold, manage and transfer assets in the form of cryptocurrencies. The contract's execution is then performed on-chain and is subject to consensus, i.e. every node on the blockchain network has to run the function calls and keep track of their side-effects including updates to the balances and contract's storage. The notion of gas is introduced in most programmable blockchains, which prevents DoS attacks from malicious parties who might try to slow down the network by performing time-consuming and resource-heavy computations. While the gas idea has largely succeeded in its goal of avoiding DoS attacks, the resulting fees are extremely high. For example, in June-September 2022, on Ethereum alone, there has been an average total gas usage of 2,706.8 ETH ≈ 3,938,749 USD per day. We propose a protocol for alleviating these costs by moving most of the computation off-chain while preserving enough data on-chain to guarantee an implicit consensus about the contract state and ownership of funds in case of dishonest parties. We perform extensive experiments over 3,330 real-world Solidity contracts that were involved in 327,132 transactions in June-September 2022 on Ethereum and show that our approach reduces their gas usage by 40.09 percent, which amounts to a whopping 442,651 USD.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"137 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77813297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Sec2vec: Anomaly Detection in HTTP Traffic and Malicious URLs Sec2vec: HTTP流量和恶意url的异常检测
IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-03-27 DOI: 10.1145/3555776.3577663
Mateusz Gniewkowski, H. Maciejewski, T. Surmacz, Wiktor Walentynowicz
In this paper, we show how methods known from Natural Language Processing (NLP) can be used to detect anomalies in HTTP requests and malicious URLs. Most of the current solutions focusing on a similar problem are either rule-based or trained using manually selected features. Modern NLP methods, however, have great potential in capturing a deep understanding of samples and therefore improving the classification results. Other methods, which rely on a similar idea, often ignore the interpretability of the results, which is so important in machine learning. We are trying to fill this gap. In addition, we show to what extent the proposed solutions are resistant to concept drift. In our work, we compare three different vectorization methods: simple BoW, fastText, and the current state-of-the-art language model RoBERTa. The obtained vectors are later used in the classification task. In order to explain our results, we utilize the SHAP method. We evaluate the feasibility of our methods on four different datasets: CSIC2010, UNSW-NB15, MALICIOUSURL, and ISCX-URL2016. The first two are related to HTTP traffic, the other two contain malicious URLs. The results we show are comparable to others or better, and most importantly - interpretable.
在本文中,我们展示了如何使用自然语言处理(NLP)中已知的方法来检测HTTP请求和恶意url中的异常情况。目前针对类似问题的大多数解决方案要么是基于规则的,要么是使用手动选择的特征进行训练的。然而,现代NLP方法在获取对样本的深入理解从而改进分类结果方面具有很大的潜力。其他依赖于类似想法的方法往往忽略了结果的可解释性,而这在机器学习中非常重要。我们正在努力填补这一空白。此外,我们还展示了所提出的解决方案在多大程度上能够抵抗概念漂移。在我们的工作中,我们比较了三种不同的矢量化方法:简单的BoW、fastText和当前最先进的语言模型RoBERTa。得到的向量稍后用于分类任务。为了解释我们的结果,我们使用了SHAP方法。我们评估了我们的方法在四个不同数据集上的可行性:CSIC2010、UNSW-NB15、MALICIOUSURL和ISCX-URL2016。前两个与HTTP流量有关,另外两个包含恶意url。我们展示的结果与他人相当或更好,最重要的是-可解释。
{"title":"Sec2vec: Anomaly Detection in HTTP Traffic and Malicious URLs","authors":"Mateusz Gniewkowski, H. Maciejewski, T. Surmacz, Wiktor Walentynowicz","doi":"10.1145/3555776.3577663","DOIUrl":"https://doi.org/10.1145/3555776.3577663","url":null,"abstract":"In this paper, we show how methods known from Natural Language Processing (NLP) can be used to detect anomalies in HTTP requests and malicious URLs. Most of the current solutions focusing on a similar problem are either rule-based or trained using manually selected features. Modern NLP methods, however, have great potential in capturing a deep understanding of samples and therefore improving the classification results. Other methods, which rely on a similar idea, often ignore the interpretability of the results, which is so important in machine learning. We are trying to fill this gap. In addition, we show to what extent the proposed solutions are resistant to concept drift. In our work, we compare three different vectorization methods: simple BoW, fastText, and the current state-of-the-art language model RoBERTa. The obtained vectors are later used in the classification task. In order to explain our results, we utilize the SHAP method. We evaluate the feasibility of our methods on four different datasets: CSIC2010, UNSW-NB15, MALICIOUSURL, and ISCX-URL2016. The first two are related to HTTP traffic, the other two contain malicious URLs. The results we show are comparable to others or better, and most importantly - interpretable.","PeriodicalId":42971,"journal":{"name":"Applied Computing Review","volume":"46 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79851654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Applied Computing Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1