Accelerate Multi-view Inference with End-edge Collaborative Computing

IF 2 3区 计算机科学 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computer Supported Cooperative Work-The Journal of Collaborative Computing Pub Date : 2023-05-24 DOI:10.1109/CSCWD57460.2023.10152842
Wangbing Cheng, MinFeng Zhang, Fang Dong, Shucun Fu
{"title":"Accelerate Multi-view Inference with End-edge Collaborative Computing","authors":"Wangbing Cheng, MinFeng Zhang, Fang Dong, Shucun Fu","doi":"10.1109/CSCWD57460.2023.10152842","DOIUrl":null,"url":null,"abstract":"Multi-view inference can utilize visual information from several views like a human being and significantly improve accuracy in some scenes, but it inevitably incurs more computing overhead than traditional DNN inference. To meet the requirement of low latency in typical scenarios, we consider utilizing model partition technique of edge computing to speed up multi-view inference, and design a multi-view end-edge co-inference execution framework (MV-IEF) which can make use of both end and edge resources for multi-view inference tasks. However, when employing the framework simply, the efficiency of multi-view inference will be constrained by network dynamics and heterogeneity of devices corresponding to multiple views. To break this constraint, we establish an optimization model based on the framework to minimize the multi-view inference time and solve it on the basis of game theory. And meanwhile, we propose a joint optimization algorithm for multi-view resource allocation and model partition (MV-JRAMP), which can make remarkable decisions of resource allocation and model partiton according to network status and computing capabilities of devices. Finally, we build a prototype and evaluate the performance of MV-JRAMP. Experiments show that MV-JRAMP can accelerate multi-view inference by up to 3.71×.","PeriodicalId":51008,"journal":{"name":"Computer Supported Cooperative Work-The Journal of Collaborative Computing","volume":"42 1","pages":"1625-1631"},"PeriodicalIF":2.0000,"publicationDate":"2023-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Supported Cooperative Work-The Journal of Collaborative Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/CSCWD57460.2023.10152842","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Multi-view inference can utilize visual information from several views like a human being and significantly improve accuracy in some scenes, but it inevitably incurs more computing overhead than traditional DNN inference. To meet the requirement of low latency in typical scenarios, we consider utilizing model partition technique of edge computing to speed up multi-view inference, and design a multi-view end-edge co-inference execution framework (MV-IEF) which can make use of both end and edge resources for multi-view inference tasks. However, when employing the framework simply, the efficiency of multi-view inference will be constrained by network dynamics and heterogeneity of devices corresponding to multiple views. To break this constraint, we establish an optimization model based on the framework to minimize the multi-view inference time and solve it on the basis of game theory. And meanwhile, we propose a joint optimization algorithm for multi-view resource allocation and model partition (MV-JRAMP), which can make remarkable decisions of resource allocation and model partiton according to network status and computing capabilities of devices. Finally, we build a prototype and evaluate the performance of MV-JRAMP. Experiments show that MV-JRAMP can accelerate multi-view inference by up to 3.71×.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用端缘协同计算加速多视图推理
多视图推理可以像人类一样利用来自多个视图的视觉信息,并在某些场景中显着提高准确性,但它不可避免地会比传统的深度神经网络推理产生更多的计算开销。为了满足典型场景下低时延的要求,我们考虑利用边缘计算的模型划分技术来加速多视图推理,并设计了一个多视图端-边缘协同推理执行框架(MV-IEF),该框架可以同时利用端-边缘资源执行多视图推理任务。然而,当简单使用该框架时,多视图推理的效率将受到网络动态和多视图对应设备的异构性的限制。为了打破这一约束,我们建立了一个基于框架的优化模型,以最小化多视图推理时间,并基于博弈论进行求解。同时,我们提出了一种多视图资源分配和模型划分联合优化算法(MV-JRAMP),该算法能够根据设备的网络状态和计算能力做出较好的资源分配和模型划分决策。最后,建立了MV-JRAMP的原型,并对其性能进行了评估。实验表明,MV-JRAMP可将多视图推理速度提高3.71倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computer Supported Cooperative Work-The Journal of Collaborative Computing
Computer Supported Cooperative Work-The Journal of Collaborative Computing COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
6.40
自引率
4.20%
发文量
31
审稿时长
>12 weeks
期刊介绍: Computer Supported Cooperative Work (CSCW): The Journal of Collaborative Computing and Work Practices is devoted to innovative research in computer-supported cooperative work (CSCW). It provides an interdisciplinary and international forum for the debate and exchange of ideas concerning theoretical, practical, technical, and social issues in CSCW. The CSCW Journal arose in response to the growing interest in the design, implementation and use of technical systems (including computing, information, and communications technologies) which support people working cooperatively, and its scope remains to encompass the multifarious aspects of research within CSCW and related areas. The CSCW Journal focuses on research oriented towards the development of collaborative computing technologies on the basis of studies of actual cooperative work practices (where ‘work’ is used in the wider sense). That is, it welcomes in particular submissions that (a) report on findings from ethnographic or similar kinds of in-depth fieldwork of work practices with a view to their technological implications, (b) report on empirical evaluations of the use of extant or novel technical solutions under real-world conditions, and/or (c) develop technical or conceptual frameworks for practice-oriented computing research based on previous fieldwork and evaluations.
期刊最新文献
Text-based Patient – Doctor Discourse Online And Patients’ Experiences of Empathy Agency, Power and Confrontation: the Role for Socially Engaged Art in CSCW with Rurban Communities in Support of Inclusion Data as Relation: Ontological Trouble in the Data-Driven Public Administration The Avatar Facial Expression Reenactment Method in the Metaverse based on Overall-Local Optical-Flow Estimation and Illumination Difference Investigating Author Research Relatedness through Crowdsourcing: A Replication Study on MTurk
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1