Journal of Computer Science and Technology最新文献

Balancing Accuracy and Training Time in Federated Learning for Violence Detection in Surveillance Videos: A Study of Neural Network Architectures 平衡用于监控视频暴力检测的联合学习的准确性和训练时间：神经网络架构研究

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-09-13 DOI: 10.1007/s11390-024-3702-7

Quentin Pajon, Swan Serre, Hugo Wissocq, Léo Rabaud, Siba Haidar, Antoun Yaacoub

This paper presents an original investigation into the domain of violence detection in videos, introducing an innovative approach tailored to the unique challenges of a federated learning environment. The study encompasses a comprehensive exploration of machine learning techniques, leveraging spatio-temporal features extracted from benchmark video datasets. In a notable departure from conventional methodologies, we introduce a novel architecture, the “Diff Gated” network, designed to streamline preprocessing and training while simultaneously enhancing accuracy. Our exploration of advanced machine learning techniques, such as super-convergence and transfer learning, expands the horizons of federated learning, offering a broader range of practical applications. Moreover, our research introduces a method for seamlessly adapting centralized datasets to the federated learning context, bridging the gap between traditional machine learning and federated learning approaches. The outcome of this study is a remarkable advancement in the field of violence detection, with our federated learning model consistently outperforming state-of-the-art models, underscoring the transformative potential of our contributions. This work represents a significant step forward in the application of machine learning techniques to critical societal challenges.

本文对视频中的暴力检测领域进行了原创性研究，针对联合学习环境的独特挑战引入了一种创新方法。这项研究利用从基准视频数据集中提取的时空特征，对机器学习技术进行了全面探索。与传统方法明显不同的是，我们引入了一种新颖的架构--"Diff Gated "网络，旨在简化预处理和训练，同时提高准确性。我们对超级收敛和迁移学习等先进机器学习技术的探索，拓展了联合学习的视野，提供了更广泛的实际应用。此外，我们的研究还介绍了一种将集中式数据集无缝适配到联合学习环境中的方法，弥合了传统机器学习和联合学习方法之间的差距。这项研究的成果是暴力侦测领域的一个显著进步，我们的联合学习模型一直优于最先进的模型，彰显了我们所做贡献的变革潜力。这项工作标志着我们在应用机器学习技术应对重大社会挑战方面迈出了重要一步。

{"title":"Balancing Accuracy and Training Time in Federated Learning for Violence Detection in Surveillance Videos: A Study of Neural Network Architectures","authors":"Quentin Pajon, Swan Serre, Hugo Wissocq, Léo Rabaud, Siba Haidar, Antoun Yaacoub","doi":"10.1007/s11390-024-3702-7","DOIUrl":"https://doi.org/10.1007/s11390-024-3702-7","url":null,"abstract":"This paper presents an original investigation into the domain of violence detection in videos, introducing an innovative approach tailored to the unique challenges of a federated learning environment. The study encompasses a comprehensive exploration of machine learning techniques, leveraging spatio-temporal features extracted from benchmark video datasets. In a notable departure from conventional methodologies, we introduce a novel architecture, the “Diff Gated” network, designed to streamline preprocessing and training while simultaneously enhancing accuracy. Our exploration of advanced machine learning techniques, such as super-convergence and transfer learning, expands the horizons of federated learning, offering a broader range of practical applications. Moreover, our research introduces a method for seamlessly adapting centralized datasets to the federated learning context, bridging the gap between traditional machine learning and federated learning approaches. The outcome of this study is a remarkable advancement in the field of violence detection, with our federated learning model consistently outperforming state-of-the-art models, underscoring the transformative potential of our contributions. This work represents a significant step forward in the application of machine learning techniques to critical societal challenges.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"152 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey of LLM Datasets: From Autoregressive Model to AI Chatbot LLM 数据集调查：从自回归模型到人工智能聊天机器人

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3767-3

Fei Du, Xin-Jian Ma, Jing-Ru Yang, Yi Liu, Chao-Ran Luo, Xue-Bin Wang, Hai-Ou Jiang, Xiang Jing

Since OpenAI opened access to ChatGPT, large language models (LLMs) become an increasingly popular topic attracting researchers’ attention from abundant domains. However, public researchers meet some problems when developing LLMs given that most of the LLMs are produced by industries and the training details are typically unrevealed. Since datasets are an important setup of LLMs, this paper does a holistic survey on the training datasets used in both the pre-train and fine-tune processes. The paper first summarizes 16 pre-train datasets and 16 fine-tune datasets used in the state-of-the-art LLMs. Secondly, based on the properties of the pre-train and fine-tune processes, it comments on pre-train datasets from quality, quantity, and relation with models, and comments on fine-tune datasets from quality, quantity, and concerns. This study then critically figures out the problems and research trends that exist in current LLM datasets. The study helps public researchers train and investigate LLMs by visual cases and provides useful comments to the research community regarding data development. To the best of our knowledge, this paper is the first to summarize and discuss datasets used in both autoregressive and chat LLMs. The survey offers insights and suggestions to researchers and LLM developers as they build their models, and contributes to the LLM study by pointing out the existing problems of LLM studies from the perspective of data.

自 OpenAI 开放 ChatGPT 访问权限以来，大型语言模型（LLM）日益成为一个热门话题，吸引了众多领域研究人员的关注。然而，由于大多数 LLM 都是由企业生产的，而且训练细节通常不公开，因此公共研究人员在开发 LLM 时遇到了一些问题。由于数据集是 LLM 的重要设置，本文对预训练和微调过程中使用的训练数据集进行了全面调查。本文首先总结了最先进的 LLM 所使用的 16 个预训练数据集和 16 个微调数据集。其次，根据预训练和微调过程的特性，从质量、数量和与模型的关系等方面对预训练数据集进行了评述，并从质量、数量和关注点等方面对微调数据集进行了评述。然后，本研究批判性地指出了当前 LLM 数据集存在的问题和研究趋势。本研究通过可视化案例帮助公共研究人员训练和研究 LLM，并为研究界提供有关数据开发的有用意见。据我们所知，本文是第一篇总结和讨论自回归和聊天 LLM 数据集的文章。该调查为研究人员和 LLM 开发人员建立模型提供了见解和建议，并从数据的角度指出了 LLM 研究中存在的问题，为 LLM 研究做出了贡献。

{"title":"A Survey of LLM Datasets: From Autoregressive Model to AI Chatbot","authors":"Fei Du, Xin-Jian Ma, Jing-Ru Yang, Yi Liu, Chao-Ran Luo, Xue-Bin Wang, Hai-Ou Jiang, Xiang Jing","doi":"10.1007/s11390-024-3767-3","DOIUrl":"https://doi.org/10.1007/s11390-024-3767-3","url":null,"abstract":"Since OpenAI opened access to ChatGPT, large language models (LLMs) become an increasingly popular topic attracting researchers’ attention from abundant domains. However, public researchers meet some problems when developing LLMs given that most of the LLMs are produced by industries and the training details are typically unrevealed. Since datasets are an important setup of LLMs, this paper does a holistic survey on the training datasets used in both the pre-train and fine-tune processes. The paper first summarizes 16 pre-train datasets and 16 fine-tune datasets used in the state-of-the-art LLMs. Secondly, based on the properties of the pre-train and fine-tune processes, it comments on pre-train datasets from quality, quantity, and relation with models, and comments on fine-tune datasets from quality, quantity, and concerns. This study then critically figures out the problems and research trends that exist in current LLM datasets. The study helps public researchers train and investigate LLMs by visual cases and provides useful comments to the research community regarding data development. To the best of our knowledge, this paper is the first to summarize and discuss datasets used in both autoregressive and chat LLMs. The survey offers insights and suggestions to researchers and LLM developers as they build their models, and contributes to the LLM study by pointing out the existing problems of LLM studies from the perspective of data.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"16 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video Colorization: A Survey 视频着色：一项调查

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-4143-z

Zhong-Zheng Peng, Yi-Xin Yang, Jin-Hui Tang, Jin-Shan Pan

Video colorization aims to add color to grayscale or monochrome videos. Although existing methods have achieved substantial and noteworthy results in the field of image colorization, video colorization presents more formidable obstacles due to the additional necessity for temporal consistency. Moreover, there is rarely a systematic review of video colorization methods. In this paper, we aim to review existing state-of-the-art video colorization methods. In addition, maintaining spatial-temporal consistency is pivotal to the process of video colorization. To gain deeper insight into the evolution of existing methods in terms of spatial-temporal consistency, we further review video colorization methods from a novel perspective. Video colorization methods can be categorized into four main categories: optical-flow based methods, scribble-based methods, exemplar-based methods, and fully automatic methods. However, optical-flow based methods rely heavily on accurate optical-flow estimation, scribble-based methods require extensive user interaction and modifications, exemplar-based methods face challenges in obtaining suitable reference images, and fully automatic methods often struggle to meet specific colorization requirements. We also discuss the existing challenges and highlight several future research opportunities worth exploring.

视频彩色化旨在为灰度或单色视频添加色彩。虽然现有的方法在图像着色领域取得了显著的成果，但由于视频着色还需要时间上的一致性，因此面临着更大的障碍。此外，目前很少有关于视频着色方法的系统性综述。在本文中，我们旨在回顾现有的最先进的视频着色方法。此外，保持时空一致性对视频着色过程至关重要。为了更深入地了解现有方法在时空一致性方面的演变，我们进一步从新颖的角度回顾了视频着色方法。视频着色方法可分为四大类：基于光流的方法、基于涂鸦的方法、基于范例的方法和全自动方法。然而，基于光流的方法在很大程度上依赖于精确的光流估计，基于涂鸦的方法需要大量的用户交互和修改，基于范例的方法在获取合适的参考图像方面面临挑战，而全自动方法往往难以满足特定的着色要求。我们还讨论了现有的挑战，并强调了未来值得探索的几个研究机会。

{"title":"Video Colorization: A Survey","authors":"Zhong-Zheng Peng, Yi-Xin Yang, Jin-Hui Tang, Jin-Shan Pan","doi":"10.1007/s11390-024-4143-z","DOIUrl":"https://doi.org/10.1007/s11390-024-4143-z","url":null,"abstract":"Video colorization aims to add color to grayscale or monochrome videos. Although existing methods have achieved substantial and noteworthy results in the field of image colorization, video colorization presents more formidable obstacles due to the additional necessity for temporal consistency. Moreover, there is rarely a systematic review of video colorization methods. In this paper, we aim to review existing state-of-the-art video colorization methods. In addition, maintaining spatial-temporal consistency is pivotal to the process of video colorization. To gain deeper insight into the evolution of existing methods in terms of spatial-temporal consistency, we further review video colorization methods from a novel perspective. Video colorization methods can be categorized into four main categories: optical-flow based methods, scribble-based methods, exemplar-based methods, and fully automatic methods. However, optical-flow based methods rely heavily on accurate optical-flow estimation, scribble-based methods require extensive user interaction and modifications, exemplar-based methods face challenges in obtaining suitable reference images, and fully automatic methods often struggle to meet specific colorization requirements. We also discuss the existing challenges and highlight several future research opportunities worth exploring.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"64 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advances of Pipeline Model Parallelism for Deep Learning Training: An Overview 用于深度学习训练的管道模型并行性的进步：概述

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3872-3

Lei Guan, Dong-Sheng Li, Ji-Ye Liang, Wen-Jian Wang, Ke-Shi Ge, Xi-Cheng Lu

Deep learning has become the cornerstone of artificial intelligence, playing an increasingly important role in human production and lifestyle. However, as the complexity of problem-solving increases, deep learning models become increasingly intricate, resulting in a proliferation of large language models with an astonishing number of parameters. Pipeline model parallelism (PMP) has emerged as one of the mainstream approaches to addressing the significant challenge of training “big models”. This paper presents a comprehensive review of PMP. It covers the basic concepts and main challenges of PMP. It also comprehensively compares synchronous and asynchronous pipeline schedules for PMP approaches, and discusses the main techniques to achieve load balance for both intra-node and inter-node training. Furthermore, the main techniques to optimize computation, storage, and communication are presented, with potential research directions being discussed.

深度学习已成为人工智能的基石，在人类生产和生活中发挥着越来越重要的作用。然而，随着解决问题的复杂性不断提高，深度学习模型也变得越来越复杂，导致参数数量惊人的大型语言模型激增。管道模型并行化（PMP）已成为应对训练 "大模型 "这一重大挑战的主流方法之一。本文全面回顾了 PMP。它涵盖了 PMP 的基本概念和主要挑战。它还全面比较了 PMP 方法的同步和异步流水线计划，并讨论了在节点内和节点间训练中实现负载平衡的主要技术。此外，还介绍了优化计算、存储和通信的主要技术，并讨论了潜在的研究方向。

引用次数: 0

Knowledge-Enhanced Conversational Agents 知识增强型对话代理

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-2883-4

Fabio Caffaro, Giuseppe Rizzo

Humanity has fantasized about artificial intelligence tools able to discuss with human beings fluently for decades. Numerous efforts have been proposed ranging from ELIZA to the modern vocal assistants. Despite the large interest in this research and innovation field, there is a lack of common understanding on the concept of conversational agents and general over expectations that hide the current limitations of existing solutions. This work proposes a literature review on the subject with a focus on the most promising type of conversational agents that are powered on top of knowledge bases and that can offer the ground knowledge to hold conversation autonomously on different topics. We describe aconceptual architecture to define the knowledge-enhanced conversational agents and investigate different domains of applications. We conclude this work by listing some promising research pathways for future work.

几十年来，人类一直幻想着人工智能工具能够流畅地与人类进行讨论。从 ELIZA 到现代发声助手，人们已经提出了无数的方案。尽管人们对这一研究和创新领域兴趣浓厚，但对会话代理的概念却缺乏共识，普遍的过高期望掩盖了现有解决方案的局限性。本作品对这一主题进行了文献综述，重点关注最有前途的会话代理类型，它们由知识库驱动，能够提供基础知识，就不同主题自主进行会话。我们描述了定义知识增强型对话代理的概念架构，并研究了不同的应用领域。最后，我们列出了未来工作中一些有前景的研究方向。

引用次数: 0

A Survey of Multimodal Controllable Diffusion Models 多模式可控扩散模型概览

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3814-0

Rui Jiang, Guang-Cong Zheng, Teng Li, Tian-Rui Yang, Jing-Dong Wang, Xi Li

Diffusion models have recently emerged as powerful generative models, producing high-fidelity samples across domains. Despite this, they have two key challenges, including improving the time-consuming iterative generation process and controlling and steering the generation process. Existing surveys provide broad overviews of diffusion model advancements. However, they lack comprehensive coverage specifically centered on techniques for controllable generation. This survey seeks to address this gap by providing a comprehensive and coherent review on controllable generation in diffusion models. We provide a detailed taxonomy defining controlled generation for diffusion models. Controllable generation is categorized based on the formulation, methodologies, and evaluation metrics. By enumerating the range of methods researchers have developed for enhanced control, we aim to establish controllable diffusion generation as a distinct subfield warranting dedicated focus. With this survey, we contextualize recent results, provide the dedicated treatment of controllable diffusion model generation, and outline limitations and future directions. To demonstrate applicability, we highlight controllable diffusion techniques for major computer vision tasks application. By consolidating methods and applications for controllable diffusion models, we hope to catalyze further innovations in reliable and scalable controllable generation.

扩散模型近来已成为强大的生成模型，可生成跨领域的高保真样本。尽管如此，它们仍面临两大挑战，包括改进耗时的迭代生成过程以及控制和引导生成过程。现有的调查对扩散模型的进展进行了广泛的概述。然而，它们缺乏专门针对可控生成技术的全面报道。本研究旨在通过对扩散模型中的可控生成进行全面、连贯的综述，填补这一空白。我们提供了定义扩散模型可控生成的详细分类法。可控生成根据公式、方法和评估指标进行分类。通过列举研究人员为增强控制而开发的一系列方法，我们旨在将可控扩散生成确立为一个值得重点关注的独特子领域。通过这项调查，我们对最新成果进行了梳理，对可控扩散模型生成进行了专门处理，并概述了局限性和未来发展方向。为了证明其适用性，我们重点介绍了主要计算机视觉任务应用中的可控扩散技术。我们希望通过整合可控扩散模型的方法和应用，推动可靠、可扩展的可控生成方面的进一步创新。

{"title":"A Survey of Multimodal Controllable Diffusion Models","authors":"Rui Jiang, Guang-Cong Zheng, Teng Li, Tian-Rui Yang, Jing-Dong Wang, Xi Li","doi":"10.1007/s11390-024-3814-0","DOIUrl":"https://doi.org/10.1007/s11390-024-3814-0","url":null,"abstract":"Diffusion models have recently emerged as powerful generative models, producing high-fidelity samples across domains. Despite this, they have two key challenges, including improving the time-consuming iterative generation process and controlling and steering the generation process. Existing surveys provide broad overviews of diffusion model advancements. However, they lack comprehensive coverage specifically centered on techniques for controllable generation. This survey seeks to address this gap by providing a comprehensive and coherent review on controllable generation in diffusion models. We provide a detailed taxonomy defining controlled generation for diffusion models. Controllable generation is categorized based on the formulation, methodologies, and evaluation metrics. By enumerating the range of methods researchers have developed for enhanced control, we aim to establish controllable diffusion generation as a distinct subfield warranting dedicated focus. With this survey, we contextualize recent results, provide the dedicated treatment of controllable diffusion model generation, and outline limitations and future directions. To demonstrate applicability, we highlight controllable diffusion techniques for major computer vision tasks application. By consolidating methods and applications for controllable diffusion models, we hope to catalyze further innovations in reliable and scalable controllable generation.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"13 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

When Crowdsourcing Meets Data Markets: A Fair Data Value Metric for Data Trading 当众包遇上数据市场：数据交易的公平数据价值度量

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-023-2519-0

Yang-Su Liu, Zhen-Zhe Zheng, Fan Wu, Gui-Hai Chen

Large-quantity and high-quality data is critical to the success of machine learning in diverse applications. Faced with the dilemma of data silos where data is difficult to circulate, emerging data markets attempt to break the dilemma by facilitating data exchange on the Internet. Crowdsourcing, on the other hand, is one of the important methods to efficiently collect large amounts of data with high-value in data markets. In this paper, we investigate the joint problem of efficient data acquisition and fair budget distribution across the crowdsourcing and data markets. We propose a new metric of data value as the uncertainty reduction of a Bayesian machine learning model by integrating the data into model training. Guided by this data value metric, we design a mechanism called Shapley Value Mechanism with Individual Rationality (SV-IR), in which we design a greedy algorithm with a constant approximation ratio to greedily select the most cost-efficient data brokers, and a fair compensation determination rule based on the Shapley value, respecting the individual rationality constraints. We further propose a fair reward distribution method for the data holders with various effort levels under the charge of a data broker. We demonstrate the fairness of the compensation determination rule and reward distribution rule by evaluating our mechanisms on two real-world datasets. The evaluation results also show that the selection algorithm in SV-IR could approach the optimal solution, and outperforms state-of-the-art methods.

大量和高质量的数据是机器学习在各种应用中取得成功的关键。面对数据难以流通的 "数据孤岛 "困境，新兴的数据市场试图通过促进互联网上的数据交换来打破这一困境。众包则是数据市场有效收集大量高价值数据的重要方法之一。在本文中，我们研究了在众包和数据市场中高效获取数据和公平分配预算的共同问题。我们提出了一种新的数据价值度量方法，即通过将数据整合到模型训练中来降低贝叶斯机器学习模型的不确定性。在这一数据价值指标的指导下，我们设计了一种称为具有个体理性的夏普利价值机制（SV-IR）的机制。在这一机制中，我们设计了一种具有恒定逼近率的贪婪算法，以贪婪地选择最具成本效益的数据经纪人，并在尊重个体理性约束的前提下，设计了一种基于夏普利价值的公平报酬确定规则。我们进一步提出了一种公平的奖励分配方法，适用于由数据经纪人负责的不同努力程度的数据持有者。通过在两个真实数据集上对我们的机制进行评估，我们证明了补偿确定规则和奖励分配规则的公平性。评估结果还表明，SV-IR 中的选择算法可以接近最优解，并优于最先进的方法。

{"title":"When Crowdsourcing Meets Data Markets: A Fair Data Value Metric for Data Trading","authors":"Yang-Su Liu, Zhen-Zhe Zheng, Fan Wu, Gui-Hai Chen","doi":"10.1007/s11390-023-2519-0","DOIUrl":"https://doi.org/10.1007/s11390-023-2519-0","url":null,"abstract":"Large-quantity and high-quality data is critical to the success of machine learning in diverse applications. Faced with the dilemma of data silos where data is difficult to circulate, emerging data markets attempt to break the dilemma by facilitating data exchange on the Internet. Crowdsourcing, on the other hand, is one of the important methods to efficiently collect large amounts of data with high-value in data markets. In this paper, we investigate the joint problem of efficient data acquisition and fair budget distribution across the crowdsourcing and data markets. We propose a new metric of data value as the uncertainty reduction of a Bayesian machine learning model by integrating the data into model training. Guided by this data value metric, we design a mechanism called Shapley Value Mechanism with Individual Rationality (SV-IR), in which we design a greedy algorithm with a constant approximation ratio to greedily select the most cost-efficient data brokers, and a fair compensation determination rule based on the Shapley value, respecting the individual rationality constraints. We further propose a fair reward distribution method for the data holders with various effort levels under the charge of a data broker. We demonstrate the fairness of the compensation determination rule and reward distribution rule by evaluating our mechanisms on two real-world datasets. The evaluation results also show that the selection algorithm in SV-IR could approach the optimal solution, and outperforms state-of-the-art methods.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"50 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Age-of-Information-Aware Federated Learning 感知信息时代的联合学习

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3914-x

Yin Xu, Ming-Jun Xiao, Chen Wu, Jie Wu, Jin-Rui Zhou, He Sun

Federated learning (FL) is an emerging privacy-preserving distributed computing paradigm, enabling numerous clients to collaboratively train machine learning models without the necessity of transmitting clients’ private datasets to the central server. Unlike most existing research where the local datasets of clients are assumed to be unchanged over time throughout the whole FL process, our study addresses such scenarios in this paper where clients’ datasets need to be updated periodically, and the server can incentivize clients to employ as fresh as possible datasets for local model training. Our primary objective is to design a client selection strategy to minimize the loss of the global model for FL loss within a constrained budget. To this end, we introduce the concept of “Age of Information” (AoI) to quantitatively assess the freshness of local datasets and conduct a theoretical analysis of the convergence bound in our AoI-aware FL system. Based on the convergence bound, we further formulate our problem as a restless multi-armed bandit (RMAB) problem. Next, we relax the RMAB problem and apply the Lagrangian Dual approach to decouple it into multiple subproblems. Finally, we propose a Whittle’s Index Based Client Selection (WICS) algorithm to determine the set of selected clients. In addition, comprehensive simulations substantiate that the proposed algorithm can effectively reduce training loss and enhance the learning accuracy compared with some state-of-the-art methods.

联盟学习（FL）是一种新兴的保护隐私的分布式计算范例，它能让众多客户端协作训练机器学习模型，而无需将客户端的私人数据集传输到中央服务器。与大多数现有研究假设客户端的本地数据集在整个 FL 过程中保持不变不同，我们的研究针对的是客户端数据集需要定期更新的情况，服务器可以激励客户端使用尽可能新鲜的数据集进行本地模型训练。我们的主要目标是设计一种客户选择策略，以便在有限的预算内最大限度地减少 FL 损失的全局模型损失。为此，我们引入了 "信息时代"（AoI）的概念来定量评估本地数据集的新鲜度，并对我们的 AoI 感知 FL 系统的收敛边界进行了理论分析。在收敛边界的基础上，我们进一步将问题表述为不安分的多臂强盗（RMAB）问题。接下来，我们放松了 RMAB 问题，并应用拉格朗日二元方法将其解耦为多个子问题。最后，我们提出了一种基于惠特尔指数的客户选择 (WICS) 算法，以确定所选客户的集合。此外，综合模拟证实，与一些最先进的方法相比，所提出的算法能有效减少训练损失，提高学习精度。

{"title":"Age-of-Information-Aware Federated Learning","authors":"Yin Xu, Ming-Jun Xiao, Chen Wu, Jie Wu, Jin-Rui Zhou, He Sun","doi":"10.1007/s11390-024-3914-x","DOIUrl":"https://doi.org/10.1007/s11390-024-3914-x","url":null,"abstract":"Federated learning (FL) is an emerging privacy-preserving distributed computing paradigm, enabling numerous clients to collaboratively train machine learning models without the necessity of transmitting clients’ private datasets to the central server. Unlike most existing research where the local datasets of clients are assumed to be unchanged over time throughout the whole FL process, our study addresses such scenarios in this paper where clients’ datasets need to be updated periodically, and the server can incentivize clients to employ as fresh as possible datasets for local model training. Our primary objective is to design a client selection strategy to minimize the loss of the global model for FL loss within a constrained budget. To this end, we introduce the concept of “Age of Information” (AoI) to quantitatively assess the freshness of local datasets and conduct a theoretical analysis of the convergence bound in our AoI-aware FL system. Based on the convergence bound, we further formulate our problem as a restless multi-armed bandit (RMAB) problem. Next, we relax the RMAB problem and apply the Lagrangian Dual approach to decouple it into multiple subproblems. Finally, we propose a Whittle’s Index Based Client Selection (WICS) algorithm to determine the set of selected clients. In addition, comprehensive simulations substantiate that the proposed algorithm can effectively reduce training loss and enhance the learning accuracy compared with some state-of-the-art methods.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"29 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DCFNet: Discriminant Correlation Filters Network for Visual Tracking DCFNet：用于视觉跟踪的判别相关滤波器网络

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-023-3788-3

Wei-Ming Hu, Qiang Wang, Jin Gao, Bing Li, Stephen Maybank

CNN (convolutional neural network) based real time trackers usually do not carry out online network update in order to maintain rapid tracking speed. This inevitably influences the adaptability to changes in object appearance. Correlation filter based trackers can update the model parameters online in real time. In this paper, we present an end-to-end lightweight network architecture, namely Discriminant Correlation Filter Network (DCFNet). A differentiable DCF (discriminant correlation filter) layer is incorporated into a Siamese network architecture in order to learn the convolutional features and the correlation filter simultaneously. The correlation filter can be efficiently updated online. In previous work, we introduced a joint scale-position space to the DCFNet, forming a scale DCFNet which carries out the predictions of object scale and position simultaneously. We combine the scale DCFNet with the convolutional-deconvolutional network, learning both the high-level embedding space representations and the low-level fine-grained representations for images. The adaptability of the fine-grained correlation analysis and the generalization capability of the semantic embedding are complementary for visual tracking. The back-propagation is derived in the Fourier frequency domain throughout the entire work, preserving the efficiency of the DCF. Extensive evaluations on the OTB (Object Tracking Benchmark) and VOT (Visual Object Tracking Challenge) datasets demonstrate that the proposed trackers have fast speeds, while maintaining tracking accuracy.

基于 CNN（卷积神经网络）的实时跟踪器通常不会进行在线网络更新，以保持快速的跟踪速度。这不可避免地影响了对物体外观变化的适应性。基于相关滤波器的跟踪器可以实时在线更新模型参数。本文提出了一种端到端轻量级网络架构，即判别相关滤波器网络（DCFNet）。为了同时学习卷积特征和相关滤波器，我们在连体网络架构中加入了可微分 DCF（判别相关滤波器）层。相关滤波器可以有效地在线更新。在之前的工作中，我们为 DCFNet 引入了一个比例-位置联合空间，形成了一个比例 DCFNet，可同时预测物体的比例和位置。我们将尺度 DCFNet 与卷积-解卷积网络相结合，同时学习高层嵌入空间表示和低层图像细粒度表示。细粒度相关分析的适应性和语义嵌入的泛化能力在视觉跟踪方面相辅相成。在整个工作中，反向传播都是在傅立叶频域中进行的，从而保持了 DCF 的效率。在 OTB（物体跟踪基准）和 VOT（视觉物体跟踪挑战）数据集上进行的广泛评估表明，所提出的跟踪器在保持跟踪精度的同时，还具有较快的速度。

{"title":"DCFNet: Discriminant Correlation Filters Network for Visual Tracking","authors":"Wei-Ming Hu, Qiang Wang, Jin Gao, Bing Li, Stephen Maybank","doi":"10.1007/s11390-023-3788-3","DOIUrl":"https://doi.org/10.1007/s11390-023-3788-3","url":null,"abstract":"CNN (convolutional neural network) based real time trackers usually do not carry out online network update in order to maintain rapid tracking speed. This inevitably influences the adaptability to changes in object appearance. Correlation filter based trackers can update the model parameters online in real time. In this paper, we present an end-to-end lightweight network architecture, namely Discriminant Correlation Filter Network (DCFNet). A differentiable DCF (discriminant correlation filter) layer is incorporated into a Siamese network architecture in order to learn the convolutional features and the correlation filter simultaneously. The correlation filter can be efficiently updated online. In previous work, we introduced a joint scale-position space to the DCFNet, forming a scale DCFNet which carries out the predictions of object scale and position simultaneously. We combine the scale DCFNet with the convolutional-deconvolutional network, learning both the high-level embedding space representations and the low-level fine-grained representations for images. The adaptability of the fine-grained correlation analysis and the generalization capability of the semantic embedding are complementary for visual tracking. The back-propagation is derived in the Fourier frequency domain throughout the entire work, preserving the efficiency of the DCF. Extensive evaluations on the OTB (Object Tracking Benchmark) and VOT (Visual Object Tracking Challenge) datasets demonstrate that the proposed trackers have fast speeds, while maintaining tracking accuracy.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"2 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Neighborhood Combination Search for Single-Machine Scheduling with Sequence-Dependent Setup Time 取决于序列设置时间的单机调度的邻域组合搜索

IF 1.9 3区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Computer Science and Technology

Pub Date : 2024-07-22 DOI: 10.1007/s11390-023-2007-6

Xiao-Lu Liu, Hong-Yun Xu, Jia-Ming Chen, Zhou-Xing Su, Zhi-Peng Lyu, Jun-Wen Ding

In a local search algorithm, one of its most important features is the definition of its neighborhood which is crucial to the algorithm’s performance. In this paper, we present an analysis of neighborhood combination search for solving the single-machine scheduling problem with sequence-dependent setup time with the objective of minimizing total weighted tardiness (SMSWT). First, We propose a new neighborhood structure named Block Swap (B1) which can be considered as an extension of the previously widely used Block Move (B2) neighborhood, and a fast incremental evaluation technique to enhance its evaluation efficiency. Second, based on the Block Swap and Block Move neighborhoods, we present two kinds of neighborhood structures: neighborhood union (denoted by B1⋃B2) and token-ring search (denoted by B1 → B2), both of which are combinations of B1 and B2. Third, we incorporate the neighborhood union and token-ring search into two representative metaheuristic algorithms: the Iterated Local Search Algorithm (ILS_new) and the Hybrid Evolutionary Algorithm (HEA_new) to investigate the performance of the neighborhood union and token-ring search. Extensive experiments show the competitiveness of the token-ring search combination mechanism of the two neighborhoods. Tested on the 120 public benchmark instances, our HEA_new has a highly competitive performance in solution quality and computational time compared with both the exact algorithms and recent metaheuristics. We have also tested the HEA_new algorithm with the selected neighborhood combination search to deal with the 64 public benchmark instances of the single-machine scheduling problem with sequence-dependent setup time. HEA_new is able to match the optimal or the best known results for all the 64 instances. In particular, the computational time for reaching the best well-known results for five challenging instances is reduced by at least 61.25%.

在局部搜索算法中，最重要的特征之一是邻域的定义，这对算法的性能至关重要。在本文中，我们分析了邻域组合搜索在解决以最小化总加权延迟（SMSWT）为目标的单机调度问题中的应用。首先，我们提出了一种名为 "Block Swap (B1) "的新邻域结构，它可以看作是之前广泛使用的 "Block Move (B2) "邻域的扩展，并提出了一种快速增量评估技术来提高其评估效率。其次，在 Block Swap 和 Block Move 邻域的基础上，我们提出了两种邻域结构：邻域联合（用 B1⋃B2 表示）和标记环搜索（用 B1 → B2 表示），它们都是 B1 和 B2 的组合。第三，我们将邻域联合和令牌环搜索纳入两种具有代表性的元启发式算法：迭代局部搜索算法（ILSnew）和混合进化算法（HEAnew），以研究邻域联合和令牌环搜索的性能。大量实验表明，两个邻域的标记环搜索组合机制具有竞争力。通过对 120 个公共基准实例的测试，与精确算法和最新的元启发式相比，我们的 HEAnew 在求解质量和计算时间方面都具有很强的竞争力。我们还测试了 HEAnew 算法与所选邻域组合搜索的结合，以处理 64 个具有序列设置时间依赖性的单机调度问题的公共基准实例。在所有 64 个实例中，HEAnew 都能达到最优或已知的最佳结果。特别是，在五个具有挑战性的实例中，达到最佳已知结果所需的计算时间至少减少了 61.25%。

{"title":"Neighborhood Combination Search for Single-Machine Scheduling with Sequence-Dependent Setup Time","authors":"Xiao-Lu Liu, Hong-Yun Xu, Jia-Ming Chen, Zhou-Xing Su, Zhi-Peng Lyu, Jun-Wen Ding","doi":"10.1007/s11390-023-2007-6","DOIUrl":"https://doi.org/10.1007/s11390-023-2007-6","url":null,"abstract":"In a local search algorithm, one of its most important features is the definition of its neighborhood which is crucial to the algorithm’s performance. In this paper, we present an analysis of neighborhood combination search for solving the single-machine scheduling problem with sequence-dependent setup time with the objective of minimizing total weighted tardiness (SMSWT). First, We propose a new neighborhood structure named Block Swap (B1) which can be considered as an extension of the previously widely used Block Move (B2) neighborhood, and a fast incremental evaluation technique to enhance its evaluation efficiency. Second, based on the Block Swap and Block Move neighborhoods, we present two kinds of neighborhood structures: neighborhood union (denoted by B1⋃B2) and token-ring search (denoted by B1 → B2), both of which are combinations of B1 and B2. Third, we incorporate the neighborhood union and token-ring search into two representative metaheuristic algorithms: the Iterated Local Search Algorithm (ILSnew) and the Hybrid Evolutionary Algorithm (HEAnew) to investigate the performance of the neighborhood union and token-ring search. Extensive experiments show the competitiveness of the token-ring search combination mechanism of the two neighborhoods. Tested on the 120 public benchmark instances, our HEAnew has a highly competitive performance in solution quality and computational time compared with both the exact algorithms and recent metaheuristics. We have also tested the HEAnew algorithm with the selected neighborhood combination search to deal with the 64 public benchmark instances of the single-machine scheduling problem with sequence-dependent setup time. HEAnew is able to match the optimal or the best known results for all the 64 instances. In particular, the computational time for reaching the best well-known results for five challenging instances is reduced by at least 61.25%.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"64 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0