IEEE Open Journal of the Computer Society最新文献

英文中文

CTLA: Compressed Table Look up Algorithm for Open Flow Switch CTLA：开式流量开关的压缩查表算法

IEEE Open Journal of the Computer Society

Pub Date : 2024-02-02 DOI: 10.1109/OJCS.2024.3361710

Veeramani Sonai;Indira Bharathi;Muthaiah Uchimucthu;Sountharrajan S;Durga Prasad Bavirisetti

The size of the TCAM memory grows as more entries are added to the flow table of Open Flow switch. The procedure of looking up an IP address involves finding the longest prefix. In order to keep up with the link speed, the IP lookup operation in the forwarding table should also need to be speed up. TCAM's scalability and storage are constrained by its high power consumption and circuit density. The only time- or space-efficient algorithms for improvement are the subject of several research studies. In order to boost performance even further, this study focuses on time and space efficient algorithms. To strike a balance between speedy data access and efficient storage, this study proposes a combination of compression and a quick look-up mechanism to satisfy the space and speed requirements of the Open Flow switch. As the data is compressed, performance improves because less memory is required to store the look-up table and fewer bits are required to search. The look up complexity of proposed approach is

$O(log,(log;n/2))$

and average space reduction is 61%.

随着开放式流量交换机流量表条目的增加，TCAM 内存的大小也会随之增加。查找 IP 地址的过程包括查找最长的前缀。为了跟上链路速度，转发表中的 IP 查找操作也需要加快速度。TCAM 的可扩展性和存储能力受到其高能耗和电路密度的限制。目前仅有的时间或空间高效改进算法是多项研究的主题。为了进一步提高性能，本研究侧重于时间和空间高效算法。为了在快速数据访问和高效存储之间取得平衡，本研究提出了压缩和快速查找机制相结合的方法，以满足开放流交换机对空间和速度的要求。随着数据被压缩，性能也随之提高，因为存储查找表所需的内存更少，搜索所需的位数也更少。拟议方法的查找复杂度为 $O(log,(log;n/2))$，平均空间减少了 61%。

{"title":"CTLA: Compressed Table Look up Algorithm for Open Flow Switch","authors":"Veeramani Sonai;Indira Bharathi;Muthaiah Uchimucthu;Sountharrajan S;Durga Prasad Bavirisetti","doi":"10.1109/OJCS.2024.3361710","DOIUrl":"10.1109/OJCS.2024.3361710","url":null,"abstract":"The size of the TCAM memory grows as more entries are added to the flow table of Open Flow switch. The procedure of looking up an IP address involves finding the longest prefix. In order to keep up with the link speed, the IP lookup operation in the forwarding table should also need to be speed up. TCAM's scalability and storage are constrained by its high power consumption and circuit density. The only time- or space-efficient algorithms for improvement are the subject of several research studies. In order to boost performance even further, this study focuses on time and space efficient algorithms. To strike a balance between speedy data access and efficient storage, this study proposes a combination of compression and a quick look-up mechanism to satisfy the space and speed requirements of the Open Flow switch. As the data is compressed, performance improves because less memory is required to store the look-up table and fewer bits are required to search. The look up complexity of proposed approach is \u0000<inline-formula><tex-math>$O(log,(log;n/2))$</tex-math></inline-formula>\u0000 and average space reduction is 61%.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"5 ","pages":"73-82"},"PeriodicalIF":0.0,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10419009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139952219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Slingshot: Globally Favorable Local Updates for Federated Learning 弹弓：联盟学习的全局有利局部更新

IEEE Open Journal of the Computer Society

Pub Date : 2024-01-22 DOI: 10.1109/OJCS.2024.3356599

Jialiang Liu;Huawei Huang;Chun Wang;Sicong Zhou;Ruixin Li;Zibin Zheng

Federated Learning (FL), as a promising distributed learning paradigm, is proposed to solve the contradiction between the data hunger of modern machine learning and the increasingly stringent need for data privacy. However, clients naturally present different distributions of their local data and inconsistent local optima, which leads to poor model performance of FL. Many previous methods focus on mitigating objective inconsistency. Although local objective consistency can be guaranteed when the number of communication rounds is infinite, we should notice that the accumulation of global drift and the limitation on the potential of local updates are non-negligible in those previous methods. In this article, we study a new framework for data-heterogeneity FL, in which the local updates in clients towards the global optimum can accelerate FL. We propose a new approach called Slingshot. Slingshot's design goals are twofold, i.e., i) to retain the potential of local updates, and ii) to combine local and global trends. Experimental results show that Slingshot helps local updates become more globally favorable and outperforms other popular methods under various FL settings. For example, on CIFAR10, Slingshot achieves 46.52% improvement in test accuracy and 48.21× speedup for a lightweight neural network named SqueezeNet.

联邦学习（Federated Learning，FL）作为一种前景广阔的分布式学习范式，被提出来解决现代机器学习的数据饥渴与日益严格的数据隐私需求之间的矛盾。然而，客户端自然会呈现出不同的本地数据分布和不一致的本地最优值，这就导致 FL 的模型性能不佳。以前的许多方法都侧重于缓解目标不一致性。虽然当通信轮数为无限时，局部目标一致性可以得到保证，但我们应该注意到，在这些方法中，全局漂移的积累和局部更新潜力的限制是不可忽视的。在本文中，我们研究了一种新的数据异构 FL 框架，在这种框架中，客户端向全局最优的局部更新可以加速 FL。我们提出了一种名为 Slingshot 的新方法。Slingshot 的设计目标有两个方面，即 i) 保留局部更新的潜力；ii) 将局部和全局趋势结合起来。实验结果表明，Slingshot 能帮助局部更新变得更有利于全局，在各种 FL 设置下的表现优于其他流行方法。例如，在 CIFAR10 上，Slingshot 使名为 SqueezeNet 的轻量级神经网络的测试准确率提高了 46.52%，速度提高了 48.21 倍。

{"title":"Slingshot: Globally Favorable Local Updates for Federated Learning","authors":"Jialiang Liu;Huawei Huang;Chun Wang;Sicong Zhou;Ruixin Li;Zibin Zheng","doi":"10.1109/OJCS.2024.3356599","DOIUrl":"https://doi.org/10.1109/OJCS.2024.3356599","url":null,"abstract":"Federated Learning (FL), as a promising distributed learning paradigm, is proposed to solve the contradiction between the data hunger of modern machine learning and the increasingly stringent need for data privacy. However, clients naturally present different distributions of their local data and inconsistent local optima, which leads to poor model performance of FL. Many previous methods focus on mitigating objective inconsistency. Although local objective consistency can be guaranteed when the number of communication rounds is infinite, we should notice that the accumulation of global drift and the limitation on the potential of local updates are non-negligible in those previous methods. In this article, we study a new framework for data-heterogeneity FL, in which the local updates in clients towards the global optimum can accelerate FL. We propose a new approach called \u0000<italic>Slingshot</i>\u0000. Slingshot's design goals are twofold, i.e., i) to retain the potential of local updates, and ii) to combine local and global trends. Experimental results show that \u0000<italic>Slingshot</i>\u0000 helps local updates become more globally favorable and outperforms other popular methods under various FL settings. For example, on CIFAR10, \u0000<italic>Slingshot</i>\u0000 achieves 46.52% improvement in test accuracy and 48.21× speedup for a lightweight neural network named \u0000<italic>SqueezeNet</i>\u0000.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"5 ","pages":"39-49"},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10411043","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139715219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Low Area and Low Power FPGA Implementation of a DBSCAN-Based RF Modulation Classifier 基于 DBSCAN 的射频调制分类器的低面积、低功耗 FPGA 实现

IEEE Open Journal of the Computer Society

Pub Date : 2024-01-18 DOI: 10.1109/OJCS.2024.3355693

Bill Gavin;Tiantai Deng;Edward Ball

This paper presents a new low-area and low-power Field Programmable Gate Array (FPGA) implementation of a Radio Frequency (RF) modulation classifier based on the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm, known as DBCLASS. The proposed architecture demonstrates a novel approach for the efficient hardware realisation of the DBSCAN algorithm by utilising parallelism, a bespoke sorting algorithm, and eliminating memory access. The design achieves 100% classification accuracy with lab-captured RF data above 8 dB signal-to-noise ratio(SNR) whilst exhibiting an improvement of latency in comparison to the next quickest design by a factor of 7.5, a reduction in terms of total FPGA resources used in comparison to the next smallest complete system by a factor of 3.65, and a reduction in power consumption over the next most efficient by a factor of 4.75. The proposed design is well suited for resource-constrained applications, such as mobile cognitive radios and spectrum monitoring systems.

本文介绍了一种新的低面积、低功耗现场可编程门阵列（FPGA）实现方法，该方法基于基于密度的带噪声应用空间聚类（DBSCAN）算法（即 DBCLASS），是一种射频（RF）调制分类器。所提出的架构通过利用并行性、定制排序算法和消除内存访问，展示了一种高效硬件实现 DBSCAN 算法的新方法。该设计在实验室捕获的信噪比（SNR）超过 8 dB 的射频数据中实现了 100% 的分类准确率，同时与下一个最快的设计相比，延迟时间缩短了 7.5 倍，与下一个最小的完整系统相比，所使用的 FPGA 总资源减少了 3.65 倍，与下一个最高效的设计相比，功耗降低了 4.75 倍。所提出的设计非常适合资源受限的应用，如移动认知无线电和频谱监测系统。

引用次数: 0

IEEE Open Journal of the Computer Society Information for Authors IEEE 计算机学会公开期刊作者须知

IEEE Open Journal of the Computer Society

Pub Date : 2023-12-15 DOI: 10.1109/OJCS.2023.3239731

引用次数: 0

Prediction of Customer Behavior Changing via a Hybrid Approach 通过混合方法预测客户行为变化

IEEE Open Journal of the Computer Society

Pub Date : 2023-11-29 DOI: 10.1109/OJCS.2023.3336904

Nien-Ting Lee;Hau-Chen Lee;Joseph Hsin;Shih-Hau Fang

This study proposes a hybrid approach to predict customer churn by combining statistic approaches and machine learning models. Unlike traditional methods, where churn is defined by a fixed period of time, the proposed algorithm uses the probability of customer alive derived from the statistical model to dynamically determine the churn line. After observing customer churn through clustering over time, the proposed method segmented customers into four behaviors: new, short-term, high-value, and churn, and selected machine learning models to predict the churned customers. This combination reduces the risk to be misjudged as churn for customers with longer consumption cycles. Two public datasets were used to evaluate the hybrid approach, an online retail of U.K. gift sellers and the largest E-Commerce of Pakistan. Based on the top three learning models, the recall ranged from 0.56 to 0.72 in the former while that ranged from 0.91 to 0.95 in the latter. Results show that the proposed approach enables companies to retain important customers earlier by predicting customer churn. The proposed hybrid method requires less data than existing methods.

本研究提出了一种混合方法，通过结合统计方法和机器学习模型来预测客户流失率。与以固定时间段定义客户流失的传统方法不同，所提出的算法利用统计模型得出的客户存活概率来动态确定客户流失线。在通过聚类观察客户流失时间后，所提出的方法将客户细分为四种行为：新客户、短期客户、高价值客户和流失客户，并选择机器学习模型来预测流失客户。这种组合降低了消费周期较长的客户被误判为流失客户的风险。评估混合方法时使用了两个公共数据集，一个是英国礼品销售商的在线零售数据集，另一个是巴基斯坦最大的电子商务数据集。根据前三个学习模型，前者的召回率在 0.56 到 0.72 之间，而后者的召回率在 0.91 到 0.95 之间。结果表明，所提出的方法能让公司通过预测客户流失，更早地留住重要客户。与现有方法相比，拟议的混合方法所需的数据更少。

{"title":"Prediction of Customer Behavior Changing via a Hybrid Approach","authors":"Nien-Ting Lee;Hau-Chen Lee;Joseph Hsin;Shih-Hau Fang","doi":"10.1109/OJCS.2023.3336904","DOIUrl":"https://doi.org/10.1109/OJCS.2023.3336904","url":null,"abstract":"This study proposes a hybrid approach to predict customer churn by combining statistic approaches and machine learning models. Unlike traditional methods, where churn is defined by a fixed period of time, the proposed algorithm uses the probability of customer alive derived from the statistical model to dynamically determine the churn line. After observing customer churn through clustering over time, the proposed method segmented customers into four behaviors: new, short-term, high-value, and churn, and selected machine learning models to predict the churned customers. This combination reduces the risk to be misjudged as churn for customers with longer consumption cycles. Two public datasets were used to evaluate the hybrid approach, an online retail of U.K. gift sellers and the largest E-Commerce of Pakistan. Based on the top three learning models, the recall ranged from 0.56 to 0.72 in the former while that ranged from 0.91 to 0.95 in the latter. Results show that the proposed approach enables companies to retain important customers earlier by predicting customer churn. The proposed hybrid method requires less data than existing methods.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"5 ","pages":"27-38"},"PeriodicalIF":0.0,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10334013","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139060318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Real-Time 3-Dimensional Object Detection Based Human Action Recognition Model 基于三维物体检测的实时人体动作识别模型

IEEE Open Journal of the Computer Society

Pub Date : 2023-11-20 DOI: 10.1109/OJCS.2023.3334528

Chhaya Gupta;Nasib Singh Gill;Preeti Gulia;Sangeeta Yadav;Giovanni Pau;Mohammad Alibakhshikenari;Xiangjie Kong

Computer vision technologies have greatly improved in the last few years. Many problems have been solved using deep learning merged with more computational power. Action recognition is one of society's problems that must be addressed. Human Action Recognition (HAR) may be adopted for intelligent video surveillance systems, and the government may use the same for monitoring crimes and security purposes. This paper proposes a deep learning-based HAR model, i.e., a 3-dimensional Convolutional Network with multiplicative LSTM. The suggested model makes it easier to comprehend the tasks that an individual or team of individuals completes. The four-phase proposed model consists of a 3D Convolutional neural network (3DCNN) combined with an LSTM multiplicative recurrent network and Yolov6 for real-time object detection. The four stages of the proposed model are data fusion, feature extraction, object identification, and skeleton articulation approaches. The NTU-RGB-D, KITTI, NTU-RGB-D 120, UCF 101, and Fused datasets are some used to train the model. The suggested model surpasses other cutting-edge models by reaching an accuracy of 98.23%, 97.65%, 98.76%, 95.45%, and 97.65% on the abovementioned datasets. Other state-of-the-art (SOTA) methods compared in this study are traditional CNN, Yolov6, and CNN with BiLSTM. The results verify that actions are classified more accurately by the proposed model that combines all these techniques compared to existing ones.

在过去几年里，计算机视觉技术有了很大的进步。利用深度学习和更强的计算能力，许多问题都得到了解决。动作识别是必须解决的社会问题之一。人类动作识别（HAR）可用于智能视频监控系统，政府也可将其用于监控犯罪和安全目的。本文提出了一种基于深度学习的 HAR 模型，即带有乘法 LSTM 的三维卷积网络。所建议的模型更容易理解个人或团队完成的任务。所建议的四阶段模型由三维卷积神经网络（3DCNN）与 LSTM 乘法递归网络和用于实时物体检测的 Yolov6 组成。拟议模型的四个阶段分别是数据融合、特征提取、物体识别和骨架衔接方法。训练模型时使用了 NTU-RGB-D、KITTI、NTU-RGB-D 120、UCF 101 和 Fused 数据集。建议的模型在上述数据集上的准确率分别达到 98.23%、97.65%、98.76%、95.45% 和 97.65%，超过了其他先进模型。本研究中比较的其他先进（SOTA）方法包括传统 CNN、Yolov6 和带有 BiLSTM 的 CNN。结果证明，与现有技术相比，结合了所有这些技术的拟议模型能更准确地对动作进行分类。

{"title":"A Real-Time 3-Dimensional Object Detection Based Human Action Recognition Model","authors":"Chhaya Gupta;Nasib Singh Gill;Preeti Gulia;Sangeeta Yadav;Giovanni Pau;Mohammad Alibakhshikenari;Xiangjie Kong","doi":"10.1109/OJCS.2023.3334528","DOIUrl":"https://doi.org/10.1109/OJCS.2023.3334528","url":null,"abstract":"Computer vision technologies have greatly improved in the last few years. Many problems have been solved using deep learning merged with more computational power. Action recognition is one of society's problems that must be addressed. Human Action Recognition (HAR) may be adopted for intelligent video surveillance systems, and the government may use the same for monitoring crimes and security purposes. This paper proposes a deep learning-based HAR model, i.e., a 3-dimensional Convolutional Network with multiplicative LSTM. The suggested model makes it easier to comprehend the tasks that an individual or team of individuals completes. The four-phase proposed model consists of a 3D Convolutional neural network (3DCNN) combined with an LSTM multiplicative recurrent network and Yolov6 for real-time object detection. The four stages of the proposed model are data fusion, feature extraction, object identification, and skeleton articulation approaches. The NTU-RGB-D, KITTI, NTU-RGB-D 120, UCF 101, and Fused datasets are some used to train the model. The suggested model surpasses other cutting-edge models by reaching an accuracy of 98.23%, 97.65%, 98.76%, 95.45%, and 97.65% on the abovementioned datasets. Other state-of-the-art (SOTA) methods compared in this study are traditional CNN, Yolov6, and CNN with BiLSTM. The results verify that actions are classified more accurately by the proposed model that combines all these techniques compared to existing ones.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"5 ","pages":"14-26"},"PeriodicalIF":0.0,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10323158","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139060146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Benchmark for Personalized Federated Learning 个性化联合学习的基准

IEEE Open Journal of the Computer Society

Pub Date : 2023-11-13 DOI: 10.1109/OJCS.2023.3332351

Koji Matsuda;Yuya Sasaki;Chuan Xiao;Makoto Onizuka

Federated learning is a distributed machine learning approach that allows a single server to collaboratively build machine learning models with multiple clients without sharing datasets. Since data distributions may differ across clients, data heterogeneity is a challenging issue in federated learning. To address this issue, numerous federated learning methods have been proposed to build personalized models for clients, referred to as personalized federated learning. Nevertheless, no studies comprehensively investigate the performance of personalized federated learning methods in various experimental settings such as datasets and client settings. Therefore, in this article, we aim to benchmark the performance of existing personalized federated learning methods in various settings. We first survey the experimental settings in existing studies. We then benchmark the performance of existing methods through comprehensive experiments to reveal their characteristics in computer vision and natural language processing tasks which are the most popular tasks based on our survey. Our experimental study shows that (i) large data heterogeneity often leads to highly accurate predictions and (ii) standard federated learning methods (e.g. FedAvg) with fine-tuning often outperform personalized federated learning methods.

联盟学习是一种分布式机器学习方法，它允许单个服务器与多个客户端协作构建机器学习模型，而无需共享数据集。由于各客户端的数据分布可能不同，因此数据异构是联合学习中的一个挑战性问题。为了解决这个问题，人们提出了许多联合学习方法，为客户建立个性化模型，称为个性化联合学习。然而，目前还没有研究全面考察个性化联合学习方法在数据集和客户端设置等各种实验环境下的性能。因此，本文旨在对现有个性化联合学习方法在各种环境下的性能进行基准测试。我们首先调查了现有研究中的实验设置。然后，我们通过综合实验对现有方法的性能进行基准测试，以揭示这些方法在计算机视觉和自然语言处理任务中的特性。我们的实验研究表明：(i) 大量数据的异质性往往会带来高精度的预测；(ii) 微调后的标准联合学习方法（如 FedAvg）往往优于个性化联合学习方法。

{"title":"Benchmark for Personalized Federated Learning","authors":"Koji Matsuda;Yuya Sasaki;Chuan Xiao;Makoto Onizuka","doi":"10.1109/OJCS.2023.3332351","DOIUrl":"10.1109/OJCS.2023.3332351","url":null,"abstract":"Federated learning is a distributed machine learning approach that allows a single server to collaboratively build machine learning models with multiple clients without sharing datasets. Since data distributions may differ across clients, data heterogeneity is a challenging issue in federated learning. To address this issue, numerous federated learning methods have been proposed to build personalized models for clients, referred to as personalized federated learning. Nevertheless, no studies comprehensively investigate the performance of personalized federated learning methods in various experimental settings such as datasets and client settings. Therefore, in this article, we aim to benchmark the performance of existing personalized federated learning methods in various settings. We first survey the experimental settings in existing studies. We then benchmark the performance of existing methods through comprehensive experiments to reveal their characteristics in computer vision and natural language processing tasks which are the most popular tasks based on our survey. Our experimental study shows that (i) large data heterogeneity often leads to highly accurate predictions and (ii) standard federated learning methods (e.g. FedAvg) with fine-tuning often outperform personalized federated learning methods.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"5 ","pages":"2-13"},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10316561","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135612465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data Extraction and Question Answering on Chart Images Towards Accessibility and Data Interpretation 面向可及性和数据解释的图表图像数据提取与问答

IEEE Open Journal of the Computer Society

Pub Date : 2023-10-31 DOI: 10.1109/OJCS.2023.3328767

Shahira K C;Pulkit Joshi;Lijiya A

Graphical representations such as chart images are integral to web pages and documents. Automating data extraction from charts is possible by reverse-engineering the visualization pipeline. This study proposes a framework that automates data extraction from bar charts and integrates it with question-answering. The framework employs an object detector to recognize visual cues in the image, followed by text recognition. Mask-RCNN for plot element detection achieves a mean average precision of 95.04% at a threshold of 0.5 which decreases as the Intersection over Union (IoU) threshold increases. A contour approximation-based approach is proposed for extracting the bar coordinates, even at a higher IoU of 0.9. The textual and visual cues are associated with the legend text and preview, and the chart data is finally extracted in tabular format. We introduce an extension to the TAPAS model, called TAPAS++, by incorporating new operations and table question answering is done using TAPAS++ model. The chart summary or description is also produced in an audio format. In the future, this approach could be expanded to enable interactive question answering on charts by accepting audio inquiries from individuals with visual impairments and do more complex reasoning using Large Language Models.

图表图像等图形表示是网页和文档的组成部分。通过对可视化管道进行逆向工程，可以从图表中自动提取数据。本研究提出了一个从柱状图中自动提取数据并将其与问答集成的框架。该框架采用对象检测器来识别图像中的视觉线索，然后进行文本识别。Mask-RCNN用于地块元素检测的平均精度在阈值为0.5时达到95.04%，该阈值随着IoU阈值的增加而降低。提出了一种基于轮廓近似的方法来提取柱坐标，即使在较高的IoU为0.9时也是如此。文本和视觉提示与图例文本和预览相关联，最后以表格格式提取图表数据。我们引入了对TAPAS模型的扩展，称为TAPAS++，通过合并新的操作和使用TAPAS++模型完成的表问答。图表摘要或描述也以音频格式制作。在未来，这种方法可以扩展到通过接受视障人士的音频询问来实现图表上的交互式问答，并使用大型语言模型进行更复杂的推理。

{"title":"Data Extraction and Question Answering on Chart Images Towards Accessibility and Data Interpretation","authors":"Shahira K C;Pulkit Joshi;Lijiya A","doi":"10.1109/OJCS.2023.3328767","DOIUrl":"10.1109/OJCS.2023.3328767","url":null,"abstract":"Graphical representations such as chart images are integral to web pages and documents. Automating data extraction from charts is possible by reverse-engineering the visualization pipeline. This study proposes a framework that automates data extraction from bar charts and integrates it with question-answering. The framework employs an object detector to recognize visual cues in the image, followed by text recognition. Mask-RCNN for plot element detection achieves a mean average precision of 95.04% at a threshold of 0.5 which decreases as the Intersection over Union (IoU) threshold increases. A contour approximation-based approach is proposed for extracting the bar coordinates, even at a higher IoU of 0.9. The textual and visual cues are associated with the legend text and preview, and the chart data is finally extracted in tabular format. We introduce an extension to the TAPAS model, called TAPAS++, by incorporating new operations and table question answering is done using TAPAS++ model. The chart summary or description is also produced in an audio format. In the future, this approach could be expanded to enable interactive question answering on charts by accepting audio inquiries from individuals with visual impairments and do more complex reasoning using Large Language Models.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"4 ","pages":"314-325"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10302417","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135263025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Finding the Truth From Uncertain Time Series by Differencing 用差分法从不确定时间序列中求真

IEEE Open Journal of the Computer Society

Pub Date : 2023-10-19 DOI: 10.1109/OJCS.2023.3326150

Jizhou Sun;Delin Zhou;Bo Jiang

Time series data is ubiquitous and of great importance in real applications. But due to poor qualities and bad working conditions of sensors, time series reported by them contain more or less noises. To reduce noise, multiple sensors are usually deployed to measure an identical time series and from these observations the truth can be estimated, which derives the problem of truth discovery for uncertain time series data. Several algorithms have been proposed, but they mainly focus on minimizing the error between the estimated truth and the observations. In our study, we aim at minimizing the noise in the estimated truth. To solve this optimization problem, we first find out the level of noise produced by each sensor based on differenced time series, which can help estimating the truth wisely. Then, we propose a quadratic optimization model to minimize the noise of the estimated truth. Further, a post process is introduced to refine the result by iteration. Experimental results on both real world and synthetic data sets verify the effectiveness and efficiency of our proposed methods, respectively.

时间序列数据无处不在，在实际应用中具有重要意义。但由于传感器本身质量差，工作条件差，其上报的时间序列或多或少都含有噪声。为了降低噪声，通常部署多个传感器来测量同一时间序列，并从这些观测值中估计真值，这就产生了不确定时间序列数据的真值发现问题。已经提出了几种算法，但它们主要集中在最小化估计真值与观测值之间的误差。在我们的研究中，我们的目标是最小化估计真值中的噪声。为了解决这一优化问题，我们首先根据差分时间序列找出每个传感器产生的噪声水平，这有助于明智地估计真实值。然后，我们提出了一个二次优化模型来最小化估计真值的噪声。此外，还引入了一个后置过程，通过迭代来改进结果。在真实世界和合成数据集上的实验结果分别验证了我们提出的方法的有效性和效率。

引用次数: 0

Towards Reliable Utilization of AIGC: Blockchain-Empowered Ownership Verification Mechanism 走向AIGC的可靠利用:区块链授权的所有权验证机制

IEEE Open Journal of the Computer Society

Pub Date : 2023-09-18 DOI: 10.1109/OJCS.2023.3315835

Chuan Chen;Yihao Li;Zihou Wu;Mingfeng Xu;Rui Wang;Zibin Zheng

With the development of the blockchain technology, a decentralized and de-trusted network paradigm has been constructed, enabling multiple digital assets like NFT, to be permanently recorded and authenticated by blockchain. Also, the uniqueness and verifiability of these assets allows them to flow and generate value between any network entities. With the emergence of AI Generative Content (AIGC), the ownership of models and generative contents, which are also digital assets, has not been well protected. Both because the black-box nature of neural networks makes it difficult to mark models' ownership and because the lack of a reliable third-party verification platform. Meanwhile, the existing model-attack threat and raising ethical problems driven the research on model watermark embedding for traceability and verification, and thus the reliable basic algorithm and the verification platform are needed. In this survey, while emphasizing the importance and reason of the ownership protection in AIGC and summarizing the recent research using model watermarking, we will also introduce the achievements of blockchain in copyright in order to summarize the research history and point out future direction of model copyright validation from both the underlying technology and the supporting platform.

随着区块链技术的发展，构建了一个去中心化、去信任的网络范式，使NFT等多种数字资产能够通过区块链永久记录和认证。此外，这些资产的唯一性和可验证性允许它们在任何网络实体之间流动并产生价值。随着人工智能生成内容(AIGC)的出现，同样是数字资产的模型和生成内容的所有权并没有得到很好的保护。一方面是因为神经网络的黑盒特性使得很难标记模型的所有权，另一方面是因为缺乏可靠的第三方验证平台。同时，模型攻击威胁的存在和伦理问题的日益突出，推动了模型水印嵌入的可追溯性和可验证性研究，需要可靠的基础算法和验证平台。在本次调查中，我们在强调AIGC中所有权保护的重要性和原因的同时，总结了最近使用模型水印的研究，并介绍了区块链在版权方面的成就，以便从底层技术和支撑平台两方面总结研究历史，指出模型版权验证的未来方向。

{"title":"Towards Reliable Utilization of AIGC: Blockchain-Empowered Ownership Verification Mechanism","authors":"Chuan Chen;Yihao Li;Zihou Wu;Mingfeng Xu;Rui Wang;Zibin Zheng","doi":"10.1109/OJCS.2023.3315835","DOIUrl":"10.1109/OJCS.2023.3315835","url":null,"abstract":"With the development of the blockchain technology, a decentralized and de-trusted network paradigm has been constructed, enabling multiple digital assets like NFT, to be permanently recorded and authenticated by blockchain. Also, the uniqueness and verifiability of these assets allows them to flow and generate value between any network entities. With the emergence of AI Generative Content (AIGC), the ownership of models and generative contents, which are also digital assets, has not been well protected. Both because the black-box nature of neural networks makes it difficult to mark models' ownership and because the lack of a reliable third-party verification platform. Meanwhile, the existing model-attack threat and raising ethical problems driven the research on model watermark embedding for traceability and verification, and thus the reliable basic algorithm and the verification platform are needed. In this survey, while emphasizing the importance and reason of the ownership protection in AIGC and summarizing the recent research using model watermarking, we will also introduce the achievements of blockchain in copyright in order to summarize the research history and point out future direction of model copyright validation from both the underlying technology and the supporting platform.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"4 ","pages":"326-337"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10254223","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135502429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE Open Journal of the Computer Society

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀