ACM Transactions on Internet Technology最新文献_第5页

IRGA: An Intelligent Implicit Real-time Gait Authentication System in Heterogeneous Complex Scenarios 一种异构复杂场景下的智能隐式实时步态认证系统

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3594538

Li Yang, Xi Li, Zhuoru Ma, Lu Li, Neal Xiong, Jianfeng Ma

Gait authentication as a technique that can continuously provide identity recognition on mobile devices for security has been investigated by academics in the community for decades. However, most of the existing work achieves insufficient generalization to complex real-world environments due to the complexity of the noisy real-world gait data. To address this limitation, we propose an intelligent Implicit Real-time Gait Authentication (IRGA) system based on Deep Neural Networks (DNNs) for enhancing the adaptability of gait authentication in practice. In the proposed system, the gait data (whether with complex interference signals) will first be processed sequentially by the imperceptible collection module and data preprocessing module for improving data quality. In order to illustrate and verify the suitability of our proposal, we provide analysis of the impact of individual gait changes on data feature distribution. Finally, a fusion neural network composed of a Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) is designed to perform feature extraction and user authentication. We evaluate the proposed IRGA system in heterogeneous complex scenarios and present start-of-the-art comparisons on three datasets. Extensive experiments demonstrate that the IRGA system achieves improved performance simultaneously in several different metrics.

步态认证作为一种能够在移动设备上持续提供身份识别的安全技术，已经被学界研究了几十年。然而，由于真实世界步态数据的复杂性，现有的大多数工作对复杂的真实环境泛化不足。为了解决这一问题，我们提出了一种基于深度神经网络(dnn)的智能隐式实时步态认证(IRGA)系统，以增强步态认证在实践中的适应性。在该系统中，无论步态数据是否具有复杂的干扰信号，首先由不可察觉采集模块和数据预处理模块对步态数据进行顺序处理，以提高数据质量。为了说明和验证我们的建议的适用性，我们提供了个体步态变化对数据特征分布的影响分析。最后，设计了一个由卷积神经网络(CNN)和长短期记忆(LSTM)组成的融合神经网络来进行特征提取和用户认证。我们在异构复杂场景中评估了所提出的IRGA系统，并在三个数据集上进行了初步的比较。大量的实验表明，IRGA系统在几个不同的指标上同时取得了更好的性能。

{"title":"IRGA: An Intelligent Implicit Real-time Gait Authentication System in Heterogeneous Complex Scenarios","authors":"Li Yang, Xi Li, Zhuoru Ma, Lu Li, Neal Xiong, Jianfeng Ma","doi":"https://dl.acm.org/doi/10.1145/3594538","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3594538","url":null,"abstract":"Gait authentication as a technique that can continuously provide identity recognition on mobile devices for security has been investigated by academics in the community for decades. However, most of the existing work achieves insufficient generalization to complex real-world environments due to the complexity of the noisy real-world gait data. To address this limitation, we propose an intelligent Implicit Real-time Gait Authentication (IRGA) system based on Deep Neural Networks (DNNs) for enhancing the adaptability of gait authentication in practice. In the proposed system, the gait data (whether with complex interference signals) will first be processed sequentially by the imperceptible collection module and data preprocessing module for improving data quality. In order to illustrate and verify the suitability of our proposal, we provide analysis of the impact of individual gait changes on data feature distribution. Finally, a fusion neural network composed of a Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) is designed to perform feature extraction and user authentication. We evaluate the proposed IRGA system in heterogeneous complex scenarios and present start-of-the-art comparisons on three datasets. Extensive experiments demonstrate that the IRGA system achieves improved performance simultaneously in several different metrics.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"68 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Securing Scalable Real-time Multiparty Communications with Hybrid Information-centric Networking 利用混合信息中心网络保护可扩展的实时多方通信

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3593585

Luca Muscariello, Michele Papalini, Olivier Roques, Mauro Sardara, Arthur Tran Van

In this article, we consider security aspects of online meeting applications based on protocols such as WebRTC that leverage the Information-centric Networking (ICN) architecture to make the system fundamentally more scalable. If the scalability properties provided by ICN have been proved in recent literature, the security challenges and implications for real-time applications have not been reviewed. We show that this class of applications can benefit from strong security and scalability jointly without any major tradeoff and with significant performance improvements over traditional WebRTC systems. To achieve this goal, some modifications to the current ICN architecture must be implemented in the way integrity and authentication are verified. Extensive performance analysis of the architecture based on the open source implementation of Hybrid-ICN proves that real-time applications can greatly benefit from this novel network architecture in terms of strong security and scalable communications.

在本文中，我们考虑基于WebRTC等协议的在线会议应用程序的安全方面，这些协议利用信息中心网络(Information-centric Networking, ICN)架构使系统从根本上更具可伸缩性。如果ICN提供的可扩展性属性已经在最近的文献中得到证明，则尚未审查实时应用程序的安全挑战和影响。我们表明，这类应用程序可以从强大的安全性和可扩展性中获益，而无需任何重大权衡，并且比传统的WebRTC系统具有显著的性能改进。为了实现这一目标，必须在验证完整性和身份验证的方式上对当前ICN体系结构进行一些修改。基于Hybrid-ICN开源实现的架构的广泛性能分析证明，实时应用程序可以从这种新颖的网络架构中获得强大的安全性和可扩展的通信。

引用次数: 0

Taming Internet of Things Application Development with the IoTvar Middleware 用IoTvar中间件驯服物联网应用程序开发

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3586010

Pedro Victor Borges, Chantal Taconet, Sophie Chabridon, Denis Conan, Everton Cavalcante, Thais Batista

In the last years, Internet of Things (IoT) platforms have been designed to provide IoT applications with various services such as device discovery, context management, and data filtering. The lack of standardization has led each IoT platform to propose its own abstractions, APIs, and data models. As a consequence, programming interactions between an IoT consuming application and an IoT platform is time-consuming, is error prone, and depends on the developers’ level of knowledge about the IoT platform. To address these issues, this article introduces IoTvar, a middleware library deployed on the IoT consumer application that manages all its interactions with IoT platforms. IoTvar relies on declaring variables automatically mapped to sensors whose values are transparently updated with sensor observations through proxies on the client side. This article presents the IoTvar architecture and shows how it has been integrated into the FIWARE, OM2M, and muDEBS platforms. We also report the results of experiments performed to evaluate IoTvar, showing that it reduces the effort required to declare and manage IoT variables and has no considerable impact on CPU, memory, and energy consumption.

在过去的几年里，物联网(IoT)平台被设计为为物联网应用提供各种服务，如设备发现、上下文管理和数据过滤。由于缺乏标准化，每个物联网平台都提出了自己的抽象、api和数据模型。因此，在物联网消费应用程序和物联网平台之间编程交互是耗时的，容易出错，并且取决于开发人员对物联网平台的知识水平。为了解决这些问题，本文介绍了IoTvar，这是一个部署在IoT消费者应用程序上的中间件库，用于管理其与IoT平台的所有交互。IoTvar依赖于声明自动映射到传感器的变量，这些变量的值通过客户端的代理透明地更新传感器观察值。本文介绍了IoTvar体系结构，并展示了如何将其集成到FIWARE、OM2M和muDEBS平台中。我们还报告了评估IoTvar的实验结果，表明它减少了声明和管理物联网变量所需的工作量，并且对CPU，内存和能耗没有相当大的影响。

{"title":"Taming Internet of Things Application Development with the IoTvar Middleware","authors":"Pedro Victor Borges, Chantal Taconet, Sophie Chabridon, Denis Conan, Everton Cavalcante, Thais Batista","doi":"https://dl.acm.org/doi/10.1145/3586010","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3586010","url":null,"abstract":"In the last years, Internet of Things (IoT) platforms have been designed to provide IoT applications with various services such as device discovery, context management, and data filtering. The lack of standardization has led each IoT platform to propose its own abstractions, APIs, and data models. As a consequence, programming interactions between an IoT consuming application and an IoT platform is time-consuming, is error prone, and depends on the developers’ level of knowledge about the IoT platform. To address these issues, this article introduces IoTvar, a middleware library deployed on the IoT consumer application that manages all its interactions with IoT platforms. IoTvar relies on declaring variables automatically mapped to sensors whose values are transparently updated with sensor observations through proxies on the client side. This article presents the IoTvar architecture and shows how it has been integrated into the FIWARE, OM2M, and muDEBS platforms. We also report the results of experiments performed to evaluate IoTvar, showing that it reduces the effort required to declare and manage IoT variables and has no considerable impact on CPU, memory, and energy consumption.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"99 3 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic Personalized POI Sequence Recommendation with Fine-Grained Contexts 基于细粒度上下文的动态个性化POI序列推荐

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3583687

Jing Chen, Wenjun Jiang, Jie Wu, Kenli Li, Keqin Li

The Point Of Interest (POI) sequence recommendation is the key task in itinerary and travel route planning. Existing works usually consider the temporal and spatial factors in travel planning. However, the external environment, such as the weather, is usually overlooked. In fact, the weather is an important factor because it can affect a user’s check-in behaviors. Furthermore, most of the existing research is based on a static environment for POI sequence recommendation. While the external environment (e.g., the weather) may change during travel, it is difficult for existing works to adjust the POI sequence in time. What’s more, people usually prefer the attractive routes when traveling. To address these issues, we first conduct comprehensive data analysis on two real-world check-in datasets to study the effects of weather and time, as well as the features of the POI sequence. Based on this, we propose a model of Dynamic Personalized POI Sequence Recommendation with fine-grained contexts (DPSR for short). It extracts user interest and POI popularity with fine-grained contexts and captures the attractiveness of the POI sequence. Next, we apply the Monte Carlo Tree Search model (MCTS for short) to simulate the process of recommending POI sequence in the dynamic environment, i.e., the weather and time change after visiting a POI. What’s more, we consider different speeds to reflect the fact that people may take different transportation to transfer between POIs. To validate the efficacy of DPSR, we conduct extensive experiments. The results show that our model can improve the accuracy of the recommendation significantly. Furthermore, it can better meet user preferences and enhance experiences.

兴趣点(POI)顺序推荐是行程和旅行路线规划中的关键任务。现有的工作通常在旅行规划中考虑时间和空间因素。然而，外部环境，如天气，通常被忽视。事实上，天气是一个重要因素，因为它会影响用户的签到行为。此外，现有的研究大多是基于静态环境的POI序列推荐。在旅行过程中，外部环境(如天气)可能会发生变化，现有作品很难及时调整POI顺序。更重要的是，人们在旅行时通常更喜欢有吸引力的路线。为了解决这些问题，我们首先对两个现实世界的登记数据集进行了全面的数据分析，以研究天气和时间的影响，以及POI序列的特征。在此基础上，提出了一种基于细粒度上下文的动态个性化POI序列推荐模型(DPSR)。它通过细粒度上下文提取用户兴趣和POI流行程度，并捕获POI序列的吸引力。接下来，我们应用蒙特卡罗树搜索模型(简称MCTS)来模拟在动态环境下(即访问POI后的天气和时间变化)推荐POI序列的过程。此外，我们考虑了不同的速度，以反映人们在poi之间可能采取不同的交通方式。为了验证DPSR的有效性，我们进行了大量的实验。结果表明，该模型可以显著提高推荐的准确率。此外，它可以更好地满足用户的偏好，增强体验。

{"title":"Dynamic Personalized POI Sequence Recommendation with Fine-Grained Contexts","authors":"Jing Chen, Wenjun Jiang, Jie Wu, Kenli Li, Keqin Li","doi":"https://dl.acm.org/doi/10.1145/3583687","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3583687","url":null,"abstract":"The Point Of Interest (POI) sequence recommendation is the key task in itinerary and travel route planning. Existing works usually consider the temporal and spatial factors in travel planning. However, the external environment, such as the weather, is usually overlooked. In fact, the weather is an important factor because it can affect a user’s check-in behaviors. Furthermore, most of the existing research is based on a static environment for POI sequence recommendation. While the external environment (e.g., the weather) may change during travel, it is difficult for existing works to adjust the POI sequence in time. What’s more, people usually prefer the attractive routes when traveling. To address these issues, we first conduct comprehensive data analysis on two real-world check-in datasets to study the effects of weather and time, as well as the features of the POI sequence. Based on this, we propose a model of Dynamic Personalized POI Sequence Recommendation with fine-grained contexts (DPSR for short). It extracts user interest and POI popularity with fine-grained contexts and captures the attractiveness of the POI sequence. Next, we apply the Monte Carlo Tree Search model (MCTS for short) to simulate the process of recommending POI sequence in the dynamic environment, i.e., the weather and time change after visiting a POI. What’s more, we consider different speeds to reflect the fact that people may take different transportation to transfer between POIs. To validate the efficacy of DPSR, we conduct extensive experiments. The results show that our model can improve the accuracy of the recommendation significantly. Furthermore, it can better meet user preferences and enhance experiences.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"23 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Personalized Individual Semantics Learning to Support a Large-Scale Linguistic Consensus Process 支持大规模语言共识过程的个性化个体语义学习

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3533432

Yucheng Dong, Qin Ran, Xiangrui Chao, Congcong Li, Shui Yu

When making decisions, individuals often express their preferences linguistically. The computing with words methodology is a key basis for supporting linguistic decision making, and the words in that methodology may mean different things to different individuals. Thus, in this article, we propose a continual personalized individual semantics learning model to support a consensus-reaching process in large-scale linguistic group decision making. Specifically, we first derive personalized numerical scales from the data of linguistic preference relations. We then perform a clustering ensemble method to divide large-scale group and conduct consensus management. Finally, we present a case study of intelligent route optimization in shared mobility to illustrate the usability of our proposed model. We also demonstrate its effectiveness and feasibility through a comparative analysis.

在做决定时，人们经常用语言来表达他们的偏好。单词计算方法是支持语言决策的关键基础，该方法中的单词对不同的人可能意味着不同的东西。因此，在本文中，我们提出了一个持续个性化的个体语义学习模型，以支持大规模语言群体决策中的共识达成过程。具体而言，我们首先从语言偏好关系的数据中推导出个性化的数值尺度。然后，我们采用聚类集成方法对大规模群体进行划分并进行共识管理。最后，我们给出了一个共享交通智能路径优化的案例研究，以说明我们提出的模型的可用性。通过对比分析，论证了该方法的有效性和可行性。

引用次数: 0

Concept Drift in Software Defect Prediction: A Method for Detecting and Handling the Drift 软件缺陷预测中的概念漂移:一种检测和处理漂移的方法

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3589342

Arvind Kumar Gangwar, Sandeep Kumar

Software Defect Prediction (SDP) is crucial towards software quality assurance in software engineering. SDP analyzes the software metrics data for timely prediction of defect prone software modules. Prediction process is automated by constructing defect prediction classification models using machine learning techniques. These models are trained using metrics data from historical projects of similar types. Based on the learned experience, models are used to predict defect prone modules in currently tested software. These models perform well if the concept is stationary in a dynamic software development environment. But their performance degrades unexpectedly in the presence of change in concept (Concept Drift). Therefore, concept drift (CD) detection is an important activity for improving the overall accuracy of the prediction model. Previous studies on SDP have shown that CD may occur in software defect data and the used defect prediction model may require to be updated to deal with CD. This phenomenon of handling the CD is known as CD adaptation. It is observed that still efforts need to be done in this direction in the SDP domain. In this article, we have proposed a pair of paired learners (PoPL) approach for handling CD in SDP. We combined the drift detection capabilities of two independent paired learners and used the paired learner (PL) with the best performance in recent time for next prediction. We experimented on various publicly available software defect datasets garnered from public data repositories. Experimentation results showed that our proposed approach performed better than the existing similar works and the base PL model based on various performance measures.

在软件工程中，软件缺陷预测是保证软件质量的关键。SDP分析软件度量数据，以便及时预测容易出现缺陷的软件模块。利用机器学习技术构建缺陷预测分类模型，实现了预测过程的自动化。这些模型使用来自类似类型的历史项目的度量数据进行训练。基于所学的经验，模型被用来预测当前测试软件中容易出现缺陷的模块。如果概念在动态软件开发环境中是固定的，那么这些模型表现良好。但当概念发生变化时，它们的性能会意外下降(概念漂移)。因此，概念漂移(CD)检测是提高预测模型整体精度的重要活动。以往关于SDP的研究表明，软件缺陷数据中可能出现CD，所使用的缺陷预测模型可能需要更新来处理CD。这种处理CD的现象被称为CD适应。可以观察到，在SDP领域，仍需要在这个方向上作出努力。在本文中，我们提出了一对配对学习器(PoPL)方法来处理SDP中的CD。我们结合了两个独立的配对学习器的漂移检测能力，并使用最近表现最好的配对学习器(PL)进行下一次预测。我们对从公共数据存储库中收集的各种公开可用的软件缺陷数据集进行了实验。实验结果表明，我们提出的方法比现有的类似工作和基于各种性能指标的基本PL模型表现得更好。

{"title":"Concept Drift in Software Defect Prediction: A Method for Detecting and Handling the Drift","authors":"Arvind Kumar Gangwar, Sandeep Kumar","doi":"https://dl.acm.org/doi/10.1145/3589342","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3589342","url":null,"abstract":"Software Defect Prediction (SDP) is crucial towards software quality assurance in software engineering. SDP analyzes the software metrics data for timely prediction of defect prone software modules. Prediction process is automated by constructing defect prediction classification models using machine learning techniques. These models are trained using metrics data from historical projects of similar types. Based on the learned experience, models are used to predict defect prone modules in currently tested software. These models perform well if the concept is stationary in a dynamic software development environment. But their performance degrades unexpectedly in the presence of change in concept (Concept Drift). Therefore, concept drift (CD) detection is an important activity for improving the overall accuracy of the prediction model. Previous studies on SDP have shown that CD may occur in software defect data and the used defect prediction model may require to be updated to deal with CD. This phenomenon of handling the CD is known as CD adaptation. It is observed that still efforts need to be done in this direction in the SDP domain. In this article, we have proposed a pair of paired learners (PoPL) approach for handling CD in SDP. We combined the drift detection capabilities of two independent paired learners and used the paired learner (PL) with the best performance in recent time for next prediction. We experimented on various publicly available software defect datasets garnered from public data repositories. Experimentation results showed that our proposed approach performed better than the existing similar works and the base PL model based on various performance measures.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"281 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Highly Stable Fusion Positioning System of Smartphone under NLoS Acoustic Indoor Environment NLoS声环境下智能手机高稳定融合定位系统

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-19 DOI: https://dl.acm.org/doi/10.1145/3589765

Hucheng Wang, Zhi Wang, Lei Zhang, Xiaonan Luo, Xinheng Wang

Fusion positioning technology requires stable and effective positioning data, but this is often challenging to achieve in complex Non-Line-of-Sight (NLoS) environments. This paper proposes a fusion positioning method that can achieve stable and no hop points by adjusting parameters and predicting trends, even with a one-sided lack of fusion data. The method combines acoustic signal and Inertial Measurement Unit (IMU) data, exploiting their respective advantages. The fusion is achieved using the Kalman filter and Bayesian parameter estimation is performed for tuning IMU parameters and predicting motion trends. The proposed method overcomes the problem of fusion failure caused by long-term unilateral data loss in traditional fusion positioning. The positioning trajectory and error distribution analysis show that the proposed method performs optimally in severe NLoS experiments.

融合定位技术需要稳定有效的定位数据，但在复杂的非视距(NLoS)环境中实现这一目标往往具有挑战性。本文提出了一种融合定位方法，在片面缺乏融合数据的情况下，通过调整参数和预测趋势，实现稳定无跳点的融合定位。该方法将声信号和惯性测量单元(IMU)数据相结合，发挥各自的优势。利用卡尔曼滤波实现融合，并利用贝叶斯参数估计对IMU参数进行调整和运动趋势预测。该方法克服了传统融合定位中由于长期单侧数据丢失而导致融合失效的问题。通过对定位轨迹和误差分布的分析，表明该方法在严重NLoS实验中具有最佳的定位效果。

引用次数: 0

VoiceTalk: Multimedia-IoT Applications for Mixing Mandarin, Taiwanese, and English VoiceTalk:混合普通话、台语和英语的多媒体物联网应用

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-18 DOI: https://dl.acm.org/doi/10.1145/3543854

Yi-Bing Lin, Yuan-Fu Liao, Sin-Horng Chen, Shaw-Hwa Hwang, Yih-Ru Wang

The voice-based Internet of Multimedia Things (IoMT) is the combination of IoT interfaces and protocols with associated voice-related information, which enables advanced applications based on human-to-device interactions. An example is Automatic Speech Recognition (ASR) for live captioning and voice translation. Three major issues of ASR for IoMT are IoT development cost, speech recognition accuracy, and execution time complexity. For the first issue, most non-voice IoT applications are upgraded with the ASR feature through hard coding, which are error prone. For the second issue, recognition accuracy must be improved for ASR. For the third issue, many multimedia IoT services are real-time applications and, therefore, the ASR delay must be short.

This article elaborates on the above issues based on an IoT platform called VoiceTalk. We built the largest Taiwanese spoken corpus to train VoiceTalk ASR (VT-ASR) and show how the VT-ASR mechanism can be transparently integrated with existing IoT applications. We consider two performance measures for VoiceTalk: speech recognition accuracy and VT-ASR delay. For the acoustic tests of PAL-Labs, VT-ASR's accuracy is 96.47%, while Google's accuracy is 94.28%. We are the first to develop an analytic model to investigate the probability that the VT-ASR delay for the first speaker is complete before the second speaker starts talking. From the measurements and analytic modeling, we show that the VT-ASR delay is short enough to result in a very good user experience. Our solution has won several important government and commercial TV contracts in Taiwan. VT-ASR has demonstrated better Taiwanese Mandarin speech recognition accuracy than famous commercial products (including Google and Iflytek) in Formosa Speech Recognition Challenge 2018 (FSR-2018) and was the best among all participating ASR systems for Taiwanese recognition accuracy in FSR-2020.

基于语音的多媒体物联网(IoMT)是物联网接口和协议与相关语音相关信息的结合，它使基于人与设备交互的高级应用成为可能。一个例子是用于实时字幕和语音翻译的自动语音识别(ASR)。物联网ASR的三个主要问题是物联网开发成本、语音识别准确性和执行时间复杂性。对于第一个问题，大多数非语音物联网应用都是通过硬编码升级ASR功能的，这很容易出错。对于第二个问题，必须提高ASR的识别精度。对于第三个问题，许多多媒体物联网服务是实时应用，因此ASR延迟必须短。本文基于一个名为VoiceTalk的物联网平台详细阐述了上述问题。我们建立了最大的台湾口语语料库来训练VoiceTalk ASR (VT-ASR)，并展示了VT-ASR机制如何与现有的物联网应用透明地集成。我们考虑了VoiceTalk的两个性能指标:语音识别精度和VT-ASR延迟。对于PAL-Labs的声学测试，VT-ASR的准确率为96.47%，而Google的准确率为94.28%。我们首先开发了一个分析模型来研究第一个说话者的VT-ASR延迟在第二个说话者开始说话之前完成的概率。从测量和分析建模中，我们表明VT-ASR延迟足够短，可以产生非常好的用户体验。我们的解决方案在台湾赢得了几个重要的政府和商业电视合同。在2018台塑语音识别挑战赛(FSR-2018)中，VT-ASR的台湾普通话识别准确率优于知名商用产品(包括Google和科大讯飞)，在FSR-2020中，VT-ASR在所有参赛的ASR系统中台湾识别准确率最高。

{"title":"VoiceTalk: Multimedia-IoT Applications for Mixing Mandarin, Taiwanese, and English","authors":"Yi-Bing Lin, Yuan-Fu Liao, Sin-Horng Chen, Shaw-Hwa Hwang, Yih-Ru Wang","doi":"https://dl.acm.org/doi/10.1145/3543854","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3543854","url":null,"abstract":"The voice-based Internet of Multimedia Things (IoMT) is the combination of IoT interfaces and protocols with associated voice-related information, which enables advanced applications based on human-to-device interactions. An example is Automatic Speech Recognition (ASR) for live captioning and voice translation. Three major issues of ASR for IoMT are IoT development cost, speech recognition accuracy, and execution time complexity. For the first issue, most non-voice IoT applications are upgraded with the ASR feature through hard coding, which are error prone. For the second issue, recognition accuracy must be improved for ASR. For the third issue, many multimedia IoT services are real-time applications and, therefore, the ASR delay must be short.This article elaborates on the above issues based on an IoT platform called VoiceTalk. We built the largest Taiwanese spoken corpus to train VoiceTalk ASR (VT-ASR) and show how the VT-ASR mechanism can be transparently integrated with existing IoT applications. We consider two performance measures for VoiceTalk: speech recognition accuracy and VT-ASR delay. For the acoustic tests of PAL-Labs, VT-ASR's accuracy is 96.47%, while Google's accuracy is 94.28%. We are the first to develop an analytic model to investigate the probability that the VT-ASR delay for the first speaker is complete before the second speaker starts talking. From the measurements and analytic modeling, we show that the VT-ASR delay is short enough to result in a very good user experience. Our solution has won several important government and commercial TV contracts in Taiwan. VT-ASR has demonstrated better Taiwanese Mandarin speech recognition accuracy than famous commercial products (including Google and Iflytek) in Formosa Speech Recognition Challenge 2018 (FSR-2018) and was the best among all participating ASR systems for Taiwanese recognition accuracy in FSR-2020.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"19 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MWPoW+: A Strong Consensus Protocol for Intra-Shard Consensus in Blockchain Sharding MWPoW+:区块链分片中分片内共识的强共识协议

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-18 DOI: https://dl.acm.org/doi/10.1145/3584020

Yibin Xu, Jianhua Shao, Tijs Slaats, Boris Düdder

Blockchain sharding splits a blockchain into several shards where consensus is reached at the shard level rather than over the entire blockchain. It improves transaction throughput and reduces the computational resources required of individual nodes. But a derivation of trustworthy consensus within a shard becomes an issue as the longest chain based mechanisms used in conventional blockchains can no longer be used. Instead, a vote-based consensus mechanism must be employed. However, existing vote-based Byzantine fault tolerance consensus protocols do not offer sufficient security guarantees for sharded blockchains. First, when used to support consensus where only one block is allowed at a time (binary consensus), these protocols are susceptible to progress-hindering attacks (i.e., unable to reach a consensus). Second, when used to support a stronger type of consensus where multiple concurrent blocks are allowed (strong consensus), their tolerance of adversary nodes is low. This article proposes a new consensus protocol to address all these issues. We call the new protocol MWPoW+, as its basic framework is based on the existing Multiple Winners Proof of Work (MWPoW) protocol but includes new mechanisms to address the issues mentioned previously. MWPoW+ is a vote-based protocol for strong consensus, asynchronous in consensus derivation but synchronous in communication. We prove that it can tolerate up to f < n/2 adversary nodes in a n-node system as if using a binary consensus protocol and does not suffer from progress-hindering attacks.

区块链分片将区块链分成几个分片，在分片级别上达成共识，而不是在整个区块链上达成共识。它提高了事务吞吐量，减少了单个节点所需的计算资源。但是，由于传统区块链中使用的基于最长链的机制无法再使用，分片内可信共识的派生成为一个问题。相反，必须采用基于投票的共识机制。然而，现有的基于投票的拜占庭容错共识协议并不能为分片区块链提供足够的安全保证。首先，当用于支持一次只允许一个区块的共识(二进制共识)时，这些协议容易受到阻碍进展的攻击(即无法达成共识)。其次，当用于支持允许多个并发块的更强共识类型(强共识)时，它们对对手节点的容忍度很低。本文提出了一个新的共识协议来解决所有这些问题。我们称新协议为MWPoW+，因为它的基本框架是基于现有的多赢家工作量证明(MWPoW)协议，但包含了解决前面提到的问题的新机制。MWPoW+是一种基于投票的强共识协议，在共识派生上是异步的，但在通信上是同步的。我们证明了它可以容忍高达f <N节点系统中的N /2个对手节点，就像使用二进制共识协议一样，不会遭受阻碍进展的攻击。

{"title":"MWPoW+: A Strong Consensus Protocol for Intra-Shard Consensus in Blockchain Sharding","authors":"Yibin Xu, Jianhua Shao, Tijs Slaats, Boris Düdder","doi":"https://dl.acm.org/doi/10.1145/3584020","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3584020","url":null,"abstract":"Blockchain sharding splits a blockchain into several shards where consensus is reached at the shard level rather than over the entire blockchain. It improves transaction throughput and reduces the computational resources required of individual nodes. But a derivation of trustworthy consensus within a shard becomes an issue as the longest chain based mechanisms used in conventional blockchains can no longer be used. Instead, a vote-based consensus mechanism must be employed. However, existing vote-based Byzantine fault tolerance consensus protocols do not offer sufficient security guarantees for sharded blockchains. First, when used to support consensus where only one block is allowed at a time (binary consensus), these protocols are susceptible to progress-hindering attacks (i.e., unable to reach a consensus). Second, when used to support a stronger type of consensus where multiple concurrent blocks are allowed (strong consensus), their tolerance of adversary nodes is low. This article proposes a new consensus protocol to address all these issues. We call the new protocol MWPoW+, as its basic framework is based on the existing Multiple Winners Proof of Work (MWPoW) protocol but includes new mechanisms to address the issues mentioned previously. MWPoW+ is a vote-based protocol for strong consensus, asynchronous in consensus derivation but synchronous in communication. We prove that it can tolerate up to f < n/2 adversary nodes in a n-node system as if using a binary consensus protocol and does not suffer from progress-hindering attacks.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"11 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stance-level Sarcasm Detection with BERT and Stance-centered Graph Attention Networks 基于BERT和以姿态为中心的图注意网络的姿态级讽刺检测

IF 5.3 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Internet Technology

Pub Date : 2023-05-18 DOI: https://dl.acm.org/doi/10.1145/3533430

Yazhou Zhang, Dan Ma, Prayag Tiwari, Chen Zhang, Mehedi Masud, Mohammad Shorfuzzaman, Dawei Song

Computational Linguistics (CL) associated with the Internet of Multimedia Things (IoMT)-enabled multimedia computing applications brings several research challenges, such as real-time speech understanding, deep fake video detection, emotion recognition, home automation, and so on. Due to the emergence of machine translation, CL solutions have increased tremendously for different natural language processing (NLP) applications. Nowadays, NLP-enabled IoMT is essential for its success. Sarcasm detection, a recently emerging artificial intelligence (AI) and NLP task, aims at discovering sarcastic, ironic, and metaphoric information implied in texts that are generated in the IoMT. It has drawn much attention from the AI and IoMT research community. The advance of sarcasm detection and NLP techniques will provide a cost-effective, intelligent way to work together with machine devices and high-level human-to-device interactions. However, existing sarcasm detection approaches neglect the hidden stance behind texts, thus insufficient to exploit the full potential of the task. Indeed, the stance, i.e., whether the author of a text is in favor of, against, or neutral toward the proposition or target talked in the text, largely determines the text’s actual sarcasm orientation. To fill the gap, in this research, we propose a new task: stance-level sarcasm detection (SLSD), where the goal is to uncover the author’s latent stance and based on it to identify the sarcasm polarity expressed in the text. We then propose an integral framework, which consists of Bidirectional Encoder Representations from Transformers (BERT) and a novel stance-centered graph attention networks (SCGAT). Specifically, BERT is used to capture the sentence representation, and SCGAT is designed to capture the stance information on specific target. Extensive experiments are conducted on a Chinese sarcasm sentiment dataset we created and the SemEval-2018 Task 3 English sarcasm dataset. The experimental results prove the effectiveness of the SCGAT framework over state-of-the-art baselines by a large margin.

计算语言学(CL)与多媒体物联网(IoMT)相关的多媒体计算应用带来了一些研究挑战，如实时语音理解、深度假视频检测、情感识别、家庭自动化等。由于机器翻译的出现，CL解决方案在不同的自然语言处理(NLP)应用程序中得到了极大的发展。如今，支持nlp的IoMT对其成功至关重要。讽刺检测是一项新兴的人工智能(AI)和NLP任务，旨在发现IoMT生成的文本中隐含的讽刺、讽刺和隐喻信息。它引起了人工智能和物联网研究界的广泛关注。讽刺检测和NLP技术的进步将提供一种具有成本效益的智能方式，与机器设备和高水平的人机交互一起工作。然而，现有的讽刺检测方法忽略了文本背后隐藏的立场，不足以充分挖掘任务的潜力。事实上，立场，即文章作者对文章中所谈论的命题或对象是赞成、反对还是中立，在很大程度上决定了文章的实际讽刺取向。为了填补这一空白，在本研究中，我们提出了一个新的任务:立场级讽刺检测(SLSD)，其目标是揭示作者的潜在立场，并在此基础上识别文本中表达的讽刺极性。然后，我们提出了一个完整的框架，该框架由来自变形金刚的双向编码器表示(BERT)和一个新的以姿态为中心的图注意网络(SCGAT)组成。其中，BERT用于捕获句子表示，SCGAT用于捕获特定目标的立场信息。在我们创建的中文讽刺情绪数据集和SemEval-2018 Task 3英语讽刺数据集上进行了广泛的实验。实验结果证明了SCGAT框架在最先进的基线上的有效性。

{"title":"Stance-level Sarcasm Detection with BERT and Stance-centered Graph Attention Networks","authors":"Yazhou Zhang, Dan Ma, Prayag Tiwari, Chen Zhang, Mehedi Masud, Mohammad Shorfuzzaman, Dawei Song","doi":"https://dl.acm.org/doi/10.1145/3533430","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3533430","url":null,"abstract":"Computational Linguistics (CL) associated with the Internet of Multimedia Things (IoMT)-enabled multimedia computing applications brings several research challenges, such as real-time speech understanding, deep fake video detection, emotion recognition, home automation, and so on. Due to the emergence of machine translation, CL solutions have increased tremendously for different natural language processing (NLP) applications. Nowadays, NLP-enabled IoMT is essential for its success. Sarcasm detection, a recently emerging artificial intelligence (AI) and NLP task, aims at discovering sarcastic, ironic, and metaphoric information implied in texts that are generated in the IoMT. It has drawn much attention from the AI and IoMT research community. The advance of sarcasm detection and NLP techniques will provide a cost-effective, intelligent way to work together with machine devices and high-level human-to-device interactions. However, existing sarcasm detection approaches neglect the hidden stance behind texts, thus insufficient to exploit the full potential of the task. Indeed, the stance, i.e., whether the author of a text is in favor of, against, or neutral toward the proposition or target talked in the text, largely determines the text’s actual sarcasm orientation. To fill the gap, in this research, we propose a new task: stance-level sarcasm detection (SLSD), where the goal is to uncover the author’s latent stance and based on it to identify the sarcasm polarity expressed in the text. We then propose an integral framework, which consists of Bidirectional Encoder Representations from Transformers (BERT) and a novel stance-centered graph attention networks (SCGAT). Specifically, BERT is used to capture the sentence representation, and SCGAT is designed to capture the stance information on specific target. Extensive experiments are conducted on a Chinese sarcasm sentiment dataset we created and the SemEval-2018 Task 3 English sarcasm dataset. The experimental results prove the effectiveness of the SCGAT framework over state-of-the-art baselines by a large margin.","PeriodicalId":50911,"journal":{"name":"ACM Transactions on Internet Technology","volume":"16 1","pages":""},"PeriodicalIF":5.3,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0