Scalable Computing-Practice and Experience最新文献_第8页

FedGPS: Personalized Cross-Silo Federated Learning for Internet of Things-enabled Predictive Maintenance FedGPS:面向物联网预测性维护的个性化跨筒仓联邦学习

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00137

Yuchen Jiang, Chang Ji

Predictive maintenance (PdM) has entered into a new era adopting artificial intelligence and Internet-of-Things (IoT) technologies. It is necessary for a manufacturing company to collaborate with other clients using IoT-captured production data. However, training models in a cross-silo manner is still challenging when considering data privacy. In order to tackle these challenges, a personalized cross-silo federated learning mechanism named federated global partners searching (FedGPS) is proposed. Firstly, model parameters for the participating clients are encrypted and uploaded to the central server as input. Next, FedGPS automatically determines the collaboration degrees between clients based on data distribution. After that, personalized model updates are sent back to the clients. Finally, each client conducts local updating after data decryption. The effectiveness of the FedGPS is verified in real-world cases and our method achieves 92.35% Accuracy, 98.55% Precision, 92.90% Recall, and 95.27% F1-Score comparing with other existing models from the literature.

预测性维护(PdM)已经进入了人工智能和物联网(IoT)技术的新时代。制造公司有必要使用物联网捕获的生产数据与其他客户进行协作。然而，在考虑数据隐私时，以跨竖井的方式训练模型仍然具有挑战性。为了解决这些问题，提出了一种个性化的跨竖井联邦学习机制——联邦全局伙伴搜索(federal global partners searching, FedGPS)。首先，对参与客户端的模型参数进行加密，并将其作为输入上传到中央服务器。其次，FedGPS根据数据分布自动确定客户端之间的协作程度。之后，个性化的模型更新被发送回客户端。最后，各客户端在数据解密后进行本地更新。在实际案例中验证了FedGPS的有效性，与文献中已有的模型相比，我们的方法达到了92.35%的准确率、98.55%的精密度、92.90%的召回率和95.27%的F1-Score。

{"title":"FedGPS: Personalized Cross-Silo Federated Learning for Internet of Things-enabled Predictive Maintenance","authors":"Yuchen Jiang, Chang Ji","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00137","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00137","url":null,"abstract":"Predictive maintenance (PdM) has entered into a new era adopting artificial intelligence and Internet-of-Things (IoT) technologies. It is necessary for a manufacturing company to collaborate with other clients using IoT-captured production data. However, training models in a cross-silo manner is still challenging when considering data privacy. In order to tackle these challenges, a personalized cross-silo federated learning mechanism named federated global partners searching (FedGPS) is proposed. Firstly, model parameters for the participating clients are encrypted and uploaded to the central server as input. Next, FedGPS automatically determines the collaboration degrees between clients based on data distribution. After that, personalized model updates are sent back to the clients. Finally, each client conducts local updating after data decryption. The effectiveness of the FedGPS is verified in real-world cases and our method achieves 92.35% Accuracy, 98.55% Precision, 92.90% Recall, and 95.27% F1-Score comparing with other existing models from the literature.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"1 1","pages":"912-920"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90350608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Graph-Based Information Fusion Approach for ADHD Subtype Classification 基于图的ADHD亚型分类信息融合方法

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00112

Wuliang Huang, Xinlong Jiang, Chenlong Gao, Teng Zhang, Yunbing Xing, Yiqiang Chen, Yi Zheng, Jie Li

Attention deficit hyperactivity disorder (ADHD) is a common childhood mental disorder that encompasses three subtypes. Classifying each subtype has practical significance. However, the gold standard for subtype diagnosis depends on face-to-face consultation with psychiatrists, which is limited by medical resources. This paper proposes a graph-based multimodal fusion approach to classify each subtype objectively, alleviating the pressure on psychiatrists. The proposed method leverages heterogeneous signals, including motion and speech, which are significant indicators of ADHD. We construct a personal graph where each child is a vertex, and the similarity of their personal information measures edges. Since the associations between subjects modeled by the personal graph provide rich prior knowledge, we regard the problem of subtype classification as predicting the labels of vertices on a graph. A novel graph neural network model is proposed to enable information passing between children, fusing motion and speech features under the guidance of the personal graph. We design a reading scenario and collect a multimodal dataset containing 56 children with ADHD and 50 typically developing children. Results of ADHD subtype classification demonstrate the practical value of the proposed approach. We also perform ablation studies to verify the validity of the proposed method.

注意缺陷多动障碍(ADHD)是一种常见的儿童精神障碍，包括三种亚型。对各亚型进行分类具有实际意义。然而，亚型诊断的黄金标准依赖于与精神科医生面对面的咨询，这受到医疗资源的限制。本文提出了一种基于图的多模态融合方法来客观地对每个亚型进行分类，减轻了精神科医生的压力。该方法利用了包括运动和言语在内的异质性信号，这些信号是ADHD的重要指标。我们构建了一个个人图，其中每个孩子都是一个顶点，他们的个人信息的相似性度量边。由于由个人图建模的主题之间的关联提供了丰富的先验知识，因此我们将子类型分类问题视为预测图上顶点的标签。提出了一种新的图神经网络模型，在个人图的引导下实现儿童之间的信息传递，融合运动和语音特征。我们设计了一个阅读场景，并收集了一个包含56名ADHD儿童和50名正常发育儿童的多模态数据集。ADHD亚型分类结果证明了该方法的实用价值。我们还进行了消融研究来验证所提出方法的有效性。

{"title":"A Graph-Based Information Fusion Approach for ADHD Subtype Classification","authors":"Wuliang Huang, Xinlong Jiang, Chenlong Gao, Teng Zhang, Yunbing Xing, Yiqiang Chen, Yi Zheng, Jie Li","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00112","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00112","url":null,"abstract":"Attention deficit hyperactivity disorder (ADHD) is a common childhood mental disorder that encompasses three subtypes. Classifying each subtype has practical significance. However, the gold standard for subtype diagnosis depends on face-to-face consultation with psychiatrists, which is limited by medical resources. This paper proposes a graph-based multimodal fusion approach to classify each subtype objectively, alleviating the pressure on psychiatrists. The proposed method leverages heterogeneous signals, including motion and speech, which are significant indicators of ADHD. We construct a personal graph where each child is a vertex, and the similarity of their personal information measures edges. Since the associations between subjects modeled by the personal graph provide rich prior knowledge, we regard the problem of subtype classification as predicting the labels of vertices on a graph. A novel graph neural network model is proposed to enable information passing between children, fusing motion and speech features under the guidance of the personal graph. We design a reading scenario and collect a multimodal dataset containing 56 children with ADHD and 50 typically developing children. Results of ADHD subtype classification demonstrate the practical value of the proposed approach. We also perform ablation studies to verify the validity of the proposed method.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"14 1","pages":"714-723"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88926073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Self-distilled Named Entity Recognition Based on Boundary Detection and Biaffine Attention 基于边界检测和双碱注意的自蒸馏命名实体识别

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/smartworld-uic-atc-scalcom-digitaltwin-pricomp-metaverse56740.2022.00162

Yong Song, Zhiwei Yan, Yukun Qin, Xiaozhou Ye, Ye Ouyang

Named Entity Recognition (NER) is an important down-streaming task in natural language processing. Span-based methods are applicable to both flat and nested entities. However, they lack explicit boundary supervision. To address this issue, we propose a multi-task and self-distilled model which combines biaffine span classification and entity boundary detection tasks. Firstly, the boundary detection and biaffine span classification models are jointly trained under a multi-task learning framework to address the problem of lacking supervision of boundaries. Then, self-distillation technique is applied on the model to reassign entity probabilities from annotated spans to surrounding spans and more entity types, further improving the accuracy of NER by soft labels that contain richer knowledge. Experiments were based on a high-density entity text dataset of the commodity titles from an e-commerce company. Finally, the experimental results show that our model exhibited a better F1 score than the existing common models.

命名实体识别(NER)是自然语言处理中一个重要的下行任务。基于跨度的方法既适用于平面实体，也适用于嵌套实体。然而，它们缺乏明确的边界监督。为了解决这一问题，我们提出了一种多任务自提取模型，该模型结合了双仿跨度分类和实体边界检测任务。首先，在多任务学习框架下，联合训练边界检测模型和biaffine跨度分类模型，解决边界缺乏监督的问题;然后，在模型上应用自蒸馏技术，将实体概率从标注的跨度重新分配到周围跨度和更多的实体类型，通过包含更丰富知识的软标签进一步提高NER的准确性。实验基于一家电子商务公司商品标题的高密度实体文本数据集。最后，实验结果表明，我们的模型比现有的常用模型具有更好的F1分数。

{"title":"Self-distilled Named Entity Recognition Based on Boundary Detection and Biaffine Attention","authors":"Yong Song, Zhiwei Yan, Yukun Qin, Xiaozhou Ye, Ye Ouyang","doi":"10.1109/smartworld-uic-atc-scalcom-digitaltwin-pricomp-metaverse56740.2022.00162","DOIUrl":"https://doi.org/10.1109/smartworld-uic-atc-scalcom-digitaltwin-pricomp-metaverse56740.2022.00162","url":null,"abstract":"Named Entity Recognition (NER) is an important down-streaming task in natural language processing. Span-based methods are applicable to both flat and nested entities. However, they lack explicit boundary supervision. To address this issue, we propose a multi-task and self-distilled model which combines biaffine span classification and entity boundary detection tasks. Firstly, the boundary detection and biaffine span classification models are jointly trained under a multi-task learning framework to address the problem of lacking supervision of boundaries. Then, self-distillation technique is applied on the model to reassign entity probabilities from annotated spans to surrounding spans and more entity types, further improving the accuracy of NER by soft labels that contain richer knowledge. Experiments were based on a high-density entity text dataset of the commodity titles from an e-commerce company. Finally, the experimental results show that our model exhibited a better F1 score than the existing common models.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"1 1","pages":"1132-1137"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90184901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TCFNet: Transformer and CNN Fusion Model for LiDAR Point Cloud Semantic Segmentation 激光雷达点云语义分割的Transformer和CNN融合模型

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00197

Lu Ren, Jianwei Niu, Zhenchao Ouyang, Zhibin Zhang, Siyi Zheng

Dynamic scene understanding based on LiDAR point clouds is one of the critical perception tasks for self-driving vehicles. Among these tasks, point cloud semantic segmentation is highly challenging. Some existing work ignores the loss of crucial information caused by sampling and projecting. Others use modules with high computational complexity because of the pursuit of precision, challenging to deploy in the vehicle platform with limited computing power. This paper proposes Fusedown/Fuse-up modules for efficient down-sampling/up-sampling feature extraction. The modules combine the transformer in vision integrating the global information of the feature map with the CNN extracting local feature information. Based on these two modules, we built the transformer and CNN fusion network called TCFNet for point cloud semantic segmentation. Experiments on the SemanticKITTI show that our suitable combination of transformer and CNN is necessary for semantic segmentation accuracy, and the mIoU of our model can reach 82.7% at 10 FPS. The code can be accessed at https://github.com/donkeyofking/TCFNet.git.

基于激光雷达点云的动态场景理解是自动驾驶汽车的关键感知任务之一。其中，点云语义分割是一项极具挑战性的任务。一些现有的工作忽略了采样和投影导致的关键信息的丢失。由于追求精度，其他使用具有高计算复杂度的模块，在计算能力有限的车辆平台上部署具有挑战性。本文提出了Fusedown/ fuseup模块，用于高效的下采样/上采样特征提取。该模块将集成全局特征信息的视觉变压器与提取局部特征信息的CNN相结合。在这两个模块的基础上，我们构建了用于点云语义分割的transformer和CNN融合网络TCFNet。在SemanticKITTI上的实验表明，我们的变压器和CNN的适当组合是提高语义分割精度的必要条件，我们的模型在10 FPS下的mIoU可以达到82.7%。代码可以在https://github.com/donkeyofking/TCFNet.git上访问。

{"title":"TCFNet: Transformer and CNN Fusion Model for LiDAR Point Cloud Semantic Segmentation","authors":"Lu Ren, Jianwei Niu, Zhenchao Ouyang, Zhibin Zhang, Siyi Zheng","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00197","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00197","url":null,"abstract":"Dynamic scene understanding based on LiDAR point clouds is one of the critical perception tasks for self-driving vehicles. Among these tasks, point cloud semantic segmentation is highly challenging. Some existing work ignores the loss of crucial information caused by sampling and projecting. Others use modules with high computational complexity because of the pursuit of precision, challenging to deploy in the vehicle platform with limited computing power. This paper proposes Fusedown/Fuse-up modules for efficient down-sampling/up-sampling feature extraction. The modules combine the transformer in vision integrating the global information of the feature map with the CNN extracting local feature information. Based on these two modules, we built the transformer and CNN fusion network called TCFNet for point cloud semantic segmentation. Experiments on the SemanticKITTI show that our suitable combination of transformer and CNN is necessary for semantic segmentation accuracy, and the mIoU of our model can reach 82.7% at 10 FPS. The code can be accessed at https://github.com/donkeyofking/TCFNet.git.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"63 1","pages":"1366-1372"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86865277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Heuristic Once Learning for Image & Text Duality Information Processing 启发式一次性学习在图像和文本对偶信息处理中的应用

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00195

L. Weigang, L. Martins, Nikson Ferreira, Christian Miranda, Lucas S. Althoff, Walner Pessoa, Mylène C. Q. Farias, Ricardo Jacobi, Mauricio Rincon

Few-shot learning is an important mechanism to minimize the need for the labeling of large amounts of data and taking advantage of transfer learning. To identify image/text input with duality property, this research proposes a “Heuristic once learning (HOL)” mechanism to investigate multi-modal input processing similar to human-like behavior. First, we create an image/text data set of big Latin letters composed of small letters and another data set composed of Arabic, Chinese and Roman numerals. Secondly, we use Convolutional Neural Networks (CNN) for pre-training the dataset of letters to get structural features. Thirdly, using the acquired knowledge, a Self-organizing Map (SOM) and Contrastive Language-Image Pretraining (CLIP) are tested separately using zero-shot learning. Siamese Networks and Vision Transformer (ViT) are also tested using one-shot learning by knowledge transfer to identify the features of unknown characters. The research results show the potential and challenges to realize HOL and make a useful attempt for the development of general agents.

Few-shot学习是一种重要的机制，可以最大限度地减少对大量数据的标记需求，并利用迁移学习。为了识别具有对偶性的图像/文本输入，本研究提出了一种“启发式一次学习(HOL)”机制来研究类似于人类行为的多模态输入处理。首先，我们创建一个由小写字母组成的大拉丁字母的图像/文本数据集和另一个由阿拉伯语、汉语和罗马数字组成的数据集。其次，我们使用卷积神经网络(CNN)对字母数据集进行预训练，得到结构特征。第三，利用学习到的知识，对自组织映射(SOM)和对比语言图像预训练(CLIP)分别进行零次学习测试。通过知识转移的一次性学习，对Siamese Networks和Vision Transformer (ViT)进行了测试，以识别未知字符的特征。研究结果显示了实现HOL的潜力和挑战，为总代理的发展做出了有益的尝试。

{"title":"Heuristic Once Learning for Image & Text Duality Information Processing","authors":"L. Weigang, L. Martins, Nikson Ferreira, Christian Miranda, Lucas S. Althoff, Walner Pessoa, Mylène C. Q. Farias, Ricardo Jacobi, Mauricio Rincon","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00195","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00195","url":null,"abstract":"Few-shot learning is an important mechanism to minimize the need for the labeling of large amounts of data and taking advantage of transfer learning. To identify image/text input with duality property, this research proposes a “Heuristic once learning (HOL)” mechanism to investigate multi-modal input processing similar to human-like behavior. First, we create an image/text data set of big Latin letters composed of small letters and another data set composed of Arabic, Chinese and Roman numerals. Secondly, we use Convolutional Neural Networks (CNN) for pre-training the dataset of letters to get structural features. Thirdly, using the acquired knowledge, a Self-organizing Map (SOM) and Contrastive Language-Image Pretraining (CLIP) are tested separately using zero-shot learning. Siamese Networks and Vision Transformer (ViT) are also tested using one-shot learning by knowledge transfer to identify the features of unknown characters. The research results show the potential and challenges to realize HOL and make a useful attempt for the development of general agents.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"20 1","pages":"1353-1359"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72708078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal Hateful Memes Detection via Image Caption Supervision 通过图像标题监督的多模态仇恨模因检测

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00221

Huaicheng Fang, Fuqing Zhu, Jizhong Han, Songlin Hu

A large amount of hateful speech exist on the Internet in the form of text and images uploaded by social media users. Recently, multimodal hateful speech detection task has attracted more and more researchers to invest, producing some representative work for perceiving the negative samples. For this special multimodal task, the ability of multimodal semantic information understanding is particularly crucial. However, the existing models have insufficient understanding ability of image modality semantic compared with the text modality, due to the appearance complexity of each image. Therefore, this paper utilizes the text modality which is well understood by the model to improve understanding ability of image modality semantic. Specifically, this paper proposes an image caption supervision (ICS) auxiliary method for multimodal hateful speech detection, where the image caption is designed to supervise the feature learning of images for further understanding the semantic information. On the Facebook Hateful Memes dataset, the proposed ICS method outperforms some state-of-the-art unimodal and multimodal baselines, demonstrating the effectiveness of ICS.

大量的仇恨言论以社交媒体用户上传的文字和图片的形式存在于互联网上。近年来，多模态仇恨语音检测任务吸引了越来越多的研究者的投入，并产生了一些具有代表性的负样本感知工作。对于这种特殊的多模态任务，多模态语义信息理解能力尤为重要。然而，由于每幅图像的外观复杂性，现有模型对图像模态语义的理解能力与文本模态相比不足。因此，本文利用该模型所能理解的文本情态来提高对图像情态语义的理解能力。具体而言，本文提出了一种用于多模态仇恨语音检测的图像标题监督(ICS)辅助方法，其中图像标题被设计为监督图像的特征学习，以进一步理解语义信息。在Facebook仇恨模因数据集上，所提出的ICS方法优于一些最先进的单模态和多模态基线，证明了ICS的有效性。

{"title":"Multimodal Hateful Memes Detection via Image Caption Supervision","authors":"Huaicheng Fang, Fuqing Zhu, Jizhong Han, Songlin Hu","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00221","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00221","url":null,"abstract":"A large amount of hateful speech exist on the Internet in the form of text and images uploaded by social media users. Recently, multimodal hateful speech detection task has attracted more and more researchers to invest, producing some representative work for perceiving the negative samples. For this special multimodal task, the ability of multimodal semantic information understanding is particularly crucial. However, the existing models have insufficient understanding ability of image modality semantic compared with the text modality, due to the appearance complexity of each image. Therefore, this paper utilizes the text modality which is well understood by the model to improve understanding ability of image modality semantic. Specifically, this paper proposes an image caption supervision (ICS) auxiliary method for multimodal hateful speech detection, where the image caption is designed to supervise the feature learning of images for further understanding the semantic information. On the Facebook Hateful Memes dataset, the proposed ICS method outperforms some state-of-the-art unimodal and multimodal baselines, demonstrating the effectiveness of ICS.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"2 2","pages":"1530-1537"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72471719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Multi-Head Attention Based Dual Target Graph Collaborative Filtering Network 基于多头注意的双目标图协同过滤网络

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00086

Qinglong Peng, Bin Tang, Jinhuan Liu, Shuang Cui, Junwei Du, Yan Lu, Feng Jiang, Xu Yu

Recently, cross-domain collaborative filtering (CDCF) has been widely used to solve the data sparsity problem in recommendation systems. Therein, the dual-target cross-domain recommendation becomes a research hotspot, which aims to improve the recommendation performance of both target and source domains. Most existing approaches tend to use fixed weights or self-attention in a single representation space for the bi-directional inter-domain transfer of the user representation. However, a single representation space leads to limited representation capability, which makes the transfer of the user representation coarse-grained and inaccurate. In this paper, Multi-head Attention Based Dual Target Graph Collaborative Filtering Network (MA-DTGCF) is proposed. The core of the model is the bi-directional transfer graph convolution layer, consisting of a graph convolution layer and a bi-directional transfer layer based on a multi-head attention mechanism. The latter can achieve fine-grained and adaptive transfer of user features in multiple representation subspaces. It is worth noting that by stacking multiple bi-directional transfer graph convolutional layers, we can get high-order user and item features and achieve adaptive transfer of each order user features. Experimental results on three real datasets show that the proposed MA-DTGCF model significantly outperforms the state-of-the-art models in terms of HR and NDCG.

近年来，跨域协同过滤(CDCF)被广泛用于解决推荐系统中的数据稀疏性问题。其中，双目标跨域推荐成为一个研究热点，旨在提高目标域和源域的推荐性能。大多数现有方法倾向于在单个表示空间中使用固定权值或自关注来实现用户表示的双向域间传递。然而，单一的表示空间导致有限的表示能力，这使得用户表示的传输粗粒度和不准确。本文提出了一种基于多头注意力的双目标图协同过滤网络(MA-DTGCF)。该模型的核心是双向迁移图卷积层，由图卷积层和基于多头注意机制的双向迁移层组成。后者可以在多个表示子空间中实现用户特征的细粒度和自适应转移。值得注意的是，通过叠加多个双向传递图卷积层，我们可以得到高阶用户和物品特征，并实现每阶用户特征的自适应传递。在三个真实数据集上的实验结果表明，所提出的MA-DTGCF模型在HR和NDCG方面明显优于现有模型。

{"title":"A Multi-Head Attention Based Dual Target Graph Collaborative Filtering Network","authors":"Qinglong Peng, Bin Tang, Jinhuan Liu, Shuang Cui, Junwei Du, Yan Lu, Feng Jiang, Xu Yu","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00086","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00086","url":null,"abstract":"Recently, cross-domain collaborative filtering (CDCF) has been widely used to solve the data sparsity problem in recommendation systems. Therein, the dual-target cross-domain recommendation becomes a research hotspot, which aims to improve the recommendation performance of both target and source domains. Most existing approaches tend to use fixed weights or self-attention in a single representation space for the bi-directional inter-domain transfer of the user representation. However, a single representation space leads to limited representation capability, which makes the transfer of the user representation coarse-grained and inaccurate. In this paper, Multi-head Attention Based Dual Target Graph Collaborative Filtering Network (MA-DTGCF) is proposed. The core of the model is the bi-directional transfer graph convolution layer, consisting of a graph convolution layer and a bi-directional transfer layer based on a multi-head attention mechanism. The latter can achieve fine-grained and adaptive transfer of user features in multiple representation subspaces. It is worth noting that by stacking multiple bi-directional transfer graph convolutional layers, we can get high-order user and item features and achieve adaptive transfer of each order user features. Experimental results on three real datasets show that the proposed MA-DTGCF model significantly outperforms the state-of-the-art models in terms of HR and NDCG.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"107 1","pages":"476-483"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74476078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

What Is Next? A Generative Approach for Service Composition Recommendations 下一步是什么?服务组合推荐的生成方法

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00078

Guodong Fan, Shizhan Chen, Hongyue Wu, Ming Zhu, Xiao Xue, Zhiyong Feng

Service recommendation is important in creating composite services, workflows, e-business solutions, etc. It often takes developers a long time to Figure out what the next service is. A lot of researchers have used collaborative filtering-based or content-based approaches to recommend services for developers. However, failing to model the co-occurrence relationships between services, current approaches cannot recommend the next services for service composition. This leads to a decrease in the accuracy of service composition recommendations. To tackle this problem, this paper proposes an Encoder-Decoder-based Recommender named EDeR, which transforms the service recommendation problem into a generation problem. First, we employ a self-supervised graph embedding method to fully learn the representation of each service according to both structural and descriptive information. Then, based on the co-occurrence relationships between services, we propose an encoder-decoder model to sequentially recommend services in a way that translates user requirements into a composite service. The results obtained from experiments conducted on a real-world dataset show that EDeR outperforms the state-of-the-art approaches significantly.

服务推荐在创建组合服务、工作流、电子商务解决方案等方面非常重要。开发人员通常要花很长时间才能弄清楚下一个服务是什么。许多研究人员使用基于协作过滤或基于内容的方法向开发人员推荐服务。然而，由于无法对服务之间的共现关系进行建模，当前的方法无法为服务组合推荐下一个服务。这将导致服务组合建议的准确性降低。为了解决这一问题，本文提出了一种基于编码器-解码器的推荐器——EDeR，将服务推荐问题转化为生成问题。首先，我们采用自监督图嵌入方法，根据结构信息和描述信息充分学习每个服务的表示。然后，基于服务之间的共现关系，我们提出了一个编码器-解码器模型，以一种将用户需求转换为组合服务的方式顺序推荐服务。在真实数据集上进行的实验结果表明，EDeR的性能明显优于最先进的方法。

{"title":"What Is Next? A Generative Approach for Service Composition Recommendations","authors":"Guodong Fan, Shizhan Chen, Hongyue Wu, Ming Zhu, Xiao Xue, Zhiyong Feng","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00078","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00078","url":null,"abstract":"Service recommendation is important in creating composite services, workflows, e-business solutions, etc. It often takes developers a long time to Figure out what the next service is. A lot of researchers have used collaborative filtering-based or content-based approaches to recommend services for developers. However, failing to model the co-occurrence relationships between services, current approaches cannot recommend the next services for service composition. This leads to a decrease in the accuracy of service composition recommendations. To tackle this problem, this paper proposes an Encoder-Decoder-based Recommender named EDeR, which transforms the service recommendation problem into a generation problem. First, we employ a self-supervised graph embedding method to fully learn the representation of each service according to both structural and descriptive information. Then, based on the co-occurrence relationships between services, we propose an encoder-decoder model to sequentially recommend services in a way that translates user requirements into a composite service. The results obtained from experiments conducted on a real-world dataset show that EDeR outperforms the state-of-the-art approaches significantly.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"20 1","pages":"409-416"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74614627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-view Self-attention Network for Next POI Recommendation 下一个POI推荐的多视图自关注网络

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00265

Hao Li, P. Yue, Shangcheng Li, Fangqiang Yu, Chenxiao Zhang, Can Yang, Liangcun Jiang

Next Point-of-Interest (POI) recommendation has been applied by many Internet companies to enhance user travel experience. The state-of-the-art deep learning methods in next POI recommendation advocate the self-attention mechanism to model the user long-term check-in sequence. However, the existing methods ignore the interdependence between POI and POI category in the historical interaction. The POI and POI category sequences can be regarded as multi-view information of user check-in behaviors. This paper proposes a multi-view self-attention network (MVSAN) for next POI recommendation. Firstly, MVSAN uses a self-attention layer to update the feature representation of POI and POI category respectively. Then it generates the importance of POI under the condition of the POI category through a co-attention module. To make better use of geospatial information, we design a spatial candidate set filtering module to help the model improve recommendation performance. Experiments on two real check-in datasets show that MVSAN yields outstanding improvements over the state-of-the-art models in terms of recall.

下一个兴趣点(POI)推荐已经被许多互联网公司应用于提升用户的旅行体验。在接下来的POI推荐中，最先进的深度学习方法提倡自关注机制来模拟用户的长期登记序列。然而，现有的方法忽略了历史交互中POI和POI类别之间的相互依赖关系。POI和POI类别序列可以看作是用户签入行为的多视图信息。本文提出了一种多视图自关注网络(MVSAN)，用于下一个POI推荐。首先，MVSAN使用自关注层分别更新POI和POI类别的特征表示。然后通过共关注模块生成POI类别条件下的POI重要性。为了更好地利用地理空间信息，我们设计了一个空间候选集过滤模块来帮助模型提高推荐性能。在两个真实签入数据集上的实验表明，MVSAN在召回方面比最先进的模型有了显著的改进。

{"title":"Multi-view Self-attention Network for Next POI Recommendation","authors":"Hao Li, P. Yue, Shangcheng Li, Fangqiang Yu, Chenxiao Zhang, Can Yang, Liangcun Jiang","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00265","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00265","url":null,"abstract":"Next Point-of-Interest (POI) recommendation has been applied by many Internet companies to enhance user travel experience. The state-of-the-art deep learning methods in next POI recommendation advocate the self-attention mechanism to model the user long-term check-in sequence. However, the existing methods ignore the interdependence between POI and POI category in the historical interaction. The POI and POI category sequences can be regarded as multi-view information of user check-in behaviors. This paper proposes a multi-view self-attention network (MVSAN) for next POI recommendation. Firstly, MVSAN uses a self-attention layer to update the feature representation of POI and POI category respectively. Then it generates the importance of POI under the condition of the POI category through a co-attention module. To make better use of geospatial information, we design a spatial candidate set filtering module to help the model improve recommendation performance. Experiments on two real check-in datasets show that MVSAN yields outstanding improvements over the state-of-the-art models in terms of recall.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"5 1","pages":"1825-1832"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74654920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Lightweight Locally Repairable Code-based Storage Architecture for Blockchains 一个轻量级的本地可修复的基于代码的区块链存储架构

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Scalable Computing-Practice and Experience

Pub Date : 2022-12-01 DOI: 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00324

Wanning Bao, Liangmin Wang, Jie Chen

The blockchain system requires every node to preserve a complete copy of data arbitrarily, which exerts tremendous storage pressure on nodes. Some researchers applied the erasure code to reduce storage redundancy. However, code storage schemes have the problem of inefficient data communication while verifying transactions and downloading data. To solve this problem, this paper proposes a lightweight locally repairable code (LRC) storage scheme inspired by the idea of slice strategy from privacy computing. Firstly, partitioning each block into distinct transaction slices substantially reduces the amount of transmitted data required to verify a transaction. Secondly, our scheme can recover single-point data with fewer code data slices by local nodes and with less network communication overhead. At last, we analyze the performance of our scheme from theoretical perspectives and examine the storage performance and computation efficiency of our scheme from experimental perspectives. Results suggest that our scheme can effectively reduce the storage overhead while also decreasing the network communication overhead and improving the data reading efficiency.

区块链系统要求每个节点任意保留一份完整的数据副本，这给节点带来了巨大的存储压力。一些研究人员使用擦除码来减少存储冗余。然而，代码存储方案在验证事务和下载数据时存在数据通信效率低下的问题。为了解决这一问题，本文借鉴隐私计算中的切片策略思想，提出了一种轻量级的局部可修复代码(LRC)存储方案。首先，将每个块划分为不同的事务片大大减少了验证事务所需的传输数据量。其次，我们的方案可以用更少的本地节点代码数据切片和更少的网络通信开销来恢复单点数据。最后，从理论角度分析了该方案的性能，并从实验角度检验了该方案的存储性能和计算效率。结果表明，该方案可以有效降低存储开销，同时降低网络通信开销，提高数据读取效率。

{"title":"A Lightweight Locally Repairable Code-based Storage Architecture for Blockchains","authors":"Wanning Bao, Liangmin Wang, Jie Chen","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00324","DOIUrl":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00324","url":null,"abstract":"The blockchain system requires every node to preserve a complete copy of data arbitrarily, which exerts tremendous storage pressure on nodes. Some researchers applied the erasure code to reduce storage redundancy. However, code storage schemes have the problem of inefficient data communication while verifying transactions and downloading data. To solve this problem, this paper proposes a lightweight locally repairable code (LRC) storage scheme inspired by the idea of slice strategy from privacy computing. Firstly, partitioning each block into distinct transaction slices substantially reduces the amount of transmitted data required to verify a transaction. Secondly, our scheme can recover single-point data with fewer code data slices by local nodes and with less network communication overhead. At last, we analyze the performance of our scheme from theoretical perspectives and examine the storage performance and computation efficiency of our scheme from experimental perspectives. Results suggest that our scheme can effectively reduce the storage overhead while also decreasing the network communication overhead and improving the data reading efficiency.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"13 1","pages":"2279-2283"},"PeriodicalIF":1.1,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77688727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0