首页 > 最新文献

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning最新文献

英文 中文
Garment Metaverse: Parametric Digital Human and Dynamic Scene Try-on 服装虚拟世界:参数化数字人体和动态场景试穿
Hua Wang, Xiaoxiao Liu, Minghua Jiang, Changlong Zhou
As a new concept, the metaverse has been widely concerned by the industry, academia, media and the public. Many domestic and foreign companies have also set up in the field of the metaverse. The traditional 2D and 3D virtual fitting has not achieved breakthrough and development because of the technical problems of authenticity and timeliness. With this in mind, we developed a virtual fitting system based on the metaverse community, which includes two modules: parameterized virtual digital human modeling and multi-scene and multi-action fitting. The system realizes the construction of individualized virtual digital person. As the medium between the metaverse clothing community and the real world, it is used to achieve multi-category dynamic fitting display actions in the meta-universe clothing community platform for multi-category scenes. The system integrates the high simulation and synchronization of the metaverse with the virtual fitting system, to break the barriers of traditional virtual fitting technology and realize the combination of the garment industry and the metaverse. The experimental results show that the system can realize the construction of virtual digital human in 2.1ms at the fastest and realize the dynamic display of multi-action and multi-scene fitting.
作为一个新兴的概念,虚拟世界受到了业界、学术界、媒体和公众的广泛关注。许多国内外公司也在元宇宙领域有所建树。传统的2D和3D虚拟试衣由于真实性和时效性等技术问题,一直没有取得突破和发展。基于此,我们开发了一个基于元宇宙社区的虚拟试衣系统,包括参数化虚拟数字人体建模和多场景多动作试衣两个模块。该系统实现了个性化虚拟数字人的构建。作为元宇宙服装社区与现实世界之间的媒介,在元宇宙服装社区平台中实现多品类场景下的多品类动态试衣展示动作。该系统将虚拟试衣系统与虚拟试衣系统的高度仿真和同步性相结合,打破了传统虚拟试衣技术的壁垒,实现了服装产业与虚拟试衣的结合。实验结果表明,该系统最快可在2.1ms内实现虚拟数字人的构建,并实现多动作和多场景拟合的动态显示。
{"title":"Garment Metaverse: Parametric Digital Human and Dynamic Scene Try-on","authors":"Hua Wang, Xiaoxiao Liu, Minghua Jiang, Changlong Zhou","doi":"10.1145/3590003.3590014","DOIUrl":"https://doi.org/10.1145/3590003.3590014","url":null,"abstract":"As a new concept, the metaverse has been widely concerned by the industry, academia, media and the public. Many domestic and foreign companies have also set up in the field of the metaverse. The traditional 2D and 3D virtual fitting has not achieved breakthrough and development because of the technical problems of authenticity and timeliness. With this in mind, we developed a virtual fitting system based on the metaverse community, which includes two modules: parameterized virtual digital human modeling and multi-scene and multi-action fitting. The system realizes the construction of individualized virtual digital person. As the medium between the metaverse clothing community and the real world, it is used to achieve multi-category dynamic fitting display actions in the meta-universe clothing community platform for multi-category scenes. The system integrates the high simulation and synchronization of the metaverse with the virtual fitting system, to break the barriers of traditional virtual fitting technology and realize the combination of the garment industry and the metaverse. The experimental results show that the system can realize the construction of virtual digital human in 2.1ms at the fastest and realize the dynamic display of multi-action and multi-scene fitting.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128921393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Face Anti-spoofing Method Based on Deep Supervision 基于深度监督的人脸防欺骗方法
Hongxia Wang, Li Liu, Ailing Jia
Although face recognition technology is extensively used, it is vulnerable to various face spoofing attacks, such as photo and video attacks. Face anti-spoofing is a crucial step in the face recognition process and is particularly important for the security of identity verification. However, most of today's face anti-spoofing algorithms regard this task as an image binary classification problem, which is easy to over-fit. Therefore, this paper builds the basic deep supervised network as the baseline model and designs the central gradient convolution to extract the pixel difference information within the local region. To reduce the redundancy of gradient features, the central gradient convolution is decoupled to replace the vanilla convolution in the baseline model to form two cross-central gradient networks. A cross-feature interaction module is then built to effectively fuse the networks. And a depth uncertainty module is built for the problem that most face datasets are noisy and it is difficult for the model to extract fuzzy region features. Compared with existing methods, the proposed method performs well on the OULU-NPU, CASIA-FASD, and Replay-Attack datasets.
虽然人脸识别技术被广泛应用,但它很容易受到各种人脸欺骗攻击,如照片和视频攻击。人脸防欺骗是人脸识别过程中至关重要的一步,对身份验证的安全性尤为重要。然而,目前大多数人脸防欺骗算法都将该任务视为图像二值分类问题,容易出现过拟合。因此,本文构建基本深度监督网络作为基线模型,并设计中心梯度卷积提取局部区域内的像素差信息。为了减少梯度特征的冗余,将中心梯度卷积解耦,取代基线模型中的vanilla卷积,形成两个跨中心梯度网络。然后构建一个跨功能交互模块来有效地融合网络。针对大多数人脸数据集存在噪声,难以提取模糊区域特征的问题,建立了深度不确定性模块。与现有方法相比,该方法在OULU-NPU、CASIA-FASD和Replay-Attack数据集上表现良好。
{"title":"Face Anti-spoofing Method Based on Deep Supervision","authors":"Hongxia Wang, Li Liu, Ailing Jia","doi":"10.1145/3590003.3590023","DOIUrl":"https://doi.org/10.1145/3590003.3590023","url":null,"abstract":"Although face recognition technology is extensively used, it is vulnerable to various face spoofing attacks, such as photo and video attacks. Face anti-spoofing is a crucial step in the face recognition process and is particularly important for the security of identity verification. However, most of today's face anti-spoofing algorithms regard this task as an image binary classification problem, which is easy to over-fit. Therefore, this paper builds the basic deep supervised network as the baseline model and designs the central gradient convolution to extract the pixel difference information within the local region. To reduce the redundancy of gradient features, the central gradient convolution is decoupled to replace the vanilla convolution in the baseline model to form two cross-central gradient networks. A cross-feature interaction module is then built to effectively fuse the networks. And a depth uncertainty module is built for the problem that most face datasets are noisy and it is difficult for the model to extract fuzzy region features. Compared with existing methods, the proposed method performs well on the OULU-NPU, CASIA-FASD, and Replay-Attack datasets.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130540008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ENOSE Performance in Transient Time and Steady State Area of Gas Sensor Response for Ammonia Gas: Comparison and Study 气体传感器对氨气响应的瞬态时间和稳态区域的ENOSE性能:比较与研究
Kuan Geng, Jahangir Moshayedi Ata, Jing-hao Chen, Jiandong Hu, Hao Zhang
This paper proposed an electronic nose system that utilized a SnO2 semiconductor sensor array to detect volatile ammonia gas in farmland. All sensors were controlled by the Arduino development board. The system could collect data during both the steady-state and transient phases of sensor operation. The collected data was analyzed using PCA (principal component analysis) and MLP (Multi-layer perceptron) neural networks. The experiment was divided into two parts: The first part analyzed four concentrations of ammonia (100ppm, 200ppm, 400ppm, and Air) using PCA and MLP, which successfully distinguished the concentrations with an identification rate of over 95%. In the second part, four gases (air mixed with ammonia, pure ammonia gas, air mixed with ethanol, and pure ethanol) were analyzed using PCA and MLP, with the electronic nose system successfully distinguishing between the four types of gases. The system could read and process data during the transient phase of the sensor, and the constructed sensor array electronic nose system and acquisition method has significant potential for ammonia detection in agricultural environments.
提出了一种利用SnO2半导体传感器阵列检测农田挥发性氨气的电子鼻系统。所有传感器均由Arduino开发板控制。该系统可以在传感器工作的稳态和瞬态阶段采集数据。使用主成分分析(PCA)和多层感知器(MLP)神经网络对收集到的数据进行分析。实验分为两部分:第一部分使用PCA和MLP分析了4种浓度的氨(100ppm、200ppm、400ppm和Air),成功区分了浓度,识别率在95%以上。第二部分采用主成分分析法和MLP法对四种气体(混合氨气、纯氨气、混合乙醇气和纯乙醇气)进行分析,电子鼻系统成功区分了四种气体。该系统可以在传感器瞬态阶段读取和处理数据,所构建的传感器阵列电子鼻系统和采集方法在农业环境中氨检测具有重要的潜力。
{"title":"ENOSE Performance in Transient Time and Steady State Area of Gas Sensor Response for Ammonia Gas: Comparison and Study","authors":"Kuan Geng, Jahangir Moshayedi Ata, Jing-hao Chen, Jiandong Hu, Hao Zhang","doi":"10.1145/3590003.3590046","DOIUrl":"https://doi.org/10.1145/3590003.3590046","url":null,"abstract":"This paper proposed an electronic nose system that utilized a SnO2 semiconductor sensor array to detect volatile ammonia gas in farmland. All sensors were controlled by the Arduino development board. The system could collect data during both the steady-state and transient phases of sensor operation. The collected data was analyzed using PCA (principal component analysis) and MLP (Multi-layer perceptron) neural networks. The experiment was divided into two parts: The first part analyzed four concentrations of ammonia (100ppm, 200ppm, 400ppm, and Air) using PCA and MLP, which successfully distinguished the concentrations with an identification rate of over 95%. In the second part, four gases (air mixed with ammonia, pure ammonia gas, air mixed with ethanol, and pure ethanol) were analyzed using PCA and MLP, with the electronic nose system successfully distinguishing between the four types of gases. The system could read and process data during the transient phase of the sensor, and the constructed sensor array electronic nose system and acquisition method has significant potential for ammonia detection in agricultural environments.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127875618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Comparison of regional monitoring methods for grassland degradation based on remote sensing images 基于遥感影像的草地退化区域监测方法比较
Haoran Wang, Tianyun Xue, Zhaoran Wang, Xiangyu Bai
As an integral part of the ecosystem, grassland plays an important role in protecting water and soil, preventing wind and fixing sand and protecting biodiversity. However, some grasslands are degraded at this stage, so a grassland monitoring method is urgently needed to prevent desertification from spreading. With the rapid rise of deep learning, it is more and more popular to apply artificial intelligence methods to grassland degradation monitoring. This paper systematically and comprehensively analyzes that almost all semantic segmentation methods have been applied to relevant research on grassland degradation areas since semantic segmentation methods were applied to grassland monitoring. Then, according to the different algorithm structures of grassland extraction methods, the principles of representative algorithms are introduced in turn. Then we made a statistical analysis of the publication status, research space distribution and the number of citations of papers in this field. Finally, the analysis results are discussed, and the possible research hotspots in the future are discussed.
草地是生态系统的重要组成部分,在保水、保土、防风固沙、保护生物多样性等方面发挥着重要作用。然而,在这一阶段,一些草原正在退化,因此迫切需要一种草地监测方法来防止荒漠化的蔓延。随着深度学习的迅速兴起,将人工智能方法应用于草地退化监测越来越受欢迎。本文系统、全面地分析了自语义分割方法应用于草地监测以来,几乎所有的语义分割方法都应用于草地退化区的相关研究。然后,根据草地提取方法的算法结构不同,依次介绍了代表性算法的原理。然后对该领域的论文发表现状、研究空间分布和被引次数进行了统计分析。最后对分析结果进行了讨论,并对未来可能的研究热点进行了讨论。
{"title":"Comparison of regional monitoring methods for grassland degradation based on remote sensing images","authors":"Haoran Wang, Tianyun Xue, Zhaoran Wang, Xiangyu Bai","doi":"10.1145/3590003.3590083","DOIUrl":"https://doi.org/10.1145/3590003.3590083","url":null,"abstract":"As an integral part of the ecosystem, grassland plays an important role in protecting water and soil, preventing wind and fixing sand and protecting biodiversity. However, some grasslands are degraded at this stage, so a grassland monitoring method is urgently needed to prevent desertification from spreading. With the rapid rise of deep learning, it is more and more popular to apply artificial intelligence methods to grassland degradation monitoring. This paper systematically and comprehensively analyzes that almost all semantic segmentation methods have been applied to relevant research on grassland degradation areas since semantic segmentation methods were applied to grassland monitoring. Then, according to the different algorithm structures of grassland extraction methods, the principles of representative algorithms are introduced in turn. Then we made a statistical analysis of the publication status, research space distribution and the number of citations of papers in this field. Finally, the analysis results are discussed, and the possible research hotspots in the future are discussed.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125506916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Component for Query-based Object Detection in Crowded Scenes 拥挤场景中基于查询的对象检测组件
Shuo Mao
Query-based object detection, including DETR and Sparse R-CNN, has gained considerable attention in recent years. However, in dense scenes, end-to-end object detection methods are prone to false positives. To address this issue, we propose a graph convolution-based post-processing component to refine the output results from Sparse R-CNN. Specifically, we initially select high-scoring queries to generate true positive predictions. Subsequently, the query updater refines noisy query features using GCN. Lastly, the label assignment rule matches accepted predictions to ground truth objects, eliminates matched targets, and associates noisy predictions with the remaining ground truth objects. Our method significantly enhances performance in crowded scenes. Our method achieves 92.3% AP and 41.6% on CrowdHuman dataset, which is a challenging objection detection dataset.
基于查询的目标检测,包括DETR和Sparse R-CNN,近年来得到了相当多的关注。然而,在密集的场景中,端到端目标检测方法容易出现误报。为了解决这个问题,我们提出了一个基于图卷积的后处理组件来改进Sparse R-CNN的输出结果。具体来说,我们最初选择高分查询来生成真正预测。随后,查询更新器使用GCN对噪声查询特征进行细化。最后,标签分配规则将可接受的预测与基础真值对象相匹配,消除匹配的目标,并将噪声预测与剩余的基础真值对象相关联。我们的方法显著提高了拥挤场景下的性能。我们的方法在具有挑战性的目标检测数据集CrowdHuman上实现了92.3%的AP和41.6%的AP。
{"title":"A Component for Query-based Object Detection in Crowded Scenes","authors":"Shuo Mao","doi":"10.1145/3590003.3590039","DOIUrl":"https://doi.org/10.1145/3590003.3590039","url":null,"abstract":"Query-based object detection, including DETR and Sparse R-CNN, has gained considerable attention in recent years. However, in dense scenes, end-to-end object detection methods are prone to false positives. To address this issue, we propose a graph convolution-based post-processing component to refine the output results from Sparse R-CNN. Specifically, we initially select high-scoring queries to generate true positive predictions. Subsequently, the query updater refines noisy query features using GCN. Lastly, the label assignment rule matches accepted predictions to ground truth objects, eliminates matched targets, and associates noisy predictions with the remaining ground truth objects. Our method significantly enhances performance in crowded scenes. Our method achieves 92.3% AP and 41.6% on CrowdHuman dataset, which is a challenging objection detection dataset.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126241978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech image data mining algorithm based on multimodal decision fusion 基于多模态决策融合的语音图像数据挖掘算法
Cong Lu, Danxing Wang, Daquan Zhang, Aiqun Yu
This paper proposes a data mining algorithm based on multimodal decision fusion, which is mainly used to solve the correlation relationship of multi-level and multi-level multimodal data, the algorithm combines the methods of statistics, queueing study, machine learning and Bayesian decision fusion, compared with the results obtained by single modality, single data and single method, the algorithm proposed in this paper retains the information contained in the data to the maximum extent, and the algorithm is applied to the analysis of both numerical and text-based data. The proposed algorithm can be further extended by modifying the data types and methods to form new methods.
本文提出了一种基于多模态决策融合的数据挖掘算法,主要用于解决多层次、多层次多模态数据的关联关系,该算法结合了统计学、排队研究、机器学习和贝叶斯决策融合等方法,与单模态、单数据、单方法得到的结果相比,本文提出的算法最大程度保留了数据中所包含的信息。并将该算法应用于数值数据和文本数据的分析。该算法可以通过修改数据类型和方法来进一步扩展,从而形成新的方法。
{"title":"Speech image data mining algorithm based on multimodal decision fusion","authors":"Cong Lu, Danxing Wang, Daquan Zhang, Aiqun Yu","doi":"10.1145/3590003.3590007","DOIUrl":"https://doi.org/10.1145/3590003.3590007","url":null,"abstract":"This paper proposes a data mining algorithm based on multimodal decision fusion, which is mainly used to solve the correlation relationship of multi-level and multi-level multimodal data, the algorithm combines the methods of statistics, queueing study, machine learning and Bayesian decision fusion, compared with the results obtained by single modality, single data and single method, the algorithm proposed in this paper retains the information contained in the data to the maximum extent, and the algorithm is applied to the analysis of both numerical and text-based data. The proposed algorithm can be further extended by modifying the data types and methods to form new methods.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121532552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-Modal Audio-Text Retrieval via Sequential Feature Augmentation 基于序列特征增强的跨模态音频文本检索
Fuhu Song, Jifeng Hu, Che Wang, Jiao Huang, Haowen Zhang, Yi Wang
The goal of cross-modal audio-text retrieval is to retrieve the target audio clips (textual descriptions), which should be relevant to a given textual (audial) query. It is a challenging task because it necessitates learning comprehensive feature representations for two different modalities and unifying them into a common embedding space. However, most existing cross-modal audio-text retrieval approaches do not explicitly learn the sequential representation in audio features. Moreover, their method of directly employing a fully connected neural network to transform the different modalities into a common space is detrimental to sequential features. In this paper, we introduce a sequential feature augmentation framework based on reinforcement learning and feature fusion to enhance the sequential feature for cross-modal features. First, we adopt reinforcement learning to explore effective sequential features in audial and textual features. Then, a recurrent fusion module is applied as a feature enhancement component to project heterogeneous features into a common space. Extensive experiments are conducted on two prevalent datasets: the AudioCaps and the Clotho. The results demonstrate that our method gains a significant improvement over previous state-of-the-art methods.
跨模态音频-文本检索的目标是检索目标音频片段(文本描述),它应该与给定的文本(音频)查询相关。这是一项具有挑战性的任务,因为它需要学习两种不同模态的综合特征表示,并将它们统一到一个共同的嵌入空间中。然而,大多数现有的跨模态音频-文本检索方法没有明确地学习音频特征中的顺序表示。此外,他们直接使用全连接的神经网络将不同的模态转换到公共空间的方法不利于序列特征。本文提出了一种基于强化学习和特征融合的序列特征增强框架,用于增强跨模态特征的序列特征。首先,我们采用强化学习来探索听觉和文本特征中的有效序列特征。然后,应用循环融合模块作为特征增强组件,将异构特征投影到公共空间中。在两个流行的数据集上进行了广泛的实验:AudioCaps和Clotho。结果表明,我们的方法比以前的最先进的方法有了显著的改进。
{"title":"Cross-Modal Audio-Text Retrieval via Sequential Feature Augmentation","authors":"Fuhu Song, Jifeng Hu, Che Wang, Jiao Huang, Haowen Zhang, Yi Wang","doi":"10.1145/3590003.3590056","DOIUrl":"https://doi.org/10.1145/3590003.3590056","url":null,"abstract":"The goal of cross-modal audio-text retrieval is to retrieve the target audio clips (textual descriptions), which should be relevant to a given textual (audial) query. It is a challenging task because it necessitates learning comprehensive feature representations for two different modalities and unifying them into a common embedding space. However, most existing cross-modal audio-text retrieval approaches do not explicitly learn the sequential representation in audio features. Moreover, their method of directly employing a fully connected neural network to transform the different modalities into a common space is detrimental to sequential features. In this paper, we introduce a sequential feature augmentation framework based on reinforcement learning and feature fusion to enhance the sequential feature for cross-modal features. First, we adopt reinforcement learning to explore effective sequential features in audial and textual features. Then, a recurrent fusion module is applied as a feature enhancement component to project heterogeneous features into a common space. Extensive experiments are conducted on two prevalent datasets: the AudioCaps and the Clotho. The results demonstrate that our method gains a significant improvement over previous state-of-the-art methods.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"173 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121533331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Reinforcement Learning with Copy-oriented Context Awareness and Weighted Rewards for Abstractive Summarization 基于面向复制的上下文感知和抽象摘要加权奖励的深度强化学习
Caidong Tan
This paper presents a deep context-aware model with a copy mechanism based on reinforcement learning for abstractive text summarization. Our model is optimized using weighted ROUGEs as global prediction-based rewards and the self-critical policy gradient training algorithm, which can reduce the inconsistency between training and testing by directly optimizing the evaluation metrics. To alleviate the lexical diversity and component diversity problems caused by global prediction rewards, we improve the richness of the multi-head self-attention mechanism to capture context through global deep context representation with copy mechanism. We conduct experiments and demonstrate that our model outperforms many existing benchmarks over the Gigaword, LCSTS, and CNN/DM datasets. The experimental results demonstrate that our model has a significant effect on improving the quality of summarization.
本文提出了一种基于强化学习的深度上下文感知模型及其复制机制,用于抽象文本摘要。我们的模型使用加权rouge作为全局预测奖励和自批判策略梯度训练算法进行优化,通过直接优化评估指标来减少训练和测试之间的不一致性。为了缓解全局预测奖励导致的词汇多样性和成分多样性问题,我们通过复制机制的全局深度上下文表示来提高多头自注意机制捕获上下文的丰富性。我们进行了实验,并证明我们的模型在Gigaword、LCSTS和CNN/DM数据集上优于许多现有的基准测试。实验结果表明,该模型对提高摘要质量有显著效果。
{"title":"Deep Reinforcement Learning with Copy-oriented Context Awareness and Weighted Rewards for Abstractive Summarization","authors":"Caidong Tan","doi":"10.1145/3590003.3590019","DOIUrl":"https://doi.org/10.1145/3590003.3590019","url":null,"abstract":"This paper presents a deep context-aware model with a copy mechanism based on reinforcement learning for abstractive text summarization. Our model is optimized using weighted ROUGEs as global prediction-based rewards and the self-critical policy gradient training algorithm, which can reduce the inconsistency between training and testing by directly optimizing the evaluation metrics. To alleviate the lexical diversity and component diversity problems caused by global prediction rewards, we improve the richness of the multi-head self-attention mechanism to capture context through global deep context representation with copy mechanism. We conduct experiments and demonstrate that our model outperforms many existing benchmarks over the Gigaword, LCSTS, and CNN/DM datasets. The experimental results demonstrate that our model has a significant effect on improving the quality of summarization.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121193004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real-time Emulation of MASQUE-based QUIC Proxying in LTE Networks using ns-3 LTE网络中基于masque的QUIC代理的ns-3实时仿真
Donat Scharnitzky, Zsolt Kramer, S. Molnár, A. Mihály
Tools for real-time emulation of mobile networks are valuable for researchers due to the high amount of time and resources it allows to save compared to carrying out measurements in live networks. In this paper we present the rationale, design and prototype implementation of a novel net device in the ns-3 open source network simulator that allows for end-to-end real-time emulation of LTE networks with real endpoints. We then show the performance evaluation of a QUIC proxy built on MASQUE using our emulated LTE setup. Our results confirm the intended behavior of the implementation, however, we also show the limitations of the real-time capabilities of ns-3.
移动网络的实时仿真工具对研究人员来说很有价值,因为与在实时网络中进行测量相比,它可以节省大量的时间和资源。在本文中,我们介绍了ns-3开源网络模拟器中一种新型网络设备的基本原理、设计和原型实现,该设备允许对具有真实端点的LTE网络进行端到端实时仿真。然后,我们展示了使用我们的模拟LTE设置构建在MASQUE上的QUIC代理的性能评估。我们的结果证实了实现的预期行为,然而,我们也显示了ns-3实时功能的局限性。
{"title":"Real-time Emulation of MASQUE-based QUIC Proxying in LTE Networks using ns-3","authors":"Donat Scharnitzky, Zsolt Kramer, S. Molnár, A. Mihály","doi":"10.1145/3590003.3590995","DOIUrl":"https://doi.org/10.1145/3590003.3590995","url":null,"abstract":"Tools for real-time emulation of mobile networks are valuable for researchers due to the high amount of time and resources it allows to save compared to carrying out measurements in live networks. In this paper we present the rationale, design and prototype implementation of a novel net device in the ns-3 open source network simulator that allows for end-to-end real-time emulation of LTE networks with real endpoints. We then show the performance evaluation of a QUIC proxy built on MASQUE using our emulated LTE setup. Our results confirm the intended behavior of the implementation, however, we also show the limitations of the real-time capabilities of ns-3.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121708101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Customer Service Hot event Discovery Based on Dynamic Dialogue Embedding 基于动态对话嵌入的客户服务热点事件发现
Fei Li, Yanyan Wang, Ying Feng, Qiangzhong Feng, Yuan Zhou, Dexuan Wang
Frequent customer service conversations focus on hot topics of communication users, and automatic hot topic discovery is critical to improving user experience. Traditionally, Customer service relies on operator to write traffic summaries. It leads to the source of the conversation difficult to analyze, which makes difficult to spot aggregated hotspot events. In this paper, we propose a Customer Service hot event Discovery based on dynamic dialogue embedding (CShe-D). This model includes dynamic semantic representation of customer service dialogue, clustering-based customer service hot event discovery and new hot event prediction. In the dialogue semantic embedding module, we obtain the dynamic embedding of each dialogue with combining word importance and word length based on the pre-trained language model to capture richer semantic information in different contexts. We further apply a clustering iterative algorithm with dynamic dialogue embedding to discover customer service hotspots. It can monitor the change trend of events in real time, optimize the accuracy of hot event discovery in operator customer service. Finally, the effectiveness of our CShe-D model is verified by experiments on real dialogue data in the field of customer service.
频繁的客服会话集中在通信用户的热点话题上,而热点话题的自动发现对于提升用户体验至关重要。传统上,客户服务依赖于接线员撰写流量摘要。这导致难以分析会话的来源,从而难以发现聚合的热点事件。本文提出了一种基于动态对话嵌入(CShe-D)的客服热点事件发现方法。该模型包括客户服务对话的动态语义表示、基于聚类的客户服务热点事件发现和新的热点事件预测。在对话语义嵌入模块中,我们在预先训练好的语言模型的基础上,结合词的重要性和词的长度对每个对话进行动态嵌入,从而在不同的语境中获取更丰富的语义信息。我们进一步应用动态对话嵌入聚类迭代算法来发现客户服务热点。实时监控事件变化趋势,优化运营商客服热点事件发现的准确性。最后,通过客户服务领域真实对话数据的实验验证了CShe-D模型的有效性。
{"title":"Customer Service Hot event Discovery Based on Dynamic Dialogue Embedding","authors":"Fei Li, Yanyan Wang, Ying Feng, Qiangzhong Feng, Yuan Zhou, Dexuan Wang","doi":"10.1145/3590003.3590011","DOIUrl":"https://doi.org/10.1145/3590003.3590011","url":null,"abstract":"Frequent customer service conversations focus on hot topics of communication users, and automatic hot topic discovery is critical to improving user experience. Traditionally, Customer service relies on operator to write traffic summaries. It leads to the source of the conversation difficult to analyze, which makes difficult to spot aggregated hotspot events. In this paper, we propose a Customer Service hot event Discovery based on dynamic dialogue embedding (CShe-D). This model includes dynamic semantic representation of customer service dialogue, clustering-based customer service hot event discovery and new hot event prediction. In the dialogue semantic embedding module, we obtain the dynamic embedding of each dialogue with combining word importance and word length based on the pre-trained language model to capture richer semantic information in different contexts. We further apply a clustering iterative algorithm with dynamic dialogue embedding to discover customer service hotspots. It can monitor the change trend of events in real time, optimize the accuracy of hot event discovery in operator customer service. Finally, the effectiveness of our CShe-D model is verified by experiments on real dialogue data in the field of customer service.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125175187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1