首页 > 最新文献

2022 International Joint Conference on Neural Networks (IJCNN)最新文献

英文 中文
Transfer Learning Framework for Forecasting Fresh Produce Yield and Price 预测新鲜农产品产量和价格的迁移学习框架
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892192
Islam Nasr, L. Nassar, F. Karray
Accurate estimates of fresh produce (FP) yields and prices are crucial for having fair bidding prices by retailers along with informed asking prices by farmers, leading to the best prices for customers. To have accurate estimates, the state-of-the-art deep learning (DL) models for forecasting FP yields and prices are improved in this work while a novel transfer learning (TL) framework is proposed for better generalizability. The proposed models are trained and tested using real world datasets for the Santa Barbara region in California, which contain environmental input parameters mapped to FP yield and price output parameters. Based on an aggregated measure (AGM), the proposed model, an ensemble of Attention Deep Feedforward Neural Network with Gated Recurrent Unit (GRU) units and Deep Feedforward Neural Network with embedded GRU units, is found to significantly outperform the state-of-the-art models. Beside finding the best DL, the TL framework is utilizing FP similarity, clustering, and TL techniques customized to fit the problem in hand and enhance the model generalization to other FPs. The literature similarity algorithms are improved by considering the time series features rather than the absolute values of their points. In addition, the FPs are clustered using a hierarchical clustering technique utilizing the complete linkage of a dendrogram to automate the process of finding the similarity thresholds and avoid setting them arbitrarily. Finally, the transfer learning is applied by freezing some layers of the proposed ensemble model and fine-tuning the rest leading to significant improvement in AGM compared to the best literature model.
新鲜农产品(FP)产量和价格的准确估计对于零售商的公平竞标价格以及农民的知情要价至关重要,从而为消费者带来最优惠的价格。为了获得准确的估计,本研究改进了用于预测FP产量和价格的最先进的深度学习(DL)模型,同时提出了一种新的迁移学习(TL)框架,以获得更好的泛化性。所提出的模型使用加利福尼亚州圣巴巴拉地区的真实数据集进行训练和测试,其中包含映射到FP产量和价格输出参数的环境输入参数。基于聚合度量(AGM),所提出的模型,具有门控循环单元(GRU)单元的注意力深度前馈神经网络和具有嵌入式GRU单元的深度前馈神经网络的集成,被发现显着优于最先进的模型。除了寻找最佳深度学习外,TL框架还利用FP相似度,聚类和定制的TL技术来适应手边的问题,并增强模型对其他深度学习的泛化。文献相似度算法通过考虑时间序列特征而不是其点的绝对值来改进。此外,使用分层聚类技术对FPs进行聚类,利用树形图的完整链接来自动查找相似阈值的过程,避免任意设置它们。最后,通过冻结所提出的集成模型的某些层并对其余层进行微调来应用迁移学习,与最佳文献模型相比,AGM得到了显着改善。
{"title":"Transfer Learning Framework for Forecasting Fresh Produce Yield and Price","authors":"Islam Nasr, L. Nassar, F. Karray","doi":"10.1109/IJCNN55064.2022.9892192","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892192","url":null,"abstract":"Accurate estimates of fresh produce (FP) yields and prices are crucial for having fair bidding prices by retailers along with informed asking prices by farmers, leading to the best prices for customers. To have accurate estimates, the state-of-the-art deep learning (DL) models for forecasting FP yields and prices are improved in this work while a novel transfer learning (TL) framework is proposed for better generalizability. The proposed models are trained and tested using real world datasets for the Santa Barbara region in California, which contain environmental input parameters mapped to FP yield and price output parameters. Based on an aggregated measure (AGM), the proposed model, an ensemble of Attention Deep Feedforward Neural Network with Gated Recurrent Unit (GRU) units and Deep Feedforward Neural Network with embedded GRU units, is found to significantly outperform the state-of-the-art models. Beside finding the best DL, the TL framework is utilizing FP similarity, clustering, and TL techniques customized to fit the problem in hand and enhance the model generalization to other FPs. The literature similarity algorithms are improved by considering the time series features rather than the absolute values of their points. In addition, the FPs are clustered using a hierarchical clustering technique utilizing the complete linkage of a dendrogram to automate the process of finding the similarity thresholds and avoid setting them arbitrarily. Finally, the transfer learning is applied by freezing some layers of the proposed ensemble model and fine-tuning the rest leading to significant improvement in AGM compared to the best literature model.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129995415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
TinyML for UWB-radar based presence detection TinyML用于基于uwb雷达的存在检测
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892925
Massimo Pavan, Armando Caltabiano, M. Roveri
Tiny Machine Learning (TinyML) is a novel research area aiming at designing machine and deep learning models and algorithms able to be executed on tiny devices such as Internet-of-Things units, edge devices or embedded systems. In this paper we introduce, for the first time in the literature, a TinyML solution for presence-detection based on UltrawideBand (UWB) radar, which is a particularly promising radar technology for pervasive systems. To achieve this goal we introduce a novel family of tiny convolutional neural networks for the processing of UWB-radar data characterized by a reduced memory footprint and computational demand so as to satisfy the severe technological constraints of tiny devices. From this technological perspective, UWB-radars are particularly relevant in the presence-detection scenario since they do not acquire sensitive information of users (e.g., images, videos or audio), hence preserving their privacy. The proposed solution has been successfully tested on a public-available benchmark for the indoor presence detection and on a real-world application of in-car presence detection.
微型机器学习(TinyML)是一个新颖的研究领域,旨在设计能够在微型设备(如物联网单元、边缘设备或嵌入式系统)上执行的机器和深度学习模型和算法。在本文中,我们首次在文献中介绍了一种基于超宽带(UWB)雷达的TinyML存在检测解决方案,这是一种特别有前途的普适系统雷达技术。为了实现这一目标,我们引入了一种新型的微型卷积神经网络,用于处理超宽带雷达数据,其特点是内存占用和计算需求减少,从而满足微型设备的严格技术限制。从这个技术角度来看,超宽带雷达在存在检测场景中尤为重要,因为它们不会获取用户的敏感信息(例如图像、视频或音频),从而保护了用户的隐私。所提出的解决方案已在室内存在检测的公共基准和车内存在检测的实际应用中成功测试。
{"title":"TinyML for UWB-radar based presence detection","authors":"Massimo Pavan, Armando Caltabiano, M. Roveri","doi":"10.1109/IJCNN55064.2022.9892925","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892925","url":null,"abstract":"Tiny Machine Learning (TinyML) is a novel research area aiming at designing machine and deep learning models and algorithms able to be executed on tiny devices such as Internet-of-Things units, edge devices or embedded systems. In this paper we introduce, for the first time in the literature, a TinyML solution for presence-detection based on UltrawideBand (UWB) radar, which is a particularly promising radar technology for pervasive systems. To achieve this goal we introduce a novel family of tiny convolutional neural networks for the processing of UWB-radar data characterized by a reduced memory footprint and computational demand so as to satisfy the severe technological constraints of tiny devices. From this technological perspective, UWB-radars are particularly relevant in the presence-detection scenario since they do not acquire sensitive information of users (e.g., images, videos or audio), hence preserving their privacy. The proposed solution has been successfully tested on a public-available benchmark for the indoor presence detection and on a real-world application of in-car presence detection.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134009571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Causal Network Construction Algorithm Based on Partial Rank Correlation on Time Series 时间序列上基于偏秩相关的因果网络构建算法
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9891908
J. Yang, Qiqi Chen
Identifying causal relationships from observational time-series data is a key problem in dealing with complex dynamical systems such as in the industrial or natural climate fields. Data-driven causal network construction in such systems is challenging since data sets are often high-dimensional and nonlinear. In response to this challenge, this paper combines partial rank correlation coefficients and proposes a new structure learning algorithm, TS-PRCS, suitable for time-series causal network models. In this article, we mainly make three contributions. First, we proved that partial rank correlation can be used as a standard of independence tests. Second, we combined partial rank correlation with constraint-based causality discovery methods, and proposed a causal network discovery algorithm (TS-PRCS) on time-series data based on partial rank correlation. Finally, the effectiveness of the algorithm is proven in experiments on time-series data generated by a time-series causal network model. Compared with an existing algorithm, the proposed algorithm achieves better results on high-dimensional and nonlinear data systems, and it also demonstrates good time performance. In particular, the algorithm has been applied to real data generated by a power plant. Experiments show that our method improves the ability to detect causality on time-series data, and further promotes the development of the field of causal network construction on time-series data.
从观测时间序列数据中识别因果关系是处理复杂动力系统(如工业或自然气候领域)的关键问题。由于数据集通常是高维和非线性的,因此在此类系统中数据驱动的因果网络构建具有挑战性。针对这一挑战,本文结合偏秩相关系数,提出了一种适用于时间序列因果网络模型的结构学习算法TS-PRCS。在本文中,我们主要做了三点贡献。首先,我们证明了偏秩相关可以作为独立性检验的标准。其次,将偏秩相关与基于约束的因果关系发现方法相结合,提出了一种基于偏秩相关的时序数据因果网络发现算法(TS-PRCS)。最后,在时间序列因果网络模型生成的时间序列数据上,通过实验验证了该算法的有效性。与现有算法相比,该算法在高维非线性数据系统上取得了更好的效果,并且具有良好的时间性能。并将该算法应用于某电厂的实际数据处理。实验表明,该方法提高了时间序列数据因果关系检测的能力,进一步推动了时间序列数据因果网络构建领域的发展。
{"title":"A Causal Network Construction Algorithm Based on Partial Rank Correlation on Time Series","authors":"J. Yang, Qiqi Chen","doi":"10.1109/IJCNN55064.2022.9891908","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9891908","url":null,"abstract":"Identifying causal relationships from observational time-series data is a key problem in dealing with complex dynamical systems such as in the industrial or natural climate fields. Data-driven causal network construction in such systems is challenging since data sets are often high-dimensional and nonlinear. In response to this challenge, this paper combines partial rank correlation coefficients and proposes a new structure learning algorithm, TS-PRCS, suitable for time-series causal network models. In this article, we mainly make three contributions. First, we proved that partial rank correlation can be used as a standard of independence tests. Second, we combined partial rank correlation with constraint-based causality discovery methods, and proposed a causal network discovery algorithm (TS-PRCS) on time-series data based on partial rank correlation. Finally, the effectiveness of the algorithm is proven in experiments on time-series data generated by a time-series causal network model. Compared with an existing algorithm, the proposed algorithm achieves better results on high-dimensional and nonlinear data systems, and it also demonstrates good time performance. In particular, the algorithm has been applied to real data generated by a power plant. Experiments show that our method improves the ability to detect causality on time-series data, and further promotes the development of the field of causal network construction on time-series data.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134080188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Localization of Concept Drift: Identifying the Drifting Datapoints 概念漂移的定位:漂移数据点的识别
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892374
Fabian Hinder, Valerie Vaquet, Johannes Brinkrolf, André Artelt, Barbara Hammer
The notion of concept drift refers to the phenomenon that the distribution which is underlying the observed data changes over time. As a consequence machine learning models may become inaccurate and need adjustment. While there do exist methods to detect concept drift, to find change points in data streams, or to adjust models in the presence of observed drift, the problem of localizing drift, i.e. identifying it in data space, is yet widely unsolved - in particular from a formal perspective. This problem however is of importance, since it enables an inspection of the most prominent characteristics, e.g. features, where drift manifests itself and can therefore be used to make informed decisions, e.g. efficient updates of the training set of online learning algorithms, and perform precise adjustments of the learning model. In this paper we present a general theoretical framework that reduces drift localization to a supervised machine learning problem. We construct a new method for drift localization thereon and demonstrate the usefulness of our theory and the performance of our algorithm by comparing it to other methods from the literature.
概念漂移指的是观测数据的分布随时间变化的现象。因此,机器学习模型可能会变得不准确,需要调整。虽然确实存在检测概念漂移的方法,在数据流中找到变化点,或者在观测到漂移的情况下调整模型,但是定位漂移的问题,即在数据空间中识别它,还没有得到广泛的解决-特别是从形式的角度来看。然而,这个问题是很重要的,因为它可以检查最突出的特征,例如特征,其中漂移表现出来,因此可以用来做出明智的决策,例如在线学习算法的训练集的有效更新,并执行学习模型的精确调整。在本文中,我们提出了一个通用的理论框架,将漂移定位降低到一个监督机器学习问题。我们构建了一种新的漂移定位方法,并通过与文献中其他方法的比较,证明了我们的理论和算法的有效性。
{"title":"Localization of Concept Drift: Identifying the Drifting Datapoints","authors":"Fabian Hinder, Valerie Vaquet, Johannes Brinkrolf, André Artelt, Barbara Hammer","doi":"10.1109/IJCNN55064.2022.9892374","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892374","url":null,"abstract":"The notion of concept drift refers to the phenomenon that the distribution which is underlying the observed data changes over time. As a consequence machine learning models may become inaccurate and need adjustment. While there do exist methods to detect concept drift, to find change points in data streams, or to adjust models in the presence of observed drift, the problem of localizing drift, i.e. identifying it in data space, is yet widely unsolved - in particular from a formal perspective. This problem however is of importance, since it enables an inspection of the most prominent characteristics, e.g. features, where drift manifests itself and can therefore be used to make informed decisions, e.g. efficient updates of the training set of online learning algorithms, and perform precise adjustments of the learning model. In this paper we present a general theoretical framework that reduces drift localization to a supervised machine learning problem. We construct a new method for drift localization thereon and demonstrate the usefulness of our theory and the performance of our algorithm by comparing it to other methods from the literature.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134240096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Hearables: Artefact removal in Ear-EEG for continuous 24/7 monitoring 可听设备:去除耳内伪影,实现24/7连续监测
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892675
Edoardo Occhipinti, H. Davies, Ghena Hammour, Danilo P. Mandic
Ear-worn devices offer the opportunity to measure vital signals in a 24/7 fashion, without the need of a clinician. These devices are however prone to motion artefacts, so that entire epochs of artefact-corrupt recordings are routinely discarded. This work aims at reducing the impact of artefacts introduced by a series of common real life daily activities such as talking, chewing, and walking while recording Electroencephalogram (EEG) from the ear canal. The approach used employs multiple external sensors, such as microphones and an accelerometer as means to capture the artefact. The proposed algorithm is a combination of Noise-Assisted Multivariate Empirical Mode Decomposition (NA-MEMD) with Adaptive Noise Cancellation (ANC), where each pair (EEG and motion sensors) of Intrinsic Mode Functions (IMFs) within NA-MEMD is fed independently to multiple Normalised Least Mean Square (NLMS) adaptive filters. The resulting denoised IMFs are then added up again to reconstruct the denoised EEG signal. Results across multiple subjects show that the so denoised EEG signals have reduced power in the frequency range occupied by artefacts. Also, different sensors provide different denoising performance in the tested artefacts, with the microphones being more sensitive to artefacts which cause internal motion within the ear-canal, such as chewing, and the accelerometer being more suitable for artefacts which come from full body movements of the subjects, such as walking.
耳戴式设备提供了在不需要临床医生的情况下全天候测量生命信号的机会。然而,这些设备容易产生运动伪影,因此,整个时代的伪影损坏的录音通常被丢弃。这项工作旨在通过记录耳道脑电图(EEG)来减少由一系列常见的现实生活日常活动(如说话,咀嚼和行走)引入的人工制品的影响。所使用的方法采用多个外部传感器,如麦克风和加速度计作为捕获人工制品的手段。所提出的算法是噪声辅助多元经验模态分解(NA-MEMD)和自适应噪声消除(ANC)的结合,其中NA-MEMD内的每对(EEG和运动传感器)内禀模态函数(IMFs)被独立地馈送到多个归一化最小均方(NLMS)自适应滤波器。然后将得到的去噪后的imf再次相加以重建去噪后的脑电信号。多受试者的实验结果表明,去噪后的脑电图信号在被伪信号占据的频率范围内功率降低。此外,不同的传感器在测试的伪影中提供了不同的去噪性能,麦克风对引起耳道内部运动的伪影更敏感,比如咀嚼,加速度计更适合于来自受试者全身运动的伪影,比如走路。
{"title":"Hearables: Artefact removal in Ear-EEG for continuous 24/7 monitoring","authors":"Edoardo Occhipinti, H. Davies, Ghena Hammour, Danilo P. Mandic","doi":"10.1109/IJCNN55064.2022.9892675","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892675","url":null,"abstract":"Ear-worn devices offer the opportunity to measure vital signals in a 24/7 fashion, without the need of a clinician. These devices are however prone to motion artefacts, so that entire epochs of artefact-corrupt recordings are routinely discarded. This work aims at reducing the impact of artefacts introduced by a series of common real life daily activities such as talking, chewing, and walking while recording Electroencephalogram (EEG) from the ear canal. The approach used employs multiple external sensors, such as microphones and an accelerometer as means to capture the artefact. The proposed algorithm is a combination of Noise-Assisted Multivariate Empirical Mode Decomposition (NA-MEMD) with Adaptive Noise Cancellation (ANC), where each pair (EEG and motion sensors) of Intrinsic Mode Functions (IMFs) within NA-MEMD is fed independently to multiple Normalised Least Mean Square (NLMS) adaptive filters. The resulting denoised IMFs are then added up again to reconstruct the denoised EEG signal. Results across multiple subjects show that the so denoised EEG signals have reduced power in the frequency range occupied by artefacts. Also, different sensors provide different denoising performance in the tested artefacts, with the microphones being more sensitive to artefacts which cause internal motion within the ear-canal, such as chewing, and the accelerometer being more suitable for artefacts which come from full body movements of the subjects, such as walking.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134423544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A hybrid algorithm for fuzzy clustering based on global and local membership degree 基于全局和局部隶属度的模糊聚类混合算法
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892394
Bruno A. Pimentel, Jadson Crislan Santos Costa
The clustering task has challenges that change according to the data, thus different algorithms have been proposed where each one has a bias on the data. In the fuzzy clustering approach, the most popular algorithm is the Fuzzy C-Means (FCM), which uses a global view of variables to calculate the degree of membership. On the other hand, the Multivariate Fuzzy C-Means (MFCM) uses a local view of variables to calculate the degree of membership. In this work, we proposed a new hybrid algorithm to use a combined local and global view approaches. For this, a new objective function based on the hybridization parameter is introduced. The experiments show the robustness and superiority of the proposed algorithm in real and synthetic datasets in most of the analyzed scenarios.
聚类任务具有根据数据变化的挑战,因此提出了不同的算法,其中每个算法对数据都有偏差。在模糊聚类方法中,最流行的算法是模糊c均值(FCM),它使用全局变量视图来计算隶属度。另一方面,多元模糊c均值(Multivariate Fuzzy C-Means, MFCM)使用变量的局部视图来计算隶属度。在这项工作中,我们提出了一种新的混合算法,使用局部和全局视图相结合的方法。为此,引入了一种新的基于杂交参数的目标函数。实验结果表明,在大多数分析场景下,该算法在真实数据集和合成数据集上都具有鲁棒性和优越性。
{"title":"A hybrid algorithm for fuzzy clustering based on global and local membership degree","authors":"Bruno A. Pimentel, Jadson Crislan Santos Costa","doi":"10.1109/IJCNN55064.2022.9892394","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892394","url":null,"abstract":"The clustering task has challenges that change according to the data, thus different algorithms have been proposed where each one has a bias on the data. In the fuzzy clustering approach, the most popular algorithm is the Fuzzy C-Means (FCM), which uses a global view of variables to calculate the degree of membership. On the other hand, the Multivariate Fuzzy C-Means (MFCM) uses a local view of variables to calculate the degree of membership. In this work, we proposed a new hybrid algorithm to use a combined local and global view approaches. For this, a new objective function based on the hybridization parameter is introduced. The experiments show the robustness and superiority of the proposed algorithm in real and synthetic datasets in most of the analyzed scenarios.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134004289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KC2UM: Knowledge-Conversation Cyclic Utilization Mechanism for Knowledge-Grounded Dialogue Generation KC2UM:基于知识的对话生成的知识会话循环利用机制
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892149
Yajing Sun, Yue Hu, Luxi Xing, Wei Peng, Yuqiang Xie, Xingsheng Zhang
End-to-End open-domain dialogue systems suffer from the issues of generating inconsistent and repetitive responses. Existing dialogue models pay attention to unilaterally incorporating personalized knowledge into the dialogue to enhance the quality of generated response. However, they ignore that incorporating the personality-related information from dialogue history into personalized knowledge can boost the subsequent dialogue quality. In this paper, A Knowledge-Conversation Cyclic Utilization Mechanism (KC2UM) is proposed to enhance the dialogue quality. Specifically, A novel cyclic interaction module is designed to iteratively incorporate personalized knowledge into each turn conversation and capture the personality-related conversation information to enhance personalized knowledge semantic representation. We represent the knowledge with semantic and utilization representations to keep track of the personalized knowledge utilization. Experiments on two knowledge-grounded dialogue datasets show that our approach manages to select knowledge more accurately and generates more informative responses.
端到端开放域对话系统存在产生不一致和重复响应的问题。现有的对话模型注重单方面地将个性化的知识融入到对话中,以提高生成响应的质量。然而,他们忽略了将对话历史中的个性相关信息纳入个性化知识中可以提高后续对话的质量。本文提出了一种知识会话循环利用机制(KC2UM)来提高对话质量。具体而言,设计了一种新的循环交互模块,将个性化知识迭代地融入到每一轮会话中,并捕获与个性相关的会话信息,增强个性化知识的语义表示。我们用语义表示和利用表示来表示知识,以跟踪个性化的知识利用情况。在两个基于知识的对话数据集上的实验表明,我们的方法能够更准确地选择知识并产生更丰富的信息响应。
{"title":"KC2UM: Knowledge-Conversation Cyclic Utilization Mechanism for Knowledge-Grounded Dialogue Generation","authors":"Yajing Sun, Yue Hu, Luxi Xing, Wei Peng, Yuqiang Xie, Xingsheng Zhang","doi":"10.1109/IJCNN55064.2022.9892149","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892149","url":null,"abstract":"End-to-End open-domain dialogue systems suffer from the issues of generating inconsistent and repetitive responses. Existing dialogue models pay attention to unilaterally incorporating personalized knowledge into the dialogue to enhance the quality of generated response. However, they ignore that incorporating the personality-related information from dialogue history into personalized knowledge can boost the subsequent dialogue quality. In this paper, A Knowledge-Conversation Cyclic Utilization Mechanism (KC2UM) is proposed to enhance the dialogue quality. Specifically, A novel cyclic interaction module is designed to iteratively incorporate personalized knowledge into each turn conversation and capture the personality-related conversation information to enhance personalized knowledge semantic representation. We represent the knowledge with semantic and utilization representations to keep track of the personalized knowledge utilization. Experiments on two knowledge-grounded dialogue datasets show that our approach manages to select knowledge more accurately and generates more informative responses.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134181061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rethinking the Feature Iteration Process of Graph Convolution Networks 图卷积网络特征迭代过程的再思考
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892737
Bisheng Tang, Xiaojun Chen, Dakui Wang, Zhendong Zhao
Node classification is a fundamental research problem in graph neural networks(GNNs), which uses node's feature and label to capture node embedding in a low dimension. The existing graph node classification approaches mainly focus on GNNs from global and local perspectives. The relevant research is relatively insufficient for the micro perspective, which refers to the feature itself. In this paper, we prove that deeper GCNs' features will be updated with the same coefficient in the same dimension, limiting deeper GCNs' expression. To overcome the limits of the deeper GCN model, we propose a zero feature (k-ZF) method to train GCNs. Specifically, k-ZF randomly sets the initial k feature value to zero, acting as a data rectifier and augmenter, and is also a skill equipped with GCNs models and other GCNs skills. Extensive experiments based on three public datasets show that k-ZF significantly improves GCNs in the feature aspect and achieves competitive accuracy.
节点分类是图神经网络(gnn)的一个基础研究问题,它利用节点的特征和标签来捕获低维节点嵌入。现有的图节点分类方法主要从全局和局部两个角度对gnn进行分类。微观视角是指特征本身,相关研究相对不足。在本文中,我们证明了更深的GCNs的特征将在相同的维度上使用相同的系数进行更新,从而限制了更深的GCNs的表达。为了克服深层GCN模型的局限性,我们提出了一种零特征(k-ZF)方法来训练GCN。具体来说,k- zf将初始k个特征值随机设置为零,起到数据整流器和增强器的作用,也是一种具备GCNs模型和其他GCNs技能的技能。基于三个公开数据集的大量实验表明,k-ZF在特征方面显著改善了GCNs,达到了相当的准确率。
{"title":"Rethinking the Feature Iteration Process of Graph Convolution Networks","authors":"Bisheng Tang, Xiaojun Chen, Dakui Wang, Zhendong Zhao","doi":"10.1109/IJCNN55064.2022.9892737","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892737","url":null,"abstract":"Node classification is a fundamental research problem in graph neural networks(GNNs), which uses node's feature and label to capture node embedding in a low dimension. The existing graph node classification approaches mainly focus on GNNs from global and local perspectives. The relevant research is relatively insufficient for the micro perspective, which refers to the feature itself. In this paper, we prove that deeper GCNs' features will be updated with the same coefficient in the same dimension, limiting deeper GCNs' expression. To overcome the limits of the deeper GCN model, we propose a zero feature (k-ZF) method to train GCNs. Specifically, k-ZF randomly sets the initial k feature value to zero, acting as a data rectifier and augmenter, and is also a skill equipped with GCNs models and other GCNs skills. Extensive experiments based on three public datasets show that k-ZF significantly improves GCNs in the feature aspect and achieves competitive accuracy.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134312002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Human-Object Interactions in Egocentric Videos 在以自我为中心的视频中预测人-物交互
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892910
Manuel Benavent-Lledó, Sergiu Oprea, John Alejandro Castro-Vargas, David Mulero-Pérez, J. G. Rodríguez
Egocentric videos provide a rich source of hand-object interactions that support action recognition. However, prior to action recognition, one may need to detect the presence of hands and objects in the scene. In this work, we propose an action estimation architecture based on the simultaneous detection of the hands and objects in the scene. For the hand and object detection, we have adapted well known YOLO architecture, leveraging its inference speed and accuracy. We experimentally determined the best performing architecture for our task. After obtaining the hand and object bounding boxes, we select the most likely objects to interact with, i.e., the closest objects to a hand. The rough estimation of the closest objects to a hand is a direct approach to determine hand-object interaction. After identifying the scene and alongside a set of per-object and global actions, we could determine the most suitable action we are performing in each context.
以自我为中心的视频提供了丰富的手-物交互来源,支持动作识别。然而,在动作识别之前,人们可能需要检测场景中手和物体的存在。在这项工作中,我们提出了一种基于同时检测场景中的手和物体的动作估计架构。对于手和物体检测,我们采用了众所周知的YOLO架构,利用其推理速度和准确性。我们通过实验确定了最适合我们任务的架构。在获得手和物体边界框之后,我们选择最可能与之交互的物体,即离手最近的物体。粗略估计离手最近的物体是确定手-物体相互作用的直接方法。在确定了场景和一组对象和全局操作之后,我们可以确定在每个环境中执行的最合适的操作。
{"title":"Predicting Human-Object Interactions in Egocentric Videos","authors":"Manuel Benavent-Lledó, Sergiu Oprea, John Alejandro Castro-Vargas, David Mulero-Pérez, J. G. Rodríguez","doi":"10.1109/IJCNN55064.2022.9892910","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892910","url":null,"abstract":"Egocentric videos provide a rich source of hand-object interactions that support action recognition. However, prior to action recognition, one may need to detect the presence of hands and objects in the scene. In this work, we propose an action estimation architecture based on the simultaneous detection of the hands and objects in the scene. For the hand and object detection, we have adapted well known YOLO architecture, leveraging its inference speed and accuracy. We experimentally determined the best performing architecture for our task. After obtaining the hand and object bounding boxes, we select the most likely objects to interact with, i.e., the closest objects to a hand. The rough estimation of the closest objects to a hand is a direct approach to determine hand-object interaction. After identifying the scene and alongside a set of per-object and global actions, we could determine the most suitable action we are performing in each context.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132961379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Contrastive Learning Based Visual Representation Enhancement for Multimodal Machine Translation 基于对比学习的多模态机器翻译视觉表示增强
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892312
Shike Wang, Wen Zhang, Wenyu Guo, Dong Yu, Pengyuan Liu
Multimodal machine translation (MMT) is a task that incorporates extra image modality with text to translate. Previous works have worked on the interaction between two modalities and investigated the need of visual modality. However, few works focus on the models with better and more effective visual representation as input. We argue that the performance of MMT systems will get improved when better visual representation inputs into the systems. To investigate the thought, we introduce mT-ICL, a multimodal Transformer model with image contrastive learning. The contrastive objective is optimized to enhance the representation ability of the image encoder so that the encoder can generate better and more adaptive visual representation. Experiments show that our mT-ICL significantly outperforms the strong baseline and achieves the new SOTA on most of test sets of English-to-German and English-to-French. Further analysis reveals that visual modality works more than a regularization method under contrastive learning framework.
多模态机器翻译(MMT)是一种将额外的图像模态与文本结合起来进行翻译的任务。之前的作品研究了两种形态之间的相互作用,并研究了视觉形态的需求。然而,很少有作品关注具有更好和更有效的视觉表现的模型作为输入。我们认为,当更好的视觉表现输入到系统中时,MMT系统的性能将得到改善。为了研究这一思想,我们引入了mT-ICL,一种具有图像对比学习的多模态Transformer模型。对对比物镜进行优化,增强图像编码器的表示能力,使编码器能够产生更好、更自适应的视觉表示。实验表明,我们的mT-ICL显著优于强基线,在大多数英语-德语和英语-法语的测试集上实现了新的SOTA。进一步分析表明,在对比学习框架下,视觉模态比正则化方法更有效。
{"title":"Contrastive Learning Based Visual Representation Enhancement for Multimodal Machine Translation","authors":"Shike Wang, Wen Zhang, Wenyu Guo, Dong Yu, Pengyuan Liu","doi":"10.1109/IJCNN55064.2022.9892312","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892312","url":null,"abstract":"Multimodal machine translation (MMT) is a task that incorporates extra image modality with text to translate. Previous works have worked on the interaction between two modalities and investigated the need of visual modality. However, few works focus on the models with better and more effective visual representation as input. We argue that the performance of MMT systems will get improved when better visual representation inputs into the systems. To investigate the thought, we introduce mT-ICL, a multimodal Transformer model with image contrastive learning. The contrastive objective is optimized to enhance the representation ability of the image encoder so that the encoder can generate better and more adaptive visual representation. Experiments show that our mT-ICL significantly outperforms the strong baseline and achieves the new SOTA on most of test sets of English-to-German and English-to-French. Further analysis reveals that visual modality works more than a regularization method under contrastive learning framework.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132615682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 International Joint Conference on Neural Networks (IJCNN)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1