首页 > 最新文献

2022 International Joint Conference on Neural Networks (IJCNN)最新文献

英文 中文
Preserving Data Manifold Structure in Latent Space for Exploration through Network Geodesics 基于网络测地线的隐空间数据流形结构保护
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9891993
Sanjukta Krishnagopal, J. Bedrossian
While variational autoencoders have been successful in several tasks, the use of conventional priors are limited in their ability to encode the underlying structure of input data. We introduce an Encoded Prior Sliced Wasserstein AutoEncoder wherein an additional prior-encoder network learns a geometry and topology preserving embedding of any data manifold, thus improving the structure of latent space. The autoencoder and prior-encoder networks are iteratively trained using the Sliced Wasserstein distance, which facilitates the learning of nonstandard complex priors. We then introduce a graph-based algorithm to explore the learned manifold by traversing latent space through network-geodesics that lie along the manifold and hence are more realistic compared to conventional Euclidean interpolation. Specifically, we identify network-geodesics by maximizing the density of samples along the path while minimizing total energy. We use the 3D-spiral data to show that the prior encodes the geometry underlying the data unlike conventional autoencoders, and to demonstrate the exploration of the embedded data manifold through the network algorithm. We apply our framework to artificial as well as image datasets to demonstrate the advantages of learning improved latent structure, outlier generation, and geodesic interpolation.
虽然变分自编码器在一些任务中取得了成功,但传统先验的使用在编码输入数据的底层结构方面受到限制。我们引入了一种编码的先验切片Wasserstein自动编码器,其中一个额外的先验编码器网络学习几何和拓扑,保持任何数据流形的嵌入,从而改善潜在空间的结构。使用切片沃瑟斯坦距离迭代训练自编码器和先验编码器网络,这有利于非标准复杂先验的学习。然后,我们引入了一种基于图的算法,通过沿着流形的网络测地线遍历潜在空间来探索学习到的流形,因此与传统的欧几里得插值相比,它更现实。具体来说,我们通过最大化沿路径的样本密度,同时最小化总能量来识别网络测地线。我们使用3d螺旋数据来表明,与传统的自编码器不同,先验编码数据底层的几何形状,并通过网络算法展示对嵌入式数据流形的探索。我们将我们的框架应用于人工数据集和图像数据集,以展示学习改进的潜在结构、离群值生成和测地线插值的优势。
{"title":"Preserving Data Manifold Structure in Latent Space for Exploration through Network Geodesics","authors":"Sanjukta Krishnagopal, J. Bedrossian","doi":"10.1109/IJCNN55064.2022.9891993","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9891993","url":null,"abstract":"While variational autoencoders have been successful in several tasks, the use of conventional priors are limited in their ability to encode the underlying structure of input data. We introduce an Encoded Prior Sliced Wasserstein AutoEncoder wherein an additional prior-encoder network learns a geometry and topology preserving embedding of any data manifold, thus improving the structure of latent space. The autoencoder and prior-encoder networks are iteratively trained using the Sliced Wasserstein distance, which facilitates the learning of nonstandard complex priors. We then introduce a graph-based algorithm to explore the learned manifold by traversing latent space through network-geodesics that lie along the manifold and hence are more realistic compared to conventional Euclidean interpolation. Specifically, we identify network-geodesics by maximizing the density of samples along the path while minimizing total energy. We use the 3D-spiral data to show that the prior encodes the geometry underlying the data unlike conventional autoencoders, and to demonstrate the exploration of the embedded data manifold through the network algorithm. We apply our framework to artificial as well as image datasets to demonstrate the advantages of learning improved latent structure, outlier generation, and geodesic interpolation.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123415435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards Well-Being Management with Automated Qualitative Data Analysis 迈向福祉管理与自动化定性数据分析
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892930
Fousiya Saleem, Mohammad Hamdan, A. Zalzala
This paper reports on using qualitative data analysis to understand aspects of the well-being of dwellers in underserved communities, by applying machine learning algorithms to identify specific themes from unstructured interview data. The work involved data translation, transcription, pre-processing as well as developing Word2Vec and FastText algorithms and ultimately a combined analysis engine. The reported experiments are conducted on field data captured from communities in India, hence offering a unique opportunity to examine automated context-based qualitative data analysis. The approach is proven feasible despite the dominant limitations on technology infrastructure and community awareness. The machine learning results identify themes from the interview data within minutes as opposed to hours of manual investigations through conventional qualitative analysis techniques. The outcomes from the analysis engine can be used for creating a grounded theory for further studies, hence facilitating an evidence-based approach to the evaluation of underserved communities.
本文报告了通过应用机器学习算法从非结构化访谈数据中识别特定主题,使用定性数据分析来了解服务不足社区居民福祉的各个方面。这项工作包括数据翻译、转录、预处理,以及开发Word2Vec和FastText算法,并最终开发出一个组合分析引擎。报告的实验是根据从印度社区获取的实地数据进行的,因此提供了一个独特的机会来检查基于上下文的自动化定性数据分析。尽管在技术基础设施和社区意识方面存在主要限制,但这种方法被证明是可行的。机器学习结果可以在几分钟内从访谈数据中识别主题,而不是通过传统的定性分析技术进行数小时的人工调查。分析引擎的结果可用于为进一步研究创建有根据的理论,从而促进以证据为基础的方法来评估服务不足的社区。
{"title":"Towards Well-Being Management with Automated Qualitative Data Analysis","authors":"Fousiya Saleem, Mohammad Hamdan, A. Zalzala","doi":"10.1109/IJCNN55064.2022.9892930","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892930","url":null,"abstract":"This paper reports on using qualitative data analysis to understand aspects of the well-being of dwellers in underserved communities, by applying machine learning algorithms to identify specific themes from unstructured interview data. The work involved data translation, transcription, pre-processing as well as developing Word2Vec and FastText algorithms and ultimately a combined analysis engine. The reported experiments are conducted on field data captured from communities in India, hence offering a unique opportunity to examine automated context-based qualitative data analysis. The approach is proven feasible despite the dominant limitations on technology infrastructure and community awareness. The machine learning results identify themes from the interview data within minutes as opposed to hours of manual investigations through conventional qualitative analysis techniques. The outcomes from the analysis engine can be used for creating a grounded theory for further studies, hence facilitating an evidence-based approach to the evaluation of underserved communities.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125418188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrastive Learning in Wavelet Domain for Image Dehazing 小波域对比学习用于图像去雾
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892193
Yunru Bai, C. Yuan
Image dehazing remains a challenging problem because it is hard to restore a clean scene from a severely degraded hazy image. However, existing learning-based dehazing methods mostly ignore the fact that the interference of haze to an image is mainly concentrated in the low-frequency components. If all image components are processed indiscriminately, it is difficult to achieve a good restoration and accurate details cannot be guaranteed. In order to process the hazy images hierarchically, we propose a low-frequency sub-band contrastive regularization (LSCR) in the wavelet domain to ensure that the components of the restored image mainly affected by haze are pulled closer to the clear image and pushed far away from the hazy image. In addition, a high-frequency sub-band loss is also introduced to make high-frequency components of the restored image consistent with the clear image. Our method can better restore the haze-free image and achieve more accurate and rich details. The extensive experiments on synthetic and real-world datasets verify that the proposed method outperforms previous approaches.
图像去雾仍然是一个具有挑战性的问题,因为很难从严重退化的模糊图像中恢复干净的场景。然而,现有的基于学习的去雾方法大多忽略了雾霾对图像的干扰主要集中在低频分量这一事实。如果对所有图像分量进行不加选择地处理,很难得到很好的复原效果,也不能保证细节的准确。为了对雾霾图像进行分层处理,我们提出了一种小波域低频子带对比正则化(LSCR)方法,确保恢复图像中主要受雾霾影响的分量被拉向清晰图像,而远离雾霾图像。此外,还引入了高频子带损耗,使恢复图像的高频成分与清晰图像一致。该方法可以更好地恢复无雾图像,实现更准确、更丰富的细节。在合成数据集和实际数据集上的大量实验验证了所提出的方法优于以前的方法。
{"title":"Contrastive Learning in Wavelet Domain for Image Dehazing","authors":"Yunru Bai, C. Yuan","doi":"10.1109/IJCNN55064.2022.9892193","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892193","url":null,"abstract":"Image dehazing remains a challenging problem because it is hard to restore a clean scene from a severely degraded hazy image. However, existing learning-based dehazing methods mostly ignore the fact that the interference of haze to an image is mainly concentrated in the low-frequency components. If all image components are processed indiscriminately, it is difficult to achieve a good restoration and accurate details cannot be guaranteed. In order to process the hazy images hierarchically, we propose a low-frequency sub-band contrastive regularization (LSCR) in the wavelet domain to ensure that the components of the restored image mainly affected by haze are pulled closer to the clear image and pushed far away from the hazy image. In addition, a high-frequency sub-band loss is also introduced to make high-frequency components of the restored image consistent with the clear image. Our method can better restore the haze-free image and achieve more accurate and rich details. The extensive experiments on synthetic and real-world datasets verify that the proposed method outperforms previous approaches.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125590319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty Aware Model Integration on Reinforcement Learning 基于强化学习的不确定性感知模型集成
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892778
Takashi Nagata, Jinwei Xing, Tsutomu Kumazawa, E. Neftci
Model-based reinforcement learning is an effective approach to reducing sample complexity by adding more data from the model. Dyna is a well-known architecture that contains model-based reinforcement learning and integrates learning from interactions with an environment and a model of the environment. Although the model can greatly help to speed up the agent's learning, acquiring an accurate model is a hard problem in spite of the recent great success of function approximation using neural networks. A wrong model causes degradation of the agent's performance and raises another question: to which extent should an agent rely on the model to update its policy? In this paper, we propose to use the confidence of the model simulations to the integrated learning process so that the agent avoids updating its policy based on uncertain simulations by the model. To obtain confidence, we apply the Monte Carlo dropout technique to the state transition model. We show that this approach contributes to improving early-stage training, thus helping speed up the agent to reach reasonable performance. We conduct experiments on simulated robotic locomotion tasks to demonstrate the effectiveness of our approach.
基于模型的强化学习是一种通过从模型中添加更多数据来降低样本复杂度的有效方法。Dyna是一个著名的架构,它包含基于模型的强化学习,并集成了从与环境和环境模型的交互中学习。尽管该模型可以极大地加快智能体的学习速度,但尽管近年来使用神经网络的函数逼近取得了巨大的成功,但获取准确的模型仍然是一个难题。一个错误的模型会导致代理的性能下降,并提出另一个问题:代理应该在多大程度上依赖模型来更新其策略?在本文中,我们提出将模型模拟的置信度用于集成学习过程,以避免智能体根据模型的不确定模拟更新其策略。为了获得置信度,我们将蒙特卡罗dropout技术应用于状态转移模型。我们表明,这种方法有助于改善早期训练,从而有助于加速智能体达到合理的性能。我们进行了模拟机器人运动任务的实验,以证明我们的方法的有效性。
{"title":"Uncertainty Aware Model Integration on Reinforcement Learning","authors":"Takashi Nagata, Jinwei Xing, Tsutomu Kumazawa, E. Neftci","doi":"10.1109/IJCNN55064.2022.9892778","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892778","url":null,"abstract":"Model-based reinforcement learning is an effective approach to reducing sample complexity by adding more data from the model. Dyna is a well-known architecture that contains model-based reinforcement learning and integrates learning from interactions with an environment and a model of the environment. Although the model can greatly help to speed up the agent's learning, acquiring an accurate model is a hard problem in spite of the recent great success of function approximation using neural networks. A wrong model causes degradation of the agent's performance and raises another question: to which extent should an agent rely on the model to update its policy? In this paper, we propose to use the confidence of the model simulations to the integrated learning process so that the agent avoids updating its policy based on uncertain simulations by the model. To obtain confidence, we apply the Monte Carlo dropout technique to the state transition model. We show that this approach contributes to improving early-stage training, thus helping speed up the agent to reach reasonable performance. We conduct experiments on simulated robotic locomotion tasks to demonstrate the effectiveness of our approach.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115258662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-End Event Factuality Identification with Cross-Lingual Information 跨语言信息的端到端事件事实识别
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892869
Jin Cao, Zhong Qian, Peifeng Li
Event factuality is a description of the real situation of events in text. Event Factuality Identification (EFI) is the basic task of many related applications in the field of natural language processing. At present, most studies about EFI are carried out with the annotated event mentions, which is not applicable for practical application, and ignores the opinion of different event sources on event factuality. Moreover, previous work did not use cross-lingual information for EFI. We propose an end-to-end joint model JESF, which uses Bert to encode sentences and uses lingual feature to enrich the semantic representation of sentences, and then use BiLSTM to capture the serialized semantic features of sentences; Then, the multi-head attention is used to learn the event characteristics and identify the event mentions; After that, use multi-head attention to identify the event source; Finally, GCNs is used to capture the syntactic and semantic features, mult-head attention is used to capture the semantic features of sentences, event and event source features are integrated to identify event factuality. Especially, we use different cross-lingual related methods to learn supplementary sematic features from aligned Chinese sentences. The experimental results on FactBank show that JESF is effective and the Chinese information is helpful for English EFI, and the more effective method is to use Chinese cue as features for EFI.
事件真实性是文本中对事件真实情况的描述。事件事实识别(EFI)是自然语言处理领域中许多相关应用的基础任务。目前,大多数关于EFI的研究都是在标注事件提及的情况下进行的,这并不适用于实际应用,而且忽略了不同事件来源对事件真实性的看法。此外,以前的工作没有使用跨语言信息的EFI。提出了一种端到端联合模型JESF,利用Bert对句子进行编码,利用语言特征丰富句子的语义表示,然后利用BiLSTM捕获句子的序列化语义特征;然后,利用多头注意学习事件特征,识别事件提及;之后,利用多头关注识别事件源;最后,利用GCNs捕获句子的句法和语义特征,利用多头注意捕获句子的语义特征,结合事件和事件源特征识别事件真实性。特别是,我们使用不同的跨语言相关方法从对齐的汉语句子中学习补充语义特征。FactBank上的实验结果表明,JESF是有效的,中文信息对英语EFI有帮助,更有效的方法是使用中文提示作为EFI的特征。
{"title":"End-to-End Event Factuality Identification with Cross-Lingual Information","authors":"Jin Cao, Zhong Qian, Peifeng Li","doi":"10.1109/IJCNN55064.2022.9892869","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892869","url":null,"abstract":"Event factuality is a description of the real situation of events in text. Event Factuality Identification (EFI) is the basic task of many related applications in the field of natural language processing. At present, most studies about EFI are carried out with the annotated event mentions, which is not applicable for practical application, and ignores the opinion of different event sources on event factuality. Moreover, previous work did not use cross-lingual information for EFI. We propose an end-to-end joint model JESF, which uses Bert to encode sentences and uses lingual feature to enrich the semantic representation of sentences, and then use BiLSTM to capture the serialized semantic features of sentences; Then, the multi-head attention is used to learn the event characteristics and identify the event mentions; After that, use multi-head attention to identify the event source; Finally, GCNs is used to capture the syntactic and semantic features, mult-head attention is used to capture the semantic features of sentences, event and event source features are integrated to identify event factuality. Especially, we use different cross-lingual related methods to learn supplementary sematic features from aligned Chinese sentences. The experimental results on FactBank show that JESF is effective and the Chinese information is helpful for English EFI, and the more effective method is to use Chinese cue as features for EFI.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116000110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Binary-perspective Asymmetrical Twin Gain: a Novel Evaluation Method for Question Generation 二视角非对称双增益:一种新的问题生成评价方法
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892106
Yulan Su, Yu Hong, Hongyu Zhu, Minhan Xu, Yifan Fan, Min Zhang
We propose a novel evaluation method for Question Generation (QG) task. It is designed to verify the quality of the generated questions in terms of different references, including not only the manually-written questions (i.e., ground truth) but also their variants. Back translation is utilized to obtain the variants, and accordingly, they generally appear as paraphrases of the ground-truth examples. In particular, an Asymmetrical Twin Gain (ATG) is proposed for binary-perspective evaluation using the existing metrics, such as BLEU and ROUGE-L, respectively. It enables both the metrics to be observed from two perspectives, including the consistency between QG results and ground-truth examples, as well as that of variants. The experiments on the publicly-available benchmark SQuAD demonstrate the reliability of ATG. More importantly, ATG is proven effective for indicating the stable QG performance. It is noteworthy that the proposed binary-perspective evaluation is explored for assisting the conventional evaluation methods, instead of replacing them. The contribute can be identified as the additional insight into the robustness of QG when some slightly-different references (e.g., paraphrases) are offered for evaluation. All the models and source codes in the experiments will be made publicly available to support reproducible research.
提出了一种新的QG任务评价方法。它的目的是根据不同的参考来验证生成的问题的质量,不仅包括手工编写的问题(即基础真理),还包括它们的变体。反译是用来获得变体的,因此,它们通常是对基本真理例子的释义。特别地,提出了一种不对称双增益(ATG),用于二视角评估,分别使用现有的指标,如BLEU和ROUGE-L。它允许从两个角度观察指标,包括QG结果和基础真值示例之间的一致性,以及变体之间的一致性。在公开可用的基准测试SQuAD上的实验验证了ATG的可靠性。更重要的是,ATG被证明是有效的,表明稳定的QG性能。值得注意的是,本文提出的二元视角评价方法是为了辅助传统评价方法,而不是取代传统评价方法。当提供一些稍微不同的参考(例如,释义)进行评估时,贡献可以被识别为对QG稳健性的额外洞察。实验中的所有模型和源代码都将公开,以支持可重复的研究。
{"title":"Binary-perspective Asymmetrical Twin Gain: a Novel Evaluation Method for Question Generation","authors":"Yulan Su, Yu Hong, Hongyu Zhu, Minhan Xu, Yifan Fan, Min Zhang","doi":"10.1109/IJCNN55064.2022.9892106","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892106","url":null,"abstract":"We propose a novel evaluation method for Question Generation (QG) task. It is designed to verify the quality of the generated questions in terms of different references, including not only the manually-written questions (i.e., ground truth) but also their variants. Back translation is utilized to obtain the variants, and accordingly, they generally appear as paraphrases of the ground-truth examples. In particular, an Asymmetrical Twin Gain (ATG) is proposed for binary-perspective evaluation using the existing metrics, such as BLEU and ROUGE-L, respectively. It enables both the metrics to be observed from two perspectives, including the consistency between QG results and ground-truth examples, as well as that of variants. The experiments on the publicly-available benchmark SQuAD demonstrate the reliability of ATG. More importantly, ATG is proven effective for indicating the stable QG performance. It is noteworthy that the proposed binary-perspective evaluation is explored for assisting the conventional evaluation methods, instead of replacing them. The contribute can be identified as the additional insight into the robustness of QG when some slightly-different references (e.g., paraphrases) are offered for evaluation. All the models and source codes in the experiments will be made publicly available to support reproducible research.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122306763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting Initial Population in Multiobjective Feature Selection with Knowledge-Based Partitioning 基于知识划分的多目标特征选择初始种群提升
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892123
Ayça Deniz, Hakan Ezgi Kiziloz
The quality of features is one of the main factors that affect classification performance. Feature selection aims to remove irrelevant and redundant features from data in order to increase classification accuracy. However, identifying these features is not a trivial task due to a large search space. Evolutionary algorithms have been proven to be effective in many optimization problems, including feature selection. These algorithms require an initial population to start their search mechanism, and a poor initial population may cause getting stuck in local optima. Diversifying the initial population is known as an effective approach to overcome this issue; yet, it may not suffice as the search space grows exponentially with increasing feature sizes. In this study, we propose an enhanced initial population strategy to boost the performance of the feature selection task. In our proposed method, we ensure the diversity of the initial population by partitioning the candidate solutions according to their selected number of features. In addition, we adjust the chances of features being selected into a candidate solution regarding their information gain values, which enables wise selection of features among a vast search space. We conduct extensive experiments on many benchmark datasets retrieved from UCI Machine Learning Repository. Moreover, we apply our algorithm on a real-world, large-scale dataset, i.e., Stanford Sentiment Treebank. We observe significant improvements after the comparisons with three off-the-shelf initialization strategies.
特征的质量是影响分类性能的主要因素之一。特征选择旨在从数据中去除不相关和冗余的特征,以提高分类精度。然而,由于搜索空间很大,识别这些特性并不是一项简单的任务。进化算法已被证明在许多优化问题中是有效的,包括特征选择。这些算法需要初始种群来启动它们的搜索机制,而糟糕的初始种群可能会导致陷入局部最优状态。使初始种群多样化是克服这一问题的有效方法;然而,随着特征尺寸的增加,搜索空间呈指数级增长,这可能还不够。在本研究中,我们提出了一种增强的初始种群策略来提高特征选择任务的性能。在我们提出的方法中,我们通过根据所选择的特征数量划分候选解来确保初始总体的多样性。此外,我们根据特征的信息增益值调整特征被选择到候选解决方案的机会,从而能够在巨大的搜索空间中明智地选择特征。我们对从UCI机器学习存储库检索的许多基准数据集进行了广泛的实验。此外,我们将算法应用于真实世界的大规模数据集,即斯坦福情感树库。在与三种现成的初始化策略进行比较后,我们观察到显著的改进。
{"title":"Boosting Initial Population in Multiobjective Feature Selection with Knowledge-Based Partitioning","authors":"Ayça Deniz, Hakan Ezgi Kiziloz","doi":"10.1109/IJCNN55064.2022.9892123","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892123","url":null,"abstract":"The quality of features is one of the main factors that affect classification performance. Feature selection aims to remove irrelevant and redundant features from data in order to increase classification accuracy. However, identifying these features is not a trivial task due to a large search space. Evolutionary algorithms have been proven to be effective in many optimization problems, including feature selection. These algorithms require an initial population to start their search mechanism, and a poor initial population may cause getting stuck in local optima. Diversifying the initial population is known as an effective approach to overcome this issue; yet, it may not suffice as the search space grows exponentially with increasing feature sizes. In this study, we propose an enhanced initial population strategy to boost the performance of the feature selection task. In our proposed method, we ensure the diversity of the initial population by partitioning the candidate solutions according to their selected number of features. In addition, we adjust the chances of features being selected into a candidate solution regarding their information gain values, which enables wise selection of features among a vast search space. We conduct extensive experiments on many benchmark datasets retrieved from UCI Machine Learning Repository. Moreover, we apply our algorithm on a real-world, large-scale dataset, i.e., Stanford Sentiment Treebank. We observe significant improvements after the comparisons with three off-the-shelf initialization strategies.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122348032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Scopic Simulation for People Flow Feature Extraction Based on Topological Mapping 基于拓扑映射的多尺度人流特征提取仿真
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892568
Kodai Kaneko, N. Kubota
In recent years, advances in information and communication technology have led to the research and development of cyber-physical systems and digital twin that simulate various events in real space in cyber space. In the field of human flow simulation, it is possible to predict and analyze the flow of people in various spaces, from indoor to outdoor, and use this information to create spaces that promote human activities, such as searching for optimal layouts and alleviating congestion by distributing traffic lines. In this paper, we propose to use multi-scopic simulation to simulate human flow. Next, using the human flow data measured by the simulation, we extract and analyze the features by topological mapping. Finally, we discuss the effectiveness of the proposed method through some simulation results.
近年来,随着信息通信技术的进步,在网络空间中模拟真实空间中各种事件的网络物理系统和数字孪生系统得到了研究和发展。在人流模拟领域,可以预测和分析从室内到室外各种空间的人流,并利用这些信息创造促进人类活动的空间,如寻找最优布局,通过分配交通线路来缓解拥堵。在本文中,我们提出使用多尺度模拟来模拟人的流动。其次,利用仿真测得的人流量数据,通过拓扑映射提取特征并进行特征分析。最后,通过仿真结果验证了该方法的有效性。
{"title":"Multi-Scopic Simulation for People Flow Feature Extraction Based on Topological Mapping","authors":"Kodai Kaneko, N. Kubota","doi":"10.1109/IJCNN55064.2022.9892568","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892568","url":null,"abstract":"In recent years, advances in information and communication technology have led to the research and development of cyber-physical systems and digital twin that simulate various events in real space in cyber space. In the field of human flow simulation, it is possible to predict and analyze the flow of people in various spaces, from indoor to outdoor, and use this information to create spaces that promote human activities, such as searching for optimal layouts and alleviating congestion by distributing traffic lines. In this paper, we propose to use multi-scopic simulation to simulate human flow. Next, using the human flow data measured by the simulation, we extract and analyze the features by topological mapping. Finally, we discuss the effectiveness of the proposed method through some simulation results.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122557707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human Interaction Recognition with Skeletal Attention and Shift Graph Convolution 基于骨骼注意和移位图卷积的人机交互识别
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892292
Jin Zhou, Zhenhua Wang, Jiajun Meng, Sheng Liu, Jianhua Zhang, Shengyong Chen
Human interaction recognition has wide applications including intelligent surveillance, intelligent transportation and the analysis of sports videos. In recent years, benefiting from the development of action recognition based on deep learning, the performance of human interaction recognition has been boosted. This paper tackles two vital issues in recognizing human interactions, namely target missing and inadequate feature expression. To this end, we first design a data preprocessing method using skeleton estimation and multi-object tracking, which effectively reduces the chance of missing detection. Second, we propose a two-stream network composing of an appearance branch and a pose branch. The appearance branch extracts features enhanced via part affinity maps and part confidences maps, while the pose branch trains a customized Shift-GCN to extract skeletal features from people-pairs. Appearance and pose features are then fused to generate a more powerful representation of human interactions. Extensive experiments on two existing benchmarks, UT and BIT-Interaction, as well as a new dataset crafted by us, namely Campus-Interaction (CI), demonstrate the superior performance of the proposed approach over the state-of-the-arts.
人机交互识别在智能监控、智能交通、体育视频分析等领域有着广泛的应用。近年来,得益于基于深度学习的动作识别的发展,人类交互识别的性能得到了提升。本文解决了人类交互识别中的两个关键问题,即目标缺失和特征表达不充分。为此,我们首先设计了一种基于骨架估计和多目标跟踪的数据预处理方法,有效降低了缺失检测的概率。其次,我们提出了一个由外观分支和姿态分支组成的双流网络。外观分支提取通过部分亲和图和部分置信度图增强的特征,而姿态分支训练自定义Shift-GCN从人对中提取骨骼特征。然后将外表和姿势特征融合在一起,生成更强大的人类互动表现。在两个现有基准(UT和BIT-Interaction)以及我们制作的新数据集(即Campus-Interaction (CI))上进行的广泛实验表明,所提出的方法优于最先进的方法。
{"title":"Human Interaction Recognition with Skeletal Attention and Shift Graph Convolution","authors":"Jin Zhou, Zhenhua Wang, Jiajun Meng, Sheng Liu, Jianhua Zhang, Shengyong Chen","doi":"10.1109/IJCNN55064.2022.9892292","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892292","url":null,"abstract":"Human interaction recognition has wide applications including intelligent surveillance, intelligent transportation and the analysis of sports videos. In recent years, benefiting from the development of action recognition based on deep learning, the performance of human interaction recognition has been boosted. This paper tackles two vital issues in recognizing human interactions, namely target missing and inadequate feature expression. To this end, we first design a data preprocessing method using skeleton estimation and multi-object tracking, which effectively reduces the chance of missing detection. Second, we propose a two-stream network composing of an appearance branch and a pose branch. The appearance branch extracts features enhanced via part affinity maps and part confidences maps, while the pose branch trains a customized Shift-GCN to extract skeletal features from people-pairs. Appearance and pose features are then fused to generate a more powerful representation of human interactions. Extensive experiments on two existing benchmarks, UT and BIT-Interaction, as well as a new dataset crafted by us, namely Campus-Interaction (CI), demonstrate the superior performance of the proposed approach over the state-of-the-arts.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122531816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Feedforward Neural Network Reconstructed from High-order Quantum Systems 基于高阶量子系统重构的前馈神经网络
Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892720
Junwei Zhang, Zhao Li, Hao Peng, Ming Li, Xiaofen Wang
Neural Networks (NNs) are widely used because of their superior feature extraction capabilities, among which Feedforward Neural Network (FNN) is used as the basic model for theoretical research. Recently, Quantum Neural Networks (QNNs) based on quantum mechanics have received extensive attention due to their ability to mine quantum correlations and parallel computing. Since two classical bits are required to simulate one qubit (i.e., quantum bit) on a classical computer, it brings challenges for simulating complex quantum operations or building large-scale QNNs on a classical computer. Hardy et al. extended the classical and quantum probability theories to the Generalized Probability Theory (GPT), so it is possible to construct high-order quantum systems. This paper regards the entire feature extraction and integration process of FNN as the evolution process of the high-order quantum system, and then leverages quantum coherence to describe the complex relationship between the features extracted by each layer of the network model. Intuitively, we reconstruct FNN to change the general vector processed by each layer into the state vector of the high-order quantum system. The experimental results on four mainstream datasets show that FNN reconstructed from the high-order quantum system is significantly better than the classical counterpart.
神经网络(Neural Network, NNs)因其优越的特征提取能力而得到广泛应用,其中前馈神经网络(Feedforward Neural Network, FNN)是理论研究的基础模型。近年来,基于量子力学的量子神经网络(Quantum Neural Networks, QNNs)因其具有挖掘量子相关性和并行计算的能力而受到广泛关注。由于在经典计算机上模拟一个量子位(即量子比特)需要两个经典比特,这给在经典计算机上模拟复杂的量子运算或构建大规模qnn带来了挑战。Hardy等人将经典概率论和量子概率论扩展到广义概率论(GPT),从而使构建高阶量子系统成为可能。本文将FNN的整个特征提取和集成过程看作是高阶量子系统的演化过程,然后利用量子相干性来描述网络模型各层提取的特征之间的复杂关系。直观地,我们重构FNN,将每一层处理后的一般向量转化为高阶量子系统的状态向量。在四个主流数据集上的实验结果表明,由高阶量子系统重构的FNN明显优于经典的FNN。
{"title":"Feedforward Neural Network Reconstructed from High-order Quantum Systems","authors":"Junwei Zhang, Zhao Li, Hao Peng, Ming Li, Xiaofen Wang","doi":"10.1109/IJCNN55064.2022.9892720","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892720","url":null,"abstract":"Neural Networks (NNs) are widely used because of their superior feature extraction capabilities, among which Feedforward Neural Network (FNN) is used as the basic model for theoretical research. Recently, Quantum Neural Networks (QNNs) based on quantum mechanics have received extensive attention due to their ability to mine quantum correlations and parallel computing. Since two classical bits are required to simulate one qubit (i.e., quantum bit) on a classical computer, it brings challenges for simulating complex quantum operations or building large-scale QNNs on a classical computer. Hardy et al. extended the classical and quantum probability theories to the Generalized Probability Theory (GPT), so it is possible to construct high-order quantum systems. This paper regards the entire feature extraction and integration process of FNN as the evolution process of the high-order quantum system, and then leverages quantum coherence to describe the complex relationship between the features extracted by each layer of the network model. Intuitively, we reconstruct FNN to change the general vector processed by each layer into the state vector of the high-order quantum system. The experimental results on four mainstream datasets show that FNN reconstructed from the high-order quantum system is significantly better than the classical counterpart.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114267708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 International Joint Conference on Neural Networks (IJCNN)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1