首页 > 最新文献

2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)最新文献

英文 中文
Graph Structural Attention and Increased Global Attention for Image Captioning 图结构关注和图像标题的全局关注
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642211
Tian Zheng, Wenhua Qian, Rencan Nie, Jinde Cao, Dan Xu
Attention mechanism plays a significant role in the current encoder-decoder framework of image captioning. Nevertheless, many attention mechanisms only fuse textual feature and image feature once, failing to adequately integrate the feature between context and image. Furthermore, many image captioning networks based on scene graphs only consider the node information but ignore the structure, which is insufficient in grasping the spatial object relationship. To address the above problems, we propose structural attention and increased global attention. Two attentions select critical image features from image detail and global image. The increased global attention, focusing on global image features, enhances integration between text and image via fusing detailed image features into global attention. To better describe the relationship among image objects, our network allows for both the node information by content attention and the structure information by structural attention. Structural attention computes the similarity between the structure information of scene graph and local attention, building the image objects relationship differing from content attention. We evaluate the performance of our image captioning network in MS COCO and Visual Genome datasets. The results of the experiments show that our method achieves superior performance compared with the existing methods.
注意机制在当前的图像字幕编码器-解码器框架中起着重要的作用。然而,许多注意机制只将文本特征和图像特征融合一次,未能充分整合语境和图像之间的特征。此外,许多基于场景图的图像字幕网络只考虑节点信息而忽略了结构,对空间对象关系的把握不足。为解决上述问题,我们建议加强结构性关注和全球关注。两个关注点分别从图像细节和全局图像中选择图像的关键特征。全球关注的增加,聚焦于图像的全局特征,通过将图像的细节特征融合到全局关注中,增强了文本和图像之间的融合。为了更好地描述图像对象之间的关系,我们的网络允许通过内容关注来获取节点信息,也允许通过结构关注来获取结构信息。结构注意计算场景图的结构信息与局部注意之间的相似度,建立不同于内容注意的图像对象关系。我们在MS COCO和Visual Genome数据集上评估了我们的图像字幕网络的性能。实验结果表明,与现有方法相比,我们的方法取得了更好的性能。
{"title":"Graph Structural Attention and Increased Global Attention for Image Captioning","authors":"Tian Zheng, Wenhua Qian, Rencan Nie, Jinde Cao, Dan Xu","doi":"10.1109/ICICIP53388.2021.9642211","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642211","url":null,"abstract":"Attention mechanism plays a significant role in the current encoder-decoder framework of image captioning. Nevertheless, many attention mechanisms only fuse textual feature and image feature once, failing to adequately integrate the feature between context and image. Furthermore, many image captioning networks based on scene graphs only consider the node information but ignore the structure, which is insufficient in grasping the spatial object relationship. To address the above problems, we propose structural attention and increased global attention. Two attentions select critical image features from image detail and global image. The increased global attention, focusing on global image features, enhances integration between text and image via fusing detailed image features into global attention. To better describe the relationship among image objects, our network allows for both the node information by content attention and the structure information by structural attention. Structural attention computes the similarity between the structure information of scene graph and local attention, building the image objects relationship differing from content attention. We evaluate the performance of our image captioning network in MS COCO and Visual Genome datasets. The results of the experiments show that our method achieves superior performance compared with the existing methods.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125108092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved Extreme Learning Machine Based on Deep Learning and Its Application in Handwritten Digits Recognition 基于深度学习的改进极限学习机及其在手写数字识别中的应用
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642213
Xiao Xiao, Bolin Liao, Qiuqing Long, Yongjun He, J. Li, Luyang Han
Traditional extreme learning machine (ELM) requires a large number of hidden layer neurons in its applications, and the ability to process high-dimensional big data samples is weak. In response to the above problems, this paper proposes an improved extreme learning machine algorithm based on deep learning. This algorithm combines the double pseudo-inverse extreme learning machine (DPELM) algorithm, which has high classification accuracy and simple network structure, with the denoising autoencoder (DAE) which can extract more essential data features. Among them, DAE is used to extract the features of the data that needs to be recognized, and the DPELM mainly plays as a classifier to quickly classify and recognize the extracted features. Experimental results show that in the recognition of handwritten digits, the double pseudo-inverse extreme learning machine based on denoising autoencoder (DAE-DPELM) algorithm needs only a small number of hidden layer neurons. In addition, compared with the traditional ELM algorithm and DAE-ELM algorithm, DAE-DPELM algorithm has a higher classification accuracy.
传统的极限学习机(ELM)在应用中需要大量的隐层神经元,处理高维大数据样本的能力较弱。针对上述问题,本文提出了一种基于深度学习的改进极限学习机算法。该算法将分类精度高、网络结构简单的双伪逆极值学习机(DPELM)算法与能够提取更多基本数据特征的去噪自编码器(DAE)算法相结合。其中,DAE用于提取需要识别的数据的特征,DPELM主要作为分类器对提取的特征进行快速分类和识别。实验结果表明,在手写体数字识别中,基于去噪自编码器(DAE-DPELM)算法的双伪逆极值学习机只需要少量的隐层神经元。此外,与传统ELM算法和DAE-ELM算法相比,DAE-DPELM算法具有更高的分类精度。
{"title":"Improved Extreme Learning Machine Based on Deep Learning and Its Application in Handwritten Digits Recognition","authors":"Xiao Xiao, Bolin Liao, Qiuqing Long, Yongjun He, J. Li, Luyang Han","doi":"10.1109/ICICIP53388.2021.9642213","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642213","url":null,"abstract":"Traditional extreme learning machine (ELM) requires a large number of hidden layer neurons in its applications, and the ability to process high-dimensional big data samples is weak. In response to the above problems, this paper proposes an improved extreme learning machine algorithm based on deep learning. This algorithm combines the double pseudo-inverse extreme learning machine (DPELM) algorithm, which has high classification accuracy and simple network structure, with the denoising autoencoder (DAE) which can extract more essential data features. Among them, DAE is used to extract the features of the data that needs to be recognized, and the DPELM mainly plays as a classifier to quickly classify and recognize the extracted features. Experimental results show that in the recognition of handwritten digits, the double pseudo-inverse extreme learning machine based on denoising autoencoder (DAE-DPELM) algorithm needs only a small number of hidden layer neurons. In addition, compared with the traditional ELM algorithm and DAE-ELM algorithm, DAE-DPELM algorithm has a higher classification accuracy.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123764473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Action-Transformer for Action Recognition in Short Videos 短视频中动作识别的动作转换器
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642184
Yumeng Cai, Guoyong Cai, Jin Cai
Action recognition methods are mostly based on a 3-Dimensional (3D) Convolution Network which have some limitations in practice, e.g. redundant parameters, big memory consumed and low performance. In this paper, a new convolution-free model called action-transformer is proposed to address the mentioned problems. The model proposed is mainly composed of three modules: spatial-temporal transformation module, hybrid feature attention module, and residual-transformer module. The spatial-temporal transformation module is designed to map the split short video into spatial and temporal features. The hybrid feature attention module is designed to extract the fine-grained features from the spatial and temporal features and produce the hybrid features. The residual-transformer module is designed with the combination of the attention, feed-forward network, and the residual mechanism to extract local and global features from the hybrid features. The model is tested on the HMDB51 and UCFIOI data set, and the result shows that the memory, the parameters used by the proposed model are less than those models mentioned in the literature, and it achieves better performance too.
动作识别方法大多基于三维卷积网络,在实际应用中存在参数冗余、内存消耗大、性能低等局限性。为了解决上述问题,本文提出了一种新的无卷积模型——动作转换器。该模型主要由三个模块组成:时空变换模块、混合特征注意模块和残差变压器模块。时空变换模块是将分割后的短视频映射为时空特征。混合特征注意模块旨在从时空特征中提取细粒度特征并生成混合特征。残差变压器模块将注意力、前馈网络和残差机制相结合,从混合特征中提取局部和全局特征。在HMDB51和UCFIOI数据集上对该模型进行了测试,结果表明,该模型所使用的内存和参数比文献中提到的模型要少,并且取得了更好的性能。
{"title":"Action-Transformer for Action Recognition in Short Videos","authors":"Yumeng Cai, Guoyong Cai, Jin Cai","doi":"10.1109/ICICIP53388.2021.9642184","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642184","url":null,"abstract":"Action recognition methods are mostly based on a 3-Dimensional (3D) Convolution Network which have some limitations in practice, e.g. redundant parameters, big memory consumed and low performance. In this paper, a new convolution-free model called action-transformer is proposed to address the mentioned problems. The model proposed is mainly composed of three modules: spatial-temporal transformation module, hybrid feature attention module, and residual-transformer module. The spatial-temporal transformation module is designed to map the split short video into spatial and temporal features. The hybrid feature attention module is designed to extract the fine-grained features from the spatial and temporal features and produce the hybrid features. The residual-transformer module is designed with the combination of the attention, feed-forward network, and the residual mechanism to extract local and global features from the hybrid features. The model is tested on the HMDB51 and UCFIOI data set, and the result shows that the memory, the parameters used by the proposed model are less than those models mentioned in the literature, and it achieves better performance too.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133805967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Autonomous underwater vehicles (AUVs) path planning based on Deep Reinforcement Learning 基于深度强化学习的自主水下航行器路径规划
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642175
Zhaolun Li, Xiao-peng Luo
For autonomous underwater vehicles (AUVs), autonomous navigation in an unknown underwater environment is still a difficult problem. In recent years, people have proposed some machine learning-based methods to solve this problem, but the existing methods still cannot meet the complex and changeable underwater environment. This paper conducts technical research on the path planning of autonomous underwater vehicles, combines deep learning and reinforcement learning, uses WL interpolation surface to model the seabed, and proposes a path planning model for autonomous underwater vehicles based on deep reinforcement learning. And train the path planning model in the simulation environment, and finally achieve the goal of path planning for the underwater robot in the complex and changeable underwater environment.
对于自主水下航行器(auv)来说,未知水下环境下的自主导航仍然是一个难题。近年来,人们提出了一些基于机器学习的方法来解决这一问题,但现有的方法仍然不能满足复杂多变的水下环境。本文对自主潜航器路径规划进行了技术研究,将深度学习与强化学习相结合,利用WL插值曲面对海底进行建模,提出了一种基于深度强化学习的自主潜航器路径规划模型。并在仿真环境中训练路径规划模型,最终实现水下机器人在复杂多变的水下环境中进行路径规划的目标。
{"title":"Autonomous underwater vehicles (AUVs) path planning based on Deep Reinforcement Learning","authors":"Zhaolun Li, Xiao-peng Luo","doi":"10.1109/ICICIP53388.2021.9642175","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642175","url":null,"abstract":"For autonomous underwater vehicles (AUVs), autonomous navigation in an unknown underwater environment is still a difficult problem. In recent years, people have proposed some machine learning-based methods to solve this problem, but the existing methods still cannot meet the complex and changeable underwater environment. This paper conducts technical research on the path planning of autonomous underwater vehicles, combines deep learning and reinforcement learning, uses WL interpolation surface to model the seabed, and proposes a path planning model for autonomous underwater vehicles based on deep reinforcement learning. And train the path planning model in the simulation environment, and finally achieve the goal of path planning for the underwater robot in the complex and changeable underwater environment.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128358018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An improved global k-means clustering algorithm 一种改进的全局k-means聚类算法
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642224
Lu Wang, Xiaoyun Zhang, Huidong Wang, Chuanzheng Bai
K-means(KM) clustering algorithm is well known for its simplicity and efficiency. However, the clustering effect is greatly influenced by the selection of initial centers. To solve this problem, one of the improved algorithms is global k-means (GKM) which performs the clustering process in an incremental manner. This incremental manner makes GKM get rid of the influence of initial points selection and reach the global optimum or near global optimum results. However, GKM requires high computational cost. Therefore, an improved global k-means (IGKM) algorithm is proposed using a new guarantee reduction to reduce the computational load of GKM. Centroid theorem is introduced to reduce the computational time further. Simulation results on 14 datasets demonstrate that our IGKM algorithm can obtain better clustering results and requires less running time.
K-means(KM)聚类算法以其简单、高效而著称。然而,初始中心的选择对聚类效果有很大影响。为了解决这一问题,一种改进的算法是全局k-均值(GKM)算法,它以增量的方式执行聚类过程。这种增量方式使GKM摆脱了初始点选择的影响,达到全局最优或接近全局最优结果。然而,GKM需要很高的计算成本。为此,提出了一种改进的全局k-均值(IGKM)算法,采用新的保证约简来降低全局k-均值算法的计算量。引入质心定理,进一步缩短了计算时间。在14个数据集上的仿真结果表明,IGKM算法可以获得较好的聚类结果,并且需要较少的运行时间。
{"title":"An improved global k-means clustering algorithm","authors":"Lu Wang, Xiaoyun Zhang, Huidong Wang, Chuanzheng Bai","doi":"10.1109/ICICIP53388.2021.9642224","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642224","url":null,"abstract":"K-means(KM) clustering algorithm is well known for its simplicity and efficiency. However, the clustering effect is greatly influenced by the selection of initial centers. To solve this problem, one of the improved algorithms is global k-means (GKM) which performs the clustering process in an incremental manner. This incremental manner makes GKM get rid of the influence of initial points selection and reach the global optimum or near global optimum results. However, GKM requires high computational cost. Therefore, an improved global k-means (IGKM) algorithm is proposed using a new guarantee reduction to reduce the computational load of GKM. Centroid theorem is introduced to reduce the computational time further. Simulation results on 14 datasets demonstrate that our IGKM algorithm can obtain better clustering results and requires less running time.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128467565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic-Consistent Deep Quantization for Cross-modal Retrieval 跨模态检索的语义一致深度量化
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642180
Liya Ma, N. Zhang, Kuang-I Shu, Xitao Zou
With making up for the deficiency of the constraint representation capability of hashing codes for high-dimensional data, the quantization method has been found to generally perform better in cross-modal similarity retrieval research. However, in current quantization approaches, the codebook, as the most critical basis for quantization, is still in a passive status and detached from the learning framework. To improve the initiative of codebook, we propose a semantic-consistent deep quantization (SCDQ), which is the first scheme to integrate quantization into deep network learning in an end-to-end fashion. Specifically, two classifiers following the deep representation learning networks are formulated to produce the class-wise abstract patterns with the help of label alignment. Meanwhile, our approach learns a collaborative codebook for both modalities, which embeds bimodality semantic consistent information in codewords and bridges the relationship between the patterns in classifiers and codewords in codebook. By designing a novel algorithm architecture and codebook update strategy, SCDQ enables effective and efficient cross-modal retrieval in an asymmetric way. Extensive experiments on two benchmark datasets demonstrate that SCDQ yields optimal cross-modal retrieval performance and outperforms several state of-the-art cross-modal retrieval methods.
量化方法弥补了哈希码对高维数据约束表示能力的不足,在跨模态相似性检索研究中普遍表现较好。然而,在目前的量化方法中,码本作为量化最关键的基础,仍然处于被动状态,脱离了学习框架。为了提高码本的主动性,我们提出了语义一致深度量化(SCDQ),这是第一个将量化以端到端方式集成到深度网络学习中的方案。具体来说,两个分类器遵循深度表示学习网络,在标签对齐的帮助下产生类智能抽象模式。同时,我们的方法学习了一个两模态的协作码本,它在码字中嵌入了双模态语义一致信息,并在分类器模式和码字之间架起了桥梁。通过设计一种新颖的算法架构和码本更新策略,SCDQ能够以不对称的方式实现高效的跨模态检索。在两个基准数据集上的大量实验表明,SCDQ产生了最佳的跨模态检索性能,并且优于几种最先进的跨模态检索方法。
{"title":"Semantic-Consistent Deep Quantization for Cross-modal Retrieval","authors":"Liya Ma, N. Zhang, Kuang-I Shu, Xitao Zou","doi":"10.1109/ICICIP53388.2021.9642180","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642180","url":null,"abstract":"With making up for the deficiency of the constraint representation capability of hashing codes for high-dimensional data, the quantization method has been found to generally perform better in cross-modal similarity retrieval research. However, in current quantization approaches, the codebook, as the most critical basis for quantization, is still in a passive status and detached from the learning framework. To improve the initiative of codebook, we propose a semantic-consistent deep quantization (SCDQ), which is the first scheme to integrate quantization into deep network learning in an end-to-end fashion. Specifically, two classifiers following the deep representation learning networks are formulated to produce the class-wise abstract patterns with the help of label alignment. Meanwhile, our approach learns a collaborative codebook for both modalities, which embeds bimodality semantic consistent information in codewords and bridges the relationship between the patterns in classifiers and codewords in codebook. By designing a novel algorithm architecture and codebook update strategy, SCDQ enables effective and efficient cross-modal retrieval in an asymmetric way. Extensive experiments on two benchmark datasets demonstrate that SCDQ yields optimal cross-modal retrieval performance and outperforms several state of-the-art cross-modal retrieval methods.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"154 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134212842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous ZND (Zhang Neural Dynamics) Model for Generalized Sinkhorn Scaling of Time-Varying Matrix 时变矩阵广义Sinkhorn标度的连续ZND(张神经动力学)模型
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642201
Jianzhen Xiao, Canhui Chen, Yunong Zhang
In this paper, we first propose a Zhang neural dynamics (ZND) model for the generalized Sinkhorn scaling of time-varying matrix. Specifically, by using the dimensional reduction technique, a continuous-time ZND model of time-varying matrix scaling is proposed and analyzed. In addition, the corresponding theoretical proofs are given, which prove the theoretical validity of the proposed ZND model. Moreover, two numerical experiments containing a square case and a rectangle case are also conducted. Numerical experiments and results substantiate the effectiveness and accuracy of the proposed ZND model.
本文首先提出了时变矩阵广义Sinkhorn标度的张神经动力学(ZND)模型。具体来说,利用降维技术,提出并分析了时变矩阵标度的连续时间ZND模型。并给出了相应的理论证明,证明了所提出的ZND模型的理论有效性。此外,还进行了方形和矩形两种情况下的数值实验。数值实验和结果验证了该模型的有效性和准确性。
{"title":"Continuous ZND (Zhang Neural Dynamics) Model for Generalized Sinkhorn Scaling of Time-Varying Matrix","authors":"Jianzhen Xiao, Canhui Chen, Yunong Zhang","doi":"10.1109/ICICIP53388.2021.9642201","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642201","url":null,"abstract":"In this paper, we first propose a Zhang neural dynamics (ZND) model for the generalized Sinkhorn scaling of time-varying matrix. Specifically, by using the dimensional reduction technique, a continuous-time ZND model of time-varying matrix scaling is proposed and analyzed. In addition, the corresponding theoretical proofs are given, which prove the theoretical validity of the proposed ZND model. Moreover, two numerical experiments containing a square case and a rectangle case are also conducted. Numerical experiments and results substantiate the effectiveness and accuracy of the proposed ZND model.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134474940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pulsar Identification Based on Variational Autoencoder and Residual Network 基于变分自编码器和残差网络的脉冲星识别
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642198
Guiru Liu, Yefan Li, Zelun Bao, Qian Yin, Ping Guo
In modern astronomy, pulsar identification is a vital task to help researchers discovering new pulsars. With the great progress of modern radio telescopes improves, the amount of pulsar data collected increases exponentially, which causes the traditional pulsar identification approaches to be not enough to tackle such a large dataset. At present, many pulsar identification methods achieve promising performance based on deep neural networks. However, those neural-network-based methods still face the sample imbalance problem, which limits their performance. To be specific, the pulsar sample imbalance problem is that only an extremely limited number of real pulsar samples exist in dataset. To alleviate the problem and enhance the pulsar identification performance, we present a novel method under the framework of synergetic learning systems which includes the variational autoencoder and residual network. In this work, the variational autoencoder is used to generate some high-quality pulsar samples for training procedure to mitigate the pulsar sample imbalance problem, and then we present a residual-network-based model to promote pulsar candidate identification performance. Extensive experiments on two pulsar datasets demonstrate that our framework not only alleviates the imbalance problem, but also improves the accuracy of pulsar identification.
在现代天文学中,脉冲星识别是帮助研究人员发现新的脉冲星的重要任务。随着现代射电望远镜技术的不断进步,采集到的脉冲星数据量呈指数级增长,传统的脉冲星识别方法已不足以处理如此庞大的数据集。目前,许多基于深度神经网络的脉冲星识别方法都取得了很好的效果。然而,这些基于神经网络的方法仍然面临着样本不平衡的问题,这限制了它们的性能。具体来说,脉冲星样本不平衡问题是指数据集中存在的真实脉冲星样本数量极其有限。为了解决这一问题,提高脉冲星的识别性能,我们提出了一种基于变分自编码器和残差网络的协同学习系统。本文首先利用变分自编码器生成高质量脉冲星样本进行训练,以缓解脉冲星样本不平衡的问题,然后提出基于残差网络的脉冲星候选识别模型,提高脉冲星候选识别的性能。在两个脉冲星数据集上的大量实验表明,该框架不仅缓解了不平衡问题,而且提高了脉冲星识别的精度。
{"title":"Pulsar Identification Based on Variational Autoencoder and Residual Network","authors":"Guiru Liu, Yefan Li, Zelun Bao, Qian Yin, Ping Guo","doi":"10.1109/ICICIP53388.2021.9642198","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642198","url":null,"abstract":"In modern astronomy, pulsar identification is a vital task to help researchers discovering new pulsars. With the great progress of modern radio telescopes improves, the amount of pulsar data collected increases exponentially, which causes the traditional pulsar identification approaches to be not enough to tackle such a large dataset. At present, many pulsar identification methods achieve promising performance based on deep neural networks. However, those neural-network-based methods still face the sample imbalance problem, which limits their performance. To be specific, the pulsar sample imbalance problem is that only an extremely limited number of real pulsar samples exist in dataset. To alleviate the problem and enhance the pulsar identification performance, we present a novel method under the framework of synergetic learning systems which includes the variational autoencoder and residual network. In this work, the variational autoencoder is used to generate some high-quality pulsar samples for training procedure to mitigate the pulsar sample imbalance problem, and then we present a residual-network-based model to promote pulsar candidate identification performance. Extensive experiments on two pulsar datasets demonstrate that our framework not only alleviates the imbalance problem, but also improves the accuracy of pulsar identification.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132890915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Improved Form-Finding Method for Calculating Force Density with Group Theory 用群理论计算力密度的一种改进找形方法
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642188
Taotao Heng, Liming Zhao, Keping Liu, Jiang Yi, Xiao-jun Duan, Zhongbo Sun
A form-finding method for symmetric tensegrity structure is proposed based on the eigenvalue minimization problem of force density matrix in this paper. The topology is the only premise condition about the structure. The problem to solve force density in the self-equilibrium tensegrity structure is transformed into a linear optimization problem, which the force density matrix under the rank deficiency condition. The constraints of the objective function can be established by the characteristics of member forces and the group theory. Then the nodal coordinates can be determined by eigenvalue decomposition once the force densities is obtained. In order to to show the efficiency of the proposed method, several simulations of tensegrity structures which include plane and spatial are demonstrated. It can be found that the form-finding process of symmetric tensegrity structure in the proposed method has the characteristics of rapid speed and high precision.
本文提出了一种基于力密度矩阵特征值最小化问题的对称张拉整体结构寻形方法。拓扑结构是结构的唯一前提条件。将求解自平衡张拉整体结构的力密度问题转化为求解秩亏条件下的力密度矩阵的线性优化问题。目标函数的约束可以通过构件力的特性和群论来确定。得到力密度后,通过特征值分解确定节点坐标。为了证明该方法的有效性,对平面和空间张拉整体结构进行了仿真。结果表明,该方法对对称张拉整体结构的找形过程具有速度快、精度高等特点。
{"title":"An Improved Form-Finding Method for Calculating Force Density with Group Theory","authors":"Taotao Heng, Liming Zhao, Keping Liu, Jiang Yi, Xiao-jun Duan, Zhongbo Sun","doi":"10.1109/ICICIP53388.2021.9642188","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642188","url":null,"abstract":"A form-finding method for symmetric tensegrity structure is proposed based on the eigenvalue minimization problem of force density matrix in this paper. The topology is the only premise condition about the structure. The problem to solve force density in the self-equilibrium tensegrity structure is transformed into a linear optimization problem, which the force density matrix under the rank deficiency condition. The constraints of the objective function can be established by the characteristics of member forces and the group theory. Then the nodal coordinates can be determined by eigenvalue decomposition once the force densities is obtained. In order to to show the efficiency of the proposed method, several simulations of tensegrity structures which include plane and spatial are demonstrated. It can be found that the form-finding process of symmetric tensegrity structure in the proposed method has the characteristics of rapid speed and high precision.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121781361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
DCTCN: Deep Complex Temporal Convolutional Network for Real Time Speech Enhancement 用于实时语音增强的深度复杂时间卷积网络
Pub Date : 2021-12-03 DOI: 10.1109/ICICIP53388.2021.9642159
Huanbin Zou, Jie Zhu
Recently, deep learning-based speech enhancement approaches have been researched extensively. Most methods focus on reconstructing the target clean speech’s magnitude spectrum from noisy speech’s magnitude spectrum, and then combine the noisy speech’s phase spectrum to synthesize the waveform. In this paper, we propose a complex network-based model called Deep Complex Temporal Convolutional Network (DCTCN) to estimate the complex-valued short-time Fourier transform (STFT) of target speech from noisy speech. We design a temporal convolutional network (TCN) block based on complex dilated causal convolution. In our proposed DCTCN, we achieve an outstanding denoising performance with a low complexity of 1.33M parameters. The experiments are conducted on the DNS Challenge dataset, and the results show that complex operations and TCN blocks have significant positive effects in noise suppression.
近年来,基于深度学习的语音增强方法得到了广泛的研究。大多数方法都是将噪声语音的幅值谱重构为目标干净语音的幅值谱,然后将噪声语音的相位谱结合起来合成波形。本文提出了一种基于复杂网络的深度复杂时间卷积网络(DCTCN)模型,用于从噪声语音中估计目标语音的复值短时傅里叶变换(STFT)。我们设计了一个基于复扩展因果卷积的时间卷积网络(TCN)块。在我们提出的DCTCN中,我们以1.33M的低复杂度实现了出色的去噪性能。在DNS Challenge数据集上进行了实验,结果表明,复杂操作和TCN块对噪声抑制有显著的积极作用。
{"title":"DCTCN: Deep Complex Temporal Convolutional Network for Real Time Speech Enhancement","authors":"Huanbin Zou, Jie Zhu","doi":"10.1109/ICICIP53388.2021.9642159","DOIUrl":"https://doi.org/10.1109/ICICIP53388.2021.9642159","url":null,"abstract":"Recently, deep learning-based speech enhancement approaches have been researched extensively. Most methods focus on reconstructing the target clean speech’s magnitude spectrum from noisy speech’s magnitude spectrum, and then combine the noisy speech’s phase spectrum to synthesize the waveform. In this paper, we propose a complex network-based model called Deep Complex Temporal Convolutional Network (DCTCN) to estimate the complex-valued short-time Fourier transform (STFT) of target speech from noisy speech. We design a temporal convolutional network (TCN) block based on complex dilated causal convolution. In our proposed DCTCN, we achieve an outstanding denoising performance with a low complexity of 1.33M parameters. The experiments are conducted on the DNS Challenge dataset, and the results show that complex operations and TCN blocks have significant positive effects in noise suppression.","PeriodicalId":435799,"journal":{"name":"2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125880629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2021 11th International Conference on Intelligent Control and Information Processing (ICICIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1