首页 > 最新文献

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)最新文献

英文 中文
Distributed Learning in Trusted Execution Environment: A Case Study of Federated Learning in SGX 可信执行环境下的分布式学习:SGX中联邦学习的案例研究
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660433
Tianxing Xu, Konglin Zhu, A. Andrzejak, Lin Zhang
Federated Learning (FL) is a distributed machine learning paradigm to solve isolated data island problems under privacy constraints. Recent works reveal that FL still exists security problems in which attackers can infer private data from gradients. In this paper, we propose a distributed FL framework in Trusted Execution Environment (TEE) to protect gradients in the perspective of hardware. We use trusted Software Guard eXtensions (SGX) as an instance to implement the FL, and proposed an SGX-FL framework. Firstly, to break through the limitation of physical memory space in SGX and meanwhile preserve the privacy, we leverage a gradient filtering mechanism to obtain the “important” gradients which preserve the utmost data privacy and put them into SGX. Secondly, to enhance the global adhesion of gradients so that the important gradients can be aggregated at maximum, a grouping method is carried out to put the most appropriate number of members into one group. Finally, to keep the accuracy of the FL model, the secondary gradients of group members and aggregated important gradients are simultaneously uploaded to the server and the computation procedure is validated by the integrity method of SGX. The evaluation results show that the proposed SGX-FL reduces the computation cost by 19 times compared with the existing approaches.
联邦学习(FL)是一种分布式机器学习范式,用于解决隐私约束下的孤立数据岛问题。最近的研究表明,FL仍然存在安全问题,攻击者可以从梯度中推断私人数据。本文提出了一种基于可信执行环境(TEE)的分布式FL框架,从硬件角度保护梯度。以可信软件保护扩展(SGX)为例,提出了一个可信软件保护扩展框架。首先,为了突破SGX物理内存空间的限制,在保护隐私的同时,我们利用梯度过滤机制,获得保护最大数据隐私的“重要”梯度,并将其放入SGX中。其次,为了增强梯度的全局粘附性,最大限度地聚集重要梯度,采用分组方法,将最合适数量的梯度成员归为一组;最后,为了保证FL模型的准确性,将组成员的次级梯度和聚合的重要梯度同时上传到服务器,并通过SGX的完整性方法对计算过程进行验证。评估结果表明,与现有方法相比,所提出的SGX-FL算法的计算量减少了19倍。
{"title":"Distributed Learning in Trusted Execution Environment: A Case Study of Federated Learning in SGX","authors":"Tianxing Xu, Konglin Zhu, A. Andrzejak, Lin Zhang","doi":"10.1109/IC-NIDC54101.2021.9660433","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660433","url":null,"abstract":"Federated Learning (FL) is a distributed machine learning paradigm to solve isolated data island problems under privacy constraints. Recent works reveal that FL still exists security problems in which attackers can infer private data from gradients. In this paper, we propose a distributed FL framework in Trusted Execution Environment (TEE) to protect gradients in the perspective of hardware. We use trusted Software Guard eXtensions (SGX) as an instance to implement the FL, and proposed an SGX-FL framework. Firstly, to break through the limitation of physical memory space in SGX and meanwhile preserve the privacy, we leverage a gradient filtering mechanism to obtain the “important” gradients which preserve the utmost data privacy and put them into SGX. Secondly, to enhance the global adhesion of gradients so that the important gradients can be aggregated at maximum, a grouping method is carried out to put the most appropriate number of members into one group. Finally, to keep the accuracy of the FL model, the secondary gradients of group members and aggregated important gradients are simultaneously uploaded to the server and the computation procedure is validated by the integrity method of SGX. The evaluation results show that the proposed SGX-FL reduces the computation cost by 19 times compared with the existing approaches.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"232 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115266790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Construction of Error Correcting Output Codes for Robust Deep Neural Networks Based on Label Grouping Scheme 基于标签分组方案的鲁棒深度神经网络纠错输出码构建
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660486
Hwiyoung Youn, Soonhee Kwon, Hyunhee Lee, Jiho Kim, Songnam Hong, Dong-joon Shin
Error-Correcting Output Codes (ECOCs) have been proposed to construct multi-class classifiers using simple binary classifiers. Recently, the principle of ECOCs has been employed for improving the robustness of deep classifiers. In this paper, a novel ECOC framework is developed by presenting a novel label grouping and code-construction method. The proposed label grouping is based on linear discriminant analysis (LDA) similarity. Via simulations, it is demonstrated that deep classifiers trained with the proposed ECOC yield better classification performance on pure data and better adversarial robustness than the state-of-the-art deep neural classifiers using ECOCs.
纠错输出码(Error-Correcting Output Codes, ecoc)是一种利用简单的二值分类器来构造多类分类器的方法。近年来,深度分类器的鲁棒性得到了广泛的应用。本文通过提出一种新的标签分组和编码构造方法,提出了一种新的ECOC框架。提出的标签分组基于线性判别分析(LDA)相似度。通过仿真证明,与使用ECOC的最先进的深度神经分类器相比,使用ECOC训练的深度分类器在纯数据上具有更好的分类性能和更好的对抗鲁棒性。
{"title":"Construction of Error Correcting Output Codes for Robust Deep Neural Networks Based on Label Grouping Scheme","authors":"Hwiyoung Youn, Soonhee Kwon, Hyunhee Lee, Jiho Kim, Songnam Hong, Dong-joon Shin","doi":"10.1109/IC-NIDC54101.2021.9660486","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660486","url":null,"abstract":"Error-Correcting Output Codes (ECOCs) have been proposed to construct multi-class classifiers using simple binary classifiers. Recently, the principle of ECOCs has been employed for improving the robustness of deep classifiers. In this paper, a novel ECOC framework is developed by presenting a novel label grouping and code-construction method. The proposed label grouping is based on linear discriminant analysis (LDA) similarity. Via simulations, it is demonstrated that deep classifiers trained with the proposed ECOC yield better classification performance on pure data and better adversarial robustness than the state-of-the-art deep neural classifiers using ECOCs.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122552993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Spectrum Sensing Method Based on CNN-LSTM Deep Neural Network 基于CNN-LSTM深度神经网络的频谱感知方法
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660470
Shujian Zhang, Zhan Xu, Lu Tian, Xiaolong Yang
Spectrum sensing can effectively improve the low utilization of spectrum resources and is one of the crucial components of cognitive radio networks. This paper proposes a Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network cascaded spectrum sensing model. The model uses CNN to analyze the Short-Time Fourier transform spectrogram of the blind signal. Then the generated feature vector or feature map is passed to the LSTM according to the timestamp. Finally, it detects a signal in a specific spectrum and classifies the signal type to identify multiple signals accurately. The neural network model improves the detection probability by simultaneously acquiring the spatial and temporal characteristics of the blind signal. The experimental results show that the method in this paper can detect a variety of signals with higher detection probability within a wide range of SNR, especially under the condition of low SNR.
频谱感知可以有效改善频谱资源利用率低的现状,是认知无线电网络的重要组成部分之一。提出了一种卷积神经网络(CNN)和长短期记忆(LSTM)网络级联的频谱感知模型。该模型利用CNN对盲信号的短时傅里叶变换谱图进行分析。然后根据时间戳将生成的特征向量或特征映射传递给LSTM。最后,对特定频谱中的信号进行检测,并对信号类型进行分类,以准确识别多个信号。该神经网络模型通过同时获取盲信号的时空特征来提高检测概率。实验结果表明,本文方法可以在较宽的信噪比范围内,特别是在低信噪比条件下,以较高的检测概率检测多种信号。
{"title":"A Spectrum Sensing Method Based on CNN-LSTM Deep Neural Network","authors":"Shujian Zhang, Zhan Xu, Lu Tian, Xiaolong Yang","doi":"10.1109/IC-NIDC54101.2021.9660470","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660470","url":null,"abstract":"Spectrum sensing can effectively improve the low utilization of spectrum resources and is one of the crucial components of cognitive radio networks. This paper proposes a Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network cascaded spectrum sensing model. The model uses CNN to analyze the Short-Time Fourier transform spectrogram of the blind signal. Then the generated feature vector or feature map is passed to the LSTM according to the timestamp. Finally, it detects a signal in a specific spectrum and classifies the signal type to identify multiple signals accurately. The neural network model improves the detection probability by simultaneously acquiring the spatial and temporal characteristics of the blind signal. The experimental results show that the method in this paper can detect a variety of signals with higher detection probability within a wide range of SNR, especially under the condition of low SNR.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122587725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Forward Hybrid Routing Algorithm by Small Node Group in Underwater Acoustic Communication 水声通信中的小节点群动态前向混合路由算法
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660594
Zengjun Niu, K. Niu, Zhiqiang He
Underwater acoustic network plays an important role in various operations in the marine environment. It is very necessary to design a routing protocol that can adapt to the underwater environment with high time-varying and Doppler frequency shift. In this paper, we present a dynamic forward hybrid routing algorithm based on the small node group (SG-DFHR), to achieve a higher packet delivery ratio and lower energy consumption. The working process of SG-DFHR can be mainly divided into two stages. First, the randomly deployed underwater nodes are divided into several groups comprised of three nodes, which respectively serve as the master, secondary and ordinary node. Then a hybrid routing strategy is implemented for multi-hop transmission. In which, the master node uses the multicast for data packet transmission, while the rest nodes use the unicast method. Furthermore, in order to adapt to the dynamic changes of underwater network, we design a node group inspect and update strategy. The simulation and theoretical analysis show that our algorithm has superior performance over the ALRP, DCK-S-BEAR and SUN. Compared with the previous algorithms, under the premise of no significant increase in delay, the energy consumption and packet delivery performances of SG-DFHR are significantly improved. SG-DFHR achieves effective tradeoff among multiple performance metrics.
水声网络在海洋环境的各种作业中起着重要的作用。设计一种能够适应时变和多普勒频移的水下环境的路由协议是十分必要的。本文提出了一种基于小节点群(SG-DFHR)的动态前向混合路由算法,以实现更高的分组传送率和更低的能耗。SG-DFHR的工作过程主要分为两个阶段。首先,将随机部署的水下节点分成若干组,每组3个节点,分别作为主节点、辅助节点和普通节点。然后实现了多跳传输的混合路由策略。其中主节点采用组播方式传输数据包,其余节点采用单播方式传输数据包。此外,为了适应水下网络的动态变化,设计了节点群检测与更新策略。仿真和理论分析表明,该算法优于ALRP、DCK-S-BEAR和SUN算法。与之前的算法相比,SG-DFHR算法在不显著增加时延的前提下,能耗和分组传输性能均有显著提高。SG-DFHR在多个性能指标之间实现了有效的权衡。
{"title":"Dynamic Forward Hybrid Routing Algorithm by Small Node Group in Underwater Acoustic Communication","authors":"Zengjun Niu, K. Niu, Zhiqiang He","doi":"10.1109/IC-NIDC54101.2021.9660594","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660594","url":null,"abstract":"Underwater acoustic network plays an important role in various operations in the marine environment. It is very necessary to design a routing protocol that can adapt to the underwater environment with high time-varying and Doppler frequency shift. In this paper, we present a dynamic forward hybrid routing algorithm based on the small node group (SG-DFHR), to achieve a higher packet delivery ratio and lower energy consumption. The working process of SG-DFHR can be mainly divided into two stages. First, the randomly deployed underwater nodes are divided into several groups comprised of three nodes, which respectively serve as the master, secondary and ordinary node. Then a hybrid routing strategy is implemented for multi-hop transmission. In which, the master node uses the multicast for data packet transmission, while the rest nodes use the unicast method. Furthermore, in order to adapt to the dynamic changes of underwater network, we design a node group inspect and update strategy. The simulation and theoretical analysis show that our algorithm has superior performance over the ALRP, DCK-S-BEAR and SUN. Compared with the previous algorithms, under the premise of no significant increase in delay, the energy consumption and packet delivery performances of SG-DFHR are significantly improved. SG-DFHR achieves effective tradeoff among multiple performance metrics.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114922688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Video Summarization Based on Fusing Features and Shot Segmentation 基于融合特征和镜头分割的视频摘要
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660579
Xuming Feng, Yaping Zhu, Cheng Yang
Video summarization is a technique that creates short summaries from original videos while retaining the main representative information. Traditional video summarization models based on deep learning mostly use frames as the basic processing unit, which cannot handle long videos due to hardware limitations. In this paper, we compress the frame-level features into shot-level features using a feature extractor based on Convolutional Neural Network (CNN), which can improve the training accuracy and reduce computation. At the same time, we propose a feature fusion algorithm based on the capsule network, which combines the RGB features and Light Flow features of the video into the deep features with adaptive weights to enhance the original video features. Experiment results on two benchmark datasets (TVsum and SumMe) validate the effectiveness of our method.
视频摘要是一种从原始视频中创建简短摘要,同时保留主要代表性信息的技术。传统的基于深度学习的视频摘要模型多采用帧作为基本处理单元,由于硬件的限制,无法处理长视频。本文利用基于卷积神经网络(CNN)的特征提取器将帧级特征压缩为镜头级特征,提高了训练精度,减少了计算量。同时,我们提出了一种基于胶囊网络的特征融合算法,将视频的RGB特征和Light Flow特征结合到深度特征中,利用自适应权值增强原始视频特征。在两个基准数据集(TVsum和SumMe)上的实验结果验证了该方法的有效性。
{"title":"Video Summarization Based on Fusing Features and Shot Segmentation","authors":"Xuming Feng, Yaping Zhu, Cheng Yang","doi":"10.1109/IC-NIDC54101.2021.9660579","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660579","url":null,"abstract":"Video summarization is a technique that creates short summaries from original videos while retaining the main representative information. Traditional video summarization models based on deep learning mostly use frames as the basic processing unit, which cannot handle long videos due to hardware limitations. In this paper, we compress the frame-level features into shot-level features using a feature extractor based on Convolutional Neural Network (CNN), which can improve the training accuracy and reduce computation. At the same time, we propose a feature fusion algorithm based on the capsule network, which combines the RGB features and Light Flow features of the video into the deep features with adaptive weights to enhance the original video features. Experiment results on two benchmark datasets (TVsum and SumMe) validate the effectiveness of our method.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131042036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Economic Development Analysis of the Belt and Road Regions Based on Automatic Interpretation of Remote Sensing Images 基于遥感影像自动解译的“一带一路”区域经济发展分析
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660561
Xinzhu Qiu, Yunzhe Wang, Jingyi Cao, Guannan Xu, Yanan You, Junlong Ren
The Belt and Road (B&R) initiative is proposed to promote common development among countries along the B&R. In recent years, although the B&R has contributed to the regions along the route, it is always a controversial topic in the international community. A number of scholars have done a set of research works to analyze the effects of the B&R projects based on traditional economic methods. However, the drawbacks of subjectivity and delay reduce the conviction of the analysis results. In this paper, we leverage the objectivity and real-time features of remote sensing (RS) images to analyze the effects of the B&R project. Our research takes Voi town along the Mongolia-Nairobi Railway as the representative city. In addition, in order to prove the causal relationship between the B&R and economic development, we select the Taveta town as the comparison city. The semantic segmentation based on deep learning is applied to the multi-temporal RS images, to retrieve the economic development by automatically recognizing houses. On this basis, the construction and development of both the studied region and the comparison are quantitatively analyzed by meshing analysis and standard deviation elliptic methods. For overcoming the shortages of the conventional algorithms, a novel segmentation network based on the attention mechanism is proposed. The evaluation proves the semantic segmentation results can fully support the follow-up data analysis. In addition, the analysis results show that our work is a convincing initiative to reveal the values of the B&R projects for economic developments in the B&R-related regions.
“一带一路”倡议旨在促进沿线国家共同发展。近年来,“一带一路”虽然对沿线地区做出了贡献,但在国际社会一直是一个有争议的话题。一些学者基于传统的经济学方法对“一带一路”项目的影响进行了一系列的研究。然而,分析结果的主观性和滞后性降低了分析结果的可信度。在本文中,我们利用遥感(RS)图像的客观性和实时性特征来分析贝加莱项目的效果。本研究以蒙内铁路沿线的Voi镇为代表城市。此外,为了证明“一带一路”与经济发展之间的因果关系,我们选择了塔维塔镇作为比较城市。将基于深度学习的语义分割技术应用于多时相遥感图像,通过自动识别房屋来检索经济发展状况。在此基础上,通过网格分析和标准差椭圆法对研究区域的建设和发展以及对比进行了定量分析。针对传统分割算法的不足,提出了一种基于注意力机制的分割网络。评价结果表明,语义分割的结果可以完全支持后续的数据分析。此外,分析结果表明,我们的工作是一个令人信服的倡议,揭示了“一带一路”项目对“一带一路”相关地区经济发展的价值。
{"title":"Economic Development Analysis of the Belt and Road Regions Based on Automatic Interpretation of Remote Sensing Images","authors":"Xinzhu Qiu, Yunzhe Wang, Jingyi Cao, Guannan Xu, Yanan You, Junlong Ren","doi":"10.1109/IC-NIDC54101.2021.9660561","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660561","url":null,"abstract":"The Belt and Road (B&R) initiative is proposed to promote common development among countries along the B&R. In recent years, although the B&R has contributed to the regions along the route, it is always a controversial topic in the international community. A number of scholars have done a set of research works to analyze the effects of the B&R projects based on traditional economic methods. However, the drawbacks of subjectivity and delay reduce the conviction of the analysis results. In this paper, we leverage the objectivity and real-time features of remote sensing (RS) images to analyze the effects of the B&R project. Our research takes Voi town along the Mongolia-Nairobi Railway as the representative city. In addition, in order to prove the causal relationship between the B&R and economic development, we select the Taveta town as the comparison city. The semantic segmentation based on deep learning is applied to the multi-temporal RS images, to retrieve the economic development by automatically recognizing houses. On this basis, the construction and development of both the studied region and the comparison are quantitatively analyzed by meshing analysis and standard deviation elliptic methods. For overcoming the shortages of the conventional algorithms, a novel segmentation network based on the attention mechanism is proposed. The evaluation proves the semantic segmentation results can fully support the follow-up data analysis. In addition, the analysis results show that our work is a convincing initiative to reveal the values of the B&R projects for economic developments in the B&R-related regions.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132725693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech Separation Based on DPTNet with Sparse Attention 基于DPTNet的稀疏注意语音分离
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660488
Beom Jun Woo, H. Kim, Jeunghun Kim, N. Kim
This paper presents a sparse attention-based speech separation algorithm separating and generating clean speech from mixed audio containing speech from multiple speakers. Recent development of deep learning has enabled several speech separation models to generate clean speech audios. Especially speech separation models based on transformer show high performance due to their ability to learn long term dependencies compared with other neural network structures. However, as a transformer with self-attention falls short of catching short-term dependencies, we adopt sparse attention structure to the original transformer-based speech separation model. We show that the model with sparse attention outperforms the original full attention method.
本文提出了一种基于稀疏注意力的语音分离算法,从包含多个说话人语音的混合音频中分离并生成干净的语音。深度学习的最新发展使几种语音分离模型能够生成干净的语音音频。特别是基于变压器的语音分离模型,由于其学习长期依赖关系的能力,与其他神经网络结构相比,表现出了较高的性能。然而,由于具有自注意的变压器无法捕捉短期依赖关系,我们对原有的基于变压器的语音分离模型采用了稀疏注意结构。结果表明,稀疏注意模型优于原全注意模型。
{"title":"Speech Separation Based on DPTNet with Sparse Attention","authors":"Beom Jun Woo, H. Kim, Jeunghun Kim, N. Kim","doi":"10.1109/IC-NIDC54101.2021.9660488","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660488","url":null,"abstract":"This paper presents a sparse attention-based speech separation algorithm separating and generating clean speech from mixed audio containing speech from multiple speakers. Recent development of deep learning has enabled several speech separation models to generate clean speech audios. Especially speech separation models based on transformer show high performance due to their ability to learn long term dependencies compared with other neural network structures. However, as a transformer with self-attention falls short of catching short-term dependencies, we adopt sparse attention structure to the original transformer-based speech separation model. We show that the model with sparse attention outperforms the original full attention method.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122368474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Auxiliary Glasses Designed for Visually Impaired People with Color Blindness and Regional Visual Impairments 专为色盲和区域视觉障碍的视障人士设计的辅助眼镜
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660527
Zexuan Liu, Chuang Zhang, Ming Wu, Shizun Wang
Visually impaired people are numerous in society and in a vulnerable position, and social concerns to these disadvantaged groups should be increasing. This paper introduces a pair of AR glasses with corresponding app designed and developed for color-blind people and people with regional visual impairments. The content captured by the camera is recolored by color rotation and displayed on the glasses, so that the color-blind people can better distinguish colors. We also creatively put forward the idea of transfer of the visual field, which assists the visually impaired in the area to perceive the scene ahead. Result from 24 volunteers proves that the pair of glasses can greatly assist the life of visually impaired people.
视障者在社会上数量众多,处于弱势地位,社会对这些弱势群体的关注应该增加。本文介绍了一款为色盲和区域视觉障碍人士设计开发的AR眼镜及其相应的app。摄像头捕捉到的内容通过颜色旋转重新上色,显示在眼镜上,使色盲的人能更好地分辨颜色。我们还创造性地提出了视野转移的想法,帮助该地区的视障人士感知前方的场景。24名志愿者的测试结果证明,这副眼镜对视障人士的生活有很大的帮助。
{"title":"Auxiliary Glasses Designed for Visually Impaired People with Color Blindness and Regional Visual Impairments","authors":"Zexuan Liu, Chuang Zhang, Ming Wu, Shizun Wang","doi":"10.1109/IC-NIDC54101.2021.9660527","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660527","url":null,"abstract":"Visually impaired people are numerous in society and in a vulnerable position, and social concerns to these disadvantaged groups should be increasing. This paper introduces a pair of AR glasses with corresponding app designed and developed for color-blind people and people with regional visual impairments. The content captured by the camera is recolored by color rotation and displayed on the glasses, so that the color-blind people can better distinguish colors. We also creatively put forward the idea of transfer of the visual field, which assists the visually impaired in the area to perceive the scene ahead. Result from 24 volunteers proves that the pair of glasses can greatly assist the life of visually impaired people.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115835916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Implementation of Human Intention Estimation System Based on Eye Movement Tracking 基于眼动跟踪的人类意图估计系统的设计与实现
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660411
Jiaming Yang, Shouzhi Yu, Mingwei Ding
Eyes are the window of the mind, and eye movement contains a lot of effective information. With the breakthrough of artificial intelligence and human-computer interaction, eye movement tracking technology has gradually moved to the foreground in recent years. This paper takes medical health as a typical scenario, carries out innovative research around the potential application of eye movement tracking technology in patient communication and human-computer interaction, proposes and implements an intention estimation system based on eye movement tracking. Based on the physiology, medicine and ergonomics of eye movement, the system makes original innovations, extracts a set of eye movement parameters with high recognition accuracy and high expression efficiency, summarizes eye movement instructions that can meet the application of silent environment, and finally forms a series of embryonic applications related to interactive instructions.
眼睛是心灵的窗口,眼动包含着很多有效的信息。随着人工智能和人机交互的突破,眼动追踪技术近年来逐渐走向前台。本文以医疗健康为典型场景,围绕眼动追踪技术在患者交流和人机交互中的潜在应用进行创新研究,提出并实现了一种基于眼动追踪的意图估计系统。该系统基于眼动的生理学、医学和工效学原理,进行原创性创新,提取出一组识别精度高、表达效率高的眼动参数,总结出能够满足静音环境应用的眼动指令,最终形成了一系列与交互指令相关的应用雏形。
{"title":"Design and Implementation of Human Intention Estimation System Based on Eye Movement Tracking","authors":"Jiaming Yang, Shouzhi Yu, Mingwei Ding","doi":"10.1109/IC-NIDC54101.2021.9660411","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660411","url":null,"abstract":"Eyes are the window of the mind, and eye movement contains a lot of effective information. With the breakthrough of artificial intelligence and human-computer interaction, eye movement tracking technology has gradually moved to the foreground in recent years. This paper takes medical health as a typical scenario, carries out innovative research around the potential application of eye movement tracking technology in patient communication and human-computer interaction, proposes and implements an intention estimation system based on eye movement tracking. Based on the physiology, medicine and ergonomics of eye movement, the system makes original innovations, extracts a set of eye movement parameters with high recognition accuracy and high expression efficiency, summarizes eye movement instructions that can meet the application of silent environment, and finally forms a series of embryonic applications related to interactive instructions.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117080446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High Figure of Merit Magnetic Field Sensor Based on Photonic Crystal Slab Supporting Quasi Bound States in The Continuum 基于连续介质中支持准束缚态的光子晶体板的高品质磁场传感器
Pub Date : 2021-11-17 DOI: 10.1109/IC-NIDC54101.2021.9660425
Zhe Han, Chao Wang, Zixing Gou, Huiping Tian
In this paper, a magnetic field sensor (MFS) with high figure of merit (FOM) is theoretically proposed, which is based on photonic crystal slab (PhCS) covered by magnetic fluid film (MFF). The PhCS consists of a two dimensionally periodic nanohole array introduced into a silicon slab. The large-sized nanohole is used to increase the area of light-matter interaction. By slightly breaking the symmetry of nanoholes, quasi bound states in the continuum (BIC) with Fano line shape is excited in the PhCS, which is sensitive to external magnetic field and has a high Q-factor. The effect of MFF thickness on the magnetic field sensitivity is investigated. Furthermore, high resonance amplitude of 0.97 and low limit of detection (LOD) of 6.1×10−5 T are achieved. Compared with the researches lately published, the sensor exhibits high Q-factor and high sensitivity. Therefore, we believe the proposed sensor will contribute to the lab-on-chip magnetic field detection system design.
本文从理论上提出了一种高品质因数磁场传感器(MFS),该传感器基于磁流体膜(MFF)覆盖的光子晶体板(PhCS)。PhCS由引入硅板的二维周期性纳米孔阵列组成。大尺寸的纳米孔用于增加光-物质相互作用的面积。通过轻微破坏纳米孔的对称性,在PhCS中激发具有Fano线形状的连续介质(BIC)准束缚态,该连续介质对外部磁场敏感,具有高q因子。研究了MFF厚度对磁场灵敏度的影响。此外,还实现了0.97的高共振幅值和6.1×10−5 T的低检出限。与已有的研究成果相比,该传感器具有高q因子和高灵敏度。因此,我们相信所提出的传感器将有助于芯片上实验室磁场检测系统的设计。
{"title":"High Figure of Merit Magnetic Field Sensor Based on Photonic Crystal Slab Supporting Quasi Bound States in The Continuum","authors":"Zhe Han, Chao Wang, Zixing Gou, Huiping Tian","doi":"10.1109/IC-NIDC54101.2021.9660425","DOIUrl":"https://doi.org/10.1109/IC-NIDC54101.2021.9660425","url":null,"abstract":"In this paper, a magnetic field sensor (MFS) with high figure of merit (FOM) is theoretically proposed, which is based on photonic crystal slab (PhCS) covered by magnetic fluid film (MFF). The PhCS consists of a two dimensionally periodic nanohole array introduced into a silicon slab. The large-sized nanohole is used to increase the area of light-matter interaction. By slightly breaking the symmetry of nanoholes, quasi bound states in the continuum (BIC) with Fano line shape is excited in the PhCS, which is sensitive to external magnetic field and has a high Q-factor. The effect of MFF thickness on the magnetic field sensitivity is investigated. Furthermore, high resonance amplitude of 0.97 and low limit of detection (LOD) of 6.1×10−5 T are achieved. Compared with the researches lately published, the sensor exhibits high Q-factor and high sensitivity. Therefore, we believe the proposed sensor will contribute to the lab-on-chip magnetic field detection system design.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115599468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1