首页 > 最新文献

2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)最新文献

英文 中文
An Audio-Visual System for Sound Noise Reduction Based on Deep Neural Networks 基于深度神经网络的声音降噪视听系统
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729351
Seyedeh Sogand Hashemi, M. Asadi, M. Aghabozorgi
Audio noise has no unique definition, but in general, it includes background and environmental sounds such as objects movements, animal sounds, and etc. These sounds distract listeners and lead to loss of main content. Noise reduction is a process for removing such these unwanted sounds and extracts clear noise-free sound of an audio source. All proposed methods for this problem deal with some challenges such as residual noise, low speed performance, ambiguity in separation. In this paper an automated system is proposed to eliminate noise signal from noisy audio of an audio-visual data. This system utilizes audio and visual features of main sound source (musical instruments) to feed its two internal DNN based models: a) object detection and b) sound separation model. First, an object detection model which is designed by transfer learning method is used to identify sound source in video frames. Then based on detected source, a specific sound separation model is applied to noisy signal and extracts the noise-free audio signal. Audio and visual features play a complementary role in noise reduction process and its positive effect is obvious in obtained results. The experimental results indicate that under the noisy environment, especially in real-time applications, the proposed noise reduction scheme improves the quality of the extracted noise-free sound in comparison with other algorithms.
音频噪音没有独特的定义,但一般来说,它包括背景和环境声音,如物体运动,动物声音等。这些声音会分散听者的注意力,导致丢失主要内容。降噪是去除这些不需要的声音并提取音频源的清晰无噪声声音的过程。针对这一问题提出的方法都面临着一些挑战,如残余噪声、低速度性能、分离模糊等。本文提出了一种消除视听数据中噪声信号的自动化系统。该系统利用主要声源(乐器)的音频和视觉特征来馈送其两个内部基于DNN的模型:a)目标检测和b)声音分离模型。首先,利用迁移学习方法设计的目标检测模型对视频帧中的声源进行识别。然后根据检测到的声源,对噪声信号应用特定的声音分离模型,提取出无噪声的音频信号。音频和视觉特征在降噪过程中起着互补的作用,所获得的结果表明其积极作用是明显的。实验结果表明,在噪声环境下,特别是在实时应用中,与其他算法相比,所提出的降噪方案提高了提取的无噪声声音的质量。
{"title":"An Audio-Visual System for Sound Noise Reduction Based on Deep Neural Networks","authors":"Seyedeh Sogand Hashemi, M. Asadi, M. Aghabozorgi","doi":"10.1109/ICSPIS54653.2021.9729351","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729351","url":null,"abstract":"Audio noise has no unique definition, but in general, it includes background and environmental sounds such as objects movements, animal sounds, and etc. These sounds distract listeners and lead to loss of main content. Noise reduction is a process for removing such these unwanted sounds and extracts clear noise-free sound of an audio source. All proposed methods for this problem deal with some challenges such as residual noise, low speed performance, ambiguity in separation. In this paper an automated system is proposed to eliminate noise signal from noisy audio of an audio-visual data. This system utilizes audio and visual features of main sound source (musical instruments) to feed its two internal DNN based models: a) object detection and b) sound separation model. First, an object detection model which is designed by transfer learning method is used to identify sound source in video frames. Then based on detected source, a specific sound separation model is applied to noisy signal and extracts the noise-free audio signal. Audio and visual features play a complementary role in noise reduction process and its positive effect is obvious in obtained results. The experimental results indicate that under the noisy environment, especially in real-time applications, the proposed noise reduction scheme improves the quality of the extracted noise-free sound in comparison with other algorithms.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127838824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Group Signature Based Federated Learning in Smart Grids 基于群签名的智能电网联邦学习
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729381
Sneha Kanchan, Ajit Kumar, A. Saqib, B. Choi
Smart Grids are the need of today's energy distribution system, which maintains a systematic communication between suppliers and consumers. Often these grids need to communicate to the Human Machine Interface (HMI) server regarding their findings of the customer needs and availability. However, some external entities might compromise the HMI server, which tends to misuse smart grids' personal information. Hence, the grids should not reveal their or their customer's identity to the server. Federated Learning (FL) can solve this situation where the data from various smart grids can be collected without disclosing the grid's identity. We have proposed a group signature-based federated signature-based in which grid components sign with the group signature instead of their personal signatures. We have verified the security of our algorithm with the Automated Validation of Internet Security Protocols and Applications (AVISPA) simulator.
智能电网是当今能源分配系统的需要,它在供应商和消费者之间保持系统的通信。通常,这些网格需要与人机界面(HMI)服务器进行通信,以了解客户需求和可用性的发现。然而,一些外部实体可能会危及HMI服务器,这往往会滥用智能电网的个人信息。因此,网格不应该向服务器透露它们或它们的客户的身份。联邦学习(FL)可以解决这种情况,即可以在不泄露网格身份的情况下收集来自各种智能网格的数据。我们提出了一种基于组签名的联邦签名,其中网格组件使用组签名而不是个人签名进行签名。我们已经用互联网安全协议和应用程序的自动验证(AVISPA)模拟器验证了我们算法的安全性。
{"title":"Group Signature Based Federated Learning in Smart Grids","authors":"Sneha Kanchan, Ajit Kumar, A. Saqib, B. Choi","doi":"10.1109/ICSPIS54653.2021.9729381","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729381","url":null,"abstract":"Smart Grids are the need of today's energy distribution system, which maintains a systematic communication between suppliers and consumers. Often these grids need to communicate to the Human Machine Interface (HMI) server regarding their findings of the customer needs and availability. However, some external entities might compromise the HMI server, which tends to misuse smart grids' personal information. Hence, the grids should not reveal their or their customer's identity to the server. Federated Learning (FL) can solve this situation where the data from various smart grids can be collected without disclosing the grid's identity. We have proposed a group signature-based federated signature-based in which grid components sign with the group signature instead of their personal signatures. We have verified the security of our algorithm with the Automated Validation of Internet Security Protocols and Applications (AVISPA) simulator.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130811864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
WhisperNet: Deep Siamese Network For Emotion and Speech Tempo Invariant Visual-Only Lip-Based Biometric WhisperNet:深度暹罗网络的情感和语音节奏不变的视觉仅基于嘴唇的生物识别
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729394
Abdollah Zakeri, H. Hassanpour
In the recent decade, the field of biometrics was revolutionized thanks to the rise of deep learning. Many improvements were done on old biometric methods which reduced the security concerns. Before biometric people verification methods like facial recognition, an imposter could access people's vital information simply by finding out their password via installing a key-logger on their system. Thanks to deep learning, safer biometric approaches to person verification and person re-identification like visual authentication and audio-visual authentication were made possible and applicable on many devices like smartphones and laptops. Unfortunately, facial recognition is considered to be a threat to personal privacy by some people. Additionally, biometric methods that use the audio modality are not always applicable due to reasons like audio noise present in the environment. Lip-based biometric authentication (LBBA) is the process of authenticating a person using a video of their lips' movement while talking. In order to solve the mentioned concerns about other biometric authentication methods, we can use a visual-only LBBA method. Since people might have different emotional states that could potentially affect their utterance and speech tempo, the audio-only LBBA method must be able to produce an emotional and speech tempo invariant embedding of the input utterance video. In this article, we proposed a network inspired by the Siamese architecture that learned to produce emotion and speech tempo invariant representations of the input utterance videos. In order to train and test our proposed network, we used the CREMA-D dataset and achieved 95.41 % accuracy on the validation set.
近十年来,由于深度学习的兴起,生物识别领域发生了革命性的变化。对旧的生物识别方法进行了许多改进,减少了安全问题。在面部识别等生物识别技术出现之前,冒名顶替者只需在人们的系统上安装键盘记录器,找出密码,就能获取人们的重要信息。由于深度学习,更安全的生物识别方法可以用于人员验证和人员再识别,如视觉认证和视听认证,并适用于智能手机和笔记本电脑等许多设备。不幸的是,面部识别被一些人认为是对个人隐私的威胁。此外,由于环境中存在音频噪声等原因,使用音频模态的生物识别方法并不总是适用。基于嘴唇的生物识别认证(LBBA)是一种利用说话时嘴唇运动的视频来验证一个人身份的过程。为了解决上述对其他生物识别认证方法的担忧,我们可以使用仅视觉的LBBA方法。由于人们可能有不同的情绪状态,这可能会影响他们的话语和语音节奏,因此纯音频LBBA方法必须能够对输入的话语视频产生情绪和语音节奏不变的嵌入。在本文中,我们提出了一个受Siamese架构启发的网络,该网络学会了对输入的话语视频产生情感和语音节奏不变的表示。为了训练和测试我们提出的网络,我们使用CREMA-D数据集,在验证集上达到95.41%的准确率。
{"title":"WhisperNet: Deep Siamese Network For Emotion and Speech Tempo Invariant Visual-Only Lip-Based Biometric","authors":"Abdollah Zakeri, H. Hassanpour","doi":"10.1109/ICSPIS54653.2021.9729394","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729394","url":null,"abstract":"In the recent decade, the field of biometrics was revolutionized thanks to the rise of deep learning. Many improvements were done on old biometric methods which reduced the security concerns. Before biometric people verification methods like facial recognition, an imposter could access people's vital information simply by finding out their password via installing a key-logger on their system. Thanks to deep learning, safer biometric approaches to person verification and person re-identification like visual authentication and audio-visual authentication were made possible and applicable on many devices like smartphones and laptops. Unfortunately, facial recognition is considered to be a threat to personal privacy by some people. Additionally, biometric methods that use the audio modality are not always applicable due to reasons like audio noise present in the environment. Lip-based biometric authentication (LBBA) is the process of authenticating a person using a video of their lips' movement while talking. In order to solve the mentioned concerns about other biometric authentication methods, we can use a visual-only LBBA method. Since people might have different emotional states that could potentially affect their utterance and speech tempo, the audio-only LBBA method must be able to produce an emotional and speech tempo invariant embedding of the input utterance video. In this article, we proposed a network inspired by the Siamese architecture that learned to produce emotion and speech tempo invariant representations of the input utterance videos. In order to train and test our proposed network, we used the CREMA-D dataset and achieved 95.41 % accuracy on the validation set.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130595483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Intelligent Filtering of Graph Shells in the Problem of Influence Maximization Based on the Independent Cascade Model 影响最大化问题中基于独立级联模型的图壳智能过滤
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729376
F. Kazemzadeh, Amir Karian, A. Safaei, M. Mirzarezaee
In social networks, the problem of influence maximization seeks for a solution to find individuals or nodes in different communities so that they can diffuse information influence among a wide range of other nodes. The proposed algorithms for influence maximization problem have many drawbacks. For example, the computational overhead is very high and also the seed nodes is not selected optimally. For this reason, the influence does not spread totally in the social network.for solving the problem, This paper provides the SFIM algorithm and uses the idea of layering community nodes and identifying valuable layers to limit the search space. The operation is continued only on nodes of valuable layers, which significantly reduces the algorithm's runtime. Then, the best set of influential nodes with the highest accuracy is found by considering the main criteria of centrality topology such as harmonic and degree. Accuracy in selecting a node is one of the most important needs of the problem that is best answered. Moreover, different experiments and datasets indicate that this algorithm can provide the best efficiency required to solve the problem compared to other algorithms.
在社交网络中,影响力最大化问题寻求在不同社区中找到个体或节点的解决方案,使其能够将信息影响力扩散到更大范围的其他节点。所提出的求解影响最大化问题的算法存在许多缺陷。例如,计算开销非常高,而且种子节点的选择也不是最优的。因此,影响在社交网络中并没有完全传播。为了解决这一问题,本文提出了SFIM算法,并采用分层社区节点和识别有价值层的思想来限制搜索空间。仅在有价值层的节点上继续操作,大大减少了算法的运行时间。然后,综合考虑中心性拓扑的谐波和度等主要准则,找到精度最高的最佳影响节点集;选择节点的准确性是得到最佳答案的问题最重要的需求之一。此外,不同的实验和数据集表明,与其他算法相比,该算法可以提供解决问题所需的最佳效率。
{"title":"Intelligent Filtering of Graph Shells in the Problem of Influence Maximization Based on the Independent Cascade Model","authors":"F. Kazemzadeh, Amir Karian, A. Safaei, M. Mirzarezaee","doi":"10.1109/ICSPIS54653.2021.9729376","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729376","url":null,"abstract":"In social networks, the problem of influence maximization seeks for a solution to find individuals or nodes in different communities so that they can diffuse information influence among a wide range of other nodes. The proposed algorithms for influence maximization problem have many drawbacks. For example, the computational overhead is very high and also the seed nodes is not selected optimally. For this reason, the influence does not spread totally in the social network.for solving the problem, This paper provides the SFIM algorithm and uses the idea of layering community nodes and identifying valuable layers to limit the search space. The operation is continued only on nodes of valuable layers, which significantly reduces the algorithm's runtime. Then, the best set of influential nodes with the highest accuracy is found by considering the main criteria of centrality topology such as harmonic and degree. Accuracy in selecting a node is one of the most important needs of the problem that is best answered. Moreover, different experiments and datasets indicate that this algorithm can provide the best efficiency required to solve the problem compared to other algorithms.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"67 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124247341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Ascertainment of Appropriate GRC Structure for Two Area Thermal System under Seagull Optimization based 2DOF-PID Controller 基于海鸥优化的2DOF-PID控制器下两区热系统GRC结构的确定
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729332
C. S. Kalyan, Khamruddin Syed, B. S. Goud, Ch. Rami Reddy, Hossein Shahinzadeh, G. Gharehpetian
This paper attempted to assess the performances of two simulation models of generation rate constraint (GRC) for two area thermal systems to achieve optimal load frequency control (LFC). The dynamical models of GRC investigated in this work are coined as open loop and closed loop GRC which are extensively utilized by the researchers without providing specific analysis for their selection and suitability. This paper facilitates the selection of appropriate and most effective GRC structures based on dynamical analysis about the thermal system to obtain LFC optimally. Two area thermal system has been examined with different GRC models in the platform of MATLAB/SIMULINK under supervision of two degrees of freedom (DOF)-PID (2DOF-PID) controller optimized with seagull optimization algorithm (SOA). Simulation results demonstrate the most suitable GRC model for the thermal system to obtain optimal LFC.
为了实现最优负荷频率控制(LFC),本文试图对两种区域热系统的发电速率约束(GRC)仿真模型进行性能评估。本文研究的GRC动力学模型被称为开环GRC和闭环GRC,这些模型被研究人员广泛使用,但没有对其选择和适用性进行具体分析。在热系统动力学分析的基础上,选择最合适、最有效的GRC结构,以获得最优的LFC。在采用海鸥优化算法(SOA)优化的二自由度(DOF)-PID (2DOF-PID)控制器的监督下,在MATLAB/SIMULINK平台上对不同GRC模型下的两区域热系统进行了研究。仿真结果表明,最适合热系统的GRC模型可以获得最优的LFC。
{"title":"Ascertainment of Appropriate GRC Structure for Two Area Thermal System under Seagull Optimization based 2DOF-PID Controller","authors":"C. S. Kalyan, Khamruddin Syed, B. S. Goud, Ch. Rami Reddy, Hossein Shahinzadeh, G. Gharehpetian","doi":"10.1109/ICSPIS54653.2021.9729332","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729332","url":null,"abstract":"This paper attempted to assess the performances of two simulation models of generation rate constraint (GRC) for two area thermal systems to achieve optimal load frequency control (LFC). The dynamical models of GRC investigated in this work are coined as open loop and closed loop GRC which are extensively utilized by the researchers without providing specific analysis for their selection and suitability. This paper facilitates the selection of appropriate and most effective GRC structures based on dynamical analysis about the thermal system to obtain LFC optimally. Two area thermal system has been examined with different GRC models in the platform of MATLAB/SIMULINK under supervision of two degrees of freedom (DOF)-PID (2DOF-PID) controller optimized with seagull optimization algorithm (SOA). Simulation results demonstrate the most suitable GRC model for the thermal system to obtain optimal LFC.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134083260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Web Content Extraction by Weighing the Fundamental Contextual Rules 权衡基本上下文规则的Web内容提取
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729342
Mahdi Mohammadi, M. Shayegan, Nima Latifi
Nowadays, data access, data sharing, data extraction and data usage have become a vital issue for technology experts. With the rapid growth of content on the Web, humans need new and up-to-date approaches for data extraction from the Web. However, there is much useless and unrelated information such as navigation panel, content table, propaganda, service catalogue, and menus in these pages. Thus, the web content is considered useful (original) and useless (secondary) content. Most receivers and final users search for useful content. This research presents a new approach to extract useful content from the Web. For this purpose, child nodes are selected as the original content by weighing the fundamental contextual rules method to DOM Tree's nodes. Overall, after standardizing web page and developing DOM Tree, the best child node of the parent node are selected according to a weighing algorithm; then, the best path and the best sample node are selected. The presented solution applied on several datasets shows high accuracy rate such as Precision, Recall and F factor are 0.992, 0.983 and 0.988, respectively.
如今,数据访问、数据共享、数据提取和数据使用已经成为技术专家面临的重要问题。随着Web上内容的快速增长,人们需要新的和最新的方法来从Web中提取数据。但是,这些页面中有很多无用和不相关的信息,如导航面板、内容表、宣传、服务目录、菜单等。因此,网络内容被认为是有用的(原始的)和无用的(次要的)内容。大多数接收者和最终用户搜索有用的内容。本研究提出了一种从网络中提取有用内容的新方法。为此,通过将基本上下文规则方法与DOM Tree的节点进行权衡,选择子节点作为原始内容。总体而言,在对网页进行标准化和开发DOM树后,根据权重算法选择父节点的最佳子节点;然后,选择最佳路径和最佳样本节点。该方法在多个数据集上的应用表明,准确率(Precision)、召回率(Recall)和F因子(F factor)分别达到0.992、0.983和0.988。
{"title":"Web Content Extraction by Weighing the Fundamental Contextual Rules","authors":"Mahdi Mohammadi, M. Shayegan, Nima Latifi","doi":"10.1109/ICSPIS54653.2021.9729342","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729342","url":null,"abstract":"Nowadays, data access, data sharing, data extraction and data usage have become a vital issue for technology experts. With the rapid growth of content on the Web, humans need new and up-to-date approaches for data extraction from the Web. However, there is much useless and unrelated information such as navigation panel, content table, propaganda, service catalogue, and menus in these pages. Thus, the web content is considered useful (original) and useless (secondary) content. Most receivers and final users search for useful content. This research presents a new approach to extract useful content from the Web. For this purpose, child nodes are selected as the original content by weighing the fundamental contextual rules method to DOM Tree's nodes. Overall, after standardizing web page and developing DOM Tree, the best child node of the parent node are selected according to a weighing algorithm; then, the best path and the best sample node are selected. The presented solution applied on several datasets shows high accuracy rate such as Precision, Recall and F factor are 0.992, 0.983 and 0.988, respectively.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130433588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing the Communication Infrastructures for Democratizing the Coverage Time of Connected Vehicles 实现车联网覆盖时间民主化的通信基础设施设计
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729383
Somayeh Mokhtari, C. M. Silva, J. Nogueira
Vehicular communication can be highly improved by providing network access points distributed along with the road network. Such access points for vehicles are commonly referred to as roadside units and provide a backbone integrating the whole vehicular network. However, to avoid wasting resources and maximizing network efficiency, the locations where these units are installed need special attention in such networks' design. In this work, we propose two novel strategies (Partial Time Information (PTI) and Maximum Coverage Time (MCT)) to deploy a predefined number of roadside units seeking to maximize the number of distinct vehicles crossing covered areas during a given time threshold. Instead of relying on the full trajectory of vehicles, which may incur privacy issues, the PTI strategy utilizes duplicate coverage time ratios between urban regions to infer the best locations for deploying the roadside units, while the MCT operates in the lack of mobility information. As a baseline, we consider the FPF strategy, which regards a Markovian approach for the flow of vehicles. Simulation results based on the data traffic set of Cologne, Germany demonstrate that the proposed approaches increase the vehicle to infrastructure connection time in comparison to FPF.
通过提供与道路网络一起分布的网络接入点,可以大大改善车辆通信。这种车辆接入点通常被称为路边单元,并提供集成整个车辆网络的主干。然而,为了避免资源浪费和最大化网络效率,在此类网络的设计中需要特别注意这些单元的安装位置。在这项工作中,我们提出了两种新颖的策略(部分时间信息(PTI)和最大覆盖时间(MCT))来部署预定义数量的路边单元,以寻求在给定的时间阈值内最大化穿越覆盖区域的不同车辆数量。PTI策略不依赖于车辆的完整轨迹(这可能会导致隐私问题),而是利用城市区域之间重复的覆盖时间比来推断部署路边单元的最佳位置,而MCT则在缺乏移动信息的情况下运行。作为基准,我们考虑了FPF策略,该策略考虑了车辆流的马尔可夫方法。基于德国科隆数据交通集的仿真结果表明,与FPF相比,所提出的方法增加了车辆与基础设施的连接时间。
{"title":"Designing the Communication Infrastructures for Democratizing the Coverage Time of Connected Vehicles","authors":"Somayeh Mokhtari, C. M. Silva, J. Nogueira","doi":"10.1109/ICSPIS54653.2021.9729383","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729383","url":null,"abstract":"Vehicular communication can be highly improved by providing network access points distributed along with the road network. Such access points for vehicles are commonly referred to as roadside units and provide a backbone integrating the whole vehicular network. However, to avoid wasting resources and maximizing network efficiency, the locations where these units are installed need special attention in such networks' design. In this work, we propose two novel strategies (Partial Time Information (PTI) and Maximum Coverage Time (MCT)) to deploy a predefined number of roadside units seeking to maximize the number of distinct vehicles crossing covered areas during a given time threshold. Instead of relying on the full trajectory of vehicles, which may incur privacy issues, the PTI strategy utilizes duplicate coverage time ratios between urban regions to infer the best locations for deploying the roadside units, while the MCT operates in the lack of mobility information. As a baseline, we consider the FPF strategy, which regards a Markovian approach for the flow of vehicles. Simulation results based on the data traffic set of Cologne, Germany demonstrate that the proposed approaches increase the vehicle to infrastructure connection time in comparison to FPF.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134517684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Successive Wavenumber Filtering Approach for Defect Detection in CFRP using Wavefield Scanning 波场扫描CFRP缺陷检测的连续波数滤波方法
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729336
Erfan Basiri, Reza P. R. Hasanzadeh, M. Kersemans
Owing to the high sensitivity of carbon fiber reinforced polymer (CFRP) to internal damages, defect detection through Non-destructive testing (NDT) is deemed an essential task. One of the common methods in NDT to achieve this aim is measuring and analyzing the full-field guided waves propagation in CFRP plates. Scattered waves corresponding to deep defects are usually obscured by other waves due to their weak amplitude. A successful method to highlight these waves is to use wavenumber filtering (WF). However, WF suffers from the assumption that the optimal frequency range of excitation signal is known beforehand, which is not always available. Another drawback is that when more than one type of guided waves mode exist, this method is not capable of highlighting desirable waves or vibrations sufficiently. In this paper, full wavefield images are first constituted by exciting the guided waves via broadband chirp signal and registering them with scanning laser Doppler vibrometery (SLDV). Then, a successive wavenumber filtering (SWF) approach is introduced, which efficiently removes undesirable higher order guided wave modes, and removes the need to know a priori the optimal excitation frequency. Moreover, it is quantitatively and qualitatively shown that the proposed approach could lead to better discrimination between damaged and healthy area than conventional WF.
由于碳纤维增强聚合物(CFRP)对内部损伤的高度敏感性,通过无损检测(NDT)进行缺陷检测被认为是一项必不可少的任务。在无损检测中实现这一目标的常用方法之一是测量和分析CFRP板中的全场导波传播。深缺陷对应的散射波由于振幅较弱,通常被其他波掩盖。突出显示这些波的一个成功方法是使用波数滤波(WF)。然而,WF的缺点是假设激励信号的最佳频率范围是事先已知的,这并不总是可行的。另一个缺点是,当存在多种导波模式时,这种方法不能充分突出所需的波或振动。本文首先利用宽带啁啾信号对导波进行激励,并用扫描激光多普勒测振仪(SLDV)对其进行配准,形成了全波场图像。然后,引入连续波数滤波(SWF)方法,有效地去除不需要的高阶导波模式,并且无需先验地知道最优激励频率。此外,定量和定性结果表明,该方法比传统WF方法能更好地区分受损区和健康区。
{"title":"A Successive Wavenumber Filtering Approach for Defect Detection in CFRP using Wavefield Scanning","authors":"Erfan Basiri, Reza P. R. Hasanzadeh, M. Kersemans","doi":"10.1109/ICSPIS54653.2021.9729336","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729336","url":null,"abstract":"Owing to the high sensitivity of carbon fiber reinforced polymer (CFRP) to internal damages, defect detection through Non-destructive testing (NDT) is deemed an essential task. One of the common methods in NDT to achieve this aim is measuring and analyzing the full-field guided waves propagation in CFRP plates. Scattered waves corresponding to deep defects are usually obscured by other waves due to their weak amplitude. A successful method to highlight these waves is to use wavenumber filtering (WF). However, WF suffers from the assumption that the optimal frequency range of excitation signal is known beforehand, which is not always available. Another drawback is that when more than one type of guided waves mode exist, this method is not capable of highlighting desirable waves or vibrations sufficiently. In this paper, full wavefield images are first constituted by exciting the guided waves via broadband chirp signal and registering them with scanning laser Doppler vibrometery (SLDV). Then, a successive wavenumber filtering (SWF) approach is introduced, which efficiently removes undesirable higher order guided wave modes, and removes the need to know a priori the optimal excitation frequency. Moreover, it is quantitatively and qualitatively shown that the proposed approach could lead to better discrimination between damaged and healthy area than conventional WF.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128049697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Deep Learning Method Based on LSTM-Autoencoder Network for Household Short-term Load Forecasting 基于lstm -自编码器网络的家庭短期负荷预测混合深度学习方法
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729378
Arghavan Irankhah, Sahar Rezazadeh, M. Moghaddam, Sara Ershadi-Nasab
Energy prediction is an essential task in smart homes for demand-side management and energy consumption reduction. Therefore, an intelligent forecasting model is necessary for predicting demand-side energy in residential buildings. Recent studies have shown that deep learning networks have higher performance than traditional machine learning methods in short-term load forecasting. In this paper, a new hybrid network is proposed that consists of Auto-Encoder LSTM layer, Bi-LSTM layer, stack of LSTM layer, and finally Fully connected layer. The experiments are conducted on an individual household electric power consumption dataset and the results demonstrate that the proposed network has the smallest value in terms of root mean square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) in comparison with other state-of-the-art approaches.
能源预测是智能家居需求侧管理和降低能耗的重要任务。因此,住宅建筑需求侧能源预测需要智能预测模型。最近的研究表明,深度学习网络在短期负荷预测方面比传统的机器学习方法具有更高的性能。本文提出了一种新的混合网络,它由自编码器LSTM层、双LSTM层、LSTM层堆栈和完全连接层组成。实验在单个家庭电力消耗数据集上进行,结果表明,与其他最先进的方法相比,所提出的网络在均方根误差(RMSE)、平均绝对误差(MAE)和平均绝对百分比误差(MAPE)方面具有最小的值。
{"title":"Hybrid Deep Learning Method Based on LSTM-Autoencoder Network for Household Short-term Load Forecasting","authors":"Arghavan Irankhah, Sahar Rezazadeh, M. Moghaddam, Sara Ershadi-Nasab","doi":"10.1109/ICSPIS54653.2021.9729378","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729378","url":null,"abstract":"Energy prediction is an essential task in smart homes for demand-side management and energy consumption reduction. Therefore, an intelligent forecasting model is necessary for predicting demand-side energy in residential buildings. Recent studies have shown that deep learning networks have higher performance than traditional machine learning methods in short-term load forecasting. In this paper, a new hybrid network is proposed that consists of Auto-Encoder LSTM layer, Bi-LSTM layer, stack of LSTM layer, and finally Fully connected layer. The experiments are conducted on an individual household electric power consumption dataset and the results demonstrate that the proposed network has the smallest value in terms of root mean square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) in comparison with other state-of-the-art approaches.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"06 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127273991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Countermeasure Based on CQT Spectrogram for Deepfake Speech Detection 基于CQT谱图的深度假语音检测对策
Pub Date : 2021-12-29 DOI: 10.1109/ICSPIS54653.2021.9729387
Pedram Abdzadeh Ziabary, H. Veisi
Nowadays, biometrics like face, voice, fingerprint, and iris are widely used for the identity authentication of individuals. Automatic Speaker Verification (ASV) systems aim to verify the speaker's authenticity, but recent research has shown that they are vulnerable to various types of attacks. A large number of Text-To-Speech (TTS) and Voice Conversion (VC) methods are being used to create the so-called synthetic or deepfake speech. In recent years, numerous works have been proposed to improve the spoofing detection ability to protect ASV systems against these attacks. This work proposes a synthetic speech detection system, which uses the spectrogram of Constant Q Transform (CQT) as its input features. The CQT spectrogram provides a constant Q factor in different frequency regions similar to the human perception system. Also, compared with Short-Term Fourier Transform (STFT), CQT provides higher time resolution at higher frequencies and higher frequency resolution at lower frequencies. Additionally, the CQT spectrogram has brought us low input feature dimensions, which aids with reducing needed computation time. The Constant Q Cepstral Coefficients (CQCC) features, driven from cepstral analysis of the CQT, have been employed in some recent works for voice spoofing detection. However, to the best of our knowledge, ours is the first work using CQT magnitude and power spectrogram directly for voice spoofing detection. We also use a combination of self-attended ResNet and one class learning to provide our model the robustness against unseen attacks. Finally, it is observed that even though using input features with relatively lower dimensions and reducing computation time, we can still obtain EER 3.53% and min t-DCF 0.10 on ASVspoof 2019 Logical Access (LA) dataset, which places our model among the top performers in this field.
如今,人脸、声音、指纹、虹膜等生物识别技术被广泛用于个人身份认证。自动说话人验证(ASV)系统旨在验证说话人的真实性,但最近的研究表明,它们很容易受到各种类型的攻击。大量的文本到语音(TTS)和语音转换(VC)方法被用来创造所谓的合成或深度假语音。近年来,人们提出了许多工作来提高欺骗检测能力,以保护自动驾驶汽车系统免受这些攻击。本文提出了一种以恒Q变换(CQT)谱图作为输入特征的合成语音检测系统。CQT频谱图提供了一个常数的Q因子在不同的频率区域类似于人类感知系统。此外,与短时傅里叶变换(STFT)相比,CQT在较高频率下提供更高的时间分辨率,在较低频率下提供更高的频率分辨率。此外,CQT谱图为我们带来了低输入特征维数,这有助于减少所需的计算时间。恒定Q倒谱系数(CQCC)特征是由CQT的倒谱分析驱动的,近年来已被用于语音欺骗检测。然而,据我们所知,我们是第一个直接使用CQT幅度和功率谱进行语音欺骗检测的工作。我们还使用了自参加ResNet和一个类学习的组合,以提供我们的模型对看不见的攻击的鲁棒性。最后,我们观察到,即使使用相对较低维度的输入特征并减少计算时间,我们仍然可以在ASVspoof 2019逻辑访问(LA)数据集上获得3.53%的EER和0.10的最小t-DCF,这使得我们的模型在该领域中表现最好。
{"title":"A Countermeasure Based on CQT Spectrogram for Deepfake Speech Detection","authors":"Pedram Abdzadeh Ziabary, H. Veisi","doi":"10.1109/ICSPIS54653.2021.9729387","DOIUrl":"https://doi.org/10.1109/ICSPIS54653.2021.9729387","url":null,"abstract":"Nowadays, biometrics like face, voice, fingerprint, and iris are widely used for the identity authentication of individuals. Automatic Speaker Verification (ASV) systems aim to verify the speaker's authenticity, but recent research has shown that they are vulnerable to various types of attacks. A large number of Text-To-Speech (TTS) and Voice Conversion (VC) methods are being used to create the so-called synthetic or deepfake speech. In recent years, numerous works have been proposed to improve the spoofing detection ability to protect ASV systems against these attacks. This work proposes a synthetic speech detection system, which uses the spectrogram of Constant Q Transform (CQT) as its input features. The CQT spectrogram provides a constant Q factor in different frequency regions similar to the human perception system. Also, compared with Short-Term Fourier Transform (STFT), CQT provides higher time resolution at higher frequencies and higher frequency resolution at lower frequencies. Additionally, the CQT spectrogram has brought us low input feature dimensions, which aids with reducing needed computation time. The Constant Q Cepstral Coefficients (CQCC) features, driven from cepstral analysis of the CQT, have been employed in some recent works for voice spoofing detection. However, to the best of our knowledge, ours is the first work using CQT magnitude and power spectrogram directly for voice spoofing detection. We also use a combination of self-attended ResNet and one class learning to provide our model the robustness against unseen attacks. Finally, it is observed that even though using input features with relatively lower dimensions and reducing computation time, we can still obtain EER 3.53% and min t-DCF 0.10 on ASVspoof 2019 Logical Access (LA) dataset, which places our model among the top performers in this field.","PeriodicalId":286966,"journal":{"name":"2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126366022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1