首页 > 最新文献

2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)最新文献

英文 中文
Rāga Recognition in Indian Carnatic Music Using Transfer Learning Rāga用迁移学习识别印度卡纳蒂克音乐
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673599
S. Sreejith, R. Rajan
An important aspect of Indian Classical music (ICM) is rāga, which serves as a melodic framework for both traditions of classical music. In this work, we propose a CNN-based sliding window analysis on Mel-spectrogram for rāga recognition in Carnatic music. The important contribution of the work is that the proposed method neither requires pitch ex-traction nor metadata for the estimation of rāga. CNN learns the representation of rāga from the patterns in the Mel-spectrogram during training through a sliding-window analysis. We train and test the network on sliced-Mel-spectrogram of the original audio while the final decision is made on the audio as a whole. The performance is evaluated on 10 rāga s from the CompMusic dataset and the Ramanarunachalam carnatic youtube collection. Out of the two proposed schemes, aggregation-based VGG16 model reports a macro F1 measure of 0.61, which is comparable to the result obtained for the base-line sequence classification model.
印度古典音乐(ICM)的一个重要方面是rāga,它作为两个传统古典音乐的旋律框架。在这项工作中,我们提出了一种基于cnn的mel谱图滑动窗口分析,用于卡纳蒂克音乐中的rāga识别。这项工作的重要贡献在于,所提出的方法既不需要提取音高,也不需要元数据来估计rāga。CNN在训练过程中通过滑动窗口分析从mel谱图中的模式中学习rāga的表示。我们在原始音频的切片梅尔谱图上训练和测试网络,而最终决定是在整个音频上做出的。性能是在CompMusic数据集和Raman**am carnatic youtube集合的10 rāga s上进行评估的。在两种方案中,基于聚合的VGG16模型的宏观F1测度为0.61,与基线序列分类模型的结果相当。
{"title":"Rāga Recognition in Indian Carnatic Music Using Transfer Learning","authors":"S. Sreejith, R. Rajan","doi":"10.1109/ICMSS53060.2021.9673599","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673599","url":null,"abstract":"An important aspect of Indian Classical music (ICM) is rāga, which serves as a melodic framework for both traditions of classical music. In this work, we propose a CNN-based sliding window analysis on Mel-spectrogram for rāga recognition in Carnatic music. The important contribution of the work is that the proposed method neither requires pitch ex-traction nor metadata for the estimation of rāga. CNN learns the representation of rāga from the patterns in the Mel-spectrogram during training through a sliding-window analysis. We train and test the network on sliced-Mel-spectrogram of the original audio while the final decision is made on the audio as a whole. The performance is evaluated on 10 rāga s from the CompMusic dataset and the Ramanarunachalam carnatic youtube collection. Out of the two proposed schemes, aggregation-based VGG16 model reports a macro F1 measure of 0.61, which is comparable to the result obtained for the base-line sequence classification model.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132800547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Review On Speech Authentication And Speaker Verification Methods 语音认证和说话人验证方法综述
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673603
Jayant Gambhir, Vaishali Patil
This paper gives a brief review of the techniques available for speech authentication based on edit detection and speaker verification related to application in forensics. Many past research includes the traditional methods like ENF, DFT, STFT based analysis for speech authentication while some advanced emerging trends like SVM, Ensemble learning and Convolution Neural network have been employed for speech authentication in different application. Similarly, for speaker verification, features such as LPC, MFCC, i-vector, x-vector and techniques like Neural Network, Deep learning etc have been researched. This paper summaries the various experimental results and their merits and demerits from the research papers surveyed. This review is expected to guide towards making an unbiased decision in choosing most promising method for authentication of Speaker and Speech data in forensic application.
本文简要回顾了基于编辑检测和说话人验证的语音认证技术在法医学中的应用。许多过去的研究包括基于ENF、DFT、STFT分析的传统方法用于语音认证,而一些先进的新兴趋势如SVM、Ensemble learning和卷积神经网络被用于语音认证的不同应用。同样,对于说话人验证,研究了LPC、MFCC、i向量、x向量等特征以及神经网络、深度学习等技术。本文从所调查的研究论文中总结了各种实验结果及其优缺点。本综述旨在指导在法医应用中选择最有前途的说话人和语音数据认证方法时做出公正的决定。
{"title":"A Review On Speech Authentication And Speaker Verification Methods","authors":"Jayant Gambhir, Vaishali Patil","doi":"10.1109/ICMSS53060.2021.9673603","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673603","url":null,"abstract":"This paper gives a brief review of the techniques available for speech authentication based on edit detection and speaker verification related to application in forensics. Many past research includes the traditional methods like ENF, DFT, STFT based analysis for speech authentication while some advanced emerging trends like SVM, Ensemble learning and Convolution Neural network have been employed for speech authentication in different application. Similarly, for speaker verification, features such as LPC, MFCC, i-vector, x-vector and techniques like Neural Network, Deep learning etc have been researched. This paper summaries the various experimental results and their merits and demerits from the research papers surveyed. This review is expected to guide towards making an unbiased decision in choosing most promising method for authentication of Speaker and Speech data in forensic application.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134403082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advanced Intrusion Detection Using Deep Learning-LSTM Network On Cloud Environment 云环境下基于深度学习- lstm网络的高级入侵检测
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673607
P. Jisna, T. Jarin, P. Praveen
Cloud Computing is a favored choice of any IT organization in the current context since that provides flexibility and pay-per-use service to the users. Moreover, due to its open and inclusive architecture which is accessible to attackers. Security and privacy are a big roadblock to its success. For any IT organization, intrusion detection systems are essential to the detection and endurance of effective detection system against attacker aggressive attacks. To recognize minor occurrences and become significant breaches, a fully managed intrusion detection system is required. The most prevalent approach for intrusion detection on the cloud is the Intrusion Detection System (IDS). This research introduces a cloud-based deep learning-LSTM IDS model and evaluates it to a hybrid Stacked Contractive Auto Encoder (SCAE) + Support Vector Machine (SVM) IDS model. Deep learning algorithms like basic machine learning can be built to conduct attack detection and classification simultaneously. Also examine the detection methodologies used by certain existing intrusion detection systems. On two well-known Intrusion Detection datasets (KDD Cup 99 and NSL-KDD), our strategy outperforms current methods in terms of accurate detection.
在当前环境中,云计算是任何IT组织的首选,因为它为用户提供了灵活性和按使用付费的服务。此外,由于其开放和包容的架构,攻击者可以访问。安全和隐私是其成功的一大障碍。对于任何IT组织来说,入侵检测系统对于有效检测系统抵御攻击者的侵略性攻击至关重要。为了识别轻微的入侵事件并将其演变为重大的入侵,需要一个全面管理的入侵检测系统。云上最流行的入侵检测方法是入侵检测系统(IDS)。本研究引入了一种基于云的深度学习- lstm IDS模型,并将其评估为堆叠收缩自动编码器(SCAE) +支持向量机(SVM)混合IDS模型。像基础机器学习这样的深度学习算法可以同时进行攻击检测和分类。同时研究某些现有入侵检测系统所使用的检测方法。在两个著名的入侵检测数据集(KDD Cup 99和NSL-KDD)上,我们的策略在准确检测方面优于当前的方法。
{"title":"Advanced Intrusion Detection Using Deep Learning-LSTM Network On Cloud Environment","authors":"P. Jisna, T. Jarin, P. Praveen","doi":"10.1109/ICMSS53060.2021.9673607","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673607","url":null,"abstract":"Cloud Computing is a favored choice of any IT organization in the current context since that provides flexibility and pay-per-use service to the users. Moreover, due to its open and inclusive architecture which is accessible to attackers. Security and privacy are a big roadblock to its success. For any IT organization, intrusion detection systems are essential to the detection and endurance of effective detection system against attacker aggressive attacks. To recognize minor occurrences and become significant breaches, a fully managed intrusion detection system is required. The most prevalent approach for intrusion detection on the cloud is the Intrusion Detection System (IDS). This research introduces a cloud-based deep learning-LSTM IDS model and evaluates it to a hybrid Stacked Contractive Auto Encoder (SCAE) + Support Vector Machine (SVM) IDS model. Deep learning algorithms like basic machine learning can be built to conduct attack detection and classification simultaneously. Also examine the detection methodologies used by certain existing intrusion detection systems. On two well-known Intrusion Detection datasets (KDD Cup 99 and NSL-KDD), our strategy outperforms current methods in terms of accurate detection.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133655423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
High Quality Design-Verification of APB based Switch Interface 基于APB的交换机接口的高质量设计验证
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673640
Raghuttam K Kulkarni, Deepali Koppad
Design re-usability and Verification re-usability are two major aspects which helps in reducing the time required for the design to reach the market by satisfying the necessary design requirements. Intellectual property (IP's) involved in the design must be re-usable as a golden-reference model. Re-using the pre-silicon validation environment over various design domains saves the time required for the design to reach the market furthermore and helps in improving the overall Design-Verification Standards Verification environment re-use for different applications with different interface is done by developing a wrapper around the design. Increase in the design complexity has led to the development of more robust and reusable verification environment like UVM and OVM methodologies. This paper focuses on carrying a high-quality Design-Verification process by using UVM methodology and perform organized effort in doing considerable amount of work in Design-Verification domain by considering a switch interfaced with APB protocol.
设计可重用性和验证可重用性是两个主要方面,它们有助于通过满足必要的设计需求来减少设计进入市场所需的时间。设计中涉及的知识产权(IP)必须可作为黄金参考模型重用。在不同的设计领域中重用预硅验证环境,进一步节省了设计进入市场所需的时间,并有助于改进整体设计验证标准。对于具有不同接口的不同应用程序,验证环境的重用是通过在设计周围开发包装器来完成的。设计复杂性的增加导致了更健壮和可重用的验证环境的开发,如UVM和OVM方法。本文着重于通过使用UVM方法进行高质量的设计验证过程,并通过考虑与APB协议接口的交换机,在设计验证领域进行大量的工作。
{"title":"High Quality Design-Verification of APB based Switch Interface","authors":"Raghuttam K Kulkarni, Deepali Koppad","doi":"10.1109/ICMSS53060.2021.9673640","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673640","url":null,"abstract":"Design re-usability and Verification re-usability are two major aspects which helps in reducing the time required for the design to reach the market by satisfying the necessary design requirements. Intellectual property (IP's) involved in the design must be re-usable as a golden-reference model. Re-using the pre-silicon validation environment over various design domains saves the time required for the design to reach the market furthermore and helps in improving the overall Design-Verification Standards Verification environment re-use for different applications with different interface is done by developing a wrapper around the design. Increase in the design complexity has led to the development of more robust and reusable verification environment like UVM and OVM methodologies. This paper focuses on carrying a high-quality Design-Verification process by using UVM methodology and perform organized effort in doing considerable amount of work in Design-Verification domain by considering a switch interfaced with APB protocol.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121835562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation of Hack ALU using Quantum Dot Cellular Automata 利用量子点元胞自动机实现黑客ALU
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673636
A. A. Aadhithya, J. Akshaya, K. G. Devi, B. Nithesh, G. J. Lal, K. P. Soman
Quantum Dot Cellular Automata (QCA) technology is one of the emerging next-generation nano-scale technologies, to subdue the limitations of existing CMOS technologies. As researchers continue to work hard to find an alternative to CMOS technology, QCA provides a solution for a faster computer with a smaller size and low power consumption. This paper implements an operation-rich “HACK ALU”, capable of performing many operations. Hack AL U is a versatile design that implements 64 op-erations with just 6 control bits along with novel implementations of adders and multiplexers. This ALU has been designed using a bottom-up approach by beginning the design by constructing the basic gates and progressing to the construction of adders, and multiplexers. The scope of the research is to provide an efficient ALU design that can be integrated in the CPU design. The present work has been implemented and tested on the QCA designer software. Experimental evaluation shows that logical and arithmetic operations are consistent in the proposed design.
量子点元胞自动机(QCA)技术是克服现有CMOS技术局限性的新一代纳米级技术之一。随着研究人员继续努力寻找CMOS技术的替代品,QCA为更快的计算机提供了一个更小尺寸和低功耗的解决方案。本文实现了一个操作丰富的“HACK ALU”,能够执行多种操作。Hack AL U是一个通用的设计,实现64个操作,只有6个控制位,以及新颖的加法器和多路复用器实现。该ALU采用自下而上的方法设计,从构建基本门开始设计,然后进行加法器和多路复用器的构建。研究的范围是提供一种可以集成在CPU设计中的高效ALU设计。本工作已在QCA设计软件上进行了实现和测试。实验评估表明,所提出的设计逻辑运算和算术运算是一致的。
{"title":"Implementation of Hack ALU using Quantum Dot Cellular Automata","authors":"A. A. Aadhithya, J. Akshaya, K. G. Devi, B. Nithesh, G. J. Lal, K. P. Soman","doi":"10.1109/ICMSS53060.2021.9673636","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673636","url":null,"abstract":"Quantum Dot Cellular Automata (QCA) technology is one of the emerging next-generation nano-scale technologies, to subdue the limitations of existing CMOS technologies. As researchers continue to work hard to find an alternative to CMOS technology, QCA provides a solution for a faster computer with a smaller size and low power consumption. This paper implements an operation-rich “HACK ALU”, capable of performing many operations. Hack AL U is a versatile design that implements 64 op-erations with just 6 control bits along with novel implementations of adders and multiplexers. This ALU has been designed using a bottom-up approach by beginning the design by constructing the basic gates and progressing to the construction of adders, and multiplexers. The scope of the research is to provide an efficient ALU design that can be integrated in the CPU design. The present work has been implemented and tested on the QCA designer software. Experimental evaluation shows that logical and arithmetic operations are consistent in the proposed design.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123443373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Parkinson's Disease Using Vocal Features: An Eigen Approach 使用声音特征检测帕金森病:一种特征方法
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673634
Nakul S Pramod, L. Sajitha, Swathy Mohanlal, K. Thameem, S. M. Anzar
Parkinson's disease (PD) is a degenerative disorder of the human central nervous system that causes tremors and affects movement. Symptoms usually appear gradually over time. Researchers are seeking biomarkers for Parkinson's disease in the hopes of allowing for earlier detection and more tailored treatments to slow the disease's progression. Existing methods of diagnosis include Blood tests, MRI scans, and PET scans. However, these are highly time and resource-consuming. PD also shows an amble change in the voice patterns of a person. Hence, acoustic analysis of voice signals can indicate the progression of PD. This can be analysed using a trained classifier model, which provides an easy diagnosis of the disease. This paper analyses the performance of AI-ML models viz- Linear regression, Support Vector Machine (SVM), K-Nearest Neighbourhood (KNN), Ran-dom Forest, and XG Boost for the detection of Parkinson's disease using vocal feature sets. Experimental evaluations show that the Random Forest model produced an impressive accuracy of 100%. The classification algorithms' accuracy, precision, recall, F1-score, and Mathews Correlation Coefficient (MCC) are all examined. The Random Forest classifiers are 100% accurate, with an accuracy of 1.000, recall of 1.000, F1-score of 1.000, and MCC of 1.000. Implementing dimensionality reduction using the Eigen approach (Principal Component Analysis) and the dataset combination are the critical reasons for the reported high accuracy. The potential of this methodology is prominent as it can be used to diagnose various other diseases, such as asthma, cancer, and Alzheimer's disease.
帕金森氏症(PD)是一种人类中枢神经系统的退行性疾病,会引起震颤并影响运动。症状通常随着时间的推移逐渐出现。研究人员正在寻找帕金森氏症的生物标志物,希望能够更早发现和更有针对性的治疗,以减缓疾病的进展。现有的诊断方法包括血液检查、核磁共振扫描和PET扫描。然而,这些都非常耗时和消耗资源。PD还显示出一个人的声音模式有轻微的变化。因此,声音信号的声学分析可以指示PD的进展。这可以使用训练过的分类器模型进行分析,从而提供对疾病的简单诊断。本文分析了线性回归、支持向量机(SVM)、k近邻(KNN)、随机森林(random Forest)和XG Boost等AI-ML模型在使用声音特征集检测帕金森病中的性能。实验评估表明,随机森林模型产生了令人印象深刻的100%的准确率。对分类算法的正确率、精密度、查全率、f1得分和马修斯相关系数(MCC)进行了检验。随机森林分类器的准确率为100%,准确率为1.000,召回率为1.000,f1分数为1.000,MCC为1.000。使用特征方法(主成分分析)和数据集组合实现降维是报道的高精度的关键原因。这种方法的潜力是突出的,因为它可以用来诊断各种其他疾病,如哮喘、癌症和阿尔茨海默病。
{"title":"Detection of Parkinson's Disease Using Vocal Features: An Eigen Approach","authors":"Nakul S Pramod, L. Sajitha, Swathy Mohanlal, K. Thameem, S. M. Anzar","doi":"10.1109/ICMSS53060.2021.9673634","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673634","url":null,"abstract":"Parkinson's disease (PD) is a degenerative disorder of the human central nervous system that causes tremors and affects movement. Symptoms usually appear gradually over time. Researchers are seeking biomarkers for Parkinson's disease in the hopes of allowing for earlier detection and more tailored treatments to slow the disease's progression. Existing methods of diagnosis include Blood tests, MRI scans, and PET scans. However, these are highly time and resource-consuming. PD also shows an amble change in the voice patterns of a person. Hence, acoustic analysis of voice signals can indicate the progression of PD. This can be analysed using a trained classifier model, which provides an easy diagnosis of the disease. This paper analyses the performance of AI-ML models viz- Linear regression, Support Vector Machine (SVM), K-Nearest Neighbourhood (KNN), Ran-dom Forest, and XG Boost for the detection of Parkinson's disease using vocal feature sets. Experimental evaluations show that the Random Forest model produced an impressive accuracy of 100%. The classification algorithms' accuracy, precision, recall, F1-score, and Mathews Correlation Coefficient (MCC) are all examined. The Random Forest classifiers are 100% accurate, with an accuracy of 1.000, recall of 1.000, F1-score of 1.000, and MCC of 1.000. Implementing dimensionality reduction using the Eigen approach (Principal Component Analysis) and the dataset combination are the critical reasons for the reported high accuracy. The potential of this methodology is prominent as it can be used to diagnose various other diseases, such as asthma, cancer, and Alzheimer's disease.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125531900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Non-uniform Region Based Features for Automatic Language Identification 基于非统一区域特征的语言自动识别
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673629
Greeshma Unnikrishnan, A. George, L. Mary
An audio utterance can be identified as being spoken in a particular language by using automatic language identification (LID). Each language has its own phoneme set. Hence combination of these phonemes governed by phonotactics will help in distinguishing languages. In this work, we propose an automatic language identification system utilizing features derived from non-uniform speech regions to represent phonotac-tic differences among 4 Indian languages, namely Malayalam, Marathi, Assamese, and Kannada. For this, broad phoneme labels, namely approximant (A), closure (C), fricatives (F), nasals (N), plosive/stop (P), voiced stop (B), vowels (V), and silence (S) are obtained automatically by a broad phoneme classifier (BPC). It is a DNN-based classifier which uses hand-crafted features and Mel-frequency cepstral coefficients (MFCC). In order to automatically segment speech to smaller regions, first it is chopped at every silence regions using the labels obtained from BPC. Later, it is split again at the end of each vowel. Hence, small non-uniform regions are obtained which contain phoneme combinations that may be specific to the language of the utterance. From each region, only a fixed number of frames containing certain combination of phonemes are selected. A DNN classifier is trained using 13-dimensional MFCC features of 12 fixed frames of non-uniform regions for performing LID. An average accuracy of 97.03% is obtained for test utterances of 10 sec duration belonging to 4 languages.
通过使用自动语言识别(LID),可以将音频话语识别为以特定语言说话。每种语言都有自己的音素集。因此,这些由语音策略控制的音素组合将有助于区分语言。在这项工作中,我们提出了一个自动语言识别系统,利用来自非均匀语音区域的特征来表示4种印度语言(马拉雅拉姆语、马拉地语、阿萨姆语和卡纳达语)之间的语音差异。为此,广义音素标签,即近音(A)、闭音(C)、摩擦音(F)、鼻音(N)、爆音/顿音(P)、浊音顿音(B)、元音(V)和静音(S)是由广义音素分类器(BPC)自动获得的。它是一种基于dnn的分类器,使用手工制作的特征和mel频率倒谱系数(MFCC)。为了将语音自动分割成更小的区域,首先使用从BPC中获得的标签在每个沉默区域进行切分。之后,它在每个元音的末尾再次分裂。因此,获得了小的非均匀区域,其中包含可能特定于话语语言的音素组合。从每个区域中,只选择固定数量的包含特定音素组合的帧。利用12个非均匀区域的固定帧的13维MFCC特征训练DNN分类器进行LID。对4种语言的时长为10秒的测试话语,平均准确率为97.03%。
{"title":"Non-uniform Region Based Features for Automatic Language Identification","authors":"Greeshma Unnikrishnan, A. George, L. Mary","doi":"10.1109/ICMSS53060.2021.9673629","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673629","url":null,"abstract":"An audio utterance can be identified as being spoken in a particular language by using automatic language identification (LID). Each language has its own phoneme set. Hence combination of these phonemes governed by phonotactics will help in distinguishing languages. In this work, we propose an automatic language identification system utilizing features derived from non-uniform speech regions to represent phonotac-tic differences among 4 Indian languages, namely Malayalam, Marathi, Assamese, and Kannada. For this, broad phoneme labels, namely approximant (A), closure (C), fricatives (F), nasals (N), plosive/stop (P), voiced stop (B), vowels (V), and silence (S) are obtained automatically by a broad phoneme classifier (BPC). It is a DNN-based classifier which uses hand-crafted features and Mel-frequency cepstral coefficients (MFCC). In order to automatically segment speech to smaller regions, first it is chopped at every silence regions using the labels obtained from BPC. Later, it is split again at the end of each vowel. Hence, small non-uniform regions are obtained which contain phoneme combinations that may be specific to the language of the utterance. From each region, only a fixed number of frames containing certain combination of phonemes are selected. A DNN classifier is trained using 13-dimensional MFCC features of 12 fixed frames of non-uniform regions for performing LID. An average accuracy of 97.03% is obtained for test utterances of 10 sec duration belonging to 4 languages.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114376805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Renewable Energy Generation-Solar and Wind, Challenges and Policies adopted in Karnataka, India 可再生能源发电——太阳能和风能,印度卡纳塔克邦的挑战和政策
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673625
Zahira Tabassum, B.S. Chandrashekhar Shastry
Enhanced use of renewabIes in the power system will not only affect the energy source, but also the nature of how electricity grids are run. Smarter grids will not only be more efficient and make the transition to a more sustainable future easier, but they will also have the potential to alter the institutional connections that exist between generators, customers, and transmission and distribution corporations. India is one of the world's fastest-growing energy consumers, and it is working hard to boost renewable energy production. This research looks into the renewable energy scenario of Karnataka State. Karnataka is among the top five states of India in renewable energy generation. The research highlights the Karnataka government's strategies for increasing renewable energy's inclusion in the energy mix, as well as the obstacles and potential solutions for increasing renewable energy deployment across the state are presented. This research can be used as a guide for policymakers and researchers in other states and countries who seek to increase the use of renewable energy.
在电力系统中加强可再生能源的使用不仅会影响能源,还会影响电网运行的性质。智能电网不仅会提高效率,使向更可持续的未来过渡更容易,而且还将有可能改变存在于发电机、客户和输配电公司之间的制度联系。印度是世界上增长最快的能源消费国之一,它正在努力促进可再生能源的生产。本研究着眼于卡纳塔克邦的可再生能源情景。卡纳塔克邦是印度可再生能源发电量最大的五个邦之一。该研究强调了卡纳塔克邦政府在能源结构中增加可再生能源的战略,以及在全州范围内增加可再生能源部署的障碍和潜在解决方案。这项研究可以作为其他州和国家寻求增加可再生能源使用的政策制定者和研究人员的指南。
{"title":"Renewable Energy Generation-Solar and Wind, Challenges and Policies adopted in Karnataka, India","authors":"Zahira Tabassum, B.S. Chandrashekhar Shastry","doi":"10.1109/ICMSS53060.2021.9673625","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673625","url":null,"abstract":"Enhanced use of renewabIes in the power system will not only affect the energy source, but also the nature of how electricity grids are run. Smarter grids will not only be more efficient and make the transition to a more sustainable future easier, but they will also have the potential to alter the institutional connections that exist between generators, customers, and transmission and distribution corporations. India is one of the world's fastest-growing energy consumers, and it is working hard to boost renewable energy production. This research looks into the renewable energy scenario of Karnataka State. Karnataka is among the top five states of India in renewable energy generation. The research highlights the Karnataka government's strategies for increasing renewable energy's inclusion in the energy mix, as well as the obstacles and potential solutions for increasing renewable energy deployment across the state are presented. This research can be used as a guide for policymakers and researchers in other states and countries who seek to increase the use of renewable energy.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117184110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Direction Of Arrival Estimation using Microphone Array 基于麦克风阵列的到达方向估计
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673617
C. Junu Jahana, M. Sinith, PP Lalu
Human beings can tell the direction of sound by utilising both their ears. They may instinctively determine the di-rection of sound by combining the somewhat varied impulses that arrive at our ears. Similarly, an array of microphones connected to a computer may be used to create a sound localisation system. The basic idea behind utilising microphone arrays to estimate Direction Of Arrival (DOA) is to leverage phase information in signals picked up by spatially separated sensors (microphones). The acoustic signals arrive to the microphones with temporal delays when they are spatially distant. These time-delays are determined by the signal's DOA for a known array geometry. The audio signal is recorded using a miniDSP UMA-16 microphone array with plug and play USB audio connection. For linear arrays, the angle between the array's orientation and the sound source is calculated here. Given that the sound signal arrives at each microphone at various times, corresponding to different propagation paths, it's reasonable to infer that the recorded signals in each microphone have a Time Difference Of Arrival (TDOA), which is an important factor in microphone array processing. With the aid of the UMA16 microphone array, more visible and accurate sound source localization was achievable at lower sound power intensities, which was considered to be a significant innovation in the field of sound source localization. In addition, the DOA was calculated using an SVM classifier, which can categorise audio signals in coarse as left, right or front, and the performance metrics including accuracy, specificity and sensitivity are analysed.
人类可以用两只耳朵来辨别声音的方向。它们可能本能地通过结合到达我们耳朵的不同脉冲来确定声音的方向。类似地,连接到计算机上的麦克风阵列可用于创建声音定位系统。利用麦克风阵列估计到达方向(DOA)的基本思想是利用空间分离传感器(麦克风)拾取的信号中的相位信息。当空间距离较远时,声信号到达麦克风时具有时间延迟。这些时间延迟是由已知阵列几何形状的信号的DOA决定的。音频信号记录使用miniDSP UMA-16麦克风阵列与即插即用USB音频连接。对于线性阵列,此处计算阵列方向与声源之间的夹角。考虑到声音信号到达每个麦克风的时间不同,对应不同的传播路径,可以合理推断每个麦克风中记录的信号都有一个到达时差(Time Difference Of Arrival, TDOA),这是麦克风阵列处理中的一个重要因素。借助UMA16麦克风阵列,可以在较低的声功率强度下实现更清晰、更精确的声源定位,这被认为是声源定位领域的重大创新。此外,利用支持向量机分类器对粗音频信号进行左、右、前三种分类,并对其精度、特异性和灵敏度等性能指标进行了分析。
{"title":"Direction Of Arrival Estimation using Microphone Array","authors":"C. Junu Jahana, M. Sinith, PP Lalu","doi":"10.1109/ICMSS53060.2021.9673617","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673617","url":null,"abstract":"Human beings can tell the direction of sound by utilising both their ears. They may instinctively determine the di-rection of sound by combining the somewhat varied impulses that arrive at our ears. Similarly, an array of microphones connected to a computer may be used to create a sound localisation system. The basic idea behind utilising microphone arrays to estimate Direction Of Arrival (DOA) is to leverage phase information in signals picked up by spatially separated sensors (microphones). The acoustic signals arrive to the microphones with temporal delays when they are spatially distant. These time-delays are determined by the signal's DOA for a known array geometry. The audio signal is recorded using a miniDSP UMA-16 microphone array with plug and play USB audio connection. For linear arrays, the angle between the array's orientation and the sound source is calculated here. Given that the sound signal arrives at each microphone at various times, corresponding to different propagation paths, it's reasonable to infer that the recorded signals in each microphone have a Time Difference Of Arrival (TDOA), which is an important factor in microphone array processing. With the aid of the UMA16 microphone array, more visible and accurate sound source localization was achievable at lower sound power intensities, which was considered to be a significant innovation in the field of sound source localization. In addition, the DOA was calculated using an SVM classifier, which can categorise audio signals in coarse as left, right or front, and the performance metrics including accuracy, specificity and sensitivity are analysed.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132079590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Colour Image Encryption Algorithm With Quantum Coding 基于量子编码的彩色图像加密算法
Pub Date : 2021-11-18 DOI: 10.1109/ICMSS53060.2021.9673627
Amani Theramban, Renjith V. Ravi
In image processing, quantum mechanics has wide applications. This paper proposes a lower-complexity encryption scheme based on Novel Enhanced Quantum Representation (NEQR) and bit-level adjacent exchange operations. The chaotic henon map and quantum coding were used to create this work. The image is further diffused by XORing key streams from the henon map. This scheme further undergoes a scrambling procedure using the key generated from SHA-256. The encryption procedure done step by step is applied separately on three channels R, G, and B, of a color image. Finally, performance analysis and simulation results confirm that the encryption scheme is robust and secure.
在图像处理中,量子力学有着广泛的应用。本文提出了一种基于新型增强量子表示(NEQR)和位级相邻交换操作的低复杂度加密方案。混沌henon映射和量子编码被用于创建这项工作。通过XORing来自henon映射的键流进一步扩散图像。该方案进一步使用SHA-256生成的密钥进行置乱过程。一步一步完成的加密过程分别应用于彩色图像的三个通道R、G和B。性能分析和仿真结果验证了该加密方案的鲁棒性和安全性。
{"title":"Colour Image Encryption Algorithm With Quantum Coding","authors":"Amani Theramban, Renjith V. Ravi","doi":"10.1109/ICMSS53060.2021.9673627","DOIUrl":"https://doi.org/10.1109/ICMSS53060.2021.9673627","url":null,"abstract":"In image processing, quantum mechanics has wide applications. This paper proposes a lower-complexity encryption scheme based on Novel Enhanced Quantum Representation (NEQR) and bit-level adjacent exchange operations. The chaotic henon map and quantum coding were used to create this work. The image is further diffused by XORing key streams from the henon map. This scheme further undergoes a scrambling procedure using the key generated from SHA-256. The encryption procedure done step by step is applied separately on three channels R, G, and B, of a color image. Finally, performance analysis and simulation results confirm that the encryption scheme is robust and secure.","PeriodicalId":274597,"journal":{"name":"2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129481938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2021 Fourth International Conference on Microelectronics, Signals & Systems (ICMSS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1