首页 > 最新文献

2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)最新文献

英文 中文
A Comprehensive Review on Asphalt Pavement Distress Detection and Assessment based on Artificial Intelligence 基于人工智能的沥青路面破损检测与评估综述
Rakshitha R, S. S
Road transportation system facilitates the movement of people, goods and contributes to the national economy. This pavement network keeps on growing as years pass on. The failure of pavement may be due to heavy traffic, sunlight exposure, the seasonal changes causes unequal expansion and contraction of the surface, water intrusion and quality of construction material. Hence there is a high demand for effective pavement maintenance and rehabilitation in early stages. A lot of research is actively being conducted on detection and assessment of pavement distress. The manual approach depends on expert knowledge, consumes lot of time and it lacks the objectivity for quantification, then the automated distress detection using 3D laser technology uses hardware equipment which requires huge budget investment, hence AI based pavement distress detection and quantification methods are proposed as replacement. This paper presents a review of papers from the repositories like Google Scholar, Scopus, MDPI, ASCE (American Society of Civil Engineers) library, Hindawi of past eight years based on image processing and deep learning techniques that performs the task of detection and quantification of distress type, and advantages and disadvantages of the existing system are outlined and accounts for the interdisciplinary research to provide information for civil and the computer science enthusiast to know about the research challenges in this field needed for future research.
道路运输系统方便了人员、货物的流动,对国民经济作出了贡献。随着时间的流逝,这个路面网络不断扩大。路面的破坏可能是由于繁忙的交通、阳光照射、季节变化引起的表面不均匀膨胀和收缩、水侵入和施工材料的质量。因此,在早期阶段对有效的路面维修和修复有很高的需求。对于路面损伤的检测与评估,人们进行了大量的研究。人工方法依赖专家知识,耗时长,且缺乏量化的客观性,而采用三维激光技术的路面破损自动检测使用硬件设备,预算投入巨大,因此提出基于人工智能的路面破损检测与量化方法作为替代。本文综述了b谷歌Scholar、Scopus、MDPI、ASCE(美国土木工程师学会)图书馆、Hindawi等知识库中过去8年的基于图像处理和深度学习技术的论文,这些技术执行了检测和量化遇灾类型的任务。并对现有系统的优缺点进行了概述和说明,为民用和计算机科学爱好者了解该领域未来研究需要面临的研究挑战提供信息。
{"title":"A Comprehensive Review on Asphalt Pavement Distress Detection and Assessment based on Artificial Intelligence","authors":"Rakshitha R, S. S","doi":"10.1109/UPCON56432.2022.9986460","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986460","url":null,"abstract":"Road transportation system facilitates the movement of people, goods and contributes to the national economy. This pavement network keeps on growing as years pass on. The failure of pavement may be due to heavy traffic, sunlight exposure, the seasonal changes causes unequal expansion and contraction of the surface, water intrusion and quality of construction material. Hence there is a high demand for effective pavement maintenance and rehabilitation in early stages. A lot of research is actively being conducted on detection and assessment of pavement distress. The manual approach depends on expert knowledge, consumes lot of time and it lacks the objectivity for quantification, then the automated distress detection using 3D laser technology uses hardware equipment which requires huge budget investment, hence AI based pavement distress detection and quantification methods are proposed as replacement. This paper presents a review of papers from the repositories like Google Scholar, Scopus, MDPI, ASCE (American Society of Civil Engineers) library, Hindawi of past eight years based on image processing and deep learning techniques that performs the task of detection and quantification of distress type, and advantages and disadvantages of the existing system are outlined and accounts for the interdisciplinary research to provide information for civil and the computer science enthusiast to know about the research challenges in this field needed for future research.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127400358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BEMSS- Blockwise Encryption based Multi Secret Sharing scheme for Securing Visual Content 基于块加密的多秘密共享视觉内容安全方案
Parul Saini, K. Kumar, Shamal Kashid, Abhishek Dhiman, Alok Negi
In recent years, the significant research area in the domain of multimedia security is image information security. This can be provided by converting it into a secret code through the process of encryption or hiding its true significance or a combination of both. It is required in many domains like defence, military, banking, education, etc, to keep these sensitive images from being hacked or destroyed. In this paper, BEMSS, an (n,n) multi-secret image encryption with a secret sharing scheme is proposed using blockwise encryption. In the proposed technique two-level encryption, modulo and inter blockwise encryption, are performed to the n secret images, and a two-level share generation process is performed to the encrypted images. BEMSS recovers n lossless the secret images by modulo and blockwise decryption process. Experimental results obtained show the effectiveness of the proposed BEMSS which provide a computationally efficient security scheme.
近年来,多媒体安全领域的重要研究方向是图像信息安全。这可以通过通过加密过程将其转换为密码或隐藏其真正意义或两者的组合来提供。在许多领域,如国防、军事、银行、教育等,都需要防止这些敏感图像被黑客攻击或破坏。本文提出了一种基于块加密的(n,n)多秘密图像加密共享方案BEMSS。在该技术中,对n个秘密图像进行了模加密和块间加密两级加密,并对加密图像进行了两级共享生成过程。BEMSS通过模解密和分组解密的方式对秘密图像进行无损恢复。实验结果表明了该方法的有效性,提供了一种计算效率高的安全方案。
{"title":"BEMSS- Blockwise Encryption based Multi Secret Sharing scheme for Securing Visual Content","authors":"Parul Saini, K. Kumar, Shamal Kashid, Abhishek Dhiman, Alok Negi","doi":"10.1109/UPCON56432.2022.9986417","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986417","url":null,"abstract":"In recent years, the significant research area in the domain of multimedia security is image information security. This can be provided by converting it into a secret code through the process of encryption or hiding its true significance or a combination of both. It is required in many domains like defence, military, banking, education, etc, to keep these sensitive images from being hacked or destroyed. In this paper, BEMSS, an (n,n) multi-secret image encryption with a secret sharing scheme is proposed using blockwise encryption. In the proposed technique two-level encryption, modulo and inter blockwise encryption, are performed to the n secret images, and a two-level share generation process is performed to the encrypted images. BEMSS recovers n lossless the secret images by modulo and blockwise decryption process. Experimental results obtained show the effectiveness of the proposed BEMSS which provide a computationally efficient security scheme.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128893094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Mel Spectrogram Based Automatic Speaker Verification Using GMM-UBM 基于Mel谱图的GMM-UBM自动说话人验证
T. Kumar, Ramesh Kumar Bhukya
Speech recognition refers to the technology that enables machines to recognize persons using their speech utterances. An automatic speaker verification (ASV) is included in one of the challenging task in speech community. The ASV system works based on the speaker recognition claimed against the model. In this paper, the system works as a text-independent speaker verification (TISV) and is outlined to verify the speaker using his/her voice samples. We followed two approaches, first approach is Gaussian Mixture Model (GMM) method is used to create speaker modeling and the second approach are GMMs created from training dataset, with Universal Background Model (UBM) used for adaptation of the dataset, well known approach for speaker verification (SV). GMM-UBMs are designed as well classifier for decision making. In both the approaches, the training is performed by the Expectation Maximization (EM) and Maximum A Posteriori (MAP) adaptation for better models respectively. The NIST 2003 database is evaluated using adapted GMM-UBM following NIST 2003 speaker recognition evaluation protocol and the relative performance improvement in the SV system using GMM and GMM-UBM in terms of EER are 9.43% and 8.88%.
语音识别是指使机器能够通过人的语音来识别人的技术。自动说话人验证(ASV)是语音社区中具有挑战性的任务之一。ASV系统的工作原理是基于对模型的说话人识别。在本文中,该系统作为文本无关的说话人验证(TISV),并概述了使用说话人的语音样本来验证说话人。我们采用了两种方法,第一种方法是使用高斯混合模型(GMM)方法创建说话人建模,第二种方法是使用通用背景模型(UBM)从训练数据集创建GMM,用于自适应数据集,这是一种众所周知的说话人验证方法(SV)。GMM-UBMs也被设计为决策的分类器。在这两种方法中,分别通过期望最大化(EM)和最大后验A (MAP)自适应对较好的模型进行训练。根据NIST 2003说话人识别评估协议,使用改进的GMM- ubm对NIST 2003数据库进行评估,使用GMM和GMM- ubm的SV系统在EER方面的相对性能提高为9.43%和8.88%。
{"title":"Mel Spectrogram Based Automatic Speaker Verification Using GMM-UBM","authors":"T. Kumar, Ramesh Kumar Bhukya","doi":"10.1109/UPCON56432.2022.9986424","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986424","url":null,"abstract":"Speech recognition refers to the technology that enables machines to recognize persons using their speech utterances. An automatic speaker verification (ASV) is included in one of the challenging task in speech community. The ASV system works based on the speaker recognition claimed against the model. In this paper, the system works as a text-independent speaker verification (TISV) and is outlined to verify the speaker using his/her voice samples. We followed two approaches, first approach is Gaussian Mixture Model (GMM) method is used to create speaker modeling and the second approach are GMMs created from training dataset, with Universal Background Model (UBM) used for adaptation of the dataset, well known approach for speaker verification (SV). GMM-UBMs are designed as well classifier for decision making. In both the approaches, the training is performed by the Expectation Maximization (EM) and Maximum A Posteriori (MAP) adaptation for better models respectively. The NIST 2003 database is evaluated using adapted GMM-UBM following NIST 2003 speaker recognition evaluation protocol and the relative performance improvement in the SV system using GMM and GMM-UBM in terms of EER are 9.43% and 8.88%.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"72 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132078877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic EEG Based Emotion Recognition Using Extreme Learning Machine 基于极限学习机的自动EEG情绪识别
Nalini Pusarla, Ashutosh Kumar Singh, S. Tripathi
Emotion is very essential natural feeling of humans. Emotion recognition is often used in brain-computer interface devices to assist impaired people. Electroencephalogram (EEG) signal is essential for identifying emotional states since it reacts instantly to every variation in the individual's brain. In this work, the usefulness of the tunable-Q wavelet transform (TQWT) for classifying various emotions in EEG signals is studied. TQWT breaks the EEG signals into sub-bands and extracts statistical momemts from the sub-bands. The extracted moments are features which are fed to classifier named extreme learning machine, which classifies different emotions. In comparison to other existing approaches, the experimental results of the proposed technique acheived improved emotion recognition performance on open-source datasets, SEED, SEED-IV, and DEAP. The maximum accuracy obtained with the proposed emotion recognition system is 95.2%, 95%, and 93.8% using SEED, SEED-IV, and DEAP databases, respectively, which is higher compared to the state-of-art methods.
情感是人类非常重要的自然情感。情感识别通常用于脑机接口设备,以帮助残疾人。脑电图(EEG)信号对识别情绪状态至关重要,因为它对个体大脑中的每一个变化都能迅速做出反应。在这项工作中,研究了可调q小波变换(TQWT)对脑电信号中各种情绪分类的有效性。TQWT将脑电信号分解成子带,从子带中提取统计矩。提取的矩作为特征输入到极端学习机分类器中,对不同的情绪进行分类。与其他现有方法相比,该方法在开源数据集SEED、SEED- iv和DEAP上取得了更好的情绪识别性能。使用SEED、SEED- iv和DEAP数据库,所提出的情绪识别系统获得的最大准确率分别为95.2%、95%和93.8%,高于目前的方法。
{"title":"Automatic EEG Based Emotion Recognition Using Extreme Learning Machine","authors":"Nalini Pusarla, Ashutosh Kumar Singh, S. Tripathi","doi":"10.1109/UPCON56432.2022.9986366","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986366","url":null,"abstract":"Emotion is very essential natural feeling of humans. Emotion recognition is often used in brain-computer interface devices to assist impaired people. Electroencephalogram (EEG) signal is essential for identifying emotional states since it reacts instantly to every variation in the individual's brain. In this work, the usefulness of the tunable-Q wavelet transform (TQWT) for classifying various emotions in EEG signals is studied. TQWT breaks the EEG signals into sub-bands and extracts statistical momemts from the sub-bands. The extracted moments are features which are fed to classifier named extreme learning machine, which classifies different emotions. In comparison to other existing approaches, the experimental results of the proposed technique acheived improved emotion recognition performance on open-source datasets, SEED, SEED-IV, and DEAP. The maximum accuracy obtained with the proposed emotion recognition system is 95.2%, 95%, and 93.8% using SEED, SEED-IV, and DEAP databases, respectively, which is higher compared to the state-of-art methods.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125573339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mains Interface Circuit Design for Traveling Wave Tube Amplifier 行波管放大器主接口电路设计
Kailash Kushwaha, R. Singh
This paper describes the design of a power supply intended for a traveling wave tube (TWT) power supply. For the faithful operation of the TWT amplifier, constant DC voltages are required for the operation of the collector supply, heater supply, helix supply, and control grid. In this work, the mains interface circuit for a TWT amplifier is designed and simulated using Proteus 8.9. For the removal of the ripples in the main power supply, a two-inductor based choke filter is connected in series to get ripple-free signal. The mains interface power supply generates three DC output voltages typically 300V, 150V, and +12V.
本文介绍了一种行波管(TWT)电源的设计。为了使行波管放大器可靠地工作,集电极、加热器、螺旋电源和控制电网的工作都需要恒定的直流电压。本文设计了一种行波管放大器的主接口电路,并利用proteus8.9对其进行了仿真。为了消除主电源中的纹波,串联了一个基于双电感的扼流圈滤波器,以获得无纹波信号。市电接口电源有三个直流输出电压:300V、150V、+12V。
{"title":"Mains Interface Circuit Design for Traveling Wave Tube Amplifier","authors":"Kailash Kushwaha, R. Singh","doi":"10.1109/UPCON56432.2022.9986386","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986386","url":null,"abstract":"This paper describes the design of a power supply intended for a traveling wave tube (TWT) power supply. For the faithful operation of the TWT amplifier, constant DC voltages are required for the operation of the collector supply, heater supply, helix supply, and control grid. In this work, the mains interface circuit for a TWT amplifier is designed and simulated using Proteus 8.9. For the removal of the ripples in the main power supply, a two-inductor based choke filter is connected in series to get ripple-free signal. The mains interface power supply generates three DC output voltages typically 300V, 150V, and +12V.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114179803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Passive Technique for Detecting Islanding Using Voltage Sequence Component 一种利用电压序列分量检测孤岛的被动技术
Indradeo Pratap Bharti, Prince Kumar, N. Singh, O. Gupta, N. Choudhary, Ashutosh Kumar Singh
Islanding is a well-known phenomenon in Distributed Generation (DG) in which the microgrid continues to provide power to an isolated load area even after it is disconnected from the utility grid. Fast detection of islanding is very important to maintain stability in voltage and frequency, restrict the disturbances within permissible limits, and prevent faults and cascade tripping in the grid. In case of delayed islanding detection, the system may have increased harmonics, distorted voltage profile during the transition, faults, and service interruption during critical loads. The main disadvantage of passive detection approaches is their large non-detection zone. Due to external signal injection at a common coupling point, active islanding reduces power quality and system performance. The suggested passive islanding detection approach is based on measuring the negative sequence voltage rate of change (ROCONSV) at the point of common connection (PCC). The negative sequence voltage is calculated using symmetrical component transformation and the derivative is then compared to predefined threshold values. Based on the system decision, the islanding trip signal is released for the circuit breaker at the microgrid. The proposed method is simple, reliable, and fast without the use of any external signal injection and power quality degradation.
孤岛现象是分布式发电(DG)中一个众所周知的现象,即微电网在与公用电网断开连接后仍继续向孤立的负载区供电。孤岛的快速检测对于保持电网电压和频率的稳定,将干扰限制在允许的范围内,防止电网故障和串级跳闸具有重要意义。在孤岛检测延迟的情况下,系统可能会出现谐波增加、转换过程中的电压畸变、故障和临界负载期间的业务中断等问题。被动检测方法的主要缺点是其较大的非检测区域。由于外部信号注入在公共耦合点,有源孤岛降低了电能质量和系统性能。建议的被动孤岛检测方法是基于在公共连接点(PCC)测量负序电压变化率(roconv)。使用对称分量变换计算负序电压,然后将导数与预定义的阈值进行比较。根据系统决策,释放孤岛跳闸信号给微网断路器。该方法简单、可靠、快速,不使用任何外部信号注入和电能质量下降。
{"title":"A Passive Technique for Detecting Islanding Using Voltage Sequence Component","authors":"Indradeo Pratap Bharti, Prince Kumar, N. Singh, O. Gupta, N. Choudhary, Ashutosh Kumar Singh","doi":"10.1109/UPCON56432.2022.9986462","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986462","url":null,"abstract":"Islanding is a well-known phenomenon in Distributed Generation (DG) in which the microgrid continues to provide power to an isolated load area even after it is disconnected from the utility grid. Fast detection of islanding is very important to maintain stability in voltage and frequency, restrict the disturbances within permissible limits, and prevent faults and cascade tripping in the grid. In case of delayed islanding detection, the system may have increased harmonics, distorted voltage profile during the transition, faults, and service interruption during critical loads. The main disadvantage of passive detection approaches is their large non-detection zone. Due to external signal injection at a common coupling point, active islanding reduces power quality and system performance. The suggested passive islanding detection approach is based on measuring the negative sequence voltage rate of change (ROCONSV) at the point of common connection (PCC). The negative sequence voltage is calculated using symmetrical component transformation and the derivative is then compared to predefined threshold values. Based on the system decision, the islanding trip signal is released for the circuit breaker at the microgrid. The proposed method is simple, reliable, and fast without the use of any external signal injection and power quality degradation.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114205701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sound Event Detection using Federated Learning 基于联邦学习的声音事件检测
M. K. Maurya, Mandeep Kumar, Manish Kumar
The study of sound event detection (SED) in environmental environments has gained popularity recently. However, significant logistical and privacy concerns exist because huge amounts of (private) home or urban audio data are needed. Federated learning (FL), which effectively distributes these duties, is a viable way to use enormous amounts of data without raising privacy issues. Although FL has recently gained much attention, only a few studies have been done on FL for SED. In this paper, we attempted FL for SED to fill this gap and encourage further study. This paper demonstrated the experiments on the URBAN and MNIST datasets to better understand the impact of data heterogeneity, optimizer, client participation, and communication round. Additionally, we run baseline outcomes for deep neural network designs on the datasets in an FL context. The CNN-M model is used for training and testing purposes; two datasets, namely URBAN and MNIST audio datasets, are used.
环境声事件检测(SED)的研究近年来得到了广泛的关注。然而,由于需要大量(私人)家庭或城市音频数据,因此存在重大的后勤和隐私问题。联邦学习(FL)有效地分配了这些职责,是一种使用大量数据而不会引起隐私问题的可行方法。虽然近年来FL得到了广泛的关注,但关于FL治疗SED的研究却很少。在本文中,我们尝试将FL用于SED,以填补这一空白,并鼓励进一步的研究。本文演示了URBAN和MNIST数据集上的实验,以更好地理解数据异构、优化器、客户端参与和通信回合的影响。此外,我们在FL环境下的数据集上运行深度神经网络设计的基线结果。CNN-M模型用于训练和测试目的;使用两个数据集,即URBAN和MNIST音频数据集。
{"title":"Sound Event Detection using Federated Learning","authors":"M. K. Maurya, Mandeep Kumar, Manish Kumar","doi":"10.1109/UPCON56432.2022.9986444","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986444","url":null,"abstract":"The study of sound event detection (SED) in environmental environments has gained popularity recently. However, significant logistical and privacy concerns exist because huge amounts of (private) home or urban audio data are needed. Federated learning (FL), which effectively distributes these duties, is a viable way to use enormous amounts of data without raising privacy issues. Although FL has recently gained much attention, only a few studies have been done on FL for SED. In this paper, we attempted FL for SED to fill this gap and encourage further study. This paper demonstrated the experiments on the URBAN and MNIST datasets to better understand the impact of data heterogeneity, optimizer, client participation, and communication round. Additionally, we run baseline outcomes for deep neural network designs on the datasets in an FL context. The CNN-M model is used for training and testing purposes; two datasets, namely URBAN and MNIST audio datasets, are used.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"326-327 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121095872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
COVID-19 Detection From Audio Signals Using LR-MLP-RF-GMM Classifiers 利用LR-MLP-RF-GMM分类器从音频信号中检测COVID-19
P. Kumawat, Utkarsh, Aditya Chikhale, Ramesh Kumar Bhukya
The COVID-19 pandemic bestows global challenges surpassing boundaries of country, religion race, and economy. Testing of COVID-19 patients conditions are remains a challenging task due to the lack of adequate medical supplies, well-trained personnel and conducting reverse transcription polymerase chain reaction (RT-PCR) testing is expensive, long-drown-out process violates social distancing. In this direction, we used microbiologically confirmed COVID-19 dataset based on cough recordings from Coswara dataset. The Coswara dataset is also one of the open challenge dataset for researchers to investigate sound recordings of the Coswara dataset, collected from COVID-19 infected and non-COVID-19 individuals, for classification between Positive and Negative detection. These COVID-19 recordings were collected from multiple countries, through the provided crowd-sourcing website. Here, our work mainly focuses on cough sound based recordings. The dataset is released open access. We developed an acoustic biosignature feature extractors to screen for potential problems from cough recordings, and provide personalized advice to a particular patient's state to monitor his suitable condition in real-time. In our work, cough sound recordings are converted into Mel Frequency Cepstral Coefficients (MFCCs) and passed through a Gaussian Mixture Model (GMM) based pattern recognition, decision making based on a binary pre-screening diagnostic. When validated with infected and non-infected patients, for a two-class classification, using a Coswara dataset. The GMM is applied for developing a model for detection of biomarker based detection and achieves COVID-19 and non-COVID-19 patients accuracy of 73.22% based on the Coswara dataset and also compared with existing classifiers.
新冠肺炎疫情带来了超越国家、宗教、种族、经济界限的全球性挑战。检测COVID-19患者的病情仍然是一项具有挑战性的任务,因为缺乏足够的医疗用品,训练有素的人员,进行逆转录聚合酶链反应(RT-PCR)检测昂贵,长时间的过程违反了社交距离。在这个方向上,我们使用了基于Coswara数据集咳嗽记录的微生物学证实的COVID-19数据集。Coswara数据集也是开放挑战数据集之一,供研究人员调查从COVID-19感染和非COVID-19个体收集的Coswara数据集的录音,用于区分阳性和阴性检测。这些COVID-19录音是通过提供的众包网站从多个国家收集的。在这里,我们的工作主要集中在咳嗽声的录音。数据集是开放获取的。我们开发了一种声学生物签名特征提取器,用于筛选咳嗽录音中的潜在问题,并针对特定患者的状态提供个性化建议,实时监测其合适的状态。在我们的工作中,咳嗽录音被转换成Mel频率倒谱系数(MFCCs),并通过基于高斯混合模型(GMM)的模式识别,基于二元预筛选诊断的决策制定。当对感染和非感染患者进行验证时,使用Coswara数据集进行两类分类。GMM用于开发基于生物标志物的检测模型,基于Coswara数据集并与现有分类器进行比较,实现了COVID-19和非COVID-19患者的准确率为73.22%。
{"title":"COVID-19 Detection From Audio Signals Using LR-MLP-RF-GMM Classifiers","authors":"P. Kumawat, Utkarsh, Aditya Chikhale, Ramesh Kumar Bhukya","doi":"10.1109/UPCON56432.2022.9986431","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986431","url":null,"abstract":"The COVID-19 pandemic bestows global challenges surpassing boundaries of country, religion race, and economy. Testing of COVID-19 patients conditions are remains a challenging task due to the lack of adequate medical supplies, well-trained personnel and conducting reverse transcription polymerase chain reaction (RT-PCR) testing is expensive, long-drown-out process violates social distancing. In this direction, we used microbiologically confirmed COVID-19 dataset based on cough recordings from Coswara dataset. The Coswara dataset is also one of the open challenge dataset for researchers to investigate sound recordings of the Coswara dataset, collected from COVID-19 infected and non-COVID-19 individuals, for classification between Positive and Negative detection. These COVID-19 recordings were collected from multiple countries, through the provided crowd-sourcing website. Here, our work mainly focuses on cough sound based recordings. The dataset is released open access. We developed an acoustic biosignature feature extractors to screen for potential problems from cough recordings, and provide personalized advice to a particular patient's state to monitor his suitable condition in real-time. In our work, cough sound recordings are converted into Mel Frequency Cepstral Coefficients (MFCCs) and passed through a Gaussian Mixture Model (GMM) based pattern recognition, decision making based on a binary pre-screening diagnostic. When validated with infected and non-infected patients, for a two-class classification, using a Coswara dataset. The GMM is applied for developing a model for detection of biomarker based detection and achieves COVID-19 and non-COVID-19 patients accuracy of 73.22% based on the Coswara dataset and also compared with existing classifiers.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115273243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved Feature Selection Algorithm for Autism Detection 一种改进的自闭症检测特征选择算法
Uday Singh, Shailendra Shukla, M. M. Gore
Autism Spectrum Disorder (ASD) is one of the most common acute neurodevelopmental disorders. It is associated with the development of the brain. ASD severely affects a child's physical and mental health. ASD detection at an early age is challenging as its symptoms come after two years. Each ASD patient has a different set of symptoms (features). In recent years, machine learning has offered a new potential solution for the detection of Autism. The effectiveness of the machine learning models depends on the dataset's features. This paper proposes a feature selection algorithm (which is based on feature correlation and ranking) for early ASD detection on the clinical ASD dataset. The performance of the feature selection algorithm is compared with different machine learning algorithms (LR, GBC, AdaBoost, and DT). The result shows that 5 out of the 30 features with a Logistic Regression model are sufficient to detect Autism with 98.18% accuracy, 98.16% sensitivity, and 98.16% precision. The result also shows that the Gradient Boost achieves 98.18% accuracy with 5 features, and the AdaBoost achieves 97.10% accuracy with 5 features.
自闭症谱系障碍(ASD)是最常见的急性神经发育障碍之一。它与大脑的发育有关。自闭症谱系障碍严重影响儿童的身心健康。早期发现ASD是具有挑战性的,因为它的症状是在两年后出现的。每个ASD患者都有一系列不同的症状(特征)。近年来,机器学习为自闭症的检测提供了一种新的潜在解决方案。机器学习模型的有效性取决于数据集的特征。针对临床ASD数据集,提出了一种基于特征相关性和排序的ASD早期检测特征选择算法。将特征选择算法与不同的机器学习算法(LR、GBC、AdaBoost和DT)的性能进行了比较。结果表明,Logistic回归模型的30个特征中有5个足以检测自闭症,准确率为98.18%,灵敏度为98.16%,精度为98.16%。结果还表明,梯度Boost在5个特征下的准确率达到98.18%,AdaBoost在5个特征下的准确率达到97.10%。
{"title":"An Improved Feature Selection Algorithm for Autism Detection","authors":"Uday Singh, Shailendra Shukla, M. M. Gore","doi":"10.1109/UPCON56432.2022.9986364","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986364","url":null,"abstract":"Autism Spectrum Disorder (ASD) is one of the most common acute neurodevelopmental disorders. It is associated with the development of the brain. ASD severely affects a child's physical and mental health. ASD detection at an early age is challenging as its symptoms come after two years. Each ASD patient has a different set of symptoms (features). In recent years, machine learning has offered a new potential solution for the detection of Autism. The effectiveness of the machine learning models depends on the dataset's features. This paper proposes a feature selection algorithm (which is based on feature correlation and ranking) for early ASD detection on the clinical ASD dataset. The performance of the feature selection algorithm is compared with different machine learning algorithms (LR, GBC, AdaBoost, and DT). The result shows that 5 out of the 30 features with a Logistic Regression model are sufficient to detect Autism with 98.18% accuracy, 98.16% sensitivity, and 98.16% precision. The result also shows that the Gradient Boost achieves 98.18% accuracy with 5 features, and the AdaBoost achieves 97.10% accuracy with 5 features.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121392290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unified Framework for Covariance Adaptation with Multiple Source Domains 多源域协方差自适应的统一框架
Priyam Bajpai, R. Sanodiya
This paper addresses the problem of unsupervised domain adaptation in a setup where a single source is not sufficient for training the model. In this situation, a hybrid, multi-source driven training dataset is used. This calls for the need of an effective method to align the geometrically quasi-related source domains which would help prepare a better ground for aligning the unlabeled target dataset. We propose a robust framework that helps in better domain adaptation by reducing the probabilistic and subspace shift between the domains without compromising with their distributional information, and diminishing the distance of the between-class and within-class scatter of the domains collectively. The algorithm generates pseudo-labels after each iteration to update its objective function, thus helping it to perform better than conventional methods. The proposed framework tackles non-linear divergence by projecting the features into the kernel space. Computational experiments and their analysis show that the proposed algorithm performs better than other state-of-the-art domain adaptation methods on various visual recognition tasks.
本文解决了在单个源不足以训练模型的情况下的无监督域自适应问题。在这种情况下,使用混合的、多源驱动的训练数据集。这就需要一种有效的方法来对齐几何准相关的源域,这将有助于为对齐未标记的目标数据集准备更好的基础。我们提出了一个鲁棒框架,通过减少域之间的概率和子空间移动而不影响其分布信息,并减少域的类间和类内分散的距离,有助于更好地进行域适应。该算法在每次迭代后生成伪标签来更新其目标函数,从而使其比传统方法性能更好。提出的框架通过将特征投影到核空间来解决非线性发散问题。计算实验和分析表明,该算法在各种视觉识别任务上的表现优于其他领域自适应方法。
{"title":"A Unified Framework for Covariance Adaptation with Multiple Source Domains","authors":"Priyam Bajpai, R. Sanodiya","doi":"10.1109/UPCON56432.2022.9986432","DOIUrl":"https://doi.org/10.1109/UPCON56432.2022.9986432","url":null,"abstract":"This paper addresses the problem of unsupervised domain adaptation in a setup where a single source is not sufficient for training the model. In this situation, a hybrid, multi-source driven training dataset is used. This calls for the need of an effective method to align the geometrically quasi-related source domains which would help prepare a better ground for aligning the unlabeled target dataset. We propose a robust framework that helps in better domain adaptation by reducing the probabilistic and subspace shift between the domains without compromising with their distributional information, and diminishing the distance of the between-class and within-class scatter of the domains collectively. The algorithm generates pseudo-labels after each iteration to update its objective function, thus helping it to perform better than conventional methods. The proposed framework tackles non-linear divergence by projecting the features into the kernel space. Computational experiments and their analysis show that the proposed algorithm performs better than other state-of-the-art domain adaptation methods on various visual recognition tasks.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114262602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1