首页 > 最新文献

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning最新文献

英文 中文
Global-Local Framework for Medical Image Segmentation with Intra-class Imbalance Problem 一类内不平衡问题下医学图像分割的全局-局部框架
Yifan Zhou, Bing Yang, Xiaolu Lin, Risa Higashita, Jiang Liu
Deep learning methods have been demonstrated effective in medical image segmentation tasks. The results are affected by data imbalance problems. The inter-class imbalance is often considered, while the intra-class imbalance is not. The intra-class imbalance usually occurs in medical images due to external influences such as noise interference and changes in camera angle, resulting in insufficient discriminative representations within classes. Deep learning methods are easy to segment regions without complex textures and varied appearances. They are susceptible to the intra-class imbalance problem in medical images. In this paper, we propose a two-stage global-local framework to solve the intra-class imbalance problem and increase segmentation accuracy. The framework consists of (1) an auxiliary task network(ATN), (2) a local patch network(LPN), and (3) a fusion module. The ATN has a shared encoder and two separate decoders that perform global segmentation and key points localization. The key points guide to generating the fuzzy patches for the LPN. The LPN focuses on challenging patches to get a more accurate result. The fusion module generates the final output according to the global and local segmentation results. Furthermore, we have performed experiments on a private iris dataset with 290 images and a public CAMUS dataset with 1800 images. Our method achieves an IoU of 0.9280 on the iris dataset and an IoU of 0.8511 on the CAMUS dataset. The results on both datasets show that our method achieves superior performance over U-Net, CE-Net, and U-Net++.
深度学习方法已被证明在医学图像分割任务中是有效的。结果受到数据不平衡问题的影响。阶级间的不平衡经常被考虑,而阶级内的不平衡却没有被考虑。在医学图像中,由于噪声干扰、摄像机角度变化等外界影响,往往会出现类内失衡,导致类内的判别表征不足。深度学习方法很容易分割区域,没有复杂的纹理和变化的外观。他们容易受到医学图像的类内失衡问题的影响。在本文中,我们提出了一个两阶段的全局-局部框架来解决类内不平衡问题,提高分割精度。该框架由(1)辅助任务网络(ATN)、(2)局部补丁网络(LPN)和(3)融合模块组成。ATN有一个共享编码器和两个独立的解码器,执行全局分割和关键点定位。重点指导生成LPN的模糊补丁。LPN专注于挑战补丁以获得更准确的结果。融合模块根据全局和局部分割结果生成最终输出。此外,我们还在包含290张图像的私有虹膜数据集和包含1800张图像的公共CAMUS数据集上进行了实验。我们的方法在iris数据集上实现了0.9280的IoU,在CAMUS数据集上实现了0.8511的IoU。在两个数据集上的结果表明,我们的方法比U-Net、CE-Net和U-Net++具有更好的性能。
{"title":"Global-Local Framework for Medical Image Segmentation with Intra-class Imbalance Problem","authors":"Yifan Zhou, Bing Yang, Xiaolu Lin, Risa Higashita, Jiang Liu","doi":"10.1145/3590003.3590071","DOIUrl":"https://doi.org/10.1145/3590003.3590071","url":null,"abstract":"Deep learning methods have been demonstrated effective in medical image segmentation tasks. The results are affected by data imbalance problems. The inter-class imbalance is often considered, while the intra-class imbalance is not. The intra-class imbalance usually occurs in medical images due to external influences such as noise interference and changes in camera angle, resulting in insufficient discriminative representations within classes. Deep learning methods are easy to segment regions without complex textures and varied appearances. They are susceptible to the intra-class imbalance problem in medical images. In this paper, we propose a two-stage global-local framework to solve the intra-class imbalance problem and increase segmentation accuracy. The framework consists of (1) an auxiliary task network(ATN), (2) a local patch network(LPN), and (3) a fusion module. The ATN has a shared encoder and two separate decoders that perform global segmentation and key points localization. The key points guide to generating the fuzzy patches for the LPN. The LPN focuses on challenging patches to get a more accurate result. The fusion module generates the final output according to the global and local segmentation results. Furthermore, we have performed experiments on a private iris dataset with 290 images and a public CAMUS dataset with 1800 images. Our method achieves an IoU of 0.9280 on the iris dataset and an IoU of 0.8511 on the CAMUS dataset. The results on both datasets show that our method achieves superior performance over U-Net, CE-Net, and U-Net++.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132347222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-dimensional analysis of urban shrinkage problem in Liaoning Province based on multi-index system, grey correlation analysis and BP neural network with particle swarm optimization 基于多指标体系、灰色关联分析和粒子群优化BP神经网络的辽宁省城市收缩问题多维分析
Zhenyu Fang, Jun Yu Li, Junyu Xiong, Xin Wang
The rapid development of urbanization in modern China is accompanied by the increasingly serious problem of urban shrinkage. To provide an effective analytical model for the urban shrinkage problem, this paper takes Liaoning Province, which is one of the typical provinces with a serious urban shrinkage issue in China, as an example. Based on the data from 30 cities in Liaoning Province in recent years, this paper constructs a multi-index system for shrinking cities to evaluate and classify the shrinkage degree of 30 cities. The grey relation analysis model is also used to quantitatively analyze the influence of various factors on the shrinking city population, while the back-propagation neural network algorithm model optimized with particle swarm optimization is also applied to predict the development trend of shrinking cities. The results present the shrinking properties of 30 cities and correlations between different city indicators, as well as the predictive development trend of the shrinking city.
中国近代城市化快速发展的同时,城市萎缩问题也日益严重。为了给城市收缩问题提供一个有效的分析模型,本文以中国城市收缩问题严重的典型省份之一辽宁省为例。本文以辽宁省30个城市近年来的收缩城市数据为基础,构建了收缩城市多指标体系,对30个城市的收缩程度进行了评价和分类。运用灰色关联分析模型定量分析各因素对城市人口萎缩的影响,运用粒子群优化的反向传播神经网络算法模型预测城市人口萎缩的发展趋势。研究结果揭示了30个城市的收缩特征和不同城市指标之间的相关性,并预测了收缩城市的发展趋势。
{"title":"Multi-dimensional analysis of urban shrinkage problem in Liaoning Province based on multi-index system, grey correlation analysis and BP neural network with particle swarm optimization","authors":"Zhenyu Fang, Jun Yu Li, Junyu Xiong, Xin Wang","doi":"10.1145/3590003.3590016","DOIUrl":"https://doi.org/10.1145/3590003.3590016","url":null,"abstract":"The rapid development of urbanization in modern China is accompanied by the increasingly serious problem of urban shrinkage. To provide an effective analytical model for the urban shrinkage problem, this paper takes Liaoning Province, which is one of the typical provinces with a serious urban shrinkage issue in China, as an example. Based on the data from 30 cities in Liaoning Province in recent years, this paper constructs a multi-index system for shrinking cities to evaluate and classify the shrinkage degree of 30 cities. The grey relation analysis model is also used to quantitatively analyze the influence of various factors on the shrinking city population, while the back-propagation neural network algorithm model optimized with particle swarm optimization is also applied to predict the development trend of shrinking cities. The results present the shrinking properties of 30 cities and correlations between different city indicators, as well as the predictive development trend of the shrinking city.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133833060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Twitter stance detection using deep learning model with FastText Embedding 使用快速文本嵌入的深度学习模型进行Twitter姿态检测
Yongqing Deng, Yongzhong Huang
The interactivity of social media platforms allows a large number of users to comment on different political or social issues to express their views, and identifying users' stances from online comment texts helps the government to monitor public opinion more effectively. The automatic recognition of stance information in comment text has become a new research hotspot in the field of natural language processing. Most of the existing text stance analysis corpus focuses on political topics in European and American countries, and high-quality stance analysis corpus research on political topics in Southeast Asian countries is relatively scarce. In order to stimulate this research direction, this paper provides a dataset about the 2022 Philippine presidential election, which annotates the stance information of the two popular presidential candidates and provides reliable data support for subsequent stance analysis model research. Next, we build a stance detection model of hybrid deep neural networks based on BiLSTM, CNN, and Attention, and we demonstrate its effectiveness on multiple datasets and obtain the best results on the SemEval-2016 dataset. In addition, we compare FastText and Word2Vec, two pre-trained word embeddings for word encoding, and discuss which word embedding is preferred in stance detection tasks. This result shows that the stance analysis model proposed in this paper can be effectively applied to Twitter text stance data.
社交媒体平台的互动性使得大量用户可以对不同的政治或社会问题发表评论,表达自己的观点,从在线评论文本中识别用户的立场有助于政府更有效地监控民意。评论文本中立场信息的自动识别已成为自然语言处理领域的一个新的研究热点。现有的文本立场分析语料库大多集中在欧美国家的政治话题上,而高质量的东南亚国家政治话题立场分析语料库研究相对较少。为了激发这一研究方向,本文提供了一个关于2022年菲律宾总统选举的数据集,该数据集注释了两位热门总统候选人的立场信息,为后续的立场分析模型研究提供可靠的数据支持。接下来,我们建立了基于BiLSTM、CNN和Attention的混合深度神经网络的姿态检测模型,并在多个数据集上验证了其有效性,在SemEval-2016数据集上获得了最佳结果。此外,我们比较了FastText和Word2Vec这两种用于单词编码的预训练词嵌入,并讨论了哪种词嵌入更适合用于姿态检测任务。结果表明,本文提出的姿态分析模型可以有效地应用于Twitter文本姿态数据。
{"title":"Twitter stance detection using deep learning model with FastText Embedding","authors":"Yongqing Deng, Yongzhong Huang","doi":"10.1145/3590003.3590102","DOIUrl":"https://doi.org/10.1145/3590003.3590102","url":null,"abstract":"The interactivity of social media platforms allows a large number of users to comment on different political or social issues to express their views, and identifying users' stances from online comment texts helps the government to monitor public opinion more effectively. The automatic recognition of stance information in comment text has become a new research hotspot in the field of natural language processing. Most of the existing text stance analysis corpus focuses on political topics in European and American countries, and high-quality stance analysis corpus research on political topics in Southeast Asian countries is relatively scarce. In order to stimulate this research direction, this paper provides a dataset about the 2022 Philippine presidential election, which annotates the stance information of the two popular presidential candidates and provides reliable data support for subsequent stance analysis model research. Next, we build a stance detection model of hybrid deep neural networks based on BiLSTM, CNN, and Attention, and we demonstrate its effectiveness on multiple datasets and obtain the best results on the SemEval-2016 dataset. In addition, we compare FastText and Word2Vec, two pre-trained word embeddings for word encoding, and discuss which word embedding is preferred in stance detection tasks. This result shows that the stance analysis model proposed in this paper can be effectively applied to Twitter text stance data.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"179 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116143744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Learning-Based Intrusion Detection Method for Smart Grid 基于联邦学习的智能电网入侵检测方法
Dong Bin, Xin Li, Chunyan Yang, Songming Han, Ying Ling
Power systems have revealed serious security problems in the process of gradual opening, and intrusion detection as an important security defense measure can detect potential intrusions in a timely manner. In the big data environment of electric power, there are information silos between different electric power data owners, and in order to obtain intrusion detection models with better performance, traditional methods need to fuse data from all parties, which often brings difficulties in information security and data privacy protection. In this paper, we propose a distributed intrusion detection framework based on federated learning and apply it to network traffic data analysis. The framework aims to ensure the information security of each local power data while establishing a collection of decentralized data and completing the joint training of models from multiple data sources. The experimental results show that the scheme achieves 98.1% accuracy on the simulated data set, which is better than other commonly used intrusion detection algorithms. In addition, the method well ensures the security and privacy of data because the data are not interoperable among each participant under the federated learning mechanism.
电力系统在逐步开放的过程中暴露出了严重的安全问题,入侵检测作为一种重要的安全防御措施,可以及时发现潜在的入侵。在电力大数据环境中,不同电力数据所有者之间存在信息孤岛,传统方法为了获得性能更好的入侵检测模型,需要融合各方数据,这往往给信息安全和数据隐私保护带来困难。本文提出了一种基于联邦学习的分布式入侵检测框架,并将其应用于网络流量数据分析。该框架旨在确保各个地方电力数据的信息安全,同时建立一个分散的数据集合,完成多个数据源模型的联合训练。实验结果表明,该方案在模拟数据集上的准确率达到了98.1%,优于其他常用的入侵检测算法。此外,由于在联邦学习机制下各参与者之间的数据不能互操作,该方法很好地保证了数据的安全性和隐私性。
{"title":"Federated Learning-Based Intrusion Detection Method for Smart Grid","authors":"Dong Bin, Xin Li, Chunyan Yang, Songming Han, Ying Ling","doi":"10.1145/3590003.3590060","DOIUrl":"https://doi.org/10.1145/3590003.3590060","url":null,"abstract":"Power systems have revealed serious security problems in the process of gradual opening, and intrusion detection as an important security defense measure can detect potential intrusions in a timely manner. In the big data environment of electric power, there are information silos between different electric power data owners, and in order to obtain intrusion detection models with better performance, traditional methods need to fuse data from all parties, which often brings difficulties in information security and data privacy protection. In this paper, we propose a distributed intrusion detection framework based on federated learning and apply it to network traffic data analysis. The framework aims to ensure the information security of each local power data while establishing a collection of decentralized data and completing the joint training of models from multiple data sources. The experimental results show that the scheme achieves 98.1% accuracy on the simulated data set, which is better than other commonly used intrusion detection algorithms. In addition, the method well ensures the security and privacy of data because the data are not interoperable among each participant under the federated learning mechanism.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115152566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Health monitoring system for elderly people based on Raspberry Pi 基于树莓派的老年人健康监测系统
Qingsong Peng
We present a comprehensive overview of the application of Raspberry Pi in the field of health monitoring for elderly people with disabilities. Firstly we discuss the advantages of using artificial intelligence technology for health monitoring of elderly people, and the significance of using information technology devices to achieve health monitoring for the elderly, while keeping the cost of the devices low. And then we examine the development of Raspberry Pi and its advantages for health monitoring of elderly people with disabilities, such as its low cost, portability, and ease of use. After that we outline the methods of collecting data for health monitoring of elderly people, such as using sensors to measure heart rate, oxygen levels, and blood pressure, and integrating these sensors into a single device. We also discuss the implementation of a Raspberry Pi-based health monitoring system for elderly people, and the ways in which health data can be utilized to optimize the performance of the system. The work provides useful insights for those who are interested in using Raspberry Pi for health monitoring applications for elderly people with disabilities.
我们全面概述了树莓派在残疾老年人健康监测领域的应用。首先讨论了利用人工智能技术进行老年人健康监测的优势,以及利用信息技术设备实现老年人健康监测的意义,同时保持设备的低成本。然后我们研究了树莓派的发展及其对残疾老年人健康监测的优势,如其低成本,便携性和易用性。然后,我们概述了收集老年人健康监测数据的方法,例如使用传感器测量心率,氧气水平和血压,并将这些传感器集成到一个设备中。我们还讨论了一个基于树莓派的老年人健康监测系统的实现,以及如何利用健康数据来优化系统的性能。这项工作为那些有兴趣使用树莓派用于残疾老年人健康监测应用程序的人提供了有用的见解。
{"title":"Health monitoring system for elderly people based on Raspberry Pi","authors":"Qingsong Peng","doi":"10.1145/3590003.3590057","DOIUrl":"https://doi.org/10.1145/3590003.3590057","url":null,"abstract":"We present a comprehensive overview of the application of Raspberry Pi in the field of health monitoring for elderly people with disabilities. Firstly we discuss the advantages of using artificial intelligence technology for health monitoring of elderly people, and the significance of using information technology devices to achieve health monitoring for the elderly, while keeping the cost of the devices low. And then we examine the development of Raspberry Pi and its advantages for health monitoring of elderly people with disabilities, such as its low cost, portability, and ease of use. After that we outline the methods of collecting data for health monitoring of elderly people, such as using sensors to measure heart rate, oxygen levels, and blood pressure, and integrating these sensors into a single device. We also discuss the implementation of a Raspberry Pi-based health monitoring system for elderly people, and the ways in which health data can be utilized to optimize the performance of the system. The work provides useful insights for those who are interested in using Raspberry Pi for health monitoring applications for elderly people with disabilities.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123338827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Histo-Puzzle Network for Weakly Supervised Semantic Segmentation of Histological Tissue Type 用于组织类型弱监督语义分割的组织谜题网络
Tengyun Ma, Guotian He, Lin Chen, Yuanchang Lin
Digital pathological images with a large range of Histological Tissue Types (HTTs) contain more sophisticated contours than natural images. In recent years, deep learning algorithms have been widely applied to assist HTT analysis in a weakly-supervised manner by exploiting the class activation maps (CAM). However, the previous methods tend to confusedly activate the most discriminative regions of feature maps, resulting in incomplete segmented contour. This paper proposes a Histo-Puzzle network to improve the HTTs classification and segmentation based on patch-level self-supervised learning. Specifically, our model separates the HTT images into tiled patches by a puzzle module. Then we train a classifier on the supervision of reconstructed CAMs and image-level labels simultaneously. Experiments are conducted on the digital pathology database with 51 hierarchical HTTs. The experimental results show that our proposed method outperforms previous state-of-the-art methods on segmentation tasks of morphological and functional types.
具有大范围组织学组织类型(HTTs)的数字病理图像包含比自然图像更复杂的轮廓。近年来,深度学习算法已被广泛应用于利用类激活图(CAM)以弱监督的方式辅助HTT分析。然而,以往的方法往往会混淆激活特征图中最具判别性的区域,导致轮廓分割不完整。本文提出了一种基于补丁级自监督学习的组织拼图网络来改进http分类和分割。具体来说,我们的模型通过拼图模块将HTT图像分成平铺块。然后,我们训练一个分类器,同时对重构的cam和图像级标签进行监督。实验在51个分层html的数字病理数据库上进行。实验结果表明,我们提出的方法在形态学和功能类型的分割任务上优于现有的最先进的方法。
{"title":"A Histo-Puzzle Network for Weakly Supervised Semantic Segmentation of Histological Tissue Type","authors":"Tengyun Ma, Guotian He, Lin Chen, Yuanchang Lin","doi":"10.1145/3590003.3590095","DOIUrl":"https://doi.org/10.1145/3590003.3590095","url":null,"abstract":"Digital pathological images with a large range of Histological Tissue Types (HTTs) contain more sophisticated contours than natural images. In recent years, deep learning algorithms have been widely applied to assist HTT analysis in a weakly-supervised manner by exploiting the class activation maps (CAM). However, the previous methods tend to confusedly activate the most discriminative regions of feature maps, resulting in incomplete segmented contour. This paper proposes a Histo-Puzzle network to improve the HTTs classification and segmentation based on patch-level self-supervised learning. Specifically, our model separates the HTT images into tiled patches by a puzzle module. Then we train a classifier on the supervision of reconstructed CAMs and image-level labels simultaneously. Experiments are conducted on the digital pathology database with 51 hierarchical HTTs. The experimental results show that our proposed method outperforms previous state-of-the-art methods on segmentation tasks of morphological and functional types.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"67 8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116006241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Emotion Recognition Method Based On Feature Fusion and Self-Supervised Learning 基于特征融合和自监督学习的情绪识别方法
Xuan-Nam Cao, Ming Sun
Emotional diseases being represented in many kinds of human mental and cardiac problems, demanding requirements are imposed on accurate emotion recognition. Deep learning methods have gained widespread application in the field of emotion recognition, utilizing physiological signals. However, many existing methods rely solely on deep features, which can be difficult to interpret and may not provide a comprehensive understanding of physiological signals. To address this issue, we propose a novel emotion recognition method based on feature fusion and self-supervised learning. This approach combines shallow features and deep learning features, resulting in a more holistic and interpretable approach to analyzing physiological signals. In addition, we transferred the self-supervised learning method from processing images to signals, which learns sophisticated and informative features from unlabeled signal data. Our experimental results are conducted on WESAD, a publicly available dataset and the proposed model shows significant improvement in performance, which confirms the superiority of our proposed method compared to state-of-the-art methods.
情绪疾病是人类精神和心脏疾病的多种表现形式,对准确的情绪识别提出了很高的要求。深度学习方法利用生理信号在情绪识别领域得到了广泛的应用。然而,许多现有的方法仅仅依赖于深层特征,这可能难以解释,并且可能无法提供对生理信号的全面理解。为了解决这一问题,我们提出了一种基于特征融合和自监督学习的情感识别方法。这种方法结合了浅特征和深度学习特征,产生了一种更全面和可解释的方法来分析生理信号。此外,我们将自监督学习方法从处理图像转移到信号,从未标记的信号数据中学习复杂且信息丰富的特征。我们的实验结果是在WESAD(一个公开可用的数据集)上进行的,所提出的模型在性能上显示出显着的改进,这证实了我们所提出的方法与最先进的方法相比的优越性。
{"title":"An Emotion Recognition Method Based On Feature Fusion and Self-Supervised Learning","authors":"Xuan-Nam Cao, Ming Sun","doi":"10.1145/3590003.3590041","DOIUrl":"https://doi.org/10.1145/3590003.3590041","url":null,"abstract":"Emotional diseases being represented in many kinds of human mental and cardiac problems, demanding requirements are imposed on accurate emotion recognition. Deep learning methods have gained widespread application in the field of emotion recognition, utilizing physiological signals. However, many existing methods rely solely on deep features, which can be difficult to interpret and may not provide a comprehensive understanding of physiological signals. To address this issue, we propose a novel emotion recognition method based on feature fusion and self-supervised learning. This approach combines shallow features and deep learning features, resulting in a more holistic and interpretable approach to analyzing physiological signals. In addition, we transferred the self-supervised learning method from processing images to signals, which learns sophisticated and informative features from unlabeled signal data. Our experimental results are conducted on WESAD, a publicly available dataset and the proposed model shows significant improvement in performance, which confirms the superiority of our proposed method compared to state-of-the-art methods.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126580949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Arbitrary-oriented Objects in Remote Sensing Imagery with Segmentation-Aware Mask 基于分割感知掩模的遥感图像任意方向目标检测
Jiali Wei, Bo Hua, Fei Gao, Huan Zhang, Jiangwei Fan, Shuran Zhang
Arbitrary-Oriented object detection in remote sensing images is a hot topic in recent years. Currently, most arbitrary-oriented object detectors adopt the oriented bounding box (OBB) to represent targets in remote sensing imagery. However, OBB representation suffers from suboptimal regression problems caused by the ambiguity of the angle definition. In this paper, we propose a novel framework to Learning Segmentation-aware Mask for arbitrary-oriented object Detection (LSM-Det) in remote sensing imagery. LSM-Det predicts the mask of the object, and then converts the mask prediction into a minimum external OBB to achieve arbitrary-oriented object detection. Moreover, we designed a segmentation-aware branch to select high-quality predictions via the output matching score. Our method achieves superior performance on multiple remote sensing datasets. Code and models are available to facilitate related research.
面向任意目标的遥感图像检测是近年来研究的热点。目前,遥感图像中的任意方向目标检测器大多采用定向边界框(OBB)来表示目标。然而,OBB表示存在由于角度定义不明确而导致的次优回归问题。在本文中,我们提出了一种新的框架来学习用于遥感图像中任意目标检测(LSM-Det)的分割感知掩码。LSM-Det预测目标的掩码,然后将掩码预测转化为最小的外部OBB,实现任意方向的目标检测。此外,我们设计了一个分割感知分支,通过输出匹配分数来选择高质量的预测。该方法在多遥感数据集上具有优异的性能。代码和模型可用于促进相关研究。
{"title":"Detecting Arbitrary-oriented Objects in Remote Sensing Imagery with Segmentation-Aware Mask","authors":"Jiali Wei, Bo Hua, Fei Gao, Huan Zhang, Jiangwei Fan, Shuran Zhang","doi":"10.1145/3590003.3590032","DOIUrl":"https://doi.org/10.1145/3590003.3590032","url":null,"abstract":"Arbitrary-Oriented object detection in remote sensing images is a hot topic in recent years. Currently, most arbitrary-oriented object detectors adopt the oriented bounding box (OBB) to represent targets in remote sensing imagery. However, OBB representation suffers from suboptimal regression problems caused by the ambiguity of the angle definition. In this paper, we propose a novel framework to Learning Segmentation-aware Mask for arbitrary-oriented object Detection (LSM-Det) in remote sensing imagery. LSM-Det predicts the mask of the object, and then converts the mask prediction into a minimum external OBB to achieve arbitrary-oriented object detection. Moreover, we designed a segmentation-aware branch to select high-quality predictions via the output matching score. Our method achieves superior performance on multiple remote sensing datasets. Code and models are available to facilitate related research.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130503626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RhySpeech: A Deployable Rhythmic Text-to-Speech Based on Feed-Forward Transformer for Reading Disabilities RhySpeech:一种可部署的基于前馈变压器的有节奏文本到语音的阅读障碍
Yi-Hsien Lin
Dyslexia was first proposed in 1877, but this century-old problem still troubles many people today [1]. Dyslexia is marked by difficulty in reading despite having normal or superior conditions in their environment and intellectual ability, is curable using multi-sensory learning, which involves providing audio stimulus, sometimes generated from expressive text-to-speech. However, such generated audio lacks rhythmic features, marked by inadequate insertion of pauses. In response to such technological difficulty, this paper proposes RhySpeech, which models rhythm using feed-forward transformer neural networks and an LRV (Latent Rhythm Vector). The LRV receives input from the pitch, energy, and duration features encoded using a Transformers network along with the numeric encoding of the previous 16 phonemes, which together build a strong sense of context for the pause prediction. This LRV is trained to generate adequate lengths and positions of pa uses, allowing the synthesized audio to have more accurate pausing
诵读困难症最早是在1877年提出的,但这个长达一个世纪的问题至今仍困扰着许多人[1]。阅读障碍的特点是,尽管他们的环境和智力能力正常或优越,但阅读困难,可以通过多感官学习来治愈,这种学习包括提供音频刺激,有时是由表达性的文本到语音产生的。然而,这种生成的音频缺乏节奏特征,其特点是插入的停顿不足。针对这种技术困难,本文提出了RhySpeech,它使用前馈变压器神经网络和LRV (Latent rhythm Vector)来建模节奏。LRV接收来自音高、能量和持续时间特征的输入,这些特征使用transformer网络编码,以及前16个音素的数字编码,它们一起为暂停预测构建了强大的上下文感。这个LRV训练产生足够的长度和位置的pa使用,允许合成音频有更准确的暂停
{"title":"RhySpeech: A Deployable Rhythmic Text-to-Speech Based on Feed-Forward Transformer for Reading Disabilities","authors":"Yi-Hsien Lin","doi":"10.1145/3590003.3590062","DOIUrl":"https://doi.org/10.1145/3590003.3590062","url":null,"abstract":"Dyslexia was first proposed in 1877, but this century-old problem still troubles many people today [1]. Dyslexia is marked by difficulty in reading despite having normal or superior conditions in their environment and intellectual ability, is curable using multi-sensory learning, which involves providing audio stimulus, sometimes generated from expressive text-to-speech. However, such generated audio lacks rhythmic features, marked by inadequate insertion of pauses. In response to such technological difficulty, this paper proposes RhySpeech, which models rhythm using feed-forward transformer neural networks and an LRV (Latent Rhythm Vector). The LRV receives input from the pitch, energy, and duration features encoded using a Transformers network along with the numeric encoding of the previous 16 phonemes, which together build a strong sense of context for the pause prediction. This LRV is trained to generate adequate lengths and positions of pa uses, allowing the synthesized audio to have more accurate pausing","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128448014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaussian-guided character erasure for data augment of industrial characters 工业字符数据扩充的高斯制导字符擦除
Hongchao Gao, Chao Yao, Zhennan Wang
The application of scene text erasure technology in privacy protection, camera-based virtual reality translation and image editing has attracted more and more research interests. Recent efforts on scene text erasing have shown promising results. We utilize text removal methods as a component of industrial characters generation procedure to generate large-scale synthetic character images so as to mitigate the issue of insufficient samples in the recognition task of industrial characters. Existing character erasure models has achieved good performance in natural scenes. However, in industrial scenes, these erasure networks are easily affected by salient no-character regions leading to the attention shift. To overcome this limitation, we proposed a character erasure network based on attention mechanism which embed an additional region awareness layer to guide attention to the correct character regions. Meanwhile, we devise a gaussian heat map supervision method for learning additional region awareness layer. The experiments show that the proposed method performs favourably on four industrial character datasets.
场景文本擦除技术在隐私保护、基于摄像机的虚拟现实翻译和图像编辑等方面的应用引起了越来越多的研究兴趣。最近在场景文本擦除方面的努力已经显示出可喜的结果。我们利用文本去除方法作为工业字符生成过程的组成部分,生成大规模的合成字符图像,以缓解工业字符识别任务中样本不足的问题。现有的字符擦除模型在自然场景中取得了较好的效果。然而,在工业场景中,这些擦除网络容易受到显著的无字符区域的影响,导致注意力转移。为了克服这一限制,我们提出了一种基于注意机制的字符擦除网络,该网络嵌入了一个额外的区域感知层,将注意力引导到正确的字符区域。同时,我们设计了一种高斯热图监督方法来学习额外的区域感知层。实验结果表明,该方法在四种工业特征数据集上具有良好的性能。
{"title":"Gaussian-guided character erasure for data augment of industrial characters","authors":"Hongchao Gao, Chao Yao, Zhennan Wang","doi":"10.1145/3590003.3590077","DOIUrl":"https://doi.org/10.1145/3590003.3590077","url":null,"abstract":"The application of scene text erasure technology in privacy protection, camera-based virtual reality translation and image editing has attracted more and more research interests. Recent efforts on scene text erasing have shown promising results. We utilize text removal methods as a component of industrial characters generation procedure to generate large-scale synthetic character images so as to mitigate the issue of insufficient samples in the recognition task of industrial characters. Existing character erasure models has achieved good performance in natural scenes. However, in industrial scenes, these erasure networks are easily affected by salient no-character regions leading to the attention shift. To overcome this limitation, we proposed a character erasure network based on attention mechanism which embed an additional region awareness layer to guide attention to the correct character regions. Meanwhile, we devise a gaussian heat map supervision method for learning additional region awareness layer. The experiments show that the proposed method performs favourably on four industrial character datasets.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129126890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1