首页 > 最新文献

2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)最新文献

英文 中文
Quantum Embeddings of Classical Data for Quantum Machine Learning 量子机器学习经典数据的量子嵌入
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10138000
G. Luca, Yinong Chen
A major area of research in the field of quantum machine learning is the analysis of the loss landscape, particularly of variational quantum algorithms. These works often provide bounds and generalizations for various ansatzes and quantum embedding strategies. These analyses include approaches such as the Hessian and Fisher information matrices as well as generalized trigonometric polynomials. However, many such reviews often rely on a rotational encoding in practice or focus on few different approaches. The goal of this work is to statistically analyze experimental results from a quantum machine learning model that employs various different quantum embedding approaches, including those covered in related work, as well as the effect of measurement basis on the model.
量子机器学习领域的一个主要研究领域是对损失情况的分析,特别是变分量子算法。这些工作通常为各种分析和量子嵌入策略提供了界限和概括。这些分析包括诸如Hessian和Fisher信息矩阵以及广义三角多项式等方法。然而,许多这样的评论往往依赖于实践中的旋转编码或关注少数不同的方法。本工作的目标是统计分析量子机器学习模型的实验结果,该模型采用了各种不同的量子嵌入方法,包括相关工作中涉及的方法,以及测量基础对模型的影响。
{"title":"Quantum Embeddings of Classical Data for Quantum Machine Learning","authors":"G. Luca, Yinong Chen","doi":"10.1109/ACAIT56212.2022.10138000","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10138000","url":null,"abstract":"A major area of research in the field of quantum machine learning is the analysis of the loss landscape, particularly of variational quantum algorithms. These works often provide bounds and generalizations for various ansatzes and quantum embedding strategies. These analyses include approaches such as the Hessian and Fisher information matrices as well as generalized trigonometric polynomials. However, many such reviews often rely on a rotational encoding in practice or focus on few different approaches. The goal of this work is to statistically analyze experimental results from a quantum machine learning model that employs various different quantum embedding approaches, including those covered in related work, as well as the effect of measurement basis on the model.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131115163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Learning Model Research for Cortical Bone Separation in Chest CT Spine Imaging 胸部CT脊柱成像皮质骨分离的深度学习模型研究
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137862
Haitao Yu, Juntao Zeng, Xiaofeng Xie
Osteoporosis is a global skeletal disease which will seriously affect the human life. The early diagnosis of osteoporosis by using bone mineral density (BMD) examination can help to decrease the probability of osteoporosis. In the development of computer aided diagnosis, the calculation of BMD can be achieved by deep learning model in CT, without using the specially measuring devices. In this paper, we used a 3D-Unet model to segment the cortical and cancellous bone in the spine and perform quantitative analysis. After that, the three-dimensional visualization of cortical and cancellous bone was reconstructed, and the BMD value and other information were calculated to help doctors to predict the risk of osteoporosis. The expeirmental result shown that the proposed method achieve high performance in segementation and quantization.
骨质疏松症是一种严重影响人类生活的全球性骨骼疾病。骨质疏松症的早期诊断通过骨密度检查有助于降低骨质疏松症的发生概率。在计算机辅助诊断的发展中,BMD的计算可以通过CT的深度学习模型来实现,而不需要使用专门的测量设备。在本文中,我们使用3D-Unet模型对脊柱皮质骨和松质骨进行分割并进行定量分析。然后重建皮质骨和松质骨的三维可视化,并计算BMD值等信息,帮助医生预测骨质疏松的风险。实验结果表明,该方法在分割和量化方面取得了较好的效果。
{"title":"Deep Learning Model Research for Cortical Bone Separation in Chest CT Spine Imaging","authors":"Haitao Yu, Juntao Zeng, Xiaofeng Xie","doi":"10.1109/ACAIT56212.2022.10137862","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137862","url":null,"abstract":"Osteoporosis is a global skeletal disease which will seriously affect the human life. The early diagnosis of osteoporosis by using bone mineral density (BMD) examination can help to decrease the probability of osteoporosis. In the development of computer aided diagnosis, the calculation of BMD can be achieved by deep learning model in CT, without using the specially measuring devices. In this paper, we used a 3D-Unet model to segment the cortical and cancellous bone in the spine and perform quantitative analysis. After that, the three-dimensional visualization of cortical and cancellous bone was reconstructed, and the BMD value and other information were calculated to help doctors to predict the risk of osteoporosis. The expeirmental result shown that the proposed method achieve high performance in segementation and quantization.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131206141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Scale Dense Feature Fusion Based Loess Landslide Recognition 基于多尺度密集特征融合的黄土滑坡识别
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10138001
Kaiyue Sun, Qiaoming Li, Wenlong Wang, P. Zhang, Zhantu Li, Xingnan Zhao, Zeqi Li
Loess landslide geological disasters are widely distributed in Northwest China, but there are few relevant attention and researches. Landslide recognition can provide information help for landslide disaster management and risk management. Previous works of landslide recognition of remote sensing images based on deep learning, due to the lack of high resolution multi-source datasets, the boundary of landslide recognition is missing and not obvious and the identification accuracy is not ideal. In this work, a multi-scale dense feature fusion loess landslide recognition network (MDFF) was proposed and an open dataset of loess landslide samples (MSLLD) based on GF-2 images and DEM was constructed, which has spectral and topographic information. The MDFF network retains different levels of features by means of dense connection mechanism to make up for the loss of detailed features, the dense connected dilated convolution layer is introduced into the network to capture the different scale features of landslide images, expand the receptive field and avoid convolution degradation. When testing different networks on MSLLD, the proposed network achieves the most advanced performance, mIoU and F1-score were 82.31 % and 84.59% respectively, indicating that the proposed network can effectively recognize landslides, which is of great value for the investigation and analysis of loess landslide disasters.
黄土滑坡地质灾害在西北地区分布广泛,但相关的关注和研究却很少。滑坡识别可以为滑坡灾害管理和风险管理提供信息帮助。以往基于深度学习的滑坡遥感图像识别工作,由于缺乏高分辨率多源数据集,滑坡识别边界缺失且不明显,识别精度不理想。本文提出了一种多尺度密集特征融合的黄土滑坡识别网络(MDFF),并基于GF-2图像和DEM构建了具有光谱和地形信息的黄土滑坡样本开放数据集(MSLLD)。MDFF网络通过密集连接机制保留不同层次的特征,弥补细节特征的缺失,在网络中引入密集连接的扩展卷积层,捕捉滑坡图像的不同尺度特征,扩大接收野,避免卷积退化。在MSLLD上对不同网络进行测试时,所提网络的性能最先进,mIoU和f1得分分别为82.31%和84.59%,表明所提网络能够有效识别滑坡,对黄土滑坡灾害的调查分析具有重要价值。
{"title":"Multi-Scale Dense Feature Fusion Based Loess Landslide Recognition","authors":"Kaiyue Sun, Qiaoming Li, Wenlong Wang, P. Zhang, Zhantu Li, Xingnan Zhao, Zeqi Li","doi":"10.1109/ACAIT56212.2022.10138001","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10138001","url":null,"abstract":"Loess landslide geological disasters are widely distributed in Northwest China, but there are few relevant attention and researches. Landslide recognition can provide information help for landslide disaster management and risk management. Previous works of landslide recognition of remote sensing images based on deep learning, due to the lack of high resolution multi-source datasets, the boundary of landslide recognition is missing and not obvious and the identification accuracy is not ideal. In this work, a multi-scale dense feature fusion loess landslide recognition network (MDFF) was proposed and an open dataset of loess landslide samples (MSLLD) based on GF-2 images and DEM was constructed, which has spectral and topographic information. The MDFF network retains different levels of features by means of dense connection mechanism to make up for the loss of detailed features, the dense connected dilated convolution layer is introduced into the network to capture the different scale features of landslide images, expand the receptive field and avoid convolution degradation. When testing different networks on MSLLD, the proposed network achieves the most advanced performance, mIoU and F1-score were 82.31 % and 84.59% respectively, indicating that the proposed network can effectively recognize landslides, which is of great value for the investigation and analysis of loess landslide disasters.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133298458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IPGD: A Dataset for Robotic Inside-Propped Grasp Detection IPGD:机器人内支撑抓取检测数据集
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137845
Xuefeng Liu, Guangjian Zhang
Grasping skills are the basic skills required by robots in many practical applications. Recent research on robotic grasping detection generally focuses on grasping poses similar to human grasping. However, this grasping pose is not suitable for all grasping scenarios in practical applications. Therefore, this paper uses a new inside-propped grasping pose to label a large number of images with inside-propped grasping potential. In this way, an inside-propped grasp dataset is completed. Based on this dataset, this paper constructs a generative deep neural network for the inside-propped grasping prediction. The experimental results show that the success rate of the inside-propped grasping prediction network is 65.59%, and the average prediction time is 82ms, which has achieved good results in accuracy and real-time performance.
抓取技能是机器人在许多实际应用中需要掌握的基本技能。近年来对机器人抓取检测的研究主要集中在类似人类抓取的抓取姿态上。然而,这种抓取姿势并不适用于实际应用中的所有抓取场景。因此,本文采用一种新的内支撑抓取姿态对大量具有内支撑抓取势的图像进行标注。这样,就完成了一个内部支撑的抓取数据集。在此基础上,构建了生成式深度神经网络进行内支撑抓取预测。实验结果表明,内支撑抓取预测网络的成功率为65.59%,平均预测时间为82ms,在准确率和实时性方面都取得了较好的效果。
{"title":"IPGD: A Dataset for Robotic Inside-Propped Grasp Detection","authors":"Xuefeng Liu, Guangjian Zhang","doi":"10.1109/ACAIT56212.2022.10137845","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137845","url":null,"abstract":"Grasping skills are the basic skills required by robots in many practical applications. Recent research on robotic grasping detection generally focuses on grasping poses similar to human grasping. However, this grasping pose is not suitable for all grasping scenarios in practical applications. Therefore, this paper uses a new inside-propped grasping pose to label a large number of images with inside-propped grasping potential. In this way, an inside-propped grasp dataset is completed. Based on this dataset, this paper constructs a generative deep neural network for the inside-propped grasping prediction. The experimental results show that the success rate of the inside-propped grasping prediction network is 65.59%, and the average prediction time is 82ms, which has achieved good results in accuracy and real-time performance.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134265270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic SLAM Based on Compensated Segmentation and Geometric Constraints in Dynamic Environments 动态环境下基于补偿分割和几何约束的语义SLAM
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137941
Baofu Fang, Shuai Zhou, Hao Wang
Most of the existing slam algorithms are designed based on the assumption of a static environment, this strong assumption limits the practical application of most slam systems. The main reason is that moving objects will cause feature mismatch in the pose estimation process, which in turn affects the accuracy of localization and mapping. In this paper, we propose a SLAM algorithm in a dynamic environment. First, we use the BlendMask network to detect potential moving objects to generate masks for dynamic objects. The geometrically constrained joint optical flow method is used to detect dynamic feature points. Secondly, aiming at the failure of semantic segmentation network segmentation, a missed detection compensation algorithm based on the invariance of adjacent frame speed is proposed. Finally, a keyframe selection strategy is proposed to construct a semantic octree graph containing only static objects. We evaluate our algorithm on TUM RGB-D and real scene datasets. The experimental results show that the algorithm has high accuracy and real-time performance.
现有的slam算法大多是基于静态环境的假设来设计的,这种强假设限制了大多数slam系统的实际应用。主要原因是运动物体在姿态估计过程中会引起特征失配,进而影响定位和映射的精度。本文提出了一种动态环境下的SLAM算法。首先,我们使用BlendMask网络检测潜在的移动对象,为动态对象生成蒙版。采用几何约束联合光流法检测动态特征点。其次,针对语义分割网络分割失败的问题,提出了一种基于相邻帧速度不变性的缺失检测补偿算法。最后,提出了一种关键帧选择策略来构造一个只包含静态对象的语义八叉树图。我们在TUM RGB-D和真实场景数据集上评估了我们的算法。实验结果表明,该算法具有较高的精度和实时性。
{"title":"Semantic SLAM Based on Compensated Segmentation and Geometric Constraints in Dynamic Environments","authors":"Baofu Fang, Shuai Zhou, Hao Wang","doi":"10.1109/ACAIT56212.2022.10137941","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137941","url":null,"abstract":"Most of the existing slam algorithms are designed based on the assumption of a static environment, this strong assumption limits the practical application of most slam systems. The main reason is that moving objects will cause feature mismatch in the pose estimation process, which in turn affects the accuracy of localization and mapping. In this paper, we propose a SLAM algorithm in a dynamic environment. First, we use the BlendMask network to detect potential moving objects to generate masks for dynamic objects. The geometrically constrained joint optical flow method is used to detect dynamic feature points. Secondly, aiming at the failure of semantic segmentation network segmentation, a missed detection compensation algorithm based on the invariance of adjacent frame speed is proposed. Finally, a keyframe selection strategy is proposed to construct a semantic octree graph containing only static objects. We evaluate our algorithm on TUM RGB-D and real scene datasets. The experimental results show that the algorithm has high accuracy and real-time performance.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125332183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis and Research on Electric Heating Risk Early Warning Based on Embedded Feature Selection and DBSCAN Adaptive Clustering 基于嵌入式特征选择和DBSCAN自适应聚类的电加热风险预警分析与研究
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137835
Hui Xu, Lu Zhang, Longfei Ma, Xianglong Li, Siyue Lu, Shaokun Chen, Yifeng Ding, Wenbin Zhou
With the gradual expansion of the scale of”coal to electricity” users, the number of complaints about the heating effect, electricity safety and heating equipment safety guarantee in the clean heating operation process continues increasing. It is not possible to warn of possible emergencies in advance, and can be only remedied afterwards, completely in a passive response state. Therefore, rapid and accurate positioning of the key link is an urgent problem to be solved, but also the key to improve user satisfaction. Aimed at above problems, this paper established an automatic, information and intelligent electric heating risk warning mechanism. Based on the embedded feature selection algorithm and the DBSCAN adaptive clustering algorithm, a standardized vocabulary of customer appeals and complaint topics were constructed, combined with user historical electricity consumption data, and through the monitoring and matching of risk topics, an early warning model of customer electric heating abnormal risks was established. The model proposed in the article has strong practicability and provides strong support for lean management on the grid side, precise positioning of problems on the operation and maintenance side, government-side management decisionmaking and user satisfaction, and can promote safe, reliable and economical operation of the grid.
随着“煤改电”用户规模的逐步扩大,清洁供热运行过程中关于供热效果、用电安全、供热设备安全保障等方面的投诉不断增多。对可能发生的突发事件无法提前预警,只能事后补救,完全处于被动应对状态。因此,快速准确定位关键环节是亟待解决的问题,也是提高用户满意度的关键。针对上述问题,本文建立了自动、信息化、智能化的电加热风险预警机制。基于嵌入式特征选择算法和DBSCAN自适应聚类算法,构建了标准化的客户申诉和投诉主题词汇表,结合用户历史用电量数据,通过对风险主题的监测和匹配,建立了客户电加热异常风险预警模型。本文提出的模型具有较强的实用性,为电网侧精益管理、运维侧问题精准定位、政府侧管理决策和用户满意度提供有力支持,能够促进电网安全、可靠、经济运行。
{"title":"Analysis and Research on Electric Heating Risk Early Warning Based on Embedded Feature Selection and DBSCAN Adaptive Clustering","authors":"Hui Xu, Lu Zhang, Longfei Ma, Xianglong Li, Siyue Lu, Shaokun Chen, Yifeng Ding, Wenbin Zhou","doi":"10.1109/ACAIT56212.2022.10137835","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137835","url":null,"abstract":"With the gradual expansion of the scale of”coal to electricity” users, the number of complaints about the heating effect, electricity safety and heating equipment safety guarantee in the clean heating operation process continues increasing. It is not possible to warn of possible emergencies in advance, and can be only remedied afterwards, completely in a passive response state. Therefore, rapid and accurate positioning of the key link is an urgent problem to be solved, but also the key to improve user satisfaction. Aimed at above problems, this paper established an automatic, information and intelligent electric heating risk warning mechanism. Based on the embedded feature selection algorithm and the DBSCAN adaptive clustering algorithm, a standardized vocabulary of customer appeals and complaint topics were constructed, combined with user historical electricity consumption data, and through the monitoring and matching of risk topics, an early warning model of customer electric heating abnormal risks was established. The model proposed in the article has strong practicability and provides strong support for lean management on the grid side, precise positioning of problems on the operation and maintenance side, government-side management decisionmaking and user satisfaction, and can promote safe, reliable and economical operation of the grid.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132522586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Meta Learning Improves Low-Resource Speech Recognition 对抗性元学习改进低资源语音识别
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137854
Yaqi Chen, Dan Qu, Wenlin Zhang, Fen Yu, Haotong Zhang, Xukui Yang
Low-resource automatic speech recognition is a chal- lenging task. To solve this issue, multilingual meta-learning learns a better model initialization from many source language tasks, allowing for rapid adaption to the target language. However, due to the lack of limitations on multilingual pre-training, the shared semantic space of different languages is difficult to learn. In this work, we propose an adversarial meta-learning training approach to solve this problem. By using the adversarial auxiliary aim of language identification in the meta-learning algorithm, it will guide the model encoder to generate language-independent embedding features, which can improve model generalization. And we use Wasserstein distance and temporal normalization to optimize our adversarial training, making the training more stable and easier. The approach is evaluated on the IARPA BABEL. The results reveal that our approach only requires half as many meta learning training epochs to attain comparable multilingual pre-training performance. It also outperforms the meta learning in all target languages fine-tuning and achieves comparable performance in small data scales. Specially, it can reduce CER from 71% to 62% with fine-tuning 25% of Vietnamese data. Finally, we show why our approach is superior than others by using t-SNE.
低资源自动语音识别是一项具有挑战性的任务。为了解决这个问题,多语言元学习从许多源语言任务中学习更好的模型初始化,从而允许快速适应目标语言。然而,由于多语言预训练缺乏局限性,不同语言之间的共享语义空间难以学习。在这项工作中,我们提出了一种对抗性元学习训练方法来解决这个问题。在元学习算法中利用语言识别的对抗性辅助目标,引导模型编码器生成与语言无关的嵌入特征,提高模型泛化能力。我们使用Wasserstein距离和时间归一化来优化我们的对抗性训练,使训练更加稳定和简单。该方法在IARPA BABEL上进行了评估。结果表明,我们的方法只需要一半的元学习训练时间就可以获得相当的多语言预训练性能。它在所有目标语言的微调中都优于元学习,并且在小数据规模上也达到了相当的性能。特别是,它可以通过微调25%的越南数据将CER从71%降低到62%。最后,我们通过使用t-SNE来说明为什么我们的方法优于其他方法。
{"title":"Adversarial Meta Learning Improves Low-Resource Speech Recognition","authors":"Yaqi Chen, Dan Qu, Wenlin Zhang, Fen Yu, Haotong Zhang, Xukui Yang","doi":"10.1109/ACAIT56212.2022.10137854","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137854","url":null,"abstract":"Low-resource automatic speech recognition is a chal- lenging task. To solve this issue, multilingual meta-learning learns a better model initialization from many source language tasks, allowing for rapid adaption to the target language. However, due to the lack of limitations on multilingual pre-training, the shared semantic space of different languages is difficult to learn. In this work, we propose an adversarial meta-learning training approach to solve this problem. By using the adversarial auxiliary aim of language identification in the meta-learning algorithm, it will guide the model encoder to generate language-independent embedding features, which can improve model generalization. And we use Wasserstein distance and temporal normalization to optimize our adversarial training, making the training more stable and easier. The approach is evaluated on the IARPA BABEL. The results reveal that our approach only requires half as many meta learning training epochs to attain comparable multilingual pre-training performance. It also outperforms the meta learning in all target languages fine-tuning and achieves comparable performance in small data scales. Specially, it can reduce CER from 71% to 62% with fine-tuning 25% of Vietnamese data. Finally, we show why our approach is superior than others by using t-SNE.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133881718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Object Tracking Method Combined with Lightweight Hybrid Attention Siamese Network 结合轻量级混合注意力连体网络的目标跟踪方法
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137999
Ruoyu Lou, Wu Yang, Yingjiang Li, Ling Lu
Aiming at the problem that the target tracking method of deep learning has a large number of model parameters and insufficient real-time performance, it is difficult to apply to mobile terminals or embedded devices with insufficient computing power. A lightweight hybrid attention-based twin network tracking algorithm is proposed. Firstly, based on MobileNetv3-Large network, group convolution and channel rearrangement are performed; then, in view of the problem that traditional attention mechanism only considers a single scope, this paper proposes a lightweight group-gated mixed attention (Group-gated mixed attention, GG); finally, GG is embedded in the Siamese network structure of this paper and the hierarchical feature fusion strategy is used to improve the tracking accuracy. Experiments show that the parameters of the proposed GG decrease by 26.2% compared with CBAM, decrease by 6.50% compared with SE, and increase Top-1 by 2.59% and 2.68% respectively; the experiments on the OTB100 and VOT2018 datasets demonstrate that the proposed algorithm is comparable to traditional tracking Compared with the algorithm, the accuracy and real-time performance have great advantages.
针对深度学习的目标跟踪方法模型参数较多,实时性不足的问题,难以应用于计算能力不足的移动终端或嵌入式设备。提出了一种轻量级的基于混合注意力的双网络跟踪算法。首先,基于MobileNetv3-Large网络,进行群卷积和信道重排;然后,针对传统注意机制只考虑单一范围的问题,提出了一种轻量级的群控混合注意(group-gated mixed attention, GG);最后,将GG嵌入到本文的Siamese网络结构中,并采用分层特征融合策略提高跟踪精度。实验表明,所提GG的参数比CBAM降低26.2%,比SE降低6.50%,Top-1分别提高2.59%和2.68%;在OTB100和VOT2018数据集上的实验表明,该算法与传统的跟踪算法相比,精度和实时性都有很大的优势。
{"title":"Object Tracking Method Combined with Lightweight Hybrid Attention Siamese Network","authors":"Ruoyu Lou, Wu Yang, Yingjiang Li, Ling Lu","doi":"10.1109/ACAIT56212.2022.10137999","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137999","url":null,"abstract":"Aiming at the problem that the target tracking method of deep learning has a large number of model parameters and insufficient real-time performance, it is difficult to apply to mobile terminals or embedded devices with insufficient computing power. A lightweight hybrid attention-based twin network tracking algorithm is proposed. Firstly, based on MobileNetv3-Large network, group convolution and channel rearrangement are performed; then, in view of the problem that traditional attention mechanism only considers a single scope, this paper proposes a lightweight group-gated mixed attention (Group-gated mixed attention, GG); finally, GG is embedded in the Siamese network structure of this paper and the hierarchical feature fusion strategy is used to improve the tracking accuracy. Experiments show that the parameters of the proposed GG decrease by 26.2% compared with CBAM, decrease by 6.50% compared with SE, and increase Top-1 by 2.59% and 2.68% respectively; the experiments on the OTB100 and VOT2018 datasets demonstrate that the proposed algorithm is comparable to traditional tracking Compared with the algorithm, the accuracy and real-time performance have great advantages.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133565316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Identification of Financial Abnormal Fluctuations in Pledged Repurchase Transactions Based on Machine Learning 基于机器学习的质押回购交易中财务异常波动识别研究
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137951
Zhijian Xu
In order to improve the financial evaluation ability of pledged repo transactions, a method of identifying abnormal financial fluctuations of pledged repo transactions based on machine learning is proposed. Using the method of market risk identification, the pledge risk index system evaluation model for the financial evaluation of pledge type repo transactions is constructed. The balance of the capital flow channel of the pledge type repo financial system is controlled by using machine learning algorithm. Combined with machine learning to extract the abnormal fluctuation characteristics of the pledge type repo financial system, the fuzzy classification learning model of the data structure of the pledge type repo financial system is constructed. Spatial resampling method is used to reconstruct the abnormal financial volatility of pledge repurchase transactions and mining association rules. Clustering and matching the abnormal feature spectrum of the structural data of the financial system of pledge repurchase transactions by using machine learning algorithms. The model adopts the evaluation method of fluctuation synergy parameter. An adaptive learning algorithm is used to identify the abnormal financial fluctuations of pledge repurchase transactions. The simulation results show that this method has good clustering characteristics in identifying the abnormal financial fluctuations of pledge type repo transactions, effectively reducing the capital loss of the financial system structure of pledge type repo transactions, and improving the risk management ability.
为了提高质押回购交易的财务评估能力,提出了一种基于机器学习的质押回购交易异常财务波动识别方法。运用市场风险识别的方法,构建质押型回购交易财务评价的质押风险指标体系评价模型。利用机器学习算法控制质押式回购金融系统资金流动通道的平衡。结合机器学习提取质押式回购金融系统的异常波动特征,构建质押式回购金融系统数据结构的模糊分类学习模型。利用空间重采样方法重构质押回购交易的异常金融波动,挖掘关联规则。利用机器学习算法对质押回购交易金融系统结构数据的异常特征谱进行聚类和匹配。该模型采用波动协同参数评价方法。采用自适应学习算法识别质押回购交易中的异常财务波动。仿真结果表明,该方法在识别质押式回购交易异常金融波动方面具有良好的聚类特性,有效降低质押式回购交易金融体系结构的资金损失,提高风险管理能力。
{"title":"Research on Identification of Financial Abnormal Fluctuations in Pledged Repurchase Transactions Based on Machine Learning","authors":"Zhijian Xu","doi":"10.1109/ACAIT56212.2022.10137951","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137951","url":null,"abstract":"In order to improve the financial evaluation ability of pledged repo transactions, a method of identifying abnormal financial fluctuations of pledged repo transactions based on machine learning is proposed. Using the method of market risk identification, the pledge risk index system evaluation model for the financial evaluation of pledge type repo transactions is constructed. The balance of the capital flow channel of the pledge type repo financial system is controlled by using machine learning algorithm. Combined with machine learning to extract the abnormal fluctuation characteristics of the pledge type repo financial system, the fuzzy classification learning model of the data structure of the pledge type repo financial system is constructed. Spatial resampling method is used to reconstruct the abnormal financial volatility of pledge repurchase transactions and mining association rules. Clustering and matching the abnormal feature spectrum of the structural data of the financial system of pledge repurchase transactions by using machine learning algorithms. The model adopts the evaluation method of fluctuation synergy parameter. An adaptive learning algorithm is used to identify the abnormal financial fluctuations of pledge repurchase transactions. The simulation results show that this method has good clustering characteristics in identifying the abnormal financial fluctuations of pledge type repo transactions, effectively reducing the capital loss of the financial system structure of pledge type repo transactions, and improving the risk management ability.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117137966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Error Correction Method of Business English Translation Based on Convolutional Neural Network 基于卷积神经网络的商务英语翻译纠错方法
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137929
Dengyi Xiao
In order to correct business English translation errors, this paper puts forward a method of business English translation error correction based on convolutional neural network and English pronunciation feature recognition. The blind convolution network spectrum parameter detection method is used to detect the pronunciation spectrum features of business English translation, and the scalar time series of pronunciation output audio parameter sequence and translated text semantic feature sequence are established. Combined with the noise intensity detection and signal scale decomposition method of business English translation pronunciation audio time series, the detailed signal energy parameters of business English translation pronunciation audio time series are extracted, and the convolution neural network classification method is used to classify the features. The interference component of single audio feature sequence of English translation pronunciation is removed by high-frequency wavelet threshold detection, and the modulation and demodulation of single audio feature sequence of English translation pronunciation are realized by using translation dictionary set and semantic context matching. The spectral analysis and error correction model of business English translation pronunciation audio time series is established, and the output stability of business English translation pronunciation audio time series is detected by threshold detection on each scale. According to the difference between output signal and pronunciation standard signal, the accuracy of English translator is detected and identified. The simulation results show that the accuracy of business English translation error correction with this method is high, the detection performance is good, and the output accuracy of English translators is improved.
为了纠正商务英语翻译错误,本文提出了一种基于卷积神经网络和英语发音特征识别的商务英语翻译纠错方法。采用盲卷积网络频谱参数检测方法检测商务英语翻译的发音频谱特征,建立发音输出音频参数序列的标量时间序列和译文语义特征序列。结合商务英语翻译语音音频时间序列的噪声强度检测和信号尺度分解方法,提取商务英语翻译语音音频时间序列的详细信号能量参数,并采用卷积神经网络分类方法对特征进行分类。通过高频小波阈值检测去除英语翻译语音单音频特征序列的干扰分量,利用翻译字典集和语义上下文匹配实现英语翻译语音单音频特征序列的调制解调。建立了商务英语翻译语音音频时间序列的频谱分析与纠错模型,并在各尺度上通过阈值检测检测商务英语翻译语音音频时间序列的输出稳定性。根据输出信号与发音标准信号的差异,检测和识别英语翻译器的准确性。仿真结果表明,用该方法进行商务英语翻译纠错准确率高,检测性能好,提高了英语翻译人员的输出精度。
{"title":"Error Correction Method of Business English Translation Based on Convolutional Neural Network","authors":"Dengyi Xiao","doi":"10.1109/ACAIT56212.2022.10137929","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137929","url":null,"abstract":"In order to correct business English translation errors, this paper puts forward a method of business English translation error correction based on convolutional neural network and English pronunciation feature recognition. The blind convolution network spectrum parameter detection method is used to detect the pronunciation spectrum features of business English translation, and the scalar time series of pronunciation output audio parameter sequence and translated text semantic feature sequence are established. Combined with the noise intensity detection and signal scale decomposition method of business English translation pronunciation audio time series, the detailed signal energy parameters of business English translation pronunciation audio time series are extracted, and the convolution neural network classification method is used to classify the features. The interference component of single audio feature sequence of English translation pronunciation is removed by high-frequency wavelet threshold detection, and the modulation and demodulation of single audio feature sequence of English translation pronunciation are realized by using translation dictionary set and semantic context matching. The spectral analysis and error correction model of business English translation pronunciation audio time series is established, and the output stability of business English translation pronunciation audio time series is detected by threshold detection on each scale. According to the difference between output signal and pronunciation standard signal, the accuracy of English translator is detected and identified. The simulation results show that the accuracy of business English translation error correction with this method is high, the detection performance is good, and the output accuracy of English translators is improved.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114996089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1