首页 > 最新文献

Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms最新文献

英文 中文
Household Energy Consumption Prediction: A Deep Neuroevolution Approach 家庭能源消费预测:一种深度神经进化方法
Alexander Soudaei, Jianhua Zhang, Mohamed Elmi, Mikael Tsechoev, Zishan Khan, Ahmed Osman
Accurate energy consumption prediction can provide insights to make better informed decisions on energy purchase and generation. It also can prevent overloading and make it possible to store energy more efficiently. In this work, we propose a new deep learning model to predict the household energy consumption. In the new model, we employ differential evolution (DE) algorithm to automatically determine the optimal architecture of the deep neural network. The energy prediction results are presented and analyzed to show the effectiveness of the deep neuroevolution model constructed.
准确的能源消耗预测可以为在能源购买和发电方面做出更明智的决策提供见解。它还可以防止超载,使更有效地储存能量成为可能。在这项工作中,我们提出了一个新的深度学习模型来预测家庭能源消耗。在新模型中,我们采用差分进化(DE)算法自动确定深度神经网络的最优结构。最后给出了能量预测结果并对其进行了分析,验证了所构建的深度神经进化模型的有效性。
{"title":"Household Energy Consumption Prediction: A Deep Neuroevolution Approach","authors":"Alexander Soudaei, Jianhua Zhang, Mohamed Elmi, Mikael Tsechoev, Zishan Khan, Ahmed Osman","doi":"10.1145/3611450.3611474","DOIUrl":"https://doi.org/10.1145/3611450.3611474","url":null,"abstract":"Accurate energy consumption prediction can provide insights to make better informed decisions on energy purchase and generation. It also can prevent overloading and make it possible to store energy more efficiently. In this work, we propose a new deep learning model to predict the household energy consumption. In the new model, we employ differential evolution (DE) algorithm to automatically determine the optimal architecture of the deep neural network. The energy prediction results are presented and analyzed to show the effectiveness of the deep neuroevolution model constructed.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127487692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Control Method of SOFC-based DC Micro-grid to Avoid Fuel Starvation when External Load Power Increases 基于sofc的直流微电网外部负载增加时燃料短缺控制方法
Lin Zhang, Hongtu Xie, Shifei Li, Chengsheng Zhang, Di Zhang, Wenhui Tang, Zhaojian Zhang
At present, the direct current (DC) micro-grid based on the solid oxide fuel cell (SOFC) can supply the power to the external load independently. Despite an adequate and steady supply of the electricity to the external load, the high efficiency and avoiding fuel starvation is other points for the attention. In this paper, a control method of the SOFC-based DC micro-grid has been proposed, which can avoid the fuel starvation when the external load power increases This method adopts the optimal operating points (OOPs) to obtain the maximum efficiency, and then a novel time-delay control algorithm based on the system electric current is designed to avoid the fuel starvation. All simulation results demonstrate that the proposed method is feasible, which can effectively solve the fuel starvation problem. What's more, the output efficiency can be up to 40%, which can get the high efficiency of the power supply. The works in this paper can provide the reference for other similar systems to solve the fuel starvation problem.
目前,基于固体氧化物燃料电池(SOFC)的直流微电网可以独立向外部负载供电。尽管外部负载的充足和稳定的电力供应,高效率和避免燃料短缺是其他值得注意的点。本文提出了一种基于sofc的直流微电网的控制方法,该方法采用最优工作点(OOPs)来获得最大的效率,并设计了一种基于系统电流的新型延时控制算法,以避免外部负载功率增加时的燃料短缺。仿真结果表明,该方法是可行的,可以有效地解决燃油短缺问题。更重要的是,输出效率可达40%,可以获得高效率的电源。本文的工作可为其他类似系统解决缺油问题提供参考。
{"title":"A Control Method of SOFC-based DC Micro-grid to Avoid Fuel Starvation when External Load Power Increases","authors":"Lin Zhang, Hongtu Xie, Shifei Li, Chengsheng Zhang, Di Zhang, Wenhui Tang, Zhaojian Zhang","doi":"10.1145/3611450.3611455","DOIUrl":"https://doi.org/10.1145/3611450.3611455","url":null,"abstract":"At present, the direct current (DC) micro-grid based on the solid oxide fuel cell (SOFC) can supply the power to the external load independently. Despite an adequate and steady supply of the electricity to the external load, the high efficiency and avoiding fuel starvation is other points for the attention. In this paper, a control method of the SOFC-based DC micro-grid has been proposed, which can avoid the fuel starvation when the external load power increases This method adopts the optimal operating points (OOPs) to obtain the maximum efficiency, and then a novel time-delay control algorithm based on the system electric current is designed to avoid the fuel starvation. All simulation results demonstrate that the proposed method is feasible, which can effectively solve the fuel starvation problem. What's more, the output efficiency can be up to 40%, which can get the high efficiency of the power supply. The works in this paper can provide the reference for other similar systems to solve the fuel starvation problem.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125451322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A solution of TSP based on the improved ant colony optimization 基于改进蚁群优化的TSP求解方法
Hengyu Nie, Meijuan Li, X. Chen, Zaihui Cui
In recent years, self-driving delivery vehicles have been used more and more widely. The route planning of courier vehicles can be abstracted as the traveling salesman problem (TSP). For the courier vehicle path optimization problem, an improved population-based ant colony optimization algorithm (IPACO) is proposed. The ant colony optimization algorithm (ACO) is a swarm intelligent bionic algorithm with the advantages of positive feedback, robustness, and easy combination with other algorithms, but it also has the problems of low solution accuracy and easy to fall into local optimality. In order to avoid these problems, the 2-opt local optimization operator is combined in the algorithm search process to improve the diversity of the population. In addition, the property that the simulated annealing algorithm probabilistically accepts relatively poor solutions is used to optimize the optimal ants during the iterative process. Finally, some TSPLIB examples are selected to verify the performance of the algorithm, and the fast adaptation capability of the algorithm under the change of path node weights is verified by simulation.
近年来,自动驾驶送货车辆的应用越来越广泛。快递车辆的路线规划问题可以抽象为旅行商问题(TSP)。针对快递车辆路径优化问题,提出了一种改进的基于群体的蚁群优化算法(IPACO)。蚁群优化算法(ant colony optimization algorithm, ACO)是一种群体智能仿生算法,具有正反馈、鲁棒性好、易于与其他算法结合等优点,但也存在求解精度低、容易陷入局部最优的问题。为了避免这些问题,在算法的搜索过程中结合了2-opt局部优化算子,提高了种群的多样性。此外,利用模拟退火算法概率接受较差解的特性,在迭代过程中对最优蚂蚁进行优化。最后,选取一些TSPLIB实例验证了算法的性能,并通过仿真验证了算法在路径节点权值变化下的快速自适应能力。
{"title":"A solution of TSP based on the improved ant colony optimization","authors":"Hengyu Nie, Meijuan Li, X. Chen, Zaihui Cui","doi":"10.1145/3611450.3611465","DOIUrl":"https://doi.org/10.1145/3611450.3611465","url":null,"abstract":"In recent years, self-driving delivery vehicles have been used more and more widely. The route planning of courier vehicles can be abstracted as the traveling salesman problem (TSP). For the courier vehicle path optimization problem, an improved population-based ant colony optimization algorithm (IPACO) is proposed. The ant colony optimization algorithm (ACO) is a swarm intelligent bionic algorithm with the advantages of positive feedback, robustness, and easy combination with other algorithms, but it also has the problems of low solution accuracy and easy to fall into local optimality. In order to avoid these problems, the 2-opt local optimization operator is combined in the algorithm search process to improve the diversity of the population. In addition, the property that the simulated annealing algorithm probabilistically accepts relatively poor solutions is used to optimize the optimal ants during the iterative process. Finally, some TSPLIB examples are selected to verify the performance of the algorithm, and the fast adaptation capability of the algorithm under the change of path node weights is verified by simulation.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126278946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Evidential Classifier with Multiple Pre-trained Language Models for Nested Named Entity Recognition 基于多个预训练语言模型的嵌套命名实体识别证据分类器
Haitao Liu, Jihua Song, Weiming Peng
Nested named entity recognition (NER) is an important and challenging task in information extraction. One effective approach is to detect regions in sentences that are later classified by neural networks. Since pre-trained language models (PLMs) were proposed, nested NER models have benefited a lot from them. However, it is common that only one PLM is utilized for a given model, and the performance varies with different PLMs. We note that there exist some conflicting predictions which lead to the final variation. Thus, there is still room for investigation as to whether a model could achieve even better performance by conducting a comprehensive analysis of results from various PLMs. In this paper, we propose an evidential classifier with multiple PLMs for nested NER. First, the well-known deep exhaustive model is trained separately with different PLMs, whose predictions are then treated as pieces of evidence that can be represented in the framework of Dempster-Shafer theory. Finally, the pooled evidence is obtained using a combination rule, based on which the inference is performed. Experiments are conducted on the GENIA dataset, and detailed analysis demonstrates the merits of our model.
嵌套命名实体识别(NER)是信息抽取中的一项重要且具有挑战性的任务。一种有效的方法是检测句子中的区域,然后用神经网络进行分类。自预训练语言模型(plm)提出以来,嵌套NER模型从中获益良多。然而,对于给定的模型,通常只使用一个PLM,并且性能随不同的PLM而变化。我们注意到,存在一些相互矛盾的预测,导致最终的变化。因此,对于一个模型是否可以通过对各种plm的结果进行综合分析来获得更好的性能,仍然有研究的余地。在本文中,我们提出了一个具有多个plm的证据分类器用于嵌套NER。首先,用不同的plm分别训练著名的深度穷举模型,然后将其预测作为可以在Dempster-Shafer理论框架中表示的证据片段。最后,使用组合规则获得汇集的证据,并在此基础上进行推理。在GENIA数据集上进行了实验,详细的分析证明了该模型的优点。
{"title":"An Evidential Classifier with Multiple Pre-trained Language Models for Nested Named Entity Recognition","authors":"Haitao Liu, Jihua Song, Weiming Peng","doi":"10.1145/3611450.3611477","DOIUrl":"https://doi.org/10.1145/3611450.3611477","url":null,"abstract":"Nested named entity recognition (NER) is an important and challenging task in information extraction. One effective approach is to detect regions in sentences that are later classified by neural networks. Since pre-trained language models (PLMs) were proposed, nested NER models have benefited a lot from them. However, it is common that only one PLM is utilized for a given model, and the performance varies with different PLMs. We note that there exist some conflicting predictions which lead to the final variation. Thus, there is still room for investigation as to whether a model could achieve even better performance by conducting a comprehensive analysis of results from various PLMs. In this paper, we propose an evidential classifier with multiple PLMs for nested NER. First, the well-known deep exhaustive model is trained separately with different PLMs, whose predictions are then treated as pieces of evidence that can be represented in the framework of Dempster-Shafer theory. Finally, the pooled evidence is obtained using a combination rule, based on which the inference is performed. Experiments are conducted on the GENIA dataset, and detailed analysis demonstrates the merits of our model.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"204 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130346736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Word-Constrained Response Generation for International Chinese Language Education based on Decoder Backward Attention 基于解码器后向注意的国际汉语教学词约束反应生成
Jingbo Sun, T. Song, Haitao Liu, Weiming Peng, Jihua Song
Dialogue systems are a valuable technology in the field of natural language processing to improve work, learning, and daily life. Currently, dialogue systems are employed as an educational technology for mentoring, evaluation, and personalized learning. To make dialogue teaching achieve the purpose of training vocabulary at the primary level of international Chinese learning education, we first collect the entire dialogue corpus from textbooks to create the dataset, and then we propose a dialogue response generation model, Seq2BF-Attention, containing a specific word based on the sequence to backward and forward sequences framework by adding an attention to enhance the modeling of dialogue posts. We also provide two decoder connection strategies, backward hidden connection and backward attention, to handle the problems of not sharing parameters and incoherent generation separately. It has been experimentally proven that our suggested models perform well in both the ICLE and Weibo datasets across all metrics.
对话系统是自然语言处理领域的一项有价值的技术,可以改善工作、学习和日常生活。目前,对话系统被用作指导、评估和个性化学习的教育技术。为了使对话教学达到国际汉语学习教育初级阶段词汇训练的目的,我们首先从教科书中收集完整的对话语料库来创建数据集,然后我们提出了一个对话响应生成模型Seq2BF-Attention,该模型包含了一个特定的基于序列到向后和向前序列的框架,通过添加一个关注来增强对话帖子的建模。我们还提供了向后隐藏连接和向后注意两种解码器连接策略,分别解决了参数不共享和不相干生成的问题。实验证明,我们建议的模型在ICLE和微博数据集的所有指标上都表现良好。
{"title":"Word-Constrained Response Generation for International Chinese Language Education based on Decoder Backward Attention","authors":"Jingbo Sun, T. Song, Haitao Liu, Weiming Peng, Jihua Song","doi":"10.1145/3611450.3611478","DOIUrl":"https://doi.org/10.1145/3611450.3611478","url":null,"abstract":"Dialogue systems are a valuable technology in the field of natural language processing to improve work, learning, and daily life. Currently, dialogue systems are employed as an educational technology for mentoring, evaluation, and personalized learning. To make dialogue teaching achieve the purpose of training vocabulary at the primary level of international Chinese learning education, we first collect the entire dialogue corpus from textbooks to create the dataset, and then we propose a dialogue response generation model, Seq2BF-Attention, containing a specific word based on the sequence to backward and forward sequences framework by adding an attention to enhance the modeling of dialogue posts. We also provide two decoder connection strategies, backward hidden connection and backward attention, to handle the problems of not sharing parameters and incoherent generation separately. It has been experimentally proven that our suggested models perform well in both the ICLE and Weibo datasets across all metrics.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121287095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study on the fault diagnosis method of ship main engine unbalanced data based on improved DQN 基于改进DQN的船舶主机不平衡数据故障诊断方法研究
Meiwen Wang, Hui Cao, Guozhong Li
The Deep Q_Network (DQN) algorithm in reinforcement learning is introduced to main engine fault diagnosis to improve the accuracy and efficiency of fault diagnosis by using the optimized DQN network algorithm to compensate for the lack of data imbalance for unbalanced fault data that are close to the real situation. The optimization of the DQN network algorithm is reflected in three aspects: firstly, the ɛ-greedy algorithm is optimized using the Upper Confidence Bound (UCB) algorithm, which makes the algorithm achieve a better balance between experience and exploratory in the selection of fault types; secondly, the fully connected network of the basic DQN is optimized using the triple-formed layer CNN network layer is optimized to improve the algorithm operation efficiency; meanwhile, the reward function for unbalanced data is set according to the balance rate, and the problem of reward value bias and local optimum for small amount of data is considered, so that the optimized DQN network algorithm gets improved accuracy in fault diagnosis of unbalanced data. Finally, the optimized DQN network, the base DQN network, the DCNN, and the ResNet18 are run for diagnosis on the unbalanced data set. Compared with other algorithmic networks, the optimized DQN improved 5.18%∼18.58% in accuracy. The results show that the DQN algorithm model can be applied with main engine unbalanced data fault diagnosis, and the improved DQN algorithm achieves good results in the efficiency and stability of diagnosis.
将强化学习中的深度Q_Network (Deep Q_Network, DQN)算法引入到主机故障诊断中,利用优化后的DQN网络算法对不平衡的故障数据进行补偿,使不平衡的故障数据更接近真实情况,从而提高故障诊断的准确性和效率。DQN网络算法的优化体现在三个方面:首先,利用上置信度界(Upper Confidence Bound, UCB)算法对算法进行优化,使算法在故障类型选择上更好地平衡了经验与探索性;其次,对基本DQN的全连接网络进行三层优化,对CNN网络层进行优化,提高算法运行效率;同时,根据平衡率设置非平衡数据的奖励函数,并考虑了奖励值偏差和小数据局部最优的问题,使得优化后的DQN网络算法在非平衡数据故障诊断中的准确率得到了提高。最后,运行优化后的DQN网络、基本DQN网络、DCNN和ResNet18对不平衡数据集进行诊断。与其他算法网络相比,优化后的DQN的准确率提高了5.18% ~ 18.58%。结果表明,DQN算法模型可用于主机不平衡数据故障诊断,改进后的DQN算法在诊断效率和稳定性方面取得了较好的效果。
{"title":"Study on the fault diagnosis method of ship main engine unbalanced data based on improved DQN","authors":"Meiwen Wang, Hui Cao, Guozhong Li","doi":"10.1145/3611450.3611453","DOIUrl":"https://doi.org/10.1145/3611450.3611453","url":null,"abstract":"The Deep Q_Network (DQN) algorithm in reinforcement learning is introduced to main engine fault diagnosis to improve the accuracy and efficiency of fault diagnosis by using the optimized DQN network algorithm to compensate for the lack of data imbalance for unbalanced fault data that are close to the real situation. The optimization of the DQN network algorithm is reflected in three aspects: firstly, the ɛ-greedy algorithm is optimized using the Upper Confidence Bound (UCB) algorithm, which makes the algorithm achieve a better balance between experience and exploratory in the selection of fault types; secondly, the fully connected network of the basic DQN is optimized using the triple-formed layer CNN network layer is optimized to improve the algorithm operation efficiency; meanwhile, the reward function for unbalanced data is set according to the balance rate, and the problem of reward value bias and local optimum for small amount of data is considered, so that the optimized DQN network algorithm gets improved accuracy in fault diagnosis of unbalanced data. Finally, the optimized DQN network, the base DQN network, the DCNN, and the ResNet18 are run for diagnosis on the unbalanced data set. Compared with other algorithmic networks, the optimized DQN improved 5.18%∼18.58% in accuracy. The results show that the DQN algorithm model can be applied with main engine unbalanced data fault diagnosis, and the improved DQN algorithm achieves good results in the efficiency and stability of diagnosis.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117081856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning-based End-to-End Address Recognition Solution on Chinese Courier Order Forms 基于深度学习的中文快递订单端到端地址识别解决方案
Jiayi Zhang, Yue Liu
The courier industry in China has grown quickly due to the rise of online shopping. However, courier notes can unavoidably become smudged or damaged during the delivery process, making it difficult to read the printed Chinese address information or recognize the barcodes. To solve this problem, this paper proposes an end-to-end solution to recognize damaged Chinese addresses: the CRNN model is trained for address recognition for damaged Chinese courier orders using a large Chinese address dataset generated via data augmentation and manual collection. And an address association algorithm is proposed to reduce the recognition errors at the provincial and municipal levels of the addresses. By applying this algorithm, the final accuracy is increased by 2% to 98.7%.
由于网上购物的兴起,中国的快递业发展迅速。然而,快递单在递送过程中不可避免地会被弄脏或损坏,使打印的中文地址信息难以阅读或识别条形码。为了解决这一问题,本文提出了一种端到端识别损坏中文地址的解决方案:使用通过数据增强和人工收集生成的大型中文地址数据集训练CRNN模型来识别损坏的中文快递订单的地址。并提出了一种地址关联算法,以减少省、市级地址的识别误差。应用该算法,最终准确率提高了2%,达到98.7%。
{"title":"Deep Learning-based End-to-End Address Recognition Solution on Chinese Courier Order Forms","authors":"Jiayi Zhang, Yue Liu","doi":"10.1145/3611450.3611473","DOIUrl":"https://doi.org/10.1145/3611450.3611473","url":null,"abstract":"The courier industry in China has grown quickly due to the rise of online shopping. However, courier notes can unavoidably become smudged or damaged during the delivery process, making it difficult to read the printed Chinese address information or recognize the barcodes. To solve this problem, this paper proposes an end-to-end solution to recognize damaged Chinese addresses: the CRNN model is trained for address recognition for damaged Chinese courier orders using a large Chinese address dataset generated via data augmentation and manual collection. And an address association algorithm is proposed to reduce the recognition errors at the provincial and municipal levels of the addresses. By applying this algorithm, the final accuracy is increased by 2% to 98.7%.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125815138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Feature Cross-Lingual Transfer Learning Approach for Low-Resource Vietnamese Speech Synthesis 低资源越南语语音合成的多特征跨语言迁移学习方法
Zhi Qiao, Jian Yang, Zhan Wang
Abstract—Based on neural network end-to-end speech synthesis systems, high-quality speech can be synthesized when there is sufficient training data. However, it is difficult for languages with small datasets to synthesize speech with high quality and naturalness. Vietnamese is a tonal language, belonging to the Vietic branch of the Austroasiatic language family, which uses a spelling system. To improve the quality and naturalness of speech synthesis with limited dataset resources, we first use transfer learning to improve the acoustic model of Vietnamese by leveraging the similarities in pronunciation and grammar between Mandarin Chinese and Vietnamese. Secondly, based on the prosodic characteristics of Vietnamese, we use a "speech-text" alignment tool to extract prosodic boundary information and supplement it to the training text sequence. Using FastSpeech2 as the experimental baseline system, we designed and added a prosody embedding layer. The experimental results show that the model trained with prosodic markers has better prosody expression compared to the original text. Furthermore, compared to the baseline system, adding the prosody embedding layer improved the prosody expression of the synthesized speech and eliminated the need for marked text during speech synthesis.
摘要基于神经网络的端到端语音合成系统,在有足够训练数据的情况下可以合成高质量的语音。然而,对于小数据集的语言来说,很难合成高质量和自然的语音。越南语是一种声调语言,属于南亚语系的越南语分支,使用拼写系统。为了在有限的数据集资源下提高语音合成的质量和自然度,我们首先利用迁移学习来改进越南语的声学模型,利用普通话和越南语在发音和语法上的相似性。其次,根据越南语的韵律特征,使用“语音-文本”对齐工具提取韵律边界信息,并将其补充到训练文本序列中。以FastSpeech2为实验基准系统,设计并添加韵律嵌入层。实验结果表明,使用韵律标记训练的模型比原始文本具有更好的韵律表达。此外,与基线系统相比,加入韵律嵌入层改善了合成语音的韵律表达,消除了语音合成过程中对标记文本的需求。
{"title":"Multi-Feature Cross-Lingual Transfer Learning Approach for Low-Resource Vietnamese Speech Synthesis","authors":"Zhi Qiao, Jian Yang, Zhan Wang","doi":"10.1145/3611450.3611476","DOIUrl":"https://doi.org/10.1145/3611450.3611476","url":null,"abstract":"Abstract—Based on neural network end-to-end speech synthesis systems, high-quality speech can be synthesized when there is sufficient training data. However, it is difficult for languages with small datasets to synthesize speech with high quality and naturalness. Vietnamese is a tonal language, belonging to the Vietic branch of the Austroasiatic language family, which uses a spelling system. To improve the quality and naturalness of speech synthesis with limited dataset resources, we first use transfer learning to improve the acoustic model of Vietnamese by leveraging the similarities in pronunciation and grammar between Mandarin Chinese and Vietnamese. Secondly, based on the prosodic characteristics of Vietnamese, we use a \"speech-text\" alignment tool to extract prosodic boundary information and supplement it to the training text sequence. Using FastSpeech2 as the experimental baseline system, we designed and added a prosody embedding layer. The experimental results show that the model trained with prosodic markers has better prosody expression compared to the original text. Furthermore, compared to the baseline system, adding the prosody embedding layer improved the prosody expression of the synthesized speech and eliminated the need for marked text during speech synthesis.","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129931659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms 2023第三届人工智能、自动化与算法国际会议论文集
{"title":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","authors":"","doi":"10.1145/3611450","DOIUrl":"https://doi.org/10.1145/3611450","url":null,"abstract":"","PeriodicalId":289906,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms","volume":"76 12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126020383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 3rd International Conference on Artificial Intelligence, Automation and Algorithms
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1