资源受限语音合成神经网络库的开发

2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE) Pub Date : 2020-12-01 DOI:10.1109/ICRAIE51050.2020.9358310

Sujeendran Menon, Pawel Zarzycki, M. Ganzha, M. Paprzycki

{"title":"资源受限语音合成神经网络库的开发","authors":"Sujeendran Menon, Pawel Zarzycki, M. Ganzha, M. Paprzycki","doi":"10.1109/ICRAIE51050.2020.9358310","DOIUrl":null,"url":null,"abstract":"Machine learning frameworks, like Tensorflow and PyTorch, use GPU hardware acceleration to deliver the needed performance. Since GPUs require a lot of power (and space) to operate, typical use cases involve high-performance servers, with the final deployment available as a cloud service. To address limitations of this approach, AI Accelerators have been proposed. In this context, we have designed and implemented a library of neural network algorithms, to efficiently run on “edge devices”, with AI Accelerators. Moreover, a unified interface has been provided, to allow easy experimentation with various neural networks applied to the same dataset. Here, let us stress that we do not propose new algorithms, but port known ones to, resource restricted, edge devices. The context is provided by a speech synthesis application for edge devices that is deployed on an NVIDIA Jetson Nano. This application is to be used by social robots for real-time off-cloud text-to-speech processing.","PeriodicalId":149717,"journal":{"name":"2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Development of a Neural Network Library for Resource Constrained Speech Synthesis\",\"authors\":\"Sujeendran Menon, Pawel Zarzycki, M. Ganzha, M. Paprzycki\",\"doi\":\"10.1109/ICRAIE51050.2020.9358310\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning frameworks, like Tensorflow and PyTorch, use GPU hardware acceleration to deliver the needed performance. Since GPUs require a lot of power (and space) to operate, typical use cases involve high-performance servers, with the final deployment available as a cloud service. To address limitations of this approach, AI Accelerators have been proposed. In this context, we have designed and implemented a library of neural network algorithms, to efficiently run on “edge devices”, with AI Accelerators. Moreover, a unified interface has been provided, to allow easy experimentation with various neural networks applied to the same dataset. Here, let us stress that we do not propose new algorithms, but port known ones to, resource restricted, edge devices. The context is provided by a speech synthesis application for edge devices that is deployed on an NVIDIA Jetson Nano. This application is to be used by social robots for real-time off-cloud text-to-speech processing.\",\"PeriodicalId\":149717,\"journal\":{\"name\":\"2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRAIE51050.2020.9358310\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAIE51050.2020.9358310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

机器学习框架，如Tensorflow和PyTorch，使用GPU硬件加速来提供所需的性能。由于gpu需要大量的电力(和空间)来运行，典型的用例涉及高性能服务器，最终部署作为云服务。为了解决这种方法的局限性，已经提出了人工智能加速器。在这种情况下，我们设计并实现了一个神经网络算法库，通过人工智能加速器在“边缘设备”上有效运行。此外，还提供了一个统一的接口，以便于将各种神经网络应用于同一数据集进行简单的实验。在这里，让我们强调一下，我们不是提出新的算法，而是将已知的算法移植到资源有限的边缘设备上。上下文由部署在NVIDIA Jetson Nano上的边缘设备的语音合成应用程序提供。这个应用程序将被社交机器人用于实时的云外文本到语音处理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Development of a Neural Network Library for Resource Constrained Speech Synthesis

Machine learning frameworks, like Tensorflow and PyTorch, use GPU hardware acceleration to deliver the needed performance. Since GPUs require a lot of power (and space) to operate, typical use cases involve high-performance servers, with the final deployment available as a cloud service. To address limitations of this approach, AI Accelerators have been proposed. In this context, we have designed and implemented a library of neural network algorithms, to efficiently run on “edge devices”, with AI Accelerators. Moreover, a unified interface has been provided, to allow easy experimentation with various neural networks applied to the same dataset. Here, let us stress that we do not propose new algorithms, but port known ones to, resource restricted, edge devices. The context is provided by a speech synthesis application for edge devices that is deployed on an NVIDIA Jetson Nano. This application is to be used by social robots for real-time off-cloud text-to-speech processing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE)

自引率

0.00%

发文量