具有自适应频谱的模拟人类语音聊天机器人

Gautam Chettiar, A. Shukla, Preet Nalwaya, K. Sethi, Surya Prakash
{"title":"具有自适应频谱的模拟人类语音聊天机器人","authors":"Gautam Chettiar, A. Shukla, Preet Nalwaya, K. Sethi, Surya Prakash","doi":"10.1109/ICCT56969.2023.10076120","DOIUrl":null,"url":null,"abstract":"Recent trends in artificial intelligence and natural language processing models have led to the generation of highly efficient and versatile intelligent chatbot models., which have the potential to supplant human speech to the level of conversational proficiency. The proposed method creates a chatbot model that trains itself on open conversation datasets and aims to impersonate without compromising the emotional sentiments in the voice. These datasets extract from the applications such as WhatsApp., Telegram., Messenger., or any other chatting platform. Datasets convert to a machine-readable format., which is dynamically updated in real-time during the conversation., and then using speech conversion algorithms convert the reply into the desired individual's voice. The proposed model's conversational ability depends on the amount of conversation data., which gives the output in the person's voice frequency. By using an NLP-based chatbot trained on personalized data using KNN., and handling misses by pipelining the chatbot inputs to the GPT-2 model., the model can generate human-like replies even if there is data insufficiency. The natural replies are complemented with matching human voice and tone characteristics by using the vocoder model., which matches the spectral characteristics of the target voice onto the required voice. This opens a plethora of commercial and therapeutic applications that provide excellent insights into implementing natural communication models for humanoid and robotics innovations.","PeriodicalId":128100,"journal":{"name":"2023 3rd International Conference on Intelligent Communication and Computational Techniques (ICCT)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Impersonated Human Speech Chatbot with Adaptive Frequency Spectrum\",\"authors\":\"Gautam Chettiar, A. Shukla, Preet Nalwaya, K. Sethi, Surya Prakash\",\"doi\":\"10.1109/ICCT56969.2023.10076120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent trends in artificial intelligence and natural language processing models have led to the generation of highly efficient and versatile intelligent chatbot models., which have the potential to supplant human speech to the level of conversational proficiency. The proposed method creates a chatbot model that trains itself on open conversation datasets and aims to impersonate without compromising the emotional sentiments in the voice. These datasets extract from the applications such as WhatsApp., Telegram., Messenger., or any other chatting platform. Datasets convert to a machine-readable format., which is dynamically updated in real-time during the conversation., and then using speech conversion algorithms convert the reply into the desired individual's voice. The proposed model's conversational ability depends on the amount of conversation data., which gives the output in the person's voice frequency. By using an NLP-based chatbot trained on personalized data using KNN., and handling misses by pipelining the chatbot inputs to the GPT-2 model., the model can generate human-like replies even if there is data insufficiency. The natural replies are complemented with matching human voice and tone characteristics by using the vocoder model., which matches the spectral characteristics of the target voice onto the required voice. This opens a plethora of commercial and therapeutic applications that provide excellent insights into implementing natural communication models for humanoid and robotics innovations.\",\"PeriodicalId\":128100,\"journal\":{\"name\":\"2023 3rd International Conference on Intelligent Communication and Computational Techniques (ICCT)\",\"volume\":\"177 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Intelligent Communication and Computational Techniques (ICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCT56969.2023.10076120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Intelligent Communication and Computational Techniques (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT56969.2023.10076120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

人工智能和自然语言处理模型的最新趋势导致了高效、通用的智能聊天机器人模型的产生。它们有可能取代人类的语言,达到对话的熟练程度。该方法创建了一个聊天机器人模型,该模型在开放的对话数据集上进行自我训练,旨在在不影响语音情感的情况下进行模拟。这些数据集是从WhatsApp等应用程序中提取的。电报。信使。,或其他聊天平台。数据集转换为机器可读的格式。,在对话过程中实时动态更新。,然后使用语音转换算法将回复转换为所需个人的声音。该模型的会话能力取决于会话数据的数量。,它以人的声音频率输出。通过使用基于nlp的聊天机器人,使用KNN对个性化数据进行训练。,并通过将聊天机器人的输入流水线化到GPT-2模型来处理失误。,即使在数据不足的情况下,该模型也能生成类似人类的回答。通过使用声码器模型,将自然回复与匹配的人声和音调特征相辅相成。,将目标语音的频谱特征与所需语音相匹配。这打开了大量的商业和治疗应用,为实现人形和机器人创新的自然通信模型提供了极好的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Impersonated Human Speech Chatbot with Adaptive Frequency Spectrum
Recent trends in artificial intelligence and natural language processing models have led to the generation of highly efficient and versatile intelligent chatbot models., which have the potential to supplant human speech to the level of conversational proficiency. The proposed method creates a chatbot model that trains itself on open conversation datasets and aims to impersonate without compromising the emotional sentiments in the voice. These datasets extract from the applications such as WhatsApp., Telegram., Messenger., or any other chatting platform. Datasets convert to a machine-readable format., which is dynamically updated in real-time during the conversation., and then using speech conversion algorithms convert the reply into the desired individual's voice. The proposed model's conversational ability depends on the amount of conversation data., which gives the output in the person's voice frequency. By using an NLP-based chatbot trained on personalized data using KNN., and handling misses by pipelining the chatbot inputs to the GPT-2 model., the model can generate human-like replies even if there is data insufficiency. The natural replies are complemented with matching human voice and tone characteristics by using the vocoder model., which matches the spectral characteristics of the target voice onto the required voice. This opens a plethora of commercial and therapeutic applications that provide excellent insights into implementing natural communication models for humanoid and robotics innovations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
About ICCT '23 A Novel Technique to Detect URL Phishing based on Feature Count Effectiveness of Anti-Spoofing Protocols for Email Authentication Optimal Predictive Maintenance Technique for Manufacturing Semiconductors using Machine Learning Development of Secure IoT Ecosystems for Healthcare
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1