Gautam Chettiar, A. Shukla, Preet Nalwaya, K. Sethi, Surya Prakash
{"title":"Impersonated Human Speech Chatbot with Adaptive Frequency Spectrum","authors":"Gautam Chettiar, A. Shukla, Preet Nalwaya, K. Sethi, Surya Prakash","doi":"10.1109/ICCT56969.2023.10076120","DOIUrl":null,"url":null,"abstract":"Recent trends in artificial intelligence and natural language processing models have led to the generation of highly efficient and versatile intelligent chatbot models., which have the potential to supplant human speech to the level of conversational proficiency. The proposed method creates a chatbot model that trains itself on open conversation datasets and aims to impersonate without compromising the emotional sentiments in the voice. These datasets extract from the applications such as WhatsApp., Telegram., Messenger., or any other chatting platform. Datasets convert to a machine-readable format., which is dynamically updated in real-time during the conversation., and then using speech conversion algorithms convert the reply into the desired individual's voice. The proposed model's conversational ability depends on the amount of conversation data., which gives the output in the person's voice frequency. By using an NLP-based chatbot trained on personalized data using KNN., and handling misses by pipelining the chatbot inputs to the GPT-2 model., the model can generate human-like replies even if there is data insufficiency. The natural replies are complemented with matching human voice and tone characteristics by using the vocoder model., which matches the spectral characteristics of the target voice onto the required voice. This opens a plethora of commercial and therapeutic applications that provide excellent insights into implementing natural communication models for humanoid and robotics innovations.","PeriodicalId":128100,"journal":{"name":"2023 3rd International Conference on Intelligent Communication and Computational Techniques (ICCT)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Intelligent Communication and Computational Techniques (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT56969.2023.10076120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Recent trends in artificial intelligence and natural language processing models have led to the generation of highly efficient and versatile intelligent chatbot models., which have the potential to supplant human speech to the level of conversational proficiency. The proposed method creates a chatbot model that trains itself on open conversation datasets and aims to impersonate without compromising the emotional sentiments in the voice. These datasets extract from the applications such as WhatsApp., Telegram., Messenger., or any other chatting platform. Datasets convert to a machine-readable format., which is dynamically updated in real-time during the conversation., and then using speech conversion algorithms convert the reply into the desired individual's voice. The proposed model's conversational ability depends on the amount of conversation data., which gives the output in the person's voice frequency. By using an NLP-based chatbot trained on personalized data using KNN., and handling misses by pipelining the chatbot inputs to the GPT-2 model., the model can generate human-like replies even if there is data insufficiency. The natural replies are complemented with matching human voice and tone characteristics by using the vocoder model., which matches the spectral characteristics of the target voice onto the required voice. This opens a plethora of commercial and therapeutic applications that provide excellent insights into implementing natural communication models for humanoid and robotics innovations.