{"title":"一个利用机器学习再现音频信息的模型","authors":"R. Yadav, R. Bharti, R. Nagar, Sanchit Kumar","doi":"10.1109/incet49848.2020.9154064","DOIUrl":null,"url":null,"abstract":"This model aims to develop an efficient way to recapitulate large audio messages or clips for valuable insights. With increase in utilization of audio/visual data day by day, there is a need to handle audio files more intelligently. In this document, a novel approach is presented to build a summarized audio for a given long audio file. This method is composed primarily of three modules namely: Conversion of Speech into Text, Text summarization, and lastly conversion of text into speech. Each module is fed by the output of another module except speech to text conversion where input is the given audio file for which summary has to be formed. The first step in audio recapitulation is conversion of given audio to text. This is made possible by sending asynchronous requests to Google Cloud speech API. The next module accomplishes its task of extracting important sentences from the transcript by using the Text Rank algorithm. The last module is to convert the summarized text generated from the output of text summarization module to an audio file. This whole method is given a suitable User Interface using flask and thus a web application is formed for helping users to interact with this model.","PeriodicalId":174411,"journal":{"name":"2020 International Conference for Emerging Technology (INCET)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Model For Recapitulating Audio Messages Using Machine Learning\",\"authors\":\"R. Yadav, R. Bharti, R. Nagar, Sanchit Kumar\",\"doi\":\"10.1109/incet49848.2020.9154064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This model aims to develop an efficient way to recapitulate large audio messages or clips for valuable insights. With increase in utilization of audio/visual data day by day, there is a need to handle audio files more intelligently. In this document, a novel approach is presented to build a summarized audio for a given long audio file. This method is composed primarily of three modules namely: Conversion of Speech into Text, Text summarization, and lastly conversion of text into speech. Each module is fed by the output of another module except speech to text conversion where input is the given audio file for which summary has to be formed. The first step in audio recapitulation is conversion of given audio to text. This is made possible by sending asynchronous requests to Google Cloud speech API. The next module accomplishes its task of extracting important sentences from the transcript by using the Text Rank algorithm. The last module is to convert the summarized text generated from the output of text summarization module to an audio file. This whole method is given a suitable User Interface using flask and thus a web application is formed for helping users to interact with this model.\",\"PeriodicalId\":174411,\"journal\":{\"name\":\"2020 International Conference for Emerging Technology (INCET)\",\"volume\":\"94 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/incet49848.2020.9154064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/incet49848.2020.9154064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Model For Recapitulating Audio Messages Using Machine Learning
This model aims to develop an efficient way to recapitulate large audio messages or clips for valuable insights. With increase in utilization of audio/visual data day by day, there is a need to handle audio files more intelligently. In this document, a novel approach is presented to build a summarized audio for a given long audio file. This method is composed primarily of three modules namely: Conversion of Speech into Text, Text summarization, and lastly conversion of text into speech. Each module is fed by the output of another module except speech to text conversion where input is the given audio file for which summary has to be formed. The first step in audio recapitulation is conversion of given audio to text. This is made possible by sending asynchronous requests to Google Cloud speech API. The next module accomplishes its task of extracting important sentences from the transcript by using the Text Rank algorithm. The last module is to convert the summarized text generated from the output of text summarization module to an audio file. This whole method is given a suitable User Interface using flask and thus a web application is formed for helping users to interact with this model.