{"title":"基于Bi-LSTM RNN的缅甸对话行为识别","authors":"Sann Su Su Yee, K. Soe","doi":"10.1109/O-COCOSDA50338.2020.9295014","DOIUrl":null,"url":null,"abstract":"Spoken language understanding (SLU) is an essential element of any dialogue system to understand the language where dialogue act (DA) recognition is also critical aspects of pre-processing step for speech understanding and dialogue system. This paper proposes a deep learning-based DA model which use a deep recurrent neural network (RNN) with bi-directional long short-term memory (Bi-LSTM). The model mainly consists of a word-encode layer, a Bi-LSTM layer, and a softmax layer. For corpus preparation, we collected and annotated a large dialog act annotation corpus, which is called MmTravel (Myanmar Travel) corpus, on travel domain human-human conversations dataset (consists of 80k utterances). This paper reports analysis and comparison of proposed model Bi-LSTM with RNN, LSTM, and baseline SVM model. Experiments on the dataset is shown that our proposed DA model performs better than our previous work, support vector machine (SVM) models, which achieve an improvement of more than 2% accuracy increase in classification on the dataset.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Myanmar Dialogue Act Recognition Using Bi-LSTM RNN\",\"authors\":\"Sann Su Su Yee, K. Soe\",\"doi\":\"10.1109/O-COCOSDA50338.2020.9295014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spoken language understanding (SLU) is an essential element of any dialogue system to understand the language where dialogue act (DA) recognition is also critical aspects of pre-processing step for speech understanding and dialogue system. This paper proposes a deep learning-based DA model which use a deep recurrent neural network (RNN) with bi-directional long short-term memory (Bi-LSTM). The model mainly consists of a word-encode layer, a Bi-LSTM layer, and a softmax layer. For corpus preparation, we collected and annotated a large dialog act annotation corpus, which is called MmTravel (Myanmar Travel) corpus, on travel domain human-human conversations dataset (consists of 80k utterances). This paper reports analysis and comparison of proposed model Bi-LSTM with RNN, LSTM, and baseline SVM model. Experiments on the dataset is shown that our proposed DA model performs better than our previous work, support vector machine (SVM) models, which achieve an improvement of more than 2% accuracy increase in classification on the dataset.\",\"PeriodicalId\":385266,\"journal\":{\"name\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"volume\":\"106 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/O-COCOSDA50338.2020.9295014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Myanmar Dialogue Act Recognition Using Bi-LSTM RNN
Spoken language understanding (SLU) is an essential element of any dialogue system to understand the language where dialogue act (DA) recognition is also critical aspects of pre-processing step for speech understanding and dialogue system. This paper proposes a deep learning-based DA model which use a deep recurrent neural network (RNN) with bi-directional long short-term memory (Bi-LSTM). The model mainly consists of a word-encode layer, a Bi-LSTM layer, and a softmax layer. For corpus preparation, we collected and annotated a large dialog act annotation corpus, which is called MmTravel (Myanmar Travel) corpus, on travel domain human-human conversations dataset (consists of 80k utterances). This paper reports analysis and comparison of proposed model Bi-LSTM with RNN, LSTM, and baseline SVM model. Experiments on the dataset is shown that our proposed DA model performs better than our previous work, support vector machine (SVM) models, which achieve an improvement of more than 2% accuracy increase in classification on the dataset.