{"title":"Natural Language Interface for Covid-19 Amharic Database Using LSTM Encoder Decoder Architecture with Attention","authors":"Ephrem Tadesse Degu, Rosa Tsegaye Aga","doi":"10.1109/ict4da53266.2021.9671268","DOIUrl":null,"url":null,"abstract":"The COVID-19 outbreak is still a challenge in most places because of lack of up-to-date information, primarily, to the people in the world who speak and use underrepresented local languages. Ethiopia is one example of a country where several in-digenous languages are under-represented and under-resourced. Thus, building an interactive interface that responds to users' query using their local language with organized information plays a significant role. In this study, attention-augmented Encoder-Decoder Long Short Term Memory(LSTM) network model has proposed to provide adequate information about the pandemic to the people of Ethiopia by their local language, Amharic. The model converts Amharic COVID-19 related questions into the corresponding structured query language (SQL). The model retrieves information from the Amharic COVID-19 database that has developed for this study. The database contains frequently referenced COVID-19 attributes such as symptoms, prevention, transmission and frequently asked questions. In addition, a parallel Amharic Question-SQL query dataset has been prepared to evaluate the model. The LSTM Network with augmented attention mechanism has shown a clear significant result. In this study, a user interactive interface has also developed. The interface uses the proposed model and provides information about the pandemic to the people with questions in Amharic.","PeriodicalId":371663,"journal":{"name":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ict4da53266.2021.9671268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The COVID-19 outbreak is still a challenge in most places because of lack of up-to-date information, primarily, to the people in the world who speak and use underrepresented local languages. Ethiopia is one example of a country where several in-digenous languages are under-represented and under-resourced. Thus, building an interactive interface that responds to users' query using their local language with organized information plays a significant role. In this study, attention-augmented Encoder-Decoder Long Short Term Memory(LSTM) network model has proposed to provide adequate information about the pandemic to the people of Ethiopia by their local language, Amharic. The model converts Amharic COVID-19 related questions into the corresponding structured query language (SQL). The model retrieves information from the Amharic COVID-19 database that has developed for this study. The database contains frequently referenced COVID-19 attributes such as symptoms, prevention, transmission and frequently asked questions. In addition, a parallel Amharic Question-SQL query dataset has been prepared to evaluate the model. The LSTM Network with augmented attention mechanism has shown a clear significant result. In this study, a user interactive interface has also developed. The interface uses the proposed model and provides information about the pandemic to the people with questions in Amharic.