G. Mahalakshmi, Makesh Narsimhan Sreedhar, Ravi Kiran Selvam, S. Sendhilkumar
{"title":"Exploiting Bi-LSTMs for Named Entity Recognition in Indian Culinary Science","authors":"G. Mahalakshmi, Makesh Narsimhan Sreedhar, Ravi Kiran Selvam, S. Sendhilkumar","doi":"10.2139/ssrn.3545088","DOIUrl":null,"url":null,"abstract":"This paper discusses the use of Bidirectional LSTMs for recognition of Named Entities over the Indian Recipe Blogs. Recipe posts from popular blogs including Hebbar's Kitchen are harvested and trained for recognizing NEs. Both the word embeddings and character embeddings are utilized as feature vectors for training the Bi-LSTM. CRF model is used for joint decoding of the labels. The system shows a development data F1 score of 92.87% and test data F1 score of 94.66%. The dataset used and meta-results obtained are released freely for research use.","PeriodicalId":395403,"journal":{"name":"Applied Communication eJournal","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Communication eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3545088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper discusses the use of Bidirectional LSTMs for recognition of Named Entities over the Indian Recipe Blogs. Recipe posts from popular blogs including Hebbar's Kitchen are harvested and trained for recognizing NEs. Both the word embeddings and character embeddings are utilized as feature vectors for training the Bi-LSTM. CRF model is used for joint decoding of the labels. The system shows a development data F1 score of 92.87% and test data F1 score of 94.66%. The dataset used and meta-results obtained are released freely for research use.