A. Querido, Rita de Carvalho, J. Rodrigues, Steven Neale, Rita Valadas Pereira, P. Gomes, Catarina Correia, D. Amaral, A. Branco
{"title":"Named Entities in the QTLeap Corpus of Online Helpdesk Interactions","authors":"A. Querido, Rita de Carvalho, J. Rodrigues, Steven Neale, Rita Valadas Pereira, P. Gomes, Catarina Correia, D. Amaral, A. Branco","doi":"10.21747/2183-9077/RAPL2A20","DOIUrl":null,"url":null,"abstract":"In this paper we present the annotation of a corpus with named entities that are classified into semantic types and disambiguated by linking them to their corresponding entry in the Portuguese DBpedia. This corpus, QTLeap Corpus, is a multilingual collection of question and answer pairs from a chat-based helpdesk service for Information and Communication Technologies. The resulting annotated corpus is a gold-standard named entity annotated lexical resource that is useful in supporting the training and evaluation of named entity annotation and disambiguation tools for Portuguese.","PeriodicalId":313789,"journal":{"name":"Revista da Associação Portuguesa de Linguística","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista da Associação Portuguesa de Linguística","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21747/2183-9077/RAPL2A20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper we present the annotation of a corpus with named entities that are classified into semantic types and disambiguated by linking them to their corresponding entry in the Portuguese DBpedia. This corpus, QTLeap Corpus, is a multilingual collection of question and answer pairs from a chat-based helpdesk service for Information and Communication Technologies. The resulting annotated corpus is a gold-standard named entity annotated lexical resource that is useful in supporting the training and evaluation of named entity annotation and disambiguation tools for Portuguese.