{"title":"Using the European Commission country recommendations to predict sovereign ratings: A topic modeling approach","authors":"Ivan Pastor Sanz","doi":"10.1016/j.eswax.2020.100026","DOIUrl":null,"url":null,"abstract":"<div><p>This paper examines the role of textual and unstructured data in the credit risk assessment of sovereigns. Specifically, in this paper, a novel approach to understand and predict sovereign ratings is proposed. For that purpose, information embedded in the annual country reports issued by the European Commission is used. The model employs a neural-network-based document embedding known as document to vector (Doc2Vec) to convert each country report into a numerical vector, which is then used as features into a logistic regression. The model is trained using information from 2011 to 2019 and it correctly predicts the 70.27% of country ratings in the test sample, improving slightly the results obtained using only macroeconomic variables.</p></div>","PeriodicalId":36838,"journal":{"name":"Expert Systems with Applications: X","volume":"5 ","pages":"Article 100026"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.eswax.2020.100026","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications: X","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590188520300056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 2
Abstract
This paper examines the role of textual and unstructured data in the credit risk assessment of sovereigns. Specifically, in this paper, a novel approach to understand and predict sovereign ratings is proposed. For that purpose, information embedded in the annual country reports issued by the European Commission is used. The model employs a neural-network-based document embedding known as document to vector (Doc2Vec) to convert each country report into a numerical vector, which is then used as features into a logistic regression. The model is trained using information from 2011 to 2019 and it correctly predicts the 70.27% of country ratings in the test sample, improving slightly the results obtained using only macroeconomic variables.