语气或术语:机器学习文本分析、特色词汇提取以及中国债券定价证据

IF 2.1 2区 经济学 Q2 BUSINESS, FINANCE Journal of Empirical Finance Pub Date : 2024-08-22 DOI:10.1016/j.jempfin.2024.101534
Yueqian Peng , Li Shi , Xiaojun Shi , Songtao Tan
{"title":"语气或术语:机器学习文本分析、特色词汇提取以及中国债券定价证据","authors":"Yueqian Peng ,&nbsp;Li Shi ,&nbsp;Xiaojun Shi ,&nbsp;Songtao Tan","doi":"10.1016/j.jempfin.2024.101534","DOIUrl":null,"url":null,"abstract":"<div><p>We apply the machine-learning technique proposed by Zhou et al. (2024) to analyze credit rating reports in China’s bond markets, identifying featured vocabulary and generating text analysis scores. Compared with the traditional bag-of-words text analysis, evidence suggests three advantages of machine-learning scoring. Firstly, it covers featured vocabulary that compensates for missing information; secondly, it reduces misclassification of words’ sentiments; moreover, it mitigates the problem of equal weighting inherent in the bag-of-words method. Our findings indicate that the featured vocabulary neglected in the bag-of-words method plays a crucial role in text analysis and significantly contributes to bond pricing. Additionally, we find that machine-learning text analysis can address AAA rating inflation within China’s bond markets to some extent. In contrast, the bag-of-words method exhibits limited efficacy in mitigating this issue.</p></div>","PeriodicalId":15704,"journal":{"name":"Journal of Empirical Finance","volume":"78 ","pages":"Article 101534"},"PeriodicalIF":2.1000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tone or term: Machine-learning text analysis, featured vocabulary extraction, and evidence from bond pricing in China\",\"authors\":\"Yueqian Peng ,&nbsp;Li Shi ,&nbsp;Xiaojun Shi ,&nbsp;Songtao Tan\",\"doi\":\"10.1016/j.jempfin.2024.101534\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>We apply the machine-learning technique proposed by Zhou et al. (2024) to analyze credit rating reports in China’s bond markets, identifying featured vocabulary and generating text analysis scores. Compared with the traditional bag-of-words text analysis, evidence suggests three advantages of machine-learning scoring. Firstly, it covers featured vocabulary that compensates for missing information; secondly, it reduces misclassification of words’ sentiments; moreover, it mitigates the problem of equal weighting inherent in the bag-of-words method. Our findings indicate that the featured vocabulary neglected in the bag-of-words method plays a crucial role in text analysis and significantly contributes to bond pricing. Additionally, we find that machine-learning text analysis can address AAA rating inflation within China’s bond markets to some extent. In contrast, the bag-of-words method exhibits limited efficacy in mitigating this issue.</p></div>\",\"PeriodicalId\":15704,\"journal\":{\"name\":\"Journal of Empirical Finance\",\"volume\":\"78 \",\"pages\":\"Article 101534\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Empirical Finance\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0927539824000690\",\"RegionNum\":2,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Empirical Finance","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0927539824000690","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0

摘要

我们将 Zhou 等人(2024 年)提出的机器学习技术用于分析中国债券市场的信用评级报告,识别特色词汇并生成文本分析评分。与传统的词袋文本分析相比,有证据表明机器学习评分有三个优势。首先,它涵盖了特色词汇,弥补了信息缺失;其次,它减少了对词语情感的错误分类;此外,它还缓解了词袋法固有的等权重问题。我们的研究结果表明,词袋法中被忽视的特征词汇在文本分析中起着至关重要的作用,对债券定价有很大的帮助。此外,我们还发现机器学习文本分析可以在一定程度上解决中国债券市场 AAA 评级虚高的问题。相比之下,词袋法在缓解这一问题方面效果有限。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Tone or term: Machine-learning text analysis, featured vocabulary extraction, and evidence from bond pricing in China

We apply the machine-learning technique proposed by Zhou et al. (2024) to analyze credit rating reports in China’s bond markets, identifying featured vocabulary and generating text analysis scores. Compared with the traditional bag-of-words text analysis, evidence suggests three advantages of machine-learning scoring. Firstly, it covers featured vocabulary that compensates for missing information; secondly, it reduces misclassification of words’ sentiments; moreover, it mitigates the problem of equal weighting inherent in the bag-of-words method. Our findings indicate that the featured vocabulary neglected in the bag-of-words method plays a crucial role in text analysis and significantly contributes to bond pricing. Additionally, we find that machine-learning text analysis can address AAA rating inflation within China’s bond markets to some extent. In contrast, the bag-of-words method exhibits limited efficacy in mitigating this issue.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.40
自引率
3.80%
发文量
59
期刊介绍: The Journal of Empirical Finance is a financial economics journal whose aim is to publish high quality articles in empirical finance. Empirical finance is interpreted broadly to include any type of empirical work in financial economics, financial econometrics, and also theoretical work with clear empirical implications, even when there is no empirical analysis. The Journal welcomes articles in all fields of finance, such as asset pricing, corporate finance, financial econometrics, banking, international finance, microstructure, behavioural finance, etc. The Editorial Team is willing to take risks on innovative research, controversial papers, and unusual approaches. We are also particularly interested in work produced by young scholars. The composition of the editorial board reflects such goals.
期刊最新文献
High-frequency realized stochastic volatility model Jump tail risk exposure and the cross-section of stock returns Time-varying variance decomposition of macro-finance term structure models Technological shocks and stock market volatility over a century Is firm-level political risk priced in the corporate bond market?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1