Yong Zhuang , Feilong Wang , Dickson K.W. Chiu , Kevin K.W. Ho
{"title":"Leveraging large language models to examine the interaction between investor sentiment and stock performance","authors":"Yong Zhuang , Feilong Wang , Dickson K.W. Chiu , Kevin K.W. Ho","doi":"10.1016/j.engappai.2025.110602","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding the relationship between investor sentiment and stock performance is crucial in dynamic financial markets. Existing researches often focus on financial news and stock prices, while studies on investor sentiment typically rely on traditional machine learning models that require extensive data labeling. Additionally, most researches focus on single stock indices, overlooking the impact of brand popularity. To address these gaps, this study proposes a novel framework to analyze the interaction between investor sentiment and stock performance, using Chinese Baijiu industry stocks as a case example. It further explores how brand popularity influences this relationship, offering insights for informed investment decisions through artificial intelligence technology. In this study, we leverage Generative Pre-trained Transformer 4 (GPT-4), a state-of-the-art black-box large language model, to process vast volumes of unstructured text data from stock forums. By employing in-context learning with human-labeled examples, GPT-4 generates weak labels that are subsequently used to fine-tune Large Language Model Meta AI (LLaMA), a smaller and more efficient open-source LLM from Meta AI, thereby enabling sentiment-driven decision-making in real-world scenarios. To construct a comprehensive sentiment indicator, we integrate both direct and indirect factors influencing sentiment and use principal component analysis to combine them effectively. To examine interaction between sentiment and stock yield, we apply the Granger causality test and vector autoregression models across stocks with different brand popularity levels. The results show that our framework achieves state-of-the-art performance investor sentiment analysis. Moreover, with brand popularity significantly amplifying the interaction between investor sentiment and stock yield, it leads to bidirectional Granger causality in highly popular brands.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"150 ","pages":"Article 110602"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625006025","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/28 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Understanding the relationship between investor sentiment and stock performance is crucial in dynamic financial markets. Existing researches often focus on financial news and stock prices, while studies on investor sentiment typically rely on traditional machine learning models that require extensive data labeling. Additionally, most researches focus on single stock indices, overlooking the impact of brand popularity. To address these gaps, this study proposes a novel framework to analyze the interaction between investor sentiment and stock performance, using Chinese Baijiu industry stocks as a case example. It further explores how brand popularity influences this relationship, offering insights for informed investment decisions through artificial intelligence technology. In this study, we leverage Generative Pre-trained Transformer 4 (GPT-4), a state-of-the-art black-box large language model, to process vast volumes of unstructured text data from stock forums. By employing in-context learning with human-labeled examples, GPT-4 generates weak labels that are subsequently used to fine-tune Large Language Model Meta AI (LLaMA), a smaller and more efficient open-source LLM from Meta AI, thereby enabling sentiment-driven decision-making in real-world scenarios. To construct a comprehensive sentiment indicator, we integrate both direct and indirect factors influencing sentiment and use principal component analysis to combine them effectively. To examine interaction between sentiment and stock yield, we apply the Granger causality test and vector autoregression models across stocks with different brand popularity levels. The results show that our framework achieves state-of-the-art performance investor sentiment analysis. Moreover, with brand popularity significantly amplifying the interaction between investor sentiment and stock yield, it leads to bidirectional Granger causality in highly popular brands.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.