The development of an in-house accounting bot—an artificial intelligence (AI) assistant capable of generating internally structured bookkeeping double-entry posting schemes—is explored in this paper. The processes of curating a suitable dataset, selecting, and fine-tuning a seven-billion-parameter language model, categorized as a small language model (SLM) (SLMs typically refer to models with fewer than 10 billion parameters, whereas medium-sized models often have 14B parameters, and large-scale models exceed 70B), are described. A human-evaluated benchmark is also presented to assess model performance. To achieve efficient supervised fine-tuning (SFT), low-rank adaptation (LoRA) was employed, significantly reducing memory requirements by using a small set of trainable parameters while maintaining model expressiveness. The process of backpropagation was further optimized using Unsloth, a high-performance training framework designed for efficient video memory usage and flash attention mechanisms, which accelerates adaptation and reduces memory overhead. The model whose layers were updated is called QwenCoder2.5. It was selected with the presumption that it would be able to learn how to generate and examine bookkeeping patterns generated by accounting information system (AIS) over a 17-year history. This proof of concept aims to support researchers and practitioners exploring the integration of generative AI in accounting by providing insights into both the benefits and challenges of AI-driven automation in bookkeeping tasks. The study demonstrates how an SLM can be fine-tuned on a proprietary dataset of journal posting schemes to assist accountants, auditors, and financial analysts while also facilitating synthetic data generation. Challenges related to AI, data preprocessing, fine-tuning optimization, and evaluation methodology are introduced and examined.
{"title":"Developing an Accounting Virtual Assistant Through Supervised Fine-Tuning (SFT) of a Small Language Model (SLM)","authors":"Mario Zupan","doi":"10.1002/isaf.70011","DOIUrl":"https://doi.org/10.1002/isaf.70011","url":null,"abstract":"<p>The development of an in-house accounting bot—an artificial intelligence (AI) assistant capable of generating internally structured bookkeeping double-entry posting schemes—is explored in this paper. The processes of curating a suitable dataset, selecting, and fine-tuning a seven-billion-parameter language model, categorized as a small language model (SLM) (SLMs typically refer to models with fewer than 10 billion parameters, whereas medium-sized models often have 14B parameters, and large-scale models exceed 70B), are described. A human-evaluated benchmark is also presented to assess model performance. To achieve efficient supervised fine-tuning (SFT), low-rank adaptation (LoRA) was employed, significantly reducing memory requirements by using a small set of trainable parameters while maintaining model expressiveness. The process of backpropagation was further optimized using Unsloth, a high-performance training framework designed for efficient video memory usage and flash attention mechanisms, which accelerates adaptation and reduces memory overhead. The model whose layers were updated is called QwenCoder2.5. It was selected with the presumption that it would be able to learn how to generate and examine bookkeeping patterns generated by accounting information system (AIS) over a 17-year history. This proof of concept aims to support researchers and practitioners exploring the integration of generative AI in accounting by providing insights into both the benefits and challenges of AI-driven automation in bookkeeping tasks. The study demonstrates how an SLM can be fine-tuned on a proprietary dataset of journal posting schemes to assist accountants, auditors, and financial analysts while also facilitating synthetic data generation. Challenges related to AI, data preprocessing, fine-tuning optimization, and evaluation methodology are introduced and examined.</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/isaf.70011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144716652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Increasing computing power and access to the internet have amplified the role of social media and online news media on financial market outcomes. However, these two sources of information are intertwined in such a way that information flows between them. As a result, sentiment expressed in one source can affect stock market outcomes through the other source. This study examines this interplay between news media sentiment, social media sentiment and stock returns within the Dow Jones constituent companies from 2016 to 2023. Leveraging an extensive dataset, we adopt an approach that combines causal mediation models with robust statistical techniques to establish the mediation effects of one sentiment proxy on the relationship between the other proxy and stock returns. We also use a range of other methods like path analysis, panel vector autoregression and causal forests for robustness. The study finds that news sentiment is more influential in directly affecting stock returns than Twitter sentiment while the latter is more influential indirectly when mediated by news sentiment.
{"title":"Social Media, Traditional News and Stock Returns: A Causal Mediation Analysis","authors":"Kingstone Nyakurukwa, Yudhvir Seetharam","doi":"10.1002/isaf.70012","DOIUrl":"https://doi.org/10.1002/isaf.70012","url":null,"abstract":"<p>Increasing computing power and access to the internet have amplified the role of social media and online news media on financial market outcomes. However, these two sources of information are intertwined in such a way that information flows between them. As a result, sentiment expressed in one source can affect stock market outcomes through the other source. This study examines this interplay between news media sentiment, social media sentiment and stock returns within the Dow Jones constituent companies from 2016 to 2023. Leveraging an extensive dataset, we adopt an approach that combines causal mediation models with robust statistical techniques to establish the mediation effects of one sentiment proxy on the relationship between the other proxy and stock returns. We also use a range of other methods like path analysis, panel vector autoregression and causal forests for robustness. The study finds that news sentiment is more influential in directly affecting stock returns than <i>Twitter</i> sentiment while the latter is more influential indirectly when mediated by news sentiment.</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/isaf.70012","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144647512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}