{"title":"Detecting Financial Statement Fraud through Multidimensional Analysis of Text Readability","authors":"Fang Yang, J. David, C. Chang","doi":"10.2308/jfar-2021-019","DOIUrl":null,"url":null,"abstract":"\n This study uses Coh-Metrix to analyze multiple dimensions of readability of the MD&A section of the SEC Form 10-K. We incorporate the five main Coh-Metrix components of text easability (word concreteness, syntactic simplicity, referential cohesion, deep cohesion, and narrativity) into a logistic model to test their predictive power for financial misreporting. We find that compared to the MD&As of nonfraud firms, the MD&As of fraud firms connect clauses and sentences less coherently, use more story-like language, and show a higher number of vague and abstract words. Thus, referential cohesion, narrativity, and word concreteness significantly enhance predictive ability in fraud detection. The Coh-Metrix readability measures enhance the linguistic complexity assessment beyond traditional readability measures, such as the Fog Index and the Flesch Indexes. Financial analysts and investors can utilize the Coh-Metrix readability measures to supplement traditional readability measures and common financial statement variables in predicting financial misreporting.\n Data Availability: Data are available from the public sources cited in the text.\n JEL Classifications: G32; K42; M41; M48.","PeriodicalId":149240,"journal":{"name":"Journal of Forensic Accounting Research","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Forensic Accounting Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2308/jfar-2021-019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study uses Coh-Metrix to analyze multiple dimensions of readability of the MD&A section of the SEC Form 10-K. We incorporate the five main Coh-Metrix components of text easability (word concreteness, syntactic simplicity, referential cohesion, deep cohesion, and narrativity) into a logistic model to test their predictive power for financial misreporting. We find that compared to the MD&As of nonfraud firms, the MD&As of fraud firms connect clauses and sentences less coherently, use more story-like language, and show a higher number of vague and abstract words. Thus, referential cohesion, narrativity, and word concreteness significantly enhance predictive ability in fraud detection. The Coh-Metrix readability measures enhance the linguistic complexity assessment beyond traditional readability measures, such as the Fog Index and the Flesch Indexes. Financial analysts and investors can utilize the Coh-Metrix readability measures to supplement traditional readability measures and common financial statement variables in predicting financial misreporting.
Data Availability: Data are available from the public sources cited in the text.
JEL Classifications: G32; K42; M41; M48.
本研究采用Coh-Metrix对SEC Form 10-K中MD&A部分的可读性进行了多维度分析。我们将文本易用性的五个主要Coh-Metrix组成部分(单词具体性、句法简单性、指称衔接、深度衔接和叙事性)纳入逻辑模型,以测试它们对财务误报的预测能力。我们发现,与非欺诈公司的md&a相比,欺诈公司的md&a连接条款和句子的连贯性较差,使用更多的故事式语言,并且显示出更多的模糊和抽象词汇。因此,指称衔接、叙事性和词的具体性显著提高了欺诈检测的预测能力。Coh-Metrix可读性度量比传统的可读性度量(如Fog指数和Flesch指数)提高了语言复杂性评估。财务分析师和投资者可以利用Coh-Metrix可读性指标来补充传统的可读性指标和常见的财务报表变量来预测财务误报。数据可用性:数据可从文本中引用的公共来源获得。JEL分类:G32;K42;M41;M48。