{"title":"A hybrid method for taxonomy creation","authors":"Vasundhara Chakraborty, M. Vasarhelyi","doi":"10.4192/1577-8517-V17_2","DOIUrl":null,"url":null,"abstract":"Automatic extraction of information from the footnotes of financial statements can be particularly challenging due to a wide variation in filing structure and terminologies. Standardized text and use of data tagging can facilitate this process. This paper: (i) proposes and demonstrates a new hybrid method of taxonomy creation, using historical data; (ii) compares the taxonomy structure using the new method, with that of the existing XBRL US GAAP taxonomy; (iii) shows evidence of structural differences between the official XBRL US GAAP taxonomy and the new hybrid taxonomy and (iv) demonstrates how the tool so developed could be used for more exploratory research. Comparison of this new structure with that of the existing XBRL taxonomy structure reveals that its creation from historical data provides a greater level of aggregation compared to the XBRL US GAAP taxonomy for Pension footnotes.","PeriodicalId":404481,"journal":{"name":"The International Journal of Digital Accounting Research","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The International Journal of Digital Accounting Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4192/1577-8517-V17_2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Automatic extraction of information from the footnotes of financial statements can be particularly challenging due to a wide variation in filing structure and terminologies. Standardized text and use of data tagging can facilitate this process. This paper: (i) proposes and demonstrates a new hybrid method of taxonomy creation, using historical data; (ii) compares the taxonomy structure using the new method, with that of the existing XBRL US GAAP taxonomy; (iii) shows evidence of structural differences between the official XBRL US GAAP taxonomy and the new hybrid taxonomy and (iv) demonstrates how the tool so developed could be used for more exploratory research. Comparison of this new structure with that of the existing XBRL taxonomy structure reveals that its creation from historical data provides a greater level of aggregation compared to the XBRL US GAAP taxonomy for Pension footnotes.
由于归档结构和术语的广泛差异,从财务报表脚注中自动提取信息可能特别具有挑战性。标准化的文本和数据标记的使用可以促进这一过程。本文:(i)提出并论证了一种新的利用历史数据创建分类的混合方法;(ii)将使用新方法的分类结构与现有的XBRL US GAAP分类结构进行比较;(iii)展示了官方XBRL US GAAP分类法和新的混合分类法之间结构差异的证据;(iv)展示了如何将开发的工具用于更多的探索性研究。将这个新结构与现有XBRL分类法结构进行比较可以发现,与用于养老金脚注的XBRL US GAAP分类法相比,从历史数据创建的新结构提供了更高级别的聚合。