Toward Automating Shredding Nonprofit XML Files: The Case of IRS Form 990 Data

IF 7.3 2区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS European Journal of Information Systems Pub Date : 2022-08-03 DOI:10.2308/isys-2022-031
Husam A. Abu Khadra, D. Olsen
{"title":"Toward Automating Shredding Nonprofit XML Files: The Case of IRS Form 990 Data","authors":"Husam A. Abu Khadra, D. Olsen","doi":"10.2308/isys-2022-031","DOIUrl":null,"url":null,"abstract":"This paper presents and describes data for nonprofit IRS filings in the United States of America. The data contains 831 attributes and 1,102,884 records for the years 2016-2021. Among other items, the data include nonprofits’ comparative financial data, governance disclosures, and hired contractors, as well as management compensation, a detailed statement of revenue, statement of functional expenses, external audit, federal audit election, and reconciliation of net assets. The data is generated using Structured Query Language (SQL) self-developed code to convert the IRS form 990 Extensible Markup Language (XML) tax filing files to a dataset in Excel. This paper is the first to convert these XML files and provide much-needed open access to nonprofit data in a long format that is useful for researchers to conduct cross-sectional analysis. The 2,174 lines of source code that we developed, and a step-by-step guide are included in this paper.","PeriodicalId":50486,"journal":{"name":"European Journal of Information Systems","volume":"41 1","pages":"169-188"},"PeriodicalIF":7.3000,"publicationDate":"2022-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Information Systems","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.2308/isys-2022-031","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents and describes data for nonprofit IRS filings in the United States of America. The data contains 831 attributes and 1,102,884 records for the years 2016-2021. Among other items, the data include nonprofits’ comparative financial data, governance disclosures, and hired contractors, as well as management compensation, a detailed statement of revenue, statement of functional expenses, external audit, federal audit election, and reconciliation of net assets. The data is generated using Structured Query Language (SQL) self-developed code to convert the IRS form 990 Extensible Markup Language (XML) tax filing files to a dataset in Excel. This paper is the first to convert these XML files and provide much-needed open access to nonprofit data in a long format that is useful for researchers to conduct cross-sectional analysis. The 2,174 lines of source code that we developed, and a step-by-step guide are included in this paper.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
非营利组织XML文件的自动化分解:美国国税局990表格数据的案例
本文提出并描述了在美国的非营利性国税局备案数据。该数据包含2016-2021年的831个属性和1,102,884条记录。在其他项目中,这些数据包括非营利组织的比较财务数据、治理披露和雇用的承包商,以及管理层薪酬、详细的收入报表、职能支出报表、外部审计、联邦审计选举和净资产对账。该数据使用SQL (Structured Query Language)自行开发的代码生成,将IRS form 990 XML (Extensible Markup Language)税务申报文件转换为Excel中的数据集。本文首次转换了这些XML文件,并提供了对非营利组织数据的长格式开放访问,这对研究人员进行横断面分析很有用。本文中包含了我们开发的2174行源代码和分步指南。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
European Journal of Information Systems
European Journal of Information Systems 工程技术-计算机:信息系统
CiteScore
23.10
自引率
4.20%
发文量
52
审稿时长
>12 weeks
期刊介绍: The European Journal of Information Systems offers a unique European perspective on the theory and practice of information systems for a global readership. We actively seek first-rate articles that offer a critical examination of information technology, covering its effects, development, implementation, strategy, management, and policy.
期刊最新文献
Unveiling motivational configurations in shaping meaningful engagement in green gamification Determinants of gamification effectiveness: perspectives of technology affordances and coping responses in the context of team-based gamified training Examining the impact of mobile gambling harm minimisation features: a dualistic model of passion perspective Achieving strategic alignment between business and information technology with information technology governance: the role of commitment to principles and Top Leadership Support Reducing the incidence of biased algorithmic decisions through feature importance transparency: an empirical study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1