非营利组织XML文件的自动化分解:美国国税局990表格数据的案例

IF 7.3 2区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS European Journal of Information Systems Pub Date : 2022-08-03 DOI:10.2308/isys-2022-031
Husam A. Abu Khadra, D. Olsen
{"title":"非营利组织XML文件的自动化分解:美国国税局990表格数据的案例","authors":"Husam A. Abu Khadra, D. Olsen","doi":"10.2308/isys-2022-031","DOIUrl":null,"url":null,"abstract":"This paper presents and describes data for nonprofit IRS filings in the United States of America. The data contains 831 attributes and 1,102,884 records for the years 2016-2021. Among other items, the data include nonprofits’ comparative financial data, governance disclosures, and hired contractors, as well as management compensation, a detailed statement of revenue, statement of functional expenses, external audit, federal audit election, and reconciliation of net assets. The data is generated using Structured Query Language (SQL) self-developed code to convert the IRS form 990 Extensible Markup Language (XML) tax filing files to a dataset in Excel. This paper is the first to convert these XML files and provide much-needed open access to nonprofit data in a long format that is useful for researchers to conduct cross-sectional analysis. The 2,174 lines of source code that we developed, and a step-by-step guide are included in this paper.","PeriodicalId":50486,"journal":{"name":"European Journal of Information Systems","volume":null,"pages":null},"PeriodicalIF":7.3000,"publicationDate":"2022-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Automating Shredding Nonprofit XML Files: The Case of IRS Form 990 Data\",\"authors\":\"Husam A. Abu Khadra, D. Olsen\",\"doi\":\"10.2308/isys-2022-031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents and describes data for nonprofit IRS filings in the United States of America. The data contains 831 attributes and 1,102,884 records for the years 2016-2021. Among other items, the data include nonprofits’ comparative financial data, governance disclosures, and hired contractors, as well as management compensation, a detailed statement of revenue, statement of functional expenses, external audit, federal audit election, and reconciliation of net assets. The data is generated using Structured Query Language (SQL) self-developed code to convert the IRS form 990 Extensible Markup Language (XML) tax filing files to a dataset in Excel. This paper is the first to convert these XML files and provide much-needed open access to nonprofit data in a long format that is useful for researchers to conduct cross-sectional analysis. The 2,174 lines of source code that we developed, and a step-by-step guide are included in this paper.\",\"PeriodicalId\":50486,\"journal\":{\"name\":\"European Journal of Information Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2022-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Information Systems\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.2308/isys-2022-031\",\"RegionNum\":2,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Information Systems","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.2308/isys-2022-031","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

本文提出并描述了在美国的非营利性国税局备案数据。该数据包含2016-2021年的831个属性和1,102,884条记录。在其他项目中,这些数据包括非营利组织的比较财务数据、治理披露和雇用的承包商,以及管理层薪酬、详细的收入报表、职能支出报表、外部审计、联邦审计选举和净资产对账。该数据使用SQL (Structured Query Language)自行开发的代码生成,将IRS form 990 XML (Extensible Markup Language)税务申报文件转换为Excel中的数据集。本文首次转换了这些XML文件,并提供了对非营利组织数据的长格式开放访问,这对研究人员进行横断面分析很有用。本文中包含了我们开发的2174行源代码和分步指南。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Toward Automating Shredding Nonprofit XML Files: The Case of IRS Form 990 Data
This paper presents and describes data for nonprofit IRS filings in the United States of America. The data contains 831 attributes and 1,102,884 records for the years 2016-2021. Among other items, the data include nonprofits’ comparative financial data, governance disclosures, and hired contractors, as well as management compensation, a detailed statement of revenue, statement of functional expenses, external audit, federal audit election, and reconciliation of net assets. The data is generated using Structured Query Language (SQL) self-developed code to convert the IRS form 990 Extensible Markup Language (XML) tax filing files to a dataset in Excel. This paper is the first to convert these XML files and provide much-needed open access to nonprofit data in a long format that is useful for researchers to conduct cross-sectional analysis. The 2,174 lines of source code that we developed, and a step-by-step guide are included in this paper.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
European Journal of Information Systems
European Journal of Information Systems 工程技术-计算机:信息系统
CiteScore
23.10
自引率
4.20%
发文量
52
审稿时长
>12 weeks
期刊介绍: The European Journal of Information Systems offers a unique European perspective on the theory and practice of information systems for a global readership. We actively seek first-rate articles that offer a critical examination of information technology, covering its effects, development, implementation, strategy, management, and policy.
期刊最新文献
Unveiling motivational configurations in shaping meaningful engagement in green gamification Determinants of gamification effectiveness: perspectives of technology affordances and coping responses in the context of team-based gamified training Examining the impact of mobile gambling harm minimisation features: a dualistic model of passion perspective Achieving strategic alignment between business and information technology with information technology governance: the role of commitment to principles and Top Leadership Support Reducing the incidence of biased algorithmic decisions through feature importance transparency: an empirical study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1