Linking Inmates to Mental Health Services by Matching Records Between Independent Data Sets

Aditi Jain, Amelia Norman, L. Alonzi, Michael C. Smith, Neal Goodloe, K. P. White
{"title":"Linking Inmates to Mental Health Services by Matching Records Between Independent Data Sets","authors":"Aditi Jain, Amelia Norman, L. Alonzi, Michael C. Smith, Neal Goodloe, K. P. White","doi":"10.1109/sieds55548.2022.9799372","DOIUrl":null,"url":null,"abstract":"Officials in the United States correctional system have long been aware of the significant role that serious mental illness (SMI) plays in recidivism. In a 2011 study, Bronson reported that 68% of prison inmates with diagnosed SMI returned to custody at least once within 4 years, 8% higher than those without SMI [1]. This issue is especially prevalent in regional jails, where 63% of male inmates and 75% of female inmates in regional jails suffer from symptoms of serious mental illness every year, making immediate assistance to these individuals crucial [2]. In response, a team of University of Virginia (UVA) Systems Engineering students work in collaboration with an array of organizations in the Charlottesville-Albemarle region to identify and provide local jail inmates with the mental health services they need, and produce policy recommendations to improve conditions for individuals with SMI who are prone to exposure to the criminal justice system [3]. The current Capstone team consists of undergraduate UVA students who perform analysis using the data provided by the organizations, enabling the community to make informed decisions. However, these decisions are hindered because, since the data sets from different organizations are not linked with a unique identifier for individuals across the agencies that are responsible for the care and supervision of individuals suffering from SMI. This makes the matching of individuals between data sets difficult. This issue is exacerbated by recidivism, which results in multiple occurrences of similar (or identical) values, complicating typical record matching methods, which often rely on one-to-one matching methods. Moreover, the data include protected personal identifiers (PPI) and HIPPA protected data, which also restricts data sharing among the agencies. Thus, any effort to merge the data must adhere to applicable data security rules and non-disclosure agreements. To resolve these matching issues, we first condensed the reiterations of data within each dataset into one line per individual and included an internal consistency metric that reflects possible changes (i.e. preferred name, address, etc.) that could affect data matching. Then, we developed a matching algorithm using the Record Linkage package on Python that compares two data sets consisting of resident information from Region Ten Community Services (R10) and the Jail Management System (JMS) at the Albemarle-Charlottesville Regional Jail (ACRJ) [4]. As a result of this process, we identified over 95 additional matches and another 50 uncertain matches that required human spot-checking, which is an improvement of 10% to previous methods of record matching applied to the data set. Such results could have significant results to the Capstone team as well as to other fields of research, especially regarding medical, financial, or other forms of data that deal with changing data over time.","PeriodicalId":286724,"journal":{"name":"2022 Systems and Information Engineering Design Symposium (SIEDS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sieds55548.2022.9799372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Officials in the United States correctional system have long been aware of the significant role that serious mental illness (SMI) plays in recidivism. In a 2011 study, Bronson reported that 68% of prison inmates with diagnosed SMI returned to custody at least once within 4 years, 8% higher than those without SMI [1]. This issue is especially prevalent in regional jails, where 63% of male inmates and 75% of female inmates in regional jails suffer from symptoms of serious mental illness every year, making immediate assistance to these individuals crucial [2]. In response, a team of University of Virginia (UVA) Systems Engineering students work in collaboration with an array of organizations in the Charlottesville-Albemarle region to identify and provide local jail inmates with the mental health services they need, and produce policy recommendations to improve conditions for individuals with SMI who are prone to exposure to the criminal justice system [3]. The current Capstone team consists of undergraduate UVA students who perform analysis using the data provided by the organizations, enabling the community to make informed decisions. However, these decisions are hindered because, since the data sets from different organizations are not linked with a unique identifier for individuals across the agencies that are responsible for the care and supervision of individuals suffering from SMI. This makes the matching of individuals between data sets difficult. This issue is exacerbated by recidivism, which results in multiple occurrences of similar (or identical) values, complicating typical record matching methods, which often rely on one-to-one matching methods. Moreover, the data include protected personal identifiers (PPI) and HIPPA protected data, which also restricts data sharing among the agencies. Thus, any effort to merge the data must adhere to applicable data security rules and non-disclosure agreements. To resolve these matching issues, we first condensed the reiterations of data within each dataset into one line per individual and included an internal consistency metric that reflects possible changes (i.e. preferred name, address, etc.) that could affect data matching. Then, we developed a matching algorithm using the Record Linkage package on Python that compares two data sets consisting of resident information from Region Ten Community Services (R10) and the Jail Management System (JMS) at the Albemarle-Charlottesville Regional Jail (ACRJ) [4]. As a result of this process, we identified over 95 additional matches and another 50 uncertain matches that required human spot-checking, which is an improvement of 10% to previous methods of record matching applied to the data set. Such results could have significant results to the Capstone team as well as to other fields of research, especially regarding medical, financial, or other forms of data that deal with changing data over time.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过在独立数据集之间匹配记录,将囚犯与心理健康服务联系起来
美国惩教系统的官员们早就意识到严重精神疾病(SMI)在累犯中所起的重要作用。在2011年的一项研究中,布朗森报告说,68%被诊断为重度精神分裂症的囚犯在4年内至少再次被拘留一次,比没有重度精神分裂症的囚犯高8%。这一问题在地区监狱中尤为普遍,每年有63%的男性囚犯和75%的女性囚犯患有严重的精神疾病症状,因此对这些人的立即援助至关重要。作为回应,弗吉尼亚大学(UVA)系统工程专业的学生团队与夏洛茨维尔-阿尔伯马尔地区的一系列组织合作,确定并为当地监狱囚犯提供他们所需的心理健康服务,并提出政策建议,以改善容易暴露于刑事司法系统的重度精神障碍患者的状况。目前的Capstone团队由弗吉尼亚大学的本科生组成,他们使用组织提供的数据进行分析,使社区能够做出明智的决策。然而,这些决定受到阻碍,因为来自不同组织的数据集没有与跨负责照顾和监督重度精神分裂症患者的机构的个人的唯一标识符联系起来。这使得数据集之间的个体匹配变得困难。累犯会导致多次出现相似(或相同)的值,从而使典型的记录匹配方法复杂化,这些方法通常依赖于一对一的匹配方法。此外,这些数据包括受保护的个人标识符(PPI)和HIPPA保护的数据,这也限制了机构之间的数据共享。因此,任何合并数据的努力都必须遵守适用的数据安全规则和保密协议。为了解决这些匹配问题,我们首先将每个数据集中的重复数据浓缩成一行,并包含一个内部一致性指标,反映可能影响数据匹配的可能变化(即首选名称,地址等)。然后,我们使用Python上的Record Linkage包开发了一个匹配算法,该算法比较了来自Albemarle-Charlottesville地区监狱(ACRJ)[4]的十区社区服务(R10)和监狱管理系统(JMS)的居民信息组成的两个数据集。这个过程的结果是,我们确定了超过95个额外的匹配和另外50个需要人工抽查的不确定匹配,这比以前应用于数据集的记录匹配方法提高了10%。这样的结果可能会对Capstone团队以及其他研究领域产生重大影响,特别是在医疗、金融或其他形式的数据方面,这些数据会随着时间的推移而变化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Linville Creek Bridge: A Case Study of Design Thinking in Structural Engineering Convergence Across Behavioral and Self-report Measures Evaluating Individuals' Trust in an Autonomous Golf Cart Investigating the Illicit Trade of Cultural Property with an Automated Data Pipeline Architecture Investigating Disinformation Through the Lens of Mass Media: A System Design Dynamic Coal Production Line: Plant Design and Analysis Tool
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1