人口普查记录连接的新方法。

IF 1.6 2区 历史学 Q1 HISTORY Historical Methods Pub Date : 2011-01-01 DOI:10.1080/01615440.2010.517152
Ron Goeken, Lap Huynh, Thomas Lenius, Rebecca Vick
{"title":"人口普查记录连接的新方法。","authors":"Ron Goeken,&nbsp;Lap Huynh,&nbsp;Thomas Lenius,&nbsp;Rebecca Vick","doi":"10.1080/01615440.2010.517152","DOIUrl":null,"url":null,"abstract":"<p><p>The Minnesota Population Center (MPC) has released linked datasets through its NAPP and IPUMS projects, making them readily accessible to researchers. Prior to the availability of complete count census microdata from the MPC, researchers applied various forms of record-linking software. This essay describes the techniques used in the MPC's linking program and briefly compares this technique with those used by other researchers. The key feature of the MPC linking method is the construction of cumulative name similarity scores, based on approximately 2.5 billion record comparisons; we also use support vector mechanics to classify potential links. This article explains modifications made for the final linked datasets and includes a discussion of the role of weighting variables when using linked data.</p>","PeriodicalId":45535,"journal":{"name":"Historical Methods","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2011-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/01615440.2010.517152","citationCount":"62","resultStr":"{\"title\":\"New Methods of Census Record Linking.\",\"authors\":\"Ron Goeken,&nbsp;Lap Huynh,&nbsp;Thomas Lenius,&nbsp;Rebecca Vick\",\"doi\":\"10.1080/01615440.2010.517152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The Minnesota Population Center (MPC) has released linked datasets through its NAPP and IPUMS projects, making them readily accessible to researchers. Prior to the availability of complete count census microdata from the MPC, researchers applied various forms of record-linking software. This essay describes the techniques used in the MPC's linking program and briefly compares this technique with those used by other researchers. The key feature of the MPC linking method is the construction of cumulative name similarity scores, based on approximately 2.5 billion record comparisons; we also use support vector mechanics to classify potential links. This article explains modifications made for the final linked datasets and includes a discussion of the role of weighting variables when using linked data.</p>\",\"PeriodicalId\":45535,\"journal\":{\"name\":\"Historical Methods\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2011-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/01615440.2010.517152\",\"citationCount\":\"62\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Historical Methods\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1080/01615440.2010.517152\",\"RegionNum\":2,\"RegionCategory\":\"历史学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HISTORY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Historical Methods","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1080/01615440.2010.517152","RegionNum":2,"RegionCategory":"历史学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HISTORY","Score":null,"Total":0}
引用次数: 62

摘要

明尼苏达人口中心(MPC)通过其NAPP和IPUMS项目发布了相关的数据集,使研究人员可以很容易地访问它们。在MPC提供完整的人口普查微数据之前,研究人员应用了各种形式的记录链接软件。本文描述了MPC连接程序中使用的技术,并简要地将该技术与其他研究人员使用的技术进行了比较。MPC链接方法的关键特征是基于大约25亿条记录的比较,构建了累积的名称相似度分数;我们还使用支持向量力学对潜在链接进行分类。本文解释了对最终关联数据集所做的修改,并讨论了使用关联数据时权重变量的作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
New Methods of Census Record Linking.

The Minnesota Population Center (MPC) has released linked datasets through its NAPP and IPUMS projects, making them readily accessible to researchers. Prior to the availability of complete count census microdata from the MPC, researchers applied various forms of record-linking software. This essay describes the techniques used in the MPC's linking program and briefly compares this technique with those used by other researchers. The key feature of the MPC linking method is the construction of cumulative name similarity scores, based on approximately 2.5 billion record comparisons; we also use support vector mechanics to classify potential links. This article explains modifications made for the final linked datasets and includes a discussion of the role of weighting variables when using linked data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Historical Methods
Historical Methods Multiple-
CiteScore
3.20
自引率
7.10%
发文量
13
期刊介绍: Historical Methodsreaches an international audience of social scientists concerned with historical problems. It explores interdisciplinary approaches to new data sources, new approaches to older questions and material, and practical discussions of computer and statistical methodology, data collection, and sampling procedures. The journal includes the following features: “Evidence Matters” emphasizes how to find, decipher, and analyze evidence whether or not that evidence is meant to be quantified. “Database Developments” announces major new public databases or large alterations in older ones, discusses innovative ways to organize them, and explains new ways of categorizing information.
期刊最新文献
A New Strategy for Linking U.S. Historical Censuses: A Case Study for the IPUMS Multigenerational Longitudinal Panel. Simple Strategies for Improving Inference with Linked Data: A Case Study of the 1850-1930 IPUMS Linked Representative Historical Samples. Reconstruction of Birth Histories for the Study of Fertility in the United States, 1830-1910. Introduction to Special Issues on Historical Record Linking. Linking the 1940 U.S. Census with Modern Data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1