Binary Classification: An Introductory Machine Learning Tutorial for Social Scientists

Vivian P. Ta, Leonardo Carrico, Arthur Bousquet
{"title":"Binary Classification: An Introductory Machine Learning Tutorial for Social Scientists","authors":"Vivian P. Ta, Leonardo Carrico, Arthur Bousquet","doi":"10.2458/jmmss.5186","DOIUrl":null,"url":null,"abstract":"A barrier that prevents many social scientists from pursuing big data research is the lack of technical training required to assemble and organize big data. In an effort to address this barrier, we provide an introductory tutorial into machine learning for social scientists by demonstrating the basic steps and fundamental concepts involved in binary classification. We first describe the data and libraries required for analysis. We then demonstrate data cleaning methods, feature engineering, the model-building process, model assessment, and feature importance. Last, we discuss the ways in which social scientists can use machine learning to complement inference-based approaches and how it can contribute to a richer understanding of social science.","PeriodicalId":90602,"journal":{"name":"Journal of methods and measurement in the social sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of methods and measurement in the social sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2458/jmmss.5186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A barrier that prevents many social scientists from pursuing big data research is the lack of technical training required to assemble and organize big data. In an effort to address this barrier, we provide an introductory tutorial into machine learning for social scientists by demonstrating the basic steps and fundamental concepts involved in binary classification. We first describe the data and libraries required for analysis. We then demonstrate data cleaning methods, feature engineering, the model-building process, model assessment, and feature importance. Last, we discuss the ways in which social scientists can use machine learning to complement inference-based approaches and how it can contribute to a richer understanding of social science.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
二元分类:面向社会科学家的机器学习入门教程
阻碍许多社会科学家从事大数据研究的一个障碍是缺乏收集和组织大数据所需的技术培训。为了解决这一障碍,我们通过演示二进制分类中涉及的基本步骤和基本概念,为社会科学家提供了机器学习的入门教程。我们首先描述分析所需的数据和库。然后,我们展示了数据清理方法、特征工程、模型构建过程、模型评估和特征重要性。最后,我们讨论了社会科学家如何使用机器学习来补充基于推理的方法,以及它如何有助于更丰富地理解社会科学。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
26 weeks
期刊最新文献
Invitation for COVID-19 Submissions Machine Learning Method for High-Dimensional Education Data Comparing human coding to two natural language processing algorithms in aspirations of people affected by Duchenne Muscular Dystrophy The Modern Biased Information Test: Proposing alternatives for implicit measures Binary Classification: An Introductory Machine Learning Tutorial for Social Scientists
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1