社会科学中的大数据方法

IF 1.4 3区 社会学 Q3 DEMOGRAPHY Mathematical Population Studies Pub Date : 2019-04-03 DOI:10.1080/08898480.2019.1597577
Enrica Amaturo, Biagio Aragona
{"title":"社会科学中的大数据方法","authors":"Enrica Amaturo, Biagio Aragona","doi":"10.1080/08898480.2019.1597577","DOIUrl":null,"url":null,"abstract":"The diffusion of digital technologies and social networks has multiplied the forms of digital data that can be employed for social research. The main two forms are native digital data, which are produced in social networks, search engines, or blogging, and digitized data, which are analog data transformed into digital (Rogers, 2013). Big data are originally produced in the Internet. They allow for analyzing behaviors without interfering with individuals (Webb et al., 1966). An example is the data used in web platforms analytics, such as Google Correlate, whose purpose is to reveal the co-occurrences associated with a keyword searched through the Google search engine. This tool helped to predict the flu epidemic in the US, well before the US Centre for Disease Control and Prevention (Ginsberg et al., 2009). This example demonstrates that digital web platforms enable innovations in data analysis. Another example of native digital data is the data voluntarily uploaded on social networks, blogs, and websites. These are mainly textual or visual (images and videos), often unstructured. A third example is transactional data and the Internet of things. Transactions made through digital devices, such as smart-phones, scanners, tablets, and cards with chips (credit cards, shopping cards) produce data with some structure. These data comprise metadata (date, time, duration, or expenditures) associated with transactions. The objects connected to the Internet (the Internet of things), such as sensors for health monitoring, house automation, and driving aid, usually produce structured data, which can be organized and analyzed. Digitized data previously existed in analog form, for example images, videos, and scanned or digitally photographed documents uploaded on the web, such as museum collections or libraries available on-line. Digital humanities have converted this material into digital form. Another example is the surveys assisted by computers, where the data are inserted into digital databases. Web surveys now are conducted through the Internet (by e-mail) (Amaturo and Aragona, 2016), and allow for reaching a large sample with a small budget.","PeriodicalId":49859,"journal":{"name":"Mathematical Population Studies","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2019-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/08898480.2019.1597577","citationCount":"1","resultStr":"{\"title\":\"Methods for big data in social sciences\",\"authors\":\"Enrica Amaturo, Biagio Aragona\",\"doi\":\"10.1080/08898480.2019.1597577\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The diffusion of digital technologies and social networks has multiplied the forms of digital data that can be employed for social research. The main two forms are native digital data, which are produced in social networks, search engines, or blogging, and digitized data, which are analog data transformed into digital (Rogers, 2013). Big data are originally produced in the Internet. They allow for analyzing behaviors without interfering with individuals (Webb et al., 1966). An example is the data used in web platforms analytics, such as Google Correlate, whose purpose is to reveal the co-occurrences associated with a keyword searched through the Google search engine. This tool helped to predict the flu epidemic in the US, well before the US Centre for Disease Control and Prevention (Ginsberg et al., 2009). This example demonstrates that digital web platforms enable innovations in data analysis. Another example of native digital data is the data voluntarily uploaded on social networks, blogs, and websites. These are mainly textual or visual (images and videos), often unstructured. A third example is transactional data and the Internet of things. Transactions made through digital devices, such as smart-phones, scanners, tablets, and cards with chips (credit cards, shopping cards) produce data with some structure. These data comprise metadata (date, time, duration, or expenditures) associated with transactions. The objects connected to the Internet (the Internet of things), such as sensors for health monitoring, house automation, and driving aid, usually produce structured data, which can be organized and analyzed. Digitized data previously existed in analog form, for example images, videos, and scanned or digitally photographed documents uploaded on the web, such as museum collections or libraries available on-line. Digital humanities have converted this material into digital form. Another example is the surveys assisted by computers, where the data are inserted into digital databases. Web surveys now are conducted through the Internet (by e-mail) (Amaturo and Aragona, 2016), and allow for reaching a large sample with a small budget.\",\"PeriodicalId\":49859,\"journal\":{\"name\":\"Mathematical Population Studies\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2019-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/08898480.2019.1597577\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Population Studies\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1080/08898480.2019.1597577\",\"RegionNum\":3,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"DEMOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Population Studies","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/08898480.2019.1597577","RegionNum":3,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
引用次数: 1

摘要

数字技术和社交网络的传播使可用于社会研究的数字数据形式成倍增加。主要的两种形式是在社交网络、搜索引擎或博客中产生的原生数字数据,以及数字化数据,即转换为数字的模拟数据(Rogers,2013)。大数据最初是在互联网上产生的。它们允许在不干扰个人的情况下分析行为(Webb等人,1966)。一个例子是网络平台分析中使用的数据,如Google Correlate,其目的是揭示与通过Google搜索引擎搜索的关键词相关联的共同出现。早在美国疾病控制和预防中心(Ginsberg et al.,2009)之前,这一工具就有助于预测美国的流感疫情。这个例子表明,数字网络平台能够实现数据分析的创新。本地数字数据的另一个例子是自愿上传到社交网络、博客和网站上的数据。这些主要是文本或视觉(图像和视频),通常是非结构化的。第三个例子是事务数据和物联网。通过数字设备进行的交易,如智能手机、扫描仪、平板电脑和带芯片的卡(信用卡、购物卡),产生具有某种结构的数据。这些数据包括与交易相关联的元数据(日期、时间、持续时间或支出)。连接到互联网(物联网)的对象,如用于健康监测、房屋自动化和驾驶辅助的传感器,通常会产生结构化数据,这些数据可以进行组织和分析。数字化数据以前以模拟形式存在,例如图像、视频,以及上传到网络上的扫描或数字拍摄文件,例如博物馆藏品或在线图书馆。数字人文学科已经将这些材料转化为数字形式。另一个例子是由计算机辅助的调查,将数据插入数字数据库。现在,网络调查是通过互联网(通过电子邮件)进行的(Amaturo和Aragona,2016),可以用小预算接触到大样本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Methods for big data in social sciences
The diffusion of digital technologies and social networks has multiplied the forms of digital data that can be employed for social research. The main two forms are native digital data, which are produced in social networks, search engines, or blogging, and digitized data, which are analog data transformed into digital (Rogers, 2013). Big data are originally produced in the Internet. They allow for analyzing behaviors without interfering with individuals (Webb et al., 1966). An example is the data used in web platforms analytics, such as Google Correlate, whose purpose is to reveal the co-occurrences associated with a keyword searched through the Google search engine. This tool helped to predict the flu epidemic in the US, well before the US Centre for Disease Control and Prevention (Ginsberg et al., 2009). This example demonstrates that digital web platforms enable innovations in data analysis. Another example of native digital data is the data voluntarily uploaded on social networks, blogs, and websites. These are mainly textual or visual (images and videos), often unstructured. A third example is transactional data and the Internet of things. Transactions made through digital devices, such as smart-phones, scanners, tablets, and cards with chips (credit cards, shopping cards) produce data with some structure. These data comprise metadata (date, time, duration, or expenditures) associated with transactions. The objects connected to the Internet (the Internet of things), such as sensors for health monitoring, house automation, and driving aid, usually produce structured data, which can be organized and analyzed. Digitized data previously existed in analog form, for example images, videos, and scanned or digitally photographed documents uploaded on the web, such as museum collections or libraries available on-line. Digital humanities have converted this material into digital form. Another example is the surveys assisted by computers, where the data are inserted into digital databases. Web surveys now are conducted through the Internet (by e-mail) (Amaturo and Aragona, 2016), and allow for reaching a large sample with a small budget.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mathematical Population Studies
Mathematical Population Studies 数学-数学跨学科应用
CiteScore
3.20
自引率
11.10%
发文量
7
审稿时长
>12 weeks
期刊介绍: Mathematical Population Studies publishes carefully selected research papers in the mathematical and statistical study of populations. The journal is strongly interdisciplinary and invites contributions by mathematicians, demographers, (bio)statisticians, sociologists, economists, biologists, epidemiologists, actuaries, geographers, and others who are interested in the mathematical formulation of population-related questions. The scope covers both theoretical and empirical work. Manuscripts should be sent to Manuscript central for review. The editor-in-chief has final say on the suitability for publication.
期刊最新文献
Researching algorithm awareness: methodological approaches to investigate how people perceive, know, and interact with algorithms Detection of outliers in survey–weighted linear regression Fractional Lindley distribution generated by time scale theory, with application to discrete-time lifetime data Estimating the structure by age and sex of the US sexually active population Optimizing criterion for the upper limit of the signal response of brain neurons
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1