Taming Big Data: Integrating diverse public data sources for economic competitiveness analytics

Data4U '14 Pub Date : 2014-09-01 DOI:10.1145/2658840.2658845

R. Neamtu, Ramoza Ahsan, J. Stokes, Armend Hoxha, Jialiang Bao, Stefan Gvozdenovic, Ted Meyer, Nilesh Patel, Raghu Rangan, Yumou Wang, Dongyun Zhang, Elke A. Rundensteiner

{"title":"Taming Big Data: Integrating diverse public data sources for economic competitiveness analytics","authors":"R. Neamtu, Ramoza Ahsan, J. Stokes, Armend Hoxha, Jialiang Bao, Stefan Gvozdenovic, Ted Meyer, Nilesh Patel, Raghu Rangan, Yumou Wang, Dongyun Zhang, Elke A. Rundensteiner","doi":"10.1145/2658840.2658845","DOIUrl":null,"url":null,"abstract":"In an era where Big Data can greatly impact a broad population, many novel opportunities arise, chief among them the ability to integrate data from diverse sources and \"wrangle\" it to extract novel insights. Conceived as a tool that can help both expert and non-expert users better understand public data, MATTERS was collaboratively developed by the Massachusetts High Tech Council, WPI and other institutions as an analytic platform offering dynamic modeling capabilities. MATTERS is an integrative data source on high fidelity cost and talent competitiveness metrics. Its goal is to extract, integrate and model rich economic, financial, educational and technological information from renowned heterogeneous web data sources ranging from The US Census Bureau, The Bureau of Labor Statistics to the Institute of Education Sciences, all known to be critical factors influencing economic competitiveness of states. This demonstration of MATTERS illustrates how we tackle challenges of data acquisition, cleaning, integration and wrangling into appropriate representations, visualization and story-telling with data in the context of state competitiveness in the high-tech sector.","PeriodicalId":135661,"journal":{"name":"Data4U '14","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data4U '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2658840.2658845","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In an era where Big Data can greatly impact a broad population, many novel opportunities arise, chief among them the ability to integrate data from diverse sources and "wrangle" it to extract novel insights. Conceived as a tool that can help both expert and non-expert users better understand public data, MATTERS was collaboratively developed by the Massachusetts High Tech Council, WPI and other institutions as an analytic platform offering dynamic modeling capabilities. MATTERS is an integrative data source on high fidelity cost and talent competitiveness metrics. Its goal is to extract, integrate and model rich economic, financial, educational and technological information from renowned heterogeneous web data sources ranging from The US Census Bureau, The Bureau of Labor Statistics to the Institute of Education Sciences, all known to be critical factors influencing economic competitiveness of states. This demonstration of MATTERS illustrates how we tackle challenges of data acquisition, cleaning, integration and wrangling into appropriate representations, visualization and story-telling with data in the context of state competitiveness in the high-tech sector.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

驯服大数据:整合各种公共数据源，用于经济竞争力分析

在一个大数据可以极大地影响广泛人群的时代，出现了许多新的机会，其中最主要的是整合来自不同来源的数据并“争论”它以提取新颖见解的能力。作为一个可以帮助专家和非专业用户更好地理解公共数据的工具，MATTERS是由马萨诸塞州高科技委员会、WPI和其他机构合作开发的，作为一个提供动态建模功能的分析平台。MATTERS是一个关于高保真成本和人才竞争力指标的综合数据源。其目标是从著名的异构网络数据源中提取、整合和建模丰富的经济、金融、教育和技术信息，这些数据源包括美国人口普查局、劳工统计局和教育科学研究所，这些数据源都是影响国家经济竞争力的关键因素。这个MATTERS的演示说明了我们如何在国家在高科技领域的竞争力背景下应对数据采集、清理、整合和争论的挑战，以适当的表示、可视化和讲故事的数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Data4U '14

自引率

0.00%

发文量

期刊最新文献

An Efficient Processing of k-Dominant Skyline Query in MapReduce DiNoDB: Efficient Large-Scale Raw Data Analytics A Paradigm for Learning Queries on Big Data Affordable Analytics on Expensive Data Taming Big Data: Integrating diverse public data sources for economic competitiveness analytics