Dmap: Automating Domain Name Ecosystem Measurements and Applications

M. Wullink, G. Moura, Cristian Hesselman
{"title":"Dmap: Automating Domain Name Ecosystem Measurements and Applications","authors":"M. Wullink, G. Moura, Cristian Hesselman","doi":"10.23919/TMA.2018.8506521","DOIUrl":null,"url":null,"abstract":"Behind each Internet domain name, there is a set of entities/companies responsible for delivering the various services associated with it, such as Web hosting and e-mail. Together, they form what we refer to as DNS ecosystem. Currently, there is no single measurement tool designed to measure this ecosystem altogether. As a result, researchers that aim at analyzing (parts of) this ecosystem often have to spend significant amounts of time preparing and executing the multiple application measurements and post-processing their heterogeneous raw datasets. Given that time is a scare resource, this complexity diverts researcher's time from actual analysis, ultimately limiting how far many studies go. To help researchers facing this situation, we present Dmap, an active measurement application that reduces the complexity of executing both measurements and analysis. It does so by (i) automating the crawling of several application protocols (DNS, HTTP, TLS/SSL, SMTP, both over IPv4 and IPv6) and (ii) storing the results into a relational data base, enabling researchers to quickly perform hypothesis tests within interactive response times using SQL. Dmap current version has 40 classifiers that generate 166 derived features (e.g., CMS detection, page language), which can be used by researchers and operators to build applications and services. We present an evaluation of Dmap and show three applications that it can be used for, such as profiling the Alexa 1 million domains. We use Dmap at SIDN (.nl registry) for research on the. nl zone and make it open-source for researchers.","PeriodicalId":6607,"journal":{"name":"2018 Network Traffic Measurement and Analysis Conference (TMA)","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Network Traffic Measurement and Analysis Conference (TMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/TMA.2018.8506521","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Behind each Internet domain name, there is a set of entities/companies responsible for delivering the various services associated with it, such as Web hosting and e-mail. Together, they form what we refer to as DNS ecosystem. Currently, there is no single measurement tool designed to measure this ecosystem altogether. As a result, researchers that aim at analyzing (parts of) this ecosystem often have to spend significant amounts of time preparing and executing the multiple application measurements and post-processing their heterogeneous raw datasets. Given that time is a scare resource, this complexity diverts researcher's time from actual analysis, ultimately limiting how far many studies go. To help researchers facing this situation, we present Dmap, an active measurement application that reduces the complexity of executing both measurements and analysis. It does so by (i) automating the crawling of several application protocols (DNS, HTTP, TLS/SSL, SMTP, both over IPv4 and IPv6) and (ii) storing the results into a relational data base, enabling researchers to quickly perform hypothesis tests within interactive response times using SQL. Dmap current version has 40 classifiers that generate 166 derived features (e.g., CMS detection, page language), which can be used by researchers and operators to build applications and services. We present an evaluation of Dmap and show three applications that it can be used for, such as profiling the Alexa 1 million domains. We use Dmap at SIDN (.nl registry) for research on the. nl zone and make it open-source for researchers.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Dmap:自动化域名生态系统测量和应用
在每个互联网域名的背后,都有一组实体/公司负责提供与之相关的各种服务,例如Web托管和电子邮件。它们共同构成了我们所说的DNS生态系统。目前,还没有一种单一的测量工具可以测量整个生态系统。因此,旨在分析(部分)该生态系统的研究人员通常不得不花费大量时间准备和执行多个应用程序测量,并对其异构原始数据集进行后处理。考虑到时间是一种稀缺资源,这种复杂性分散了研究人员用于实际分析的时间,最终限制了许多研究的进展。为了帮助研究人员面对这种情况,我们提出了Dmap,一个主动测量应用程序,减少了执行测量和分析的复杂性。它通过(i)自动抓取多个应用协议(DNS, HTTP, TLS/SSL, SMTP, IPv4和IPv6)和(ii)将结果存储到关系数据库中,使研究人员能够使用SQL在交互式响应时间内快速执行假设测试。Dmap当前版本有40个分类器,产生166个派生特征(例如,CMS检测,页面语言),研究人员和操作人员可以使用它们来构建应用程序和服务。我们提出了对Dmap的评估,并展示了它可以用于的三个应用程序,例如分析Alexa 100万个域。我们在SIDN()上使用Dmap。Nl注册)的研究。并将其开放给研究人员。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
On the Analysis of Network Measurements Through Machine Learning: The Power of the Crowd App for Dynamic Crowdsourced QoE Studies of HTTP Adaptive Streaming on Mobile Devices Dmap: Automating Domain Name Ecosystem Measurements and Applications Anycaston the Move: A Look at Mobile Anycast Performance A Second Screen Journey to the Cup: Twitter Dynamics During the Stanley Cup Playoffs
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1