The Semantic Snake Charmer Search Engine: A Tool to Facilitate Data Science in High-tech Industry Domains

Corrado Grappiolo, E. V. Gerwen, J. Verhoosel, L. Somers
{"title":"The Semantic Snake Charmer Search Engine: A Tool to Facilitate Data Science in High-tech Industry Domains","authors":"Corrado Grappiolo, E. V. Gerwen, J. Verhoosel, L. Somers","doi":"10.1145/3295750.3298915","DOIUrl":null,"url":null,"abstract":"The booming popularity of data science is also affecting high-tech industries. However, since these usually have different core competencies --- building cyber-physical systems rather than e.g. machine learning or data mining algorithms --- delving into data science by domain experts such as system engineers or architects might be more cumbersome than expected. In order to help domain experts to delve into data science we designed the Semantic Snake Charmer (SSC), a domain knowledge-based search engine for Jupyter Notebooks. SSC is composed of three modules: (1) a human-machine cooperative module to identify internal documentation which contains the most relevant domain knowledge, (2) a natural language processing module capable of transforming relevant documentation into several semantic graph types, (3) a reinforcement-learning based search engine which learns, given user feedback, the best mapping between input queries and semantic graph type to rely on. We believe SSC can be a fundamental asset to allow the easy landing of data science in industrial domains.","PeriodicalId":187771,"journal":{"name":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3295750.3298915","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

The booming popularity of data science is also affecting high-tech industries. However, since these usually have different core competencies --- building cyber-physical systems rather than e.g. machine learning or data mining algorithms --- delving into data science by domain experts such as system engineers or architects might be more cumbersome than expected. In order to help domain experts to delve into data science we designed the Semantic Snake Charmer (SSC), a domain knowledge-based search engine for Jupyter Notebooks. SSC is composed of three modules: (1) a human-machine cooperative module to identify internal documentation which contains the most relevant domain knowledge, (2) a natural language processing module capable of transforming relevant documentation into several semantic graph types, (3) a reinforcement-learning based search engine which learns, given user feedback, the best mapping between input queries and semantic graph type to rely on. We believe SSC can be a fundamental asset to allow the easy landing of data science in industrial domains.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
语义耍蛇人搜索引擎:促进高科技产业领域数据科学的工具
数据科学的蓬勃发展也影响着高科技产业。然而,由于这些通常具有不同的核心能力——构建网络物理系统而不是机器学习或数据挖掘算法——由系统工程师或架构师等领域专家深入研究数据科学可能比预期的要麻烦得多。为了帮助领域专家深入研究数据科学,我们为Jupyter Notebooks设计了基于领域知识的搜索引擎Semantic Snake Charmer (SSC)。SSC由三个模块组成:(1)人机协作模块,用于识别包含最相关领域知识的内部文档;(2)自然语言处理模块,能够将相关文档转换为几种语义图类型;(3)基于强化学习的搜索引擎,在给定用户反馈的情况下,学习输入查询与语义图类型之间的最佳映射。我们相信,SSC可以成为数据科学在工业领域轻松落地的基础资产。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Assisting Health Consumers While Searching the Web through Medical Annotations Do Metrics Matter? Understanding Context for Tasks and Activities NotifyMeHere: Intelligent Notification Delivery in Multi-Device Environments Computational Surprise, Perceptual Surprise, and Personal Background in Text Understanding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1