MEX Interfaces: Automating Machine Learning Metadata Generation

Diego Esteves, Pablo N. Mendes, Diego Moussallem, J. C. Duarte, A. Zaveri, Jens Lehmann
{"title":"MEX Interfaces: Automating Machine Learning Metadata Generation","authors":"Diego Esteves, Pablo N. Mendes, Diego Moussallem, J. C. Duarte, A. Zaveri, Jens Lehmann","doi":"10.1145/2993318.2993320","DOIUrl":null,"url":null,"abstract":"Despite recent efforts to achieve a high level of interoperability of Machine Learning (ML) experiments, positively collaborating with the Reproducible Research context, we still run into problems created due to the existence of different ML platforms: each of those have a specific conceptualization or schema for representing data and metadata. This scenario leads to an extra coding-effort to achieve both the desired interoperability and a better provenance level as well as a more automatized environment for obtaining the generated results. Hence, when using ML libraries, it is a common task to re-design specific data models (schemata) and develop wrappers to manage the produced outputs. In this article, we discuss this gap focusing on the solution for the question: \"What is the cleanest and lowest-impact solution, i.e., the minimal effort to achieve both higher interoperability and provenance metadata levels in the Integrated Development Environments (IDE) context and how to facilitate the inherent data querying task?\". We introduce a novel and low-impact methodology specifically designed for code built in that context, combining Semantic Web concepts and reflection in order to minimize the gap for exporting ML metadata in a structured manner, allowing embedded code annotations that are, in run-time, converted in one of the state-of-the-art ML schemas for the Semantic Web: MEX Vocabulary.","PeriodicalId":177013,"journal":{"name":"Proceedings of the 12th International Conference on Semantic Systems","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 12th International Conference on Semantic Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2993318.2993320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Despite recent efforts to achieve a high level of interoperability of Machine Learning (ML) experiments, positively collaborating with the Reproducible Research context, we still run into problems created due to the existence of different ML platforms: each of those have a specific conceptualization or schema for representing data and metadata. This scenario leads to an extra coding-effort to achieve both the desired interoperability and a better provenance level as well as a more automatized environment for obtaining the generated results. Hence, when using ML libraries, it is a common task to re-design specific data models (schemata) and develop wrappers to manage the produced outputs. In this article, we discuss this gap focusing on the solution for the question: "What is the cleanest and lowest-impact solution, i.e., the minimal effort to achieve both higher interoperability and provenance metadata levels in the Integrated Development Environments (IDE) context and how to facilitate the inherent data querying task?". We introduce a novel and low-impact methodology specifically designed for code built in that context, combining Semantic Web concepts and reflection in order to minimize the gap for exporting ML metadata in a structured manner, allowing embedded code annotations that are, in run-time, converted in one of the state-of-the-art ML schemas for the Semantic Web: MEX Vocabulary.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MEX接口:自动化机器学习元数据生成
尽管最近努力实现机器学习(ML)实验的高水平互操作性,积极地与可复制研究上下文合作,但我们仍然遇到由于不同ML平台的存在而产生的问题:每个平台都有一个特定的概念或模式来表示数据和元数据。这种情况会导致额外的编码工作,以实现所需的互操作性和更好的来源级别,以及获得生成结果的更自动化的环境。因此,在使用ML库时,重新设计特定的数据模型(模式)和开发包装器来管理生成的输出是一项常见的任务。在本文中,我们将重点讨论以下问题的解决方案:“什么是最干净和影响最小的解决方案,即在集成开发环境(IDE)上下文中实现更高的互操作性和源元数据级别的最小努力,以及如何促进固有的数据查询任务?”我们引入了一种新颖的、低影响的方法,专门为在这种情况下构建的代码设计,结合语义Web概念和反射,以最大限度地减少以结构化方式导出ML元数据的差距,允许在运行时将嵌入的代码注释转换为用于语义Web的最先进的ML模式之一:MEX Vocabulary。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Top-level Ideas about Importing, Translating and Exporting Knowledge via an Ontology of Representation Languages Cross-Evaluation of Entity Linking and Disambiguation Systems for Clinical Text Annotation Executing SPARQL queries over Mapped Document Store with SparqlMap-M Evaluating Query and Storage Strategies for RDF Archives Linking Images to Semantic Knowledge Base with User-generated Tags
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1