数据分析的机器学习管道生成方法

Zhao Ru-tao, Wang Jing, Chen Gao-jian, Li Qian-wen, Yuan Yun-jing
{"title":"数据分析的机器学习管道生成方法","authors":"Zhao Ru-tao, Wang Jing, Chen Gao-jian, Li Qian-wen, Yuan Yun-jing","doi":"10.1109/ICCC51575.2020.9345123","DOIUrl":null,"url":null,"abstract":"Data analysis requires a high level of expertise for domain workers, and AutoML aims to make these decisions in an automated way. But it is still a difficult problem to automatically generate machine learning pipelines with high performance in acceptable time. This paper presents a DFSR (Data Feature and Service Association) approach to automatically generating machine learning pipelines utilizing data features and service associations. The experimental results showed that the performance of the generated pipelines reached the satisfactory level of current AutoML tools, and the time consumption is reduced to the minute level.","PeriodicalId":386048,"journal":{"name":"2020 IEEE 6th International Conference on Computer and Communications (ICCC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Machine Learning Pipeline Generation Approach for Data Analysis\",\"authors\":\"Zhao Ru-tao, Wang Jing, Chen Gao-jian, Li Qian-wen, Yuan Yun-jing\",\"doi\":\"10.1109/ICCC51575.2020.9345123\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data analysis requires a high level of expertise for domain workers, and AutoML aims to make these decisions in an automated way. But it is still a difficult problem to automatically generate machine learning pipelines with high performance in acceptable time. This paper presents a DFSR (Data Feature and Service Association) approach to automatically generating machine learning pipelines utilizing data features and service associations. The experimental results showed that the performance of the generated pipelines reached the satisfactory level of current AutoML tools, and the time consumption is reduced to the minute level.\",\"PeriodicalId\":386048,\"journal\":{\"name\":\"2020 IEEE 6th International Conference on Computer and Communications (ICCC)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 6th International Conference on Computer and Communications (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCC51575.2020.9345123\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 6th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC51575.2020.9345123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

数据分析需要领域工作者具备高水平的专业知识,而AutoML旨在以自动化的方式做出这些决策。但如何在可接受的时间内自动生成高性能的机器学习管道仍然是一个难题。本文提出了一种利用数据特征和服务关联自动生成机器学习管道的DFSR(数据特征和服务关联)方法。实验结果表明,所生成的管道的性能达到了现有AutoML工具的满意水平,并且将耗时降低到分钟级。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Machine Learning Pipeline Generation Approach for Data Analysis
Data analysis requires a high level of expertise for domain workers, and AutoML aims to make these decisions in an automated way. But it is still a difficult problem to automatically generate machine learning pipelines with high performance in acceptable time. This paper presents a DFSR (Data Feature and Service Association) approach to automatically generating machine learning pipelines utilizing data features and service associations. The experimental results showed that the performance of the generated pipelines reached the satisfactory level of current AutoML tools, and the time consumption is reduced to the minute level.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Safe and Stable Timing Method over Air Interface Based on Multi-Base Station Cooperation Peak to Average Power Ratio (PAPR) Mitigation for Underwater Acoustic OFDM System by Using an Efficient Hybridization Technique Monocular Visual-Inertial Odometry Based on Point and Line Features Block Halftoning for Size-Invariant Visual Cryptography Based on Two-Dimensional Lattices Airborne STAP with Unknown Mutual Coupling for Coprime Sampling Structure
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1