野外的自动机器学习

C. Perlich
{"title":"野外的自动机器学习","authors":"C. Perlich","doi":"10.1145/2959100.2959191","DOIUrl":null,"url":null,"abstract":"Machine Learning research is progressing at an ever increasing pace. Fueled by technology advances commonly referred to as \"Big Data\", all data related fields are teaming with scientific and applied activity: our communities explore new application areas, develop new learning algorithms, and continuously scale and improve optimization and estimation methods. But from an industry perspective, many of the most impeding challenges are entirely elsewhere. This talk takes a fresh look at the practical state of affairs in the context of running a large-scale automated machine learning system that supports 50 Billion decision daily on behalf of hundreds of digital advertisers. Some of the key lessons are 1) robustness beats peak performance almost always, 2) support for the constant dynamic fluctuations in the data stream is essential, 3) models exploiting unknowingly any weakness of your metrics, and finally 4) the fact that despite big data, the data you really want never exists.","PeriodicalId":315651,"journal":{"name":"Proceedings of the 10th ACM Conference on Recommender Systems","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated Machine Learning in the Wild\",\"authors\":\"C. Perlich\",\"doi\":\"10.1145/2959100.2959191\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine Learning research is progressing at an ever increasing pace. Fueled by technology advances commonly referred to as \\\"Big Data\\\", all data related fields are teaming with scientific and applied activity: our communities explore new application areas, develop new learning algorithms, and continuously scale and improve optimization and estimation methods. But from an industry perspective, many of the most impeding challenges are entirely elsewhere. This talk takes a fresh look at the practical state of affairs in the context of running a large-scale automated machine learning system that supports 50 Billion decision daily on behalf of hundreds of digital advertisers. Some of the key lessons are 1) robustness beats peak performance almost always, 2) support for the constant dynamic fluctuations in the data stream is essential, 3) models exploiting unknowingly any weakness of your metrics, and finally 4) the fact that despite big data, the data you really want never exists.\",\"PeriodicalId\":315651,\"journal\":{\"name\":\"Proceedings of the 10th ACM Conference on Recommender Systems\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 10th ACM Conference on Recommender Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2959100.2959191\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th ACM Conference on Recommender Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2959100.2959191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

机器学习研究正在以越来越快的速度发展。在通常被称为“大数据”的技术进步的推动下,所有与数据相关的领域都与科学和应用活动相结合:我们的社区探索新的应用领域,开发新的学习算法,不断扩展和改进优化和估计方法。但从行业的角度来看,许多最具阻碍的挑战完全在其他地方。这次演讲以全新的视角审视了在运行一个大型自动化机器学习系统的背景下,事务的实际状态,该系统每天代表数百个数字广告商支持500亿个决策。一些关键的经验教训是:1)健壮性几乎总是胜过峰值性能,2)对数据流中恒定动态波动的支持是必不可少的,3)模型在不知情的情况下利用你的指标的任何弱点,最后4)尽管有大数据,但你真正想要的数据永远不存在。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Automated Machine Learning in the Wild
Machine Learning research is progressing at an ever increasing pace. Fueled by technology advances commonly referred to as "Big Data", all data related fields are teaming with scientific and applied activity: our communities explore new application areas, develop new learning algorithms, and continuously scale and improve optimization and estimation methods. But from an industry perspective, many of the most impeding challenges are entirely elsewhere. This talk takes a fresh look at the practical state of affairs in the context of running a large-scale automated machine learning system that supports 50 Billion decision daily on behalf of hundreds of digital advertisers. Some of the key lessons are 1) robustness beats peak performance almost always, 2) support for the constant dynamic fluctuations in the data stream is essential, 3) models exploiting unknowingly any weakness of your metrics, and finally 4) the fact that despite big data, the data you really want never exists.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Opening Remarks Mining Information for the Cold-Item Problem Are You Influenced by Others When Rating?: Improve Rating Prediction by Conformity Modeling Contrasting Offline and Online Results when Evaluating Recommendation Algorithms Intent-Aware Diversification Using a Constrained PLSA
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1