野外的自动机器学习

Proceedings of the 10th ACM Conference on Recommender Systems Pub Date : 2016-09-07 DOI:10.1145/2959100.2959191

C. Perlich

{"title":"野外的自动机器学习","authors":"C. Perlich","doi":"10.1145/2959100.2959191","DOIUrl":null,"url":null,"abstract":"Machine Learning research is progressing at an ever increasing pace. Fueled by technology advances commonly referred to as \"Big Data\", all data related fields are teaming with scientific and applied activity: our communities explore new application areas, develop new learning algorithms, and continuously scale and improve optimization and estimation methods. But from an industry perspective, many of the most impeding challenges are entirely elsewhere. This talk takes a fresh look at the practical state of affairs in the context of running a large-scale automated machine learning system that supports 50 Billion decision daily on behalf of hundreds of digital advertisers. Some of the key lessons are 1) robustness beats peak performance almost always, 2) support for the constant dynamic fluctuations in the data stream is essential, 3) models exploiting unknowingly any weakness of your metrics, and finally 4) the fact that despite big data, the data you really want never exists.","PeriodicalId":315651,"journal":{"name":"Proceedings of the 10th ACM Conference on Recommender Systems","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated Machine Learning in the Wild\",\"authors\":\"C. Perlich\",\"doi\":\"10.1145/2959100.2959191\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine Learning research is progressing at an ever increasing pace. Fueled by technology advances commonly referred to as \\\"Big Data\\\", all data related fields are teaming with scientific and applied activity: our communities explore new application areas, develop new learning algorithms, and continuously scale and improve optimization and estimation methods. But from an industry perspective, many of the most impeding challenges are entirely elsewhere. This talk takes a fresh look at the practical state of affairs in the context of running a large-scale automated machine learning system that supports 50 Billion decision daily on behalf of hundreds of digital advertisers. Some of the key lessons are 1) robustness beats peak performance almost always, 2) support for the constant dynamic fluctuations in the data stream is essential, 3) models exploiting unknowingly any weakness of your metrics, and finally 4) the fact that despite big data, the data you really want never exists.\",\"PeriodicalId\":315651,\"journal\":{\"name\":\"Proceedings of the 10th ACM Conference on Recommender Systems\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 10th ACM Conference on Recommender Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2959100.2959191\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th ACM Conference on Recommender Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2959100.2959191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

机器学习研究正在以越来越快的速度发展。在通常被称为“大数据”的技术进步的推动下，所有与数据相关的领域都与科学和应用活动相结合:我们的社区探索新的应用领域，开发新的学习算法，不断扩展和改进优化和估计方法。但从行业的角度来看，许多最具阻碍的挑战完全在其他地方。这次演讲以全新的视角审视了在运行一个大型自动化机器学习系统的背景下，事务的实际状态，该系统每天代表数百个数字广告商支持500亿个决策。一些关键的经验教训是:1)健壮性几乎总是胜过峰值性能，2)对数据流中恒定动态波动的支持是必不可少的，3)模型在不知情的情况下利用你的指标的任何弱点，最后4)尽管有大数据，但你真正想要的数据永远不存在。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Automated Machine Learning in the Wild

Machine Learning research is progressing at an ever increasing pace. Fueled by technology advances commonly referred to as "Big Data", all data related fields are teaming with scientific and applied activity: our communities explore new application areas, develop new learning algorithms, and continuously scale and improve optimization and estimation methods. But from an industry perspective, many of the most impeding challenges are entirely elsewhere. This talk takes a fresh look at the practical state of affairs in the context of running a large-scale automated machine learning system that supports 50 Billion decision daily on behalf of hundreds of digital advertisers. Some of the key lessons are 1) robustness beats peak performance almost always, 2) support for the constant dynamic fluctuations in the data stream is essential, 3) models exploiting unknowingly any weakness of your metrics, and finally 4) the fact that despite big data, the data you really want never exists.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 10th ACM Conference on Recommender Systems

自引率

0.00%

发文量

期刊最新文献

Opening Remarks Mining Information for the Cold-Item Problem Are You Influenced by Others When Rating?: Improve Rating Prediction by Conformity Modeling Contrasting Offline and Online Results when Evaluating Recommendation Algorithms Intent-Aware Diversification Using a Constrained PLSA