首页 > 最新文献

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining最新文献

英文 中文
Deep Natural Language Processing for Search and Recommender Systems 搜索和推荐系统的深度自然语言处理
Weiwei Guo, Huiji Gao, Jun Shi, Bo Long, Liang Zhang, Bee-Chung Chen, D. Agarwal
Search and recommender systems share many fundamental components including language understanding, retrieval and ranking, and language generation. Building powerful search and recommender systems requires processing natural language effectively and efficiently. Recent rapid growth of deep learning technologies has presented both opportunities and challenges in this area. This tutorial offers an overview of deep learning based natural language processing (NLP) for search and recommender systems from an industry perspective. It first introduces deep learning based NLP technologies, including language understanding and language generation. Then it details how those technologies can be applied to common tasks in search and recommender systems, including query and document understanding, retrieval and ranking, and language generation. Applications in LinkedIn production systems are presented. The tutorial concludes with discussion of future trend.
搜索和推荐系统共享许多基本组件,包括语言理解、检索和排名以及语言生成。构建强大的搜索和推荐系统需要有效地处理自然语言。近年来深度学习技术的快速发展为这一领域带来了机遇和挑战。本教程从行业角度概述了用于搜索和推荐系统的基于深度学习的自然语言处理(NLP)。它首先介绍了基于深度学习的NLP技术,包括语言理解和语言生成。然后详细介绍了如何将这些技术应用于搜索和推荐系统中的常见任务,包括查询和文档理解、检索和排名以及语言生成。介绍了LinkedIn生产系统中的应用。本教程最后讨论了未来的趋势。
{"title":"Deep Natural Language Processing for Search and Recommender Systems","authors":"Weiwei Guo, Huiji Gao, Jun Shi, Bo Long, Liang Zhang, Bee-Chung Chen, D. Agarwal","doi":"10.1145/3292500.3332290","DOIUrl":"https://doi.org/10.1145/3292500.3332290","url":null,"abstract":"Search and recommender systems share many fundamental components including language understanding, retrieval and ranking, and language generation. Building powerful search and recommender systems requires processing natural language effectively and efficiently. Recent rapid growth of deep learning technologies has presented both opportunities and challenges in this area. This tutorial offers an overview of deep learning based natural language processing (NLP) for search and recommender systems from an industry perspective. It first introduces deep learning based NLP technologies, including language understanding and language generation. Then it details how those technologies can be applied to common tasks in search and recommender systems, including query and document understanding, retrieval and ranking, and language generation. Applications in LinkedIn production systems are presented. The tutorial concludes with discussion of future trend.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127877213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Testing Dynamic Incentive Compatibility in Display Ad Auctions 展示广告拍卖的动态激励兼容性测试
Yuan Deng, Sébastien Lahaie
The question of transparency has become a key point of contention between buyers and sellers of display advertising space: ads are allocated via complex, black-box auction systems whose mechanics can be difficult to model let alone optimize against. Motivated by this concern, this paper takes the perspective of a single advertiser and develops statistical tests to confirm whether an underlying auction mechanism is dynamically incentive compatible (IC), so that truthful bidding in each individual auction and across time is an optimal strategy. The most general notion of dynamic-IC presumes that the seller knows how buyers discount future surplus, which is questionable in practice. We characterize dynamic mechanisms that are dynamic-IC for all possible discounting factors according to two intuitive conditions: the mechanism should be IC at each stage in the usual sense, and expected present utility (under truthful bidding) should be independent of past bids. The conditions motivate two separate experiments based on bid perturbations that can be run simultaneously on the same impression traffic. We provide a novel statistical test of stage-IC along with a test for utility-independence that can detect lags in how the seller uses past bid information. We evaluate our tests on display ad data from a major ad exchange and show how they can accurately uncover evidence of first- or second-price auctions coupled with dynamic reserve prices, among other types of dynamic mechanisms.
透明度问题已成为展示广告空间买家和卖家之间争论的关键点:广告是通过复杂的黑盒拍卖系统分配的,其机制很难建模,更不用说优化了。受此启发,本文从单个广告主的角度出发,开发了统计测试,以确认潜在的拍卖机制是否具有动态激励兼容(IC),从而使每次拍卖和跨时间的真实出价是最优策略。最一般的动态概念假定卖方知道买方如何贴现未来剩余,这在实践中是值得怀疑的。根据两个直观的条件,我们描述了所有可能的贴现因素都是动态动态的动态机制:在通常意义上,该机制在每个阶段都应该是动态动态的,并且预期当前效用(在真实出价下)应该独立于过去的出价。这些条件激发了两个基于出价扰动的独立实验,这些实验可以同时在相同的印象流量上运行。我们提供了一种新的阶段ic统计测试,以及一种可以检测卖方如何使用过去出价信息的效用独立性测试。我们对一家大型广告交易所的展示广告数据进行了测试,并展示了它们如何准确地发现第一或第二价格拍卖与动态保留价格以及其他类型的动态机制相结合的证据。
{"title":"Testing Dynamic Incentive Compatibility in Display Ad Auctions","authors":"Yuan Deng, Sébastien Lahaie","doi":"10.1145/3292500.3330943","DOIUrl":"https://doi.org/10.1145/3292500.3330943","url":null,"abstract":"The question of transparency has become a key point of contention between buyers and sellers of display advertising space: ads are allocated via complex, black-box auction systems whose mechanics can be difficult to model let alone optimize against. Motivated by this concern, this paper takes the perspective of a single advertiser and develops statistical tests to confirm whether an underlying auction mechanism is dynamically incentive compatible (IC), so that truthful bidding in each individual auction and across time is an optimal strategy. The most general notion of dynamic-IC presumes that the seller knows how buyers discount future surplus, which is questionable in practice. We characterize dynamic mechanisms that are dynamic-IC for all possible discounting factors according to two intuitive conditions: the mechanism should be IC at each stage in the usual sense, and expected present utility (under truthful bidding) should be independent of past bids. The conditions motivate two separate experiments based on bid perturbations that can be run simultaneously on the same impression traffic. We provide a novel statistical test of stage-IC along with a test for utility-independence that can detect lags in how the seller uses past bid information. We evaluate our tests on display ad data from a major ad exchange and show how they can accurately uncover evidence of first- or second-price auctions coupled with dynamic reserve prices, among other types of dynamic mechanisms.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126365749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Internal Promotion Optimization 内部推广优化
Rupesh Gupta, Guangde Chen, Shipeng Yu
Most large Internet companies run internal promotions to cross-promote their different products and/or to educate members on how to obtain additional value from the products that they already use. This in turn drives engagement and/or revenue for the company. However, since these internal promotions can distract a member away from the product or page where these are shown, there is a non-zero cannibalization loss incurred for showing these internal promotions. This loss has to be carefully weighed against the gain from showing internal promotions. This can be a complex problem if different internal promotions optimize for different objectives. In that case, it is difficult to compare not just the gain from a conversion through an internal promotion against the loss incurred for showing that internal promotion, but also the gains from conversions through different internal promotions. Hence, we need a principled approach for deciding which internal promotion (if any) to serve to a member in each opportunity to serve an internal promotion. This approach should optimize not just for the net gain to the company, but also for the member's experience. In this paper, we discuss our approach for optimization of internal promotions at LinkedIn. In particular, we present a cost-benefit analysis of showing internal promotions, our formulation of internal promotion optimization as a constrained optimization problem, the architecture of the system for solving the optimization problem and serving internal promotions in real-time, and experimental results from online A/B tests.
大多数大型互联网公司都会进行内部促销,以交叉推广不同的产品和/或教育成员如何从他们已经使用的产品中获得额外的价值。这反过来又推动了公司的粘性和/或收益。然而,由于这些内部促销活动可能会分散会员对产品或页面的注意力,因此显示这些内部促销活动会产生非零的蚕食损失。必须仔细权衡这种损失与展示内部晋升所带来的收益。如果不同的内部促销针对不同的目标进行优化,这可能是一个复杂的问题。在这种情况下,不仅很难比较通过内部推广获得的转化收益与显示该内部推广所造成的损失,而且很难比较通过不同的内部推广获得的转化收益。因此,我们需要一种原则性的方法来决定在每次为内部晋升服务的机会中为成员提供哪种内部晋升服务(如果有的话)。这种方法不仅要优化公司的净收益,还要优化会员的体验。在本文中,我们讨论了我们在LinkedIn内部促销的优化方法。特别地,我们给出了显示内部促销的成本效益分析,我们将内部促销优化作为约束优化问题的表述,解决优化问题和实时服务内部促销的系统架构,以及在线a /B测试的实验结果。
{"title":"Internal Promotion Optimization","authors":"Rupesh Gupta, Guangde Chen, Shipeng Yu","doi":"10.1145/3292500.3330715","DOIUrl":"https://doi.org/10.1145/3292500.3330715","url":null,"abstract":"Most large Internet companies run internal promotions to cross-promote their different products and/or to educate members on how to obtain additional value from the products that they already use. This in turn drives engagement and/or revenue for the company. However, since these internal promotions can distract a member away from the product or page where these are shown, there is a non-zero cannibalization loss incurred for showing these internal promotions. This loss has to be carefully weighed against the gain from showing internal promotions. This can be a complex problem if different internal promotions optimize for different objectives. In that case, it is difficult to compare not just the gain from a conversion through an internal promotion against the loss incurred for showing that internal promotion, but also the gains from conversions through different internal promotions. Hence, we need a principled approach for deciding which internal promotion (if any) to serve to a member in each opportunity to serve an internal promotion. This approach should optimize not just for the net gain to the company, but also for the member's experience. In this paper, we discuss our approach for optimization of internal promotions at LinkedIn. In particular, we present a cost-benefit analysis of showing internal promotions, our formulation of internal promotion optimization as a constrained optimization problem, the architecture of the system for solving the optimization problem and serving internal promotions in real-time, and experimental results from online A/B tests.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126217289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
TrajGuard TrajGuard
Zheyi Pan, J. Bao, Weinan Zhang, Yŏng-ik Yu, Yu Zheng
Trajectory data has been widely used in many urban applications. Sharing trajectory data with effective supervision is a vital task, as it contains private information of moving objects. However, malicious data users can modify trajectories in various ways to avoid data distribution tracking by the hashing-based data signatures, e.g., MD5. Moreover, the existing trajectory data protection scheme can only protect trajectories from either spatial or temporal modifications. Finally, so far there is no authoritative third party for trajectory data sharing process, as trajectory data is too sensitive. To this end, we propose a novel trajectory copyright protection scheme, which can protect trajectory data from comprehensive types of data modifications/attacks. Three main techniques are employed to effectively guarantee the robustness and comprehensiveness of the proposed data sharing scheme: 1) the identity information is embedded distributively across a set of sub-trajectories partitioned based on the spatio-temporal regions; 2) the centroid distance of the sub-trajectories is served as a stable trajectory attribute to embed the information; and 3) the blockchain technique is used as a trusted third party to log all data transaction history for data distribution tracking in a decentralized manner. Extensive experiments were conducted based on two real-world trajectory datasets to demonstrate the effectiveness of our proposed scheme.
{"title":"TrajGuard","authors":"Zheyi Pan, J. Bao, Weinan Zhang, Yŏng-ik Yu, Yu Zheng","doi":"10.1145/3292500.3330685","DOIUrl":"https://doi.org/10.1145/3292500.3330685","url":null,"abstract":"Trajectory data has been widely used in many urban applications. Sharing trajectory data with effective supervision is a vital task, as it contains private information of moving objects. However, malicious data users can modify trajectories in various ways to avoid data distribution tracking by the hashing-based data signatures, e.g., MD5. Moreover, the existing trajectory data protection scheme can only protect trajectories from either spatial or temporal modifications. Finally, so far there is no authoritative third party for trajectory data sharing process, as trajectory data is too sensitive. To this end, we propose a novel trajectory copyright protection scheme, which can protect trajectory data from comprehensive types of data modifications/attacks. Three main techniques are employed to effectively guarantee the robustness and comprehensiveness of the proposed data sharing scheme: 1) the identity information is embedded distributively across a set of sub-trajectories partitioned based on the spatio-temporal regions; 2) the centroid distance of the sub-trajectories is served as a stable trajectory attribute to embed the information; and 3) the blockchain technique is used as a trusted third party to log all data transaction history for data distribution tracking in a decentralized manner. Extensive experiments were conducted based on two real-world trajectory datasets to demonstrate the effectiveness of our proposed scheme.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115837566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Smart Roles: Inferring Professional Roles in Email Networks 智能角色:推断电子邮件网络中的专业角色
Di Jin, Mark Heimann, Tara Safavi, Mengdi Wang, Wei Lee, Lindsay Snider, Danai Koutra
Email is ubiquitous in the workplace. Naturally, machine learning models that make third-party email clients "smarter" can dramatically impact employees' productivity and efficiency. Motivated by this potential, we study the task of professional role inference from email data, which is crucial for email prioritization and contact recommendation systems. The central question we address is: Given limited data about employees, as is common in third-party email applications, can we infer where in the organizational hierarchy these employees belong based on their email behavior? Toward our goal, in this paper we study professional role inference on a unique new email dataset comprising billions of email exchanges across thousands of organizations. Taking a network approach in which nodes are employees and edges represent email communication, we propose EMBER, or EMBedding Email-based Roles, which finds email-centric embeddings of network nodes to be used in professional role inference tasks. EMBER automatically captures behavioral similarity between employees in the email network, leading to embeddings that naturally distinguish employees of different hierarchical roles. EMBER often outperforms the state-of-the-art by 2-20% in role inference accuracy and 2.5-344x in speed. We also use EMBER with our unique dataset to study how inferred professional roles compare between organizations of different sizes and sectors, gaining new insights into organizational hierarchy.
电子邮件在工作场所无处不在。当然,使第三方电子邮件客户端“更智能”的机器学习模型可以极大地影响员工的生产力和效率。在这种潜力的激励下,我们研究了来自电子邮件数据的专业角色推断任务,这对于电子邮件优先级和联系人推荐系统至关重要。我们要解决的核心问题是:鉴于员工的数据有限(这在第三方电子邮件应用程序中很常见),我们能否根据这些员工的电子邮件行为推断出他们在组织层级中的位置?为了实现我们的目标,在本文中,我们研究了一个独特的新电子邮件数据集上的专业角色推断,该数据集包含数千个组织的数十亿封电子邮件交换。采用网络方法,其中节点是员工,边缘代表电子邮件通信,我们提出了EMBER,即嵌入基于电子邮件的角色,它找到以电子邮件为中心的网络节点嵌入,用于专业角色推断任务。EMBER自动捕获电子邮件网络中员工之间的行为相似性,从而产生嵌入,自然地区分不同层次角色的员工。EMBER通常在角色推理精度上比最先进的系统高出2-20%,在速度上高出2.5-344倍。我们还使用EMBER和我们独特的数据集来研究不同规模和行业的组织之间如何比较推断的专业角色,从而获得对组织层次的新见解。
{"title":"Smart Roles: Inferring Professional Roles in Email Networks","authors":"Di Jin, Mark Heimann, Tara Safavi, Mengdi Wang, Wei Lee, Lindsay Snider, Danai Koutra","doi":"10.1145/3292500.3330735","DOIUrl":"https://doi.org/10.1145/3292500.3330735","url":null,"abstract":"Email is ubiquitous in the workplace. Naturally, machine learning models that make third-party email clients \"smarter\" can dramatically impact employees' productivity and efficiency. Motivated by this potential, we study the task of professional role inference from email data, which is crucial for email prioritization and contact recommendation systems. The central question we address is: Given limited data about employees, as is common in third-party email applications, can we infer where in the organizational hierarchy these employees belong based on their email behavior? Toward our goal, in this paper we study professional role inference on a unique new email dataset comprising billions of email exchanges across thousands of organizations. Taking a network approach in which nodes are employees and edges represent email communication, we propose EMBER, or EMBedding Email-based Roles, which finds email-centric embeddings of network nodes to be used in professional role inference tasks. EMBER automatically captures behavioral similarity between employees in the email network, leading to embeddings that naturally distinguish employees of different hierarchical roles. EMBER often outperforms the state-of-the-art by 2-20% in role inference accuracy and 2.5-344x in speed. We also use EMBER with our unique dataset to study how inferred professional roles compare between organizations of different sizes and sectors, gaining new insights into organizational hierarchy.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129999833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Pythia: AI-assisted Code Completion System 皮媞亚:人工智能辅助代码完成系统
Alexey Svyatkovskiy, Ying Zhao, Shengyu Fu, Neel Sundaresan
In this paper, we propose a novel end-to-end approach for AI-assisted code completion called Pythia. It generates ranked lists of method and API recommendations which can be used by software developers at edit time. The system is currently deployed as part of Intellicode extension in Visual Studio Code IDE. Pythia exploits state-of-the-art large-scale deep learning models trained on code contexts extracted from abstract syntax trees. It is designed to work at a high throughput predicting the best matching code completions on the order of 100 ms. We describe the architecture of the system, perform comparisons to frequency-based approach and invocation-based Markov Chain language model, and discuss challenges serving Pythia models on lightweight client devices. The offline evaluation results obtained on 2700 Python open source software GitHub repositories show a top-5 accuracy of 92%, surpassing the baseline models by 20% averaged over classes, for both intra and cross-project settings.
在本文中,我们提出了一种新颖的端到端ai辅助代码完成方法,称为Pythia。它生成方法和API推荐的排名列表,供软件开发人员在编辑时使用。该系统目前作为Intellicode扩展的一部分部署在Visual Studio Code IDE中。Pythia利用从抽象语法树中提取的代码上下文训练的最先进的大规模深度学习模型。它被设计成在高吞吐量下工作,以100毫秒的顺序预测最佳匹配代码完成。我们描述了系统的架构,与基于频率的方法和基于调用的马尔可夫链语言模型进行了比较,并讨论了在轻量级客户端设备上提供Pythia模型的挑战。在2700个Python开源软件GitHub存储库上获得的离线评估结果显示,在类别中,无论是内部还是跨项目设置,前5名的准确率都达到92%,比基线模型平均高出20%。
{"title":"Pythia: AI-assisted Code Completion System","authors":"Alexey Svyatkovskiy, Ying Zhao, Shengyu Fu, Neel Sundaresan","doi":"10.1145/3292500.3330699","DOIUrl":"https://doi.org/10.1145/3292500.3330699","url":null,"abstract":"In this paper, we propose a novel end-to-end approach for AI-assisted code completion called Pythia. It generates ranked lists of method and API recommendations which can be used by software developers at edit time. The system is currently deployed as part of Intellicode extension in Visual Studio Code IDE. Pythia exploits state-of-the-art large-scale deep learning models trained on code contexts extracted from abstract syntax trees. It is designed to work at a high throughput predicting the best matching code completions on the order of 100 ms. We describe the architecture of the system, perform comparisons to frequency-based approach and invocation-based Markov Chain language model, and discuss challenges serving Pythia models on lightweight client devices. The offline evaluation results obtained on 2700 Python open source software GitHub repositories show a top-5 accuracy of 92%, surpassing the baseline models by 20% averaged over classes, for both intra and cross-project settings.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131771171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 109
E.T.-RNN: Applying Deep Learning to Credit Loan Applications E.T.-RNN:将深度学习应用于信用贷款申请
Dmitrii Babaev, M. Savchenko, A. Tuzhilin, Dmitrii Umerenkov
In this paper we present a novel approach to credit scoring of retail customers in the banking industry based on deep learning methods. We used RNNs on fine grained transnational data to compute credit scores for the loan applicants. We demonstrate that our approach significantly outperforms the baselines based on the customer data of a large European bank. We also conducted a pilot study on loan applicants of the bank, and the study produced significant financial gains for the organization. In addition, our method has several other advantages described in the paper that are very significant for the bank.
本文提出了一种基于深度学习方法的银行业零售客户信用评分新方法。我们在细粒度跨国数据上使用rnn来计算贷款申请人的信用评分。我们证明,我们的方法明显优于基于大型欧洲银行客户数据的基线。我们还对银行的贷款申请人进行了试点研究,该研究为组织带来了显著的财务收益。此外,我们的方法还具有论文中描述的对银行非常重要的其他几个优点。
{"title":"E.T.-RNN: Applying Deep Learning to Credit Loan Applications","authors":"Dmitrii Babaev, M. Savchenko, A. Tuzhilin, Dmitrii Umerenkov","doi":"10.1145/3292500.3330693","DOIUrl":"https://doi.org/10.1145/3292500.3330693","url":null,"abstract":"In this paper we present a novel approach to credit scoring of retail customers in the banking industry based on deep learning methods. We used RNNs on fine grained transnational data to compute credit scores for the loan applicants. We demonstrate that our approach significantly outperforms the baselines based on the customer data of a large European bank. We also conducted a pilot study on loan applicants of the bank, and the study produced significant financial gains for the organization. In addition, our method has several other advantages described in the paper that are very significant for the bank.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130792742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 69
AI for Small Businesses and Consumers: Applications and Innovations 面向小型企业和消费者的人工智能:应用与创新
Ashok Srivastava
Small businesses are the lifeblood of the U.S. economy, representing an astounding 99.9 percent of all businesses, creating two-thirds of net new jobs, and accounting for 44 percent of economic activity. Yet, 50 percent of small businesses go out of business in the first 5 years. What's behind this dismal statistic? Among the top contributing factors is cash flow management. Owners who cannot efficiently manage the inflow and outflow of cash are almost certain to fail. And, those who can are more likely to break through the statistical 5-year barrier to build thriving businesses. In this talk, we'll describe novel applications of artificial intelligence and large-scale machine learning aimed at addressing the problem of forecasting cash flow for small businesses. These are sparse, high-dimensional correlated time series. We'll present new results on forecasting this type of time series, using scalable Gaussian Processes with kernels formed through the use of deep learning. These methods yield highly accurate predictions but also include a principled approach for generating confidence intervals.
小企业是美国经济的命脉,占所有企业的99.9%,创造了三分之二的净新就业机会,占经济活动的44%。然而,50%的小企业在头5年倒闭。这个令人沮丧的数据背后是什么?其中最重要的因素是现金流管理。不能有效管理现金流入和流出的所有者几乎肯定会破产。而且,那些有能力的人更有可能突破5年的统计障碍,建立蓬勃发展的企业。在这次演讲中,我们将介绍人工智能和大规模机器学习的新应用,旨在解决预测小企业现金流的问题。这些是稀疏的,高维相关的时间序列。我们将展示预测这类时间序列的新结果,使用可扩展的高斯过程和通过使用深度学习形成的核。这些方法产生了高度准确的预测,但也包括一种产生置信区间的原则方法。
{"title":"AI for Small Businesses and Consumers: Applications and Innovations","authors":"Ashok Srivastava","doi":"10.1145/3292500.3340398","DOIUrl":"https://doi.org/10.1145/3292500.3340398","url":null,"abstract":"Small businesses are the lifeblood of the U.S. economy, representing an astounding 99.9 percent of all businesses, creating two-thirds of net new jobs, and accounting for 44 percent of economic activity. Yet, 50 percent of small businesses go out of business in the first 5 years. What's behind this dismal statistic? Among the top contributing factors is cash flow management. Owners who cannot efficiently manage the inflow and outflow of cash are almost certain to fail. And, those who can are more likely to break through the statistical 5-year barrier to build thriving businesses. In this talk, we'll describe novel applications of artificial intelligence and large-scale machine learning aimed at addressing the problem of forecasting cash flow for small businesses. These are sparse, high-dimensional correlated time series. We'll present new results on forecasting this type of time series, using scalable Gaussian Processes with kernels formed through the use of deep learning. These methods yield highly accurate predictions but also include a principled approach for generating confidence intervals.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131079563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Co-Prediction of Multiple Transportation Demands Based on Deep Spatio-Temporal Neural Network 基于深度时空神经网络的多重交通需求协同预测
Junchen Ye, Leilei Sun, Bowen Du, Yanjie Fu, Xinran Tong, Hui Xiong
Taxi and sharing bike bring great convenience to urban transportation. A lot of efforts have been made to improve the efficiency of taxi service or bike sharing system by predicting the next-period pick-up or drop-off demand. Different from the existing research, this paper is motivated by the following two facts: 1) From a micro view, an observed spatial demand at any time slot could be decomposed as a combination of many hidden spatial demand bases; 2) From a macro view, the multiple transportation demands are strongly correlated with each other, both spatially and temporally. Definitely, the above two views have great potential to revolutionize the existing taxi or bike demand prediction methods. Along this line, this paper provides a novel Co-prediction method based on Spatio-Temporal neural Network, namely, CoST-Net. In particular, a deep convolutional neural network is constructed to decompose a spatial demand into a combination of hidden spatial demand bases. The combination weight vector is used as a representation of the decomposed spatial demand. Then, a heterogeneous Long Short-Term Memory (LSTM) is proposed to integrate the states of multiple transportation demands, and also model the dynamics of them mixedly. Last, the environmental features such as humidity and temperature are incorporated with the achieved overall hidden states to predict the multiple demands simultaneously. Experiments have been conducted on real-world taxi and sharing bike demand data, results demonstrate the superiority of the proposed method over both classical and the state-of-the-art transportation demand prediction methods.
出租车和共享单车给城市交通带来了极大的便利。通过预测下一阶段的上下车需求,为提高出租车服务或共享单车系统的效率做出了很多努力。与已有研究不同的是,本文的研究动机有以下两点:1)从微观角度看,观察到的任何时隙的空间需求都可以分解为许多隐藏的空间需求基础的组合;2)从宏观上看,多种交通需求在空间和时间上都具有很强的相关性。当然,上述两种观点有很大的潜力来彻底改变现有的出租车或自行车需求预测方法。在此基础上,本文提出了一种基于时空神经网络的协同预测方法,即CoST-Net。特别地,构建深度卷积神经网络将空间需求分解为隐藏空间需求基的组合。组合权重向量作为分解后的空间需求的表示。在此基础上,提出了一种异质长短期记忆(LSTM)模型来整合多种交通运输需求的状态,并对其进行混合动态建模。最后,将湿度和温度等环境特征与实现的总体隐藏状态相结合,同时预测多个需求。对现实世界的出租车和共享单车需求数据进行了实验,结果表明该方法优于经典和最先进的交通需求预测方法。
{"title":"Co-Prediction of Multiple Transportation Demands Based on Deep Spatio-Temporal Neural Network","authors":"Junchen Ye, Leilei Sun, Bowen Du, Yanjie Fu, Xinran Tong, Hui Xiong","doi":"10.1145/3292500.3330887","DOIUrl":"https://doi.org/10.1145/3292500.3330887","url":null,"abstract":"Taxi and sharing bike bring great convenience to urban transportation. A lot of efforts have been made to improve the efficiency of taxi service or bike sharing system by predicting the next-period pick-up or drop-off demand. Different from the existing research, this paper is motivated by the following two facts: 1) From a micro view, an observed spatial demand at any time slot could be decomposed as a combination of many hidden spatial demand bases; 2) From a macro view, the multiple transportation demands are strongly correlated with each other, both spatially and temporally. Definitely, the above two views have great potential to revolutionize the existing taxi or bike demand prediction methods. Along this line, this paper provides a novel Co-prediction method based on Spatio-Temporal neural Network, namely, CoST-Net. In particular, a deep convolutional neural network is constructed to decompose a spatial demand into a combination of hidden spatial demand bases. The combination weight vector is used as a representation of the decomposed spatial demand. Then, a heterogeneous Long Short-Term Memory (LSTM) is proposed to integrate the states of multiple transportation demands, and also model the dynamics of them mixedly. Last, the environmental features such as humidity and temperature are incorporated with the achieved overall hidden states to predict the multiple demands simultaneously. Experiments have been conducted on real-world taxi and sharing bike demand data, results demonstrate the superiority of the proposed method over both classical and the state-of-the-art transportation demand prediction methods.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131697148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 91
Do Simpler Models Exist and How Can We Find Them? 是否存在更简单的模型,我们如何找到它们?
C. Rudin
While the trend in machine learning has tended towards more complex hypothesis spaces, it is not clear that this extra complexity is always necessary or helpful for many domains. In particular, models and their predictions are often made easier to understand by adding interpretability constraints. These constraints shrink the hypothesis space; that is, they make the model simpler. Statistical learning theory suggests that generalization may be improved as a result as well. However, adding extra constraints can make optimization (exponentially) harder. For instance it is much easier in practice to create an accurate neural network than an accurate and sparse decision tree. We address the following question: Can we show that a simple-but-accurate machine learning model might exist for our problem, before actually finding it? If the answer is promising, it would then be worthwhile to solve the harder constrained optimization problem to find such a model. In this talk, I present an easy calculation to check for the possibility of a simpler model. This calculation indicates that simpler-but-accurate models do exist in practice more often than you might think. I then briefly overview several new methods for interpretable machine learning. These methods are for (i) sparse optimal decision trees, (ii) sparse linear integer models (also called medical scoring systems), and (iii) interpretable case-based reasoning in deep neural networks for computer vision.
虽然机器学习的趋势趋向于更复杂的假设空间,但并不清楚这种额外的复杂性对于许多领域来说总是必要的或有帮助的。特别是,通过添加可解释性约束,模型及其预测通常更容易理解。这些约束缩小了假设空间;也就是说,它们使模型更简单。统计学习理论表明,泛化也可能因此得到改善。然而,添加额外的约束会使优化(成倍地)变得更加困难。例如,在实践中,创建一个精确的神经网络要比创建一个精确的稀疏决策树容易得多。我们解决了以下问题:在真正找到问题之前,我们能否证明一个简单但准确的机器学习模型可能存在?如果答案是有希望的,那么解决更难的约束优化问题来找到这样一个模型是值得的。在这次演讲中,我提出了一个简单的计算来检查一个更简单模型的可能性。这一计算表明,在实践中,更简单但更精确的模型确实比您想象的更常见。然后简要概述了可解释机器学习的几种新方法。这些方法适用于(i)稀疏最优决策树,(ii)稀疏线性整数模型(也称为医疗评分系统),以及(iii)用于计算机视觉的深度神经网络中可解释的基于案例的推理。
{"title":"Do Simpler Models Exist and How Can We Find Them?","authors":"C. Rudin","doi":"10.1145/3292500.3330823","DOIUrl":"https://doi.org/10.1145/3292500.3330823","url":null,"abstract":"While the trend in machine learning has tended towards more complex hypothesis spaces, it is not clear that this extra complexity is always necessary or helpful for many domains. In particular, models and their predictions are often made easier to understand by adding interpretability constraints. These constraints shrink the hypothesis space; that is, they make the model simpler. Statistical learning theory suggests that generalization may be improved as a result as well. However, adding extra constraints can make optimization (exponentially) harder. For instance it is much easier in practice to create an accurate neural network than an accurate and sparse decision tree. We address the following question: Can we show that a simple-but-accurate machine learning model might exist for our problem, before actually finding it? If the answer is promising, it would then be worthwhile to solve the harder constrained optimization problem to find such a model. In this talk, I present an easy calculation to check for the possibility of a simpler model. This calculation indicates that simpler-but-accurate models do exist in practice more often than you might think. I then briefly overview several new methods for interpretable machine learning. These methods are for (i) sparse optimal decision trees, (ii) sparse linear integer models (also called medical scoring systems), and (iii) interpretable case-based reasoning in deep neural networks for computer vision.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115408444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1