Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining最新文献_第9页

Active Deep Learning for Activity Recognition with Context Aware Annotator Selection 基于上下文感知注释器选择的活动识别主动深度学习

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330688

H. S. Hossain, Nirmalya Roy

Machine learning models are bounded by the credibility of ground truth data used for both training and testing. Regardless of the problem domain, this ground truth annotation is objectively manual and tedious as it needs considerable amount of human intervention. With the advent of Active Learning with multiple annotators, the burden can be somewhat mitigated by actively acquiring labels of most informative data instances. However, multiple annotators with varying degrees of expertise poses new set of challenges in terms of quality of the label received and availability of the annotator. Due to limited amount of ground truth information addressing the variabilities of Activity of Daily Living (ADLs), activity recognition models using wearable and mobile devices are still not robust enough for real-world deployment. In this paper, we first propose an active learning combined deep model which updates its network parameters based on the optimization of a joint loss function. We then propose a novel annotator selection model by exploiting the relationships among the users while considering their heterogeneity with respect to their expertise, physical and spatial context. Our proposed model leverages model-free deep reinforcement learning in a partially observable environment setting to capture the action-reward interaction among multiple annotators. Our experiments in real-world settings exhibit that our active deep model converges to optimal accuracy with fewer labeled instances and achieves ~8% improvement in accuracy in fewer iterations.

机器学习模型受到用于训练和测试的真实数据可信度的限制。无论问题领域是什么，这种基础真理注释客观上都是手工的，而且冗长乏味，因为它需要大量的人工干预。随着带有多个注释器的主动学习的出现，通过主动获取大多数信息数据实例的标签可以在一定程度上减轻负担。然而，具有不同专业知识程度的多个注释者在收到的标签质量和注释者的可用性方面提出了一系列新的挑战。由于处理日常生活活动(adl)可变性的地面真实信息数量有限，使用可穿戴和移动设备的活动识别模型仍然不够健壮，无法用于现实世界的部署。在本文中，我们首先提出了一种主动学习组合深度模型，该模型基于联合损失函数的优化更新其网络参数。然后，我们通过利用用户之间的关系，同时考虑他们在专业知识、物理和空间背景方面的异质性，提出了一种新的注释者选择模型。我们提出的模型在部分可观察的环境设置中利用无模型深度强化学习来捕获多个注释者之间的动作-奖励交互。我们在现实环境中的实验表明，我们的主动深度模型在更少的标记实例下收敛到最优精度，并且在更少的迭代中实现了8%的精度提高。

{"title":"Active Deep Learning for Activity Recognition with Context Aware Annotator Selection","authors":"H. S. Hossain, Nirmalya Roy","doi":"10.1145/3292500.3330688","DOIUrl":"https://doi.org/10.1145/3292500.3330688","url":null,"abstract":"Machine learning models are bounded by the credibility of ground truth data used for both training and testing. Regardless of the problem domain, this ground truth annotation is objectively manual and tedious as it needs considerable amount of human intervention. With the advent of Active Learning with multiple annotators, the burden can be somewhat mitigated by actively acquiring labels of most informative data instances. However, multiple annotators with varying degrees of expertise poses new set of challenges in terms of quality of the label received and availability of the annotator. Due to limited amount of ground truth information addressing the variabilities of Activity of Daily Living (ADLs), activity recognition models using wearable and mobile devices are still not robust enough for real-world deployment. In this paper, we first propose an active learning combined deep model which updates its network parameters based on the optimization of a joint loss function. We then propose a novel annotator selection model by exploiting the relationships among the users while considering their heterogeneity with respect to their expertise, physical and spatial context. Our proposed model leverages model-free deep reinforcement learning in a partially observable environment setting to capture the action-reward interaction among multiple annotators. Our experiments in real-world settings exhibit that our active deep model converges to optimal accuracy with fewer labeled instances and achieves ~8% improvement in accuracy in fewer iterations.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133040318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 31

The Unreasonable Effectiveness, and Difficulty, of Data in Healthcare 医疗保健中数据的不合理有效性和难度

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330645

Peter Lee

Data and data analysis are widely assumed to be the key part of the solution to healthcare systems' problems. Indeed, there are countless ways in which data can be converted into better medical diagnostic tools, more effective therapeutics, and improved productivity for clinicians. But while there is clearly great potential, some big challenges remain to make this all a reality, including making access to health data easier, addressing privacy and ethics concerns, and ensuring the clinical safety of "learning" systems. This talk illustrates what is possible in healthcare technology, and details key challenges that currently prevent this from becoming a reality.

数据和数据分析被广泛认为是解决医疗保健系统问题的关键部分。事实上，有无数种方法可以将数据转化为更好的医疗诊断工具、更有效的治疗方法，并提高临床医生的工作效率。但是，尽管有明显的巨大潜力，但要使这一切成为现实，仍然存在一些重大挑战，包括使获取健康数据更容易，解决隐私和伦理问题，以及确保“学习”系统的临床安全。本次演讲阐述了医疗保健技术的可能性，并详细介绍了目前阻碍其成为现实的关键挑战。

引用次数: 2

Sequence Multi-task Learning to Forecast Mental Wellbeing from Sparse Self-reported Data 序列多任务学习从稀疏自我报告数据预测心理健康

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330730

Dimitris Spathis, S. S. Rodríguez, K. Farrahi, C. Mascolo, Jason Rentfrow

Smartphones have started to be used as self reporting tools for mental health state as they accompany individuals during their days and can therefore gather temporally fine grained data. However, the analysis of self reported mood data offers challenges related to non-homogeneity of mood assessment among individuals due to the complexity of the feeling and the reporting scales, as well as the noise and sparseness of the reports when collected in the wild. In this paper, we propose a new end-to-end ML model inspired by video frame prediction and machine translation, that forecasts future sequences of mood from previous self-reported moods collected in the real world using mobile devices. Contrary to traditional time series forecasting algorithms, our multi-task encoder-decoder recurrent neural network learns patterns from different users, allowing and improving the prediction for users with limited number of self-reports. Unlike traditional feature-based machine learning algorithms, the encoder-decoder architecture enables to forecast a sequence of future moods rather than one single step. Meanwhile, multi-task learning exploits some unique characteristics of the data (mood is bi-dimensional), achieving better results than when training single-task networks or other classifiers. Our experiments using a real-world dataset of 33,000 user-weeks revealed that (i) 3 weeks of sparsely reported mood is the optimal number to accurately forecast mood, (ii) multi-task learning models both dimensions of mood "valence and arousal" with higher accuracy than separate or traditional ML models, and (iii) mood variability, personality traits and day of the week play a key role in the performance of our model. We believe this work provides psychologists and developers of future mobile mental health applications with a ready-to-use and effective tool for early diagnosis of mental health issues at scale.

智能手机已经开始被用作心理健康状况的自我报告工具，因为它们在白天陪伴着个人，因此可以收集时间上细粒度的数据。然而，对自我报告的情绪数据的分析提出了与个体之间情绪评估的非同质性相关的挑战，这是由于感觉和报告尺度的复杂性，以及在野外收集的报告的噪音和稀疏性。在本文中，我们提出了一个受视频帧预测和机器翻译启发的新的端到端机器学习模型，该模型使用移动设备从现实世界中收集的先前自我报告的情绪中预测未来的情绪序列。与传统的时间序列预测算法相反，我们的多任务编码器-解码器递归神经网络从不同的用户那里学习模式，允许并改进对有限数量自我报告的用户的预测。与传统的基于特征的机器学习算法不同，编码器-解码器架构能够预测未来情绪的序列，而不是单一步骤。同时，多任务学习利用了数据的一些独特特征(情绪是二维的)，比训练单任务网络或其他分类器获得了更好的结果。我们使用33,000用户周的真实世界数据集进行的实验显示:(i) 3周的稀疏报告情绪是准确预测情绪的最佳数字，(ii)多任务学习模型的情绪“效价和唤醒”两个维度比单独或传统的ML模型具有更高的准确性，以及(iii)情绪可变性，个性特征和一周中的一天在我们模型的性能中起着关键作用。我们相信，这项工作为心理学家和未来移动心理健康应用程序的开发人员提供了一个现成的、有效的工具，可以大规模地早期诊断心理健康问题。

{"title":"Sequence Multi-task Learning to Forecast Mental Wellbeing from Sparse Self-reported Data","authors":"Dimitris Spathis, S. S. Rodríguez, K. Farrahi, C. Mascolo, Jason Rentfrow","doi":"10.1145/3292500.3330730","DOIUrl":"https://doi.org/10.1145/3292500.3330730","url":null,"abstract":"Smartphones have started to be used as self reporting tools for mental health state as they accompany individuals during their days and can therefore gather temporally fine grained data. However, the analysis of self reported mood data offers challenges related to non-homogeneity of mood assessment among individuals due to the complexity of the feeling and the reporting scales, as well as the noise and sparseness of the reports when collected in the wild. In this paper, we propose a new end-to-end ML model inspired by video frame prediction and machine translation, that forecasts future sequences of mood from previous self-reported moods collected in the real world using mobile devices. Contrary to traditional time series forecasting algorithms, our multi-task encoder-decoder recurrent neural network learns patterns from different users, allowing and improving the prediction for users with limited number of self-reports. Unlike traditional feature-based machine learning algorithms, the encoder-decoder architecture enables to forecast a sequence of future moods rather than one single step. Meanwhile, multi-task learning exploits some unique characteristics of the data (mood is bi-dimensional), achieving better results than when training single-task networks or other classifiers. Our experiments using a real-world dataset of 33,000 user-weeks revealed that (i) 3 weeks of sparsely reported mood is the optimal number to accurately forecast mood, (ii) multi-task learning models both dimensions of mood \"valence and arousal\" with higher accuracy than separate or traditional ML models, and (iii) mood variability, personality traits and day of the week play a key role in the performance of our model. We believe this work provides psychologists and developers of future mobile mental health applications with a ready-to-use and effective tool for early diagnosis of mental health issues at scale.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123982651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Uncovering Pattern Formation of Information Flow 揭示信息流的模式形成

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330971

Chengxi Zang, Peng Cui, Chaoming Song, Wenwu Zhu, Fei Wang

Pattern formation is a ubiquitous phenomenon that describes the generation of orderly outcomes by self-organization. In both physical society and online social media, patterns formed by social interactions are mainly driven by information flow. Despite an increasing number of studies aiming to understand the spreads of information flow, little is known about the geometry of these spreading patterns and how they were formed during the spreading. In this paper, by exploring 432 million information flow patterns extracted from a large-scale online social media dataset, we uncover a wide range of complex geometric patterns characterized by a three-dimensional metric space. In contrast, the existing understanding of spreading patterns are limited to fanning-out or narrow tree-like geometries. We discover three key ingredients that govern the formation of complex geometric patterns of information flow. As a result, we propose a stochastic process model incorporating these ingredients, demonstrating that it successfully reproduces the diverse geometries discovered from the empirical spreading patterns. Our discoveries provide a theoretical foundation for the microscopic mechanisms of information flow, potentially leading to wide implications for prediction, control and policy decisions in social media.

模式形成是一种普遍存在的现象，它描述了自组织产生有序结果的过程。在实体社会和网络社交媒体中，社会互动形成的模式主要是由信息流驱动的。尽管越来越多的研究旨在了解信息流的传播，但人们对这些传播模式的几何形状以及它们在传播过程中是如何形成的知之甚少。本文通过对从大型在线社交媒体数据集中提取的4.32亿个信息流模式进行研究，揭示了以三维度量空间为特征的各种复杂几何模式。相比之下，现有的对扩散模式的理解仅限于扇形或狭窄的树状几何形状。我们发现了控制信息流复杂几何模式形成的三个关键因素。因此，我们提出了一个包含这些成分的随机过程模型，证明它成功地再现了从经验扩展模式中发现的各种几何形状。我们的发现为信息流的微观机制提供了理论基础，可能对社交媒体的预测、控制和政策决策产生广泛影响。

{"title":"Uncovering Pattern Formation of Information Flow","authors":"Chengxi Zang, Peng Cui, Chaoming Song, Wenwu Zhu, Fei Wang","doi":"10.1145/3292500.3330971","DOIUrl":"https://doi.org/10.1145/3292500.3330971","url":null,"abstract":"Pattern formation is a ubiquitous phenomenon that describes the generation of orderly outcomes by self-organization. In both physical society and online social media, patterns formed by social interactions are mainly driven by information flow. Despite an increasing number of studies aiming to understand the spreads of information flow, little is known about the geometry of these spreading patterns and how they were formed during the spreading. In this paper, by exploring 432 million information flow patterns extracted from a large-scale online social media dataset, we uncover a wide range of complex geometric patterns characterized by a three-dimensional metric space. In contrast, the existing understanding of spreading patterns are limited to fanning-out or narrow tree-like geometries. We discover three key ingredients that govern the formation of complex geometric patterns of information flow. As a result, we propose a stochastic process model incorporating these ingredients, demonstrating that it successfully reproduces the diverse geometries discovered from the empirical spreading patterns. Our discoveries provide a theoretical foundation for the microscopic mechanisms of information flow, potentially leading to wide implications for prediction, control and policy decisions in social media.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121241982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Scaling Multi-Armed Bandit Algorithms 缩放多武装强盗算法

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330862

Edouard Fouché, Junpei Komiyama, Klemens Böhm

The Multi-Armed Bandit (MAB) is a fundamental model capturing the dilemma between exploration and exploitation in sequential decision making. At every time step, the decision maker selects a set of arms and observes a reward from each of the chosen arms. In this paper, we present a variant of the problem, which we call the Scaling MAB (S-MAB): The goal of the decision maker is not only to maximize the cumulative rewards, i.e., choosing the arms with the highest expected reward, but also to decide how many arms to select so that, in expectation, the cost of selecting arms does not exceed the rewards. This problem is relevant to many real-world applications, e.g., online advertising, financial investments or data stream monitoring. We propose an extension of Thompson Sampling, which has strong theoretical guarantees and is reported to perform well in practice. Our extension dynamically controls the number of arms to draw. Furthermore, we combine the proposed method with ADWIN, a state-of-the-art change detector, to deal with non-static environments. We illustrate the benefits of our contribution via a real-world use case on predictive maintenance.

多武装强盗模型(Multi-Armed Bandit, MAB)是一个基本模型，它反映了顺序决策中勘探与开发之间的两难困境。在每一个时间步骤中，决策者选择一组手臂，并从每一个选择的手臂中观察奖励。在本文中，我们提出了该问题的一个变体，我们称之为尺度MAB (S-MAB):决策者的目标不仅是最大化累积奖励，即选择期望奖励最高的武器，而且还要决定选择多少武器，以便在期望中，选择武器的成本不超过奖励。这个问题与许多现实世界的应用相关，例如，在线广告、金融投资或数据流监控。我们提出了一种扩展的汤普森抽样，它有很强的理论保证，并在实践中表现良好。我们的扩展动态控制手臂的数量绘制。此外，我们将提出的方法与最先进的变化检测器ADWIN相结合，以处理非静态环境。我们通过预测性维护的实际用例说明了我们的贡献的好处。

{"title":"Scaling Multi-Armed Bandit Algorithms","authors":"Edouard Fouché, Junpei Komiyama, Klemens Böhm","doi":"10.1145/3292500.3330862","DOIUrl":"https://doi.org/10.1145/3292500.3330862","url":null,"abstract":"The Multi-Armed Bandit (MAB) is a fundamental model capturing the dilemma between exploration and exploitation in sequential decision making. At every time step, the decision maker selects a set of arms and observes a reward from each of the chosen arms. In this paper, we present a variant of the problem, which we call the Scaling MAB (S-MAB): The goal of the decision maker is not only to maximize the cumulative rewards, i.e., choosing the arms with the highest expected reward, but also to decide how many arms to select so that, in expectation, the cost of selecting arms does not exceed the rewards. This problem is relevant to many real-world applications, e.g., online advertising, financial investments or data stream monitoring. We propose an extension of Thompson Sampling, which has strong theoretical guarantees and is reported to perform well in practice. Our extension dynamically controls the number of arms to draw. Furthermore, we combine the proposed method with ADWIN, a state-of-the-art change detector, to deal with non-static environments. We illustrate the benefits of our contribution via a real-world use case on predictive maintenance.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128545532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Modeling and Applications for Temporal Point Processes 时间点过程的建模与应用

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3332298

Junchi Yan, Hongteng Xu, Liangda Li

Real-world entities' behaviors, associated with their side information, are often recorded over time as asynchronous event sequences. Such event sequences are the basis of many practical applications, neural spiking train study, earth quack prediction, crime analysis, infectious disease diffusion forecasting, condition-based preventative maintenance, information retrieval and behavior-based network analysis and services, etc. Temporal point process (TPP) is a principled mathematical tool for the modeling and learning of asynchronous event sequences, which captures the instantaneous happening rate of the events and the temporal dependency between historical and current events. TPP provides us with an interpretable model to describe the generative mechanism of event sequences, which is beneficial for event prediction and causality analysis. Recently, it has been shown that TPP has potentials to many machine learning and data science applications and can be combined with other cutting-edge machine learning techniques like deep learning, reinforcement learning, adversarial learning, and so on. We will start with an elementary introduction of TPP model, including the basic concepts of the model, the simulation method of event sequences; in the second part of the tutorial, we will introduce typical TPP models and their traditional learning methods; in the third part of the tutorial, we will discuss the recent progress on the modeling and learning of TPP, including neural network-based TPP models, generative adversarial networks (GANs) for TPP, and deep reinforcement learning of TPP. We will further talk about the practical application of TPP, including useful data augmentation methods for learning from imperfect observations, typical applications and examples like healthcare and industry maintenance, and existing open source toolboxes.

现实世界实体的行为，与其附带信息相关联，经常被记录为异步事件序列。这些事件序列是许多实际应用的基础，如神经脉冲序列研究、地球江湖医生预测、犯罪分析、传染病扩散预测、基于状态的预防性维护、信息检索和基于行为的网络分析与服务等。时间点过程(TPP)是一种用于异步事件序列建模和学习的数学工具，它捕获事件的瞬时发生速率以及历史事件和当前事件之间的时间依赖性。TPP为我们描述事件序列的生成机制提供了一个可解释的模型，有利于事件预测和因果分析。最近，研究表明TPP在许多机器学习和数据科学应用中具有潜力，并且可以与其他尖端机器学习技术(如深度学习、强化学习、对抗学习等)相结合。我们将首先对TPP模型进行初步介绍，包括模型的基本概念、事件序列的仿真方法;在本教程的第二部分，我们将介绍典型的TPP模型及其传统的学习方法;在本教程的第三部分，我们将讨论TPP建模和学习的最新进展，包括基于神经网络的TPP模型、TPP的生成对抗网络(GANs)和TPP的深度强化学习。我们将进一步讨论TPP的实际应用，包括从不完善的观察中学习的有用数据增强方法、医疗保健和行业维护等典型应用程序和示例，以及现有的开源工具箱。

{"title":"Modeling and Applications for Temporal Point Processes","authors":"Junchi Yan, Hongteng Xu, Liangda Li","doi":"10.1145/3292500.3332298","DOIUrl":"https://doi.org/10.1145/3292500.3332298","url":null,"abstract":"Real-world entities' behaviors, associated with their side information, are often recorded over time as asynchronous event sequences. Such event sequences are the basis of many practical applications, neural spiking train study, earth quack prediction, crime analysis, infectious disease diffusion forecasting, condition-based preventative maintenance, information retrieval and behavior-based network analysis and services, etc. Temporal point process (TPP) is a principled mathematical tool for the modeling and learning of asynchronous event sequences, which captures the instantaneous happening rate of the events and the temporal dependency between historical and current events. TPP provides us with an interpretable model to describe the generative mechanism of event sequences, which is beneficial for event prediction and causality analysis. Recently, it has been shown that TPP has potentials to many machine learning and data science applications and can be combined with other cutting-edge machine learning techniques like deep learning, reinforcement learning, adversarial learning, and so on. We will start with an elementary introduction of TPP model, including the basic concepts of the model, the simulation method of event sequences; in the second part of the tutorial, we will introduce typical TPP models and their traditional learning methods; in the third part of the tutorial, we will discuss the recent progress on the modeling and learning of TPP, including neural network-based TPP models, generative adversarial networks (GANs) for TPP, and deep reinforcement learning of TPP. We will further talk about the practical application of TPP, including useful data augmentation methods for learning from imperfect observations, typical applications and examples like healthcare and industry maintenance, and existing open source toolboxes.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115359393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Effective and Efficient Reuse of Past Travel Behavior for Route Recommendation 路线推荐中对过去旅行行为的有效重用

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330835

Lisi Chen, Shuo Shang, Christian S. Jensen, Bin Yao, Zhiwei Zhang, Ling Shao

With the increasing availability of moving-object tracking data, use of this data for route search and recommendation is increasingly important. To this end, we propose a novel parallel split-and-combine approach to enable route search by locations (RSL-Psc). Given a set of routes, a set of places to visit O, and a threshold θ, we retrieve the route composed of sub-routes that (i) has similarity to O no less than θ and (ii) contains the minimum number of sub-route combinations. The resulting functionality targets a broad range of applications, including route planning and recommendation, ridesharing, and location-based services in general. To enable efficient and effective RSL-Psc computation on massive route data, we develop novel search space pruning techniques and enable use of the parallel processing capabilities of modern processors. Specifically, we develop two parallel algorithms, Fully-Split Parallel Search (FSPS) and Group-Split Parallel Search (GSPS). We divide the route split-and-combine task into ∑k=0 M S(|O|,k+1) sub-tasks, where M is the maximum number of combinations and S(⋅) is the Stirling number of the second kind. In each sub-task, we use network expansion and exploit spatial similarity bounds for pruning. The algorithms split candidate routes into sub-routes and combine them to construct new routes. The sub-tasks are independent and are performed in parallel. Extensive experiments with real data offer insight into the performance of the algorithms, indicating that our RSL-Psc problem can generate high-quality results and that the two algorithms are capable of achieving high efficiency and scalability.

随着移动目标跟踪数据的可用性越来越高，使用这些数据进行路线搜索和推荐变得越来越重要。为此，我们提出了一种新的并行拆分合并方法来实现按位置的路由搜索(RSL-Psc)。给定一组路由，一组访问O的地点和一个阈值θ，我们检索由子路由组成的路由，其中(i)与O的相似性不小于θ， (ii)包含最小数量的子路由组合。由此产生的功能针对广泛的应用，包括路线规划和推荐，乘车共享和基于位置的服务。为了在大量路由数据上实现高效的RSL-Psc计算，我们开发了新的搜索空间修剪技术，并启用了现代处理器的并行处理能力。具体来说，我们开发了两种并行算法，全分割并行搜索(FSPS)和组分割并行搜索(GSPS)。我们将路径分割合并任务划分为∑k=0 M S(|O|，k+1)个子任务，其中M为最大组合数，S(⋅)为第二类斯特林数。在每个子任务中，我们使用网络扩展和利用空间相似界进行修剪。该算法将候选路由分解成子路由，并将它们组合成新的路由。子任务是独立的，并且并行执行。大量的真实数据实验提供了对算法性能的深入了解，表明我们的RSL-Psc问题可以产生高质量的结果，并且两种算法能够实现高效率和可扩展性。

{"title":"Effective and Efficient Reuse of Past Travel Behavior for Route Recommendation","authors":"Lisi Chen, Shuo Shang, Christian S. Jensen, Bin Yao, Zhiwei Zhang, Ling Shao","doi":"10.1145/3292500.3330835","DOIUrl":"https://doi.org/10.1145/3292500.3330835","url":null,"abstract":"With the increasing availability of moving-object tracking data, use of this data for route search and recommendation is increasingly important. To this end, we propose a novel parallel split-and-combine approach to enable route search by locations (RSL-Psc). Given a set of routes, a set of places to visit O, and a threshold θ, we retrieve the route composed of sub-routes that (i) has similarity to O no less than θ and (ii) contains the minimum number of sub-route combinations. The resulting functionality targets a broad range of applications, including route planning and recommendation, ridesharing, and location-based services in general. To enable efficient and effective RSL-Psc computation on massive route data, we develop novel search space pruning techniques and enable use of the parallel processing capabilities of modern processors. Specifically, we develop two parallel algorithms, Fully-Split Parallel Search (FSPS) and Group-Split Parallel Search (GSPS). We divide the route split-and-combine task into ∑k=0 M S(|O|,k+1) sub-tasks, where M is the maximum number of combinations and S(⋅) is the Stirling number of the second kind. In each sub-task, we use network expansion and exploit spatial similarity bounds for pruning. The algorithms split candidate routes into sub-routes and combine them to construct new routes. The sub-tasks are independent and are performed in parallel. Extensive experiments with real data offer insight into the performance of the algorithms, indicating that our RSL-Psc problem can generate high-quality results and that the two algorithms are capable of achieving high efficiency and scalability.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114418382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 45

MeLU

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3330859

Ho-Yong Lee, Jinbae Im, Seongwon Jang, H. Cho, Sehee Chung

This paper proposes a recommender system to alleviate the cold-start problem that can estimate user preferences based on only a small number of items. To identify a user's preference in the cold state, existing recommender systems, such as Netflix, initially provide items to a user; we call those items evidence candidates. Recommendations are then made based on the items selected by the user. Previous recommendation studies have two limitations: (1) the users who consumed a few items have poor recommendations and (2) inadequate evidence candidates are used to identify user preferences. We propose a meta-learning-based recommender system called MeLU to overcome these two limitations. From meta-learning, which can rapidly adopt new task with a few examples, MeLU can estimate new user's preferences with a few consumed items. In addition, we provide an evidence candidate selection strategy that determines distinguishing items for customized preference estimation. We validate MeLU with two benchmark datasets, and the proposed model reduces at least 5.92% mean absolute error than two comparative models on the datasets. We also conduct a user study experiment to verify the evidence selection strategy.

{"title":"MeLU","authors":"Ho-Yong Lee, Jinbae Im, Seongwon Jang, H. Cho, Sehee Chung","doi":"10.1145/3292500.3330859","DOIUrl":"https://doi.org/10.1145/3292500.3330859","url":null,"abstract":"This paper proposes a recommender system to alleviate the cold-start problem that can estimate user preferences based on only a small number of items. To identify a user's preference in the cold state, existing recommender systems, such as Netflix, initially provide items to a user; we call those items evidence candidates. Recommendations are then made based on the items selected by the user. Previous recommendation studies have two limitations: (1) the users who consumed a few items have poor recommendations and (2) inadequate evidence candidates are used to identify user preferences. We propose a meta-learning-based recommender system called MeLU to overcome these two limitations. From meta-learning, which can rapidly adopt new task with a few examples, MeLU can estimate new user's preferences with a few consumed items. In addition, we provide an evidence candidate selection strategy that determines distinguishing items for customized preference estimation. We validate MeLU with two benchmark datasets, and the proposed model reduces at least 5.92% mean absolute error than two comparative models on the datasets. We also conduct a user study experiment to verify the evidence selection strategy.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114492515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 245

Building a Better Self-Driving Car: Hardware, Software, and Knowledge 打造更好的自动驾驶汽车:硬件、软件和知识

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3340409

K. Chellapilla

Lyft's mission is to improve people's lives with the world's best transportation. Self driving vehicles have the potential to deliver unprecedented improvements to safety and quality, at a price and convenience that challenges traditional models of vehicle ownership. A combination of hardware, software, and knowledge technologies are needed to build self-driving cars. In this talk, I'll present the core problems in self-driving and how recent advances in computer vision, robotics, and machine learning are powering this revolution. The car is carefully designed with a variety of sensors that complement each other to address a wide variety of driving scenarios. Sensor fusion bring all of these signals together into an interpretable AI engine comprising of perception, prediction, planning, and controls. For example, deep learning models and large scale machine learning have closed the gap between human and machine perception. In contrast, predicting the behavior of other humans and effectively planning and negotiating maneuvers continue to be hard problems. Combining AI technologies with deep knowledge about the real world is key to addressing these.

Lyft的使命是用世界上最好的交通工具改善人们的生活。自动驾驶汽车有可能在安全性和质量方面带来前所未有的改进，其价格和便利性将挑战传统的汽车拥有模式。制造自动驾驶汽车需要硬件、软件和知识技术的结合。在这次演讲中，我将介绍自动驾驶的核心问题，以及计算机视觉、机器人技术和机器学习的最新进展如何推动这场革命。这款车经过精心设计，配备了各种传感器，相互补充，以应对各种驾驶场景。传感器融合将所有这些信号整合到一个可解释的人工智能引擎中，包括感知、预测、规划和控制。例如，深度学习模型和大规模机器学习已经缩小了人类和机器感知之间的差距。相比之下，预测其他人的行为以及有效地规划和谈判策略仍然是难题。将人工智能技术与对现实世界的深入了解相结合是解决这些问题的关键。

{"title":"Building a Better Self-Driving Car: Hardware, Software, and Knowledge","authors":"K. Chellapilla","doi":"10.1145/3292500.3340409","DOIUrl":"https://doi.org/10.1145/3292500.3340409","url":null,"abstract":"Lyft's mission is to improve people's lives with the world's best transportation. Self driving vehicles have the potential to deliver unprecedented improvements to safety and quality, at a price and convenience that challenges traditional models of vehicle ownership. A combination of hardware, software, and knowledge technologies are needed to build self-driving cars. In this talk, I'll present the core problems in self-driving and how recent advances in computer vision, robotics, and machine learning are powering this revolution. The car is carefully designed with a variety of sensors that complement each other to address a wide variety of driving scenarios. Sensor fusion bring all of these signals together into an interpretable AI engine comprising of perception, prediction, planning, and controls. For example, deep learning models and large scale machine learning have closed the gap between human and machine perception. In contrast, predicting the behavior of other humans and effectively planning and negotiating maneuvers continue to be hard problems. Combining AI technologies with deep knowledge about the real world is key to addressing these.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115242712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Statistical Mechanics Methods for Discovering Knowledge from Modern Production Quality Neural Networks 从现代生产质量神经网络中发现知识的统计力学方法

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-25 DOI: 10.1145/3292500.3332294

Charles H. Martin, Michael W. Mahoney

There have long been connections between statistical mechanics and neural networks, but in recent decades these connections have withered. However, in light of recent failings of statistical learning theory and stochastic optimization theory to describe, even qualitatively, many properties of production-quality neural network models, researchers have revisited ideas from the statistical mechanics of neural networks. This tutorial will provide an overview of the area; it will go into detail on how connections with random matrix theory and heavy-tailed random matrix theory can lead to a practical phenomenological theory for large-scale deep neural networks; and it will describe future directions.

统计力学和神经网络之间的联系由来已久，但近几十年来，这种联系已经消失了。然而，鉴于最近统计学习理论和随机优化理论在描述(甚至定性)生产质量神经网络模型的许多特性方面的失败，研究人员重新审视了神经网络统计力学的思想。本教程将提供该领域的概述;它将详细介绍与随机矩阵理论和重尾随机矩阵理论的联系如何导致大规模深度神经网络的实用现象学理论;它将描述未来的发展方向。

引用次数: 3