首页 > 最新文献

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining最新文献

英文 中文
Identifiability of Cause and Effect using Regularized Regression 用正则化回归分析因果关系的可辨识性
Alexander Marx, Jilles Vreeken
We consider the problem of telling apart cause from effect between two univariate continuous-valued random variables X and Y. In general, it is impossible to make definite statements about causality without making assumptions on the underlying model; one of the most important aspects of causal inference is hence to determine under which assumptions are we able to do so. In this paper we show under which general conditions we can identify cause from effect by simply choosing the direction with the best regression score. We define a general framework of identifiable regression-based scoring functions, and show how to instantiate it in practice using regression splines. Compared to existing methods that either give strong guarantees, but are hardly applicable in practice, or provide no guarantees, but do work well in practice, our instantiation combines the best of both worlds; it gives guarantees, while empirical evaluation on synthetic and real-world data shows that it performs at least as well as the state of the art.
我们考虑在两个单变量连续值随机变量X和y之间区分因果关系的问题。一般来说,如果不对基础模型做出假设,就不可能对因果关系做出明确的陈述;因此,因果推理的一个最重要的方面是确定在哪些假设下我们能够这样做。在本文中,我们证明了在一般条件下,我们可以通过简单地选择具有最佳回归分数的方向来识别因果关系。我们定义了一个可识别的基于回归的评分函数的一般框架,并展示了如何在实践中使用回归样条实例化它。与现有的方法相比,要么提供强大的保证,但在实践中很难适用,要么不提供保证,但在实践中工作得很好,我们的实例结合了这两个世界的最好的;它提供了保证,而对合成数据和真实世界数据的经验评估表明,它的性能至少与最先进的技术一样好。
{"title":"Identifiability of Cause and Effect using Regularized Regression","authors":"Alexander Marx, Jilles Vreeken","doi":"10.1145/3292500.3330854","DOIUrl":"https://doi.org/10.1145/3292500.3330854","url":null,"abstract":"We consider the problem of telling apart cause from effect between two univariate continuous-valued random variables X and Y. In general, it is impossible to make definite statements about causality without making assumptions on the underlying model; one of the most important aspects of causal inference is hence to determine under which assumptions are we able to do so. In this paper we show under which general conditions we can identify cause from effect by simply choosing the direction with the best regression score. We define a general framework of identifiable regression-based scoring functions, and show how to instantiate it in practice using regression splines. Compared to existing methods that either give strong guarantees, but are hardly applicable in practice, or provide no guarantees, but do work well in practice, our instantiation combines the best of both worlds; it gives guarantees, while empirical evaluation on synthetic and real-world data shows that it performs at least as well as the state of the art.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114967132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Integrating Domain-Knowledge into Deep Learning 将领域知识集成到深度学习中
R. Salakhutdinov
In this talk, I will first discuss deep learning models that can find semantically meaningful representations of words, learn to read documents and answer questions about their content. I will show how we can encode external linguistic knowledge as an explicit memory in recurrent neural networks, and use it to model co-reference relations in text. I will further introduce methods that can augment neural representation of text with structured data from Knowledge Bases for question answering, and show how we can use structured prior knowledge from Knowledge Graphs for image classification. Finally, I will introduce the notion of structured memory as being a crucial part of an intelligent agent's ability to plan and reason in partially observable environments. I will present a modular hierarchical reinforcement learning agent that can learn to store arbitrary information about the environment over long time lags, perform efficient exploration and long-term planning, while generalizing across domains and tasks.
在这次演讲中,我将首先讨论深度学习模型,它可以找到单词的语义有意义的表示,学习阅读文档并回答有关其内容的问题。我将展示我们如何将外部语言知识编码为循环神经网络中的外显记忆,并使用它来模拟文本中的共同引用关系。我将进一步介绍一些方法,这些方法可以用知识库中的结构化数据增强文本的神经表示,用于回答问题,并展示我们如何使用知识图中的结构化先验知识进行图像分类。最后,我将介绍结构化记忆的概念,它是智能代理在部分可观察环境中进行计划和推理的关键部分。我将介绍一个模块化的分层强化学习代理,它可以学习在长时间滞后的情况下存储关于环境的任意信息,执行有效的探索和长期规划,同时跨领域和任务进行推广。
{"title":"Integrating Domain-Knowledge into Deep Learning","authors":"R. Salakhutdinov","doi":"10.1145/3292500.3340416","DOIUrl":"https://doi.org/10.1145/3292500.3340416","url":null,"abstract":"In this talk, I will first discuss deep learning models that can find semantically meaningful representations of words, learn to read documents and answer questions about their content. I will show how we can encode external linguistic knowledge as an explicit memory in recurrent neural networks, and use it to model co-reference relations in text. I will further introduce methods that can augment neural representation of text with structured data from Knowledge Bases for question answering, and show how we can use structured prior knowledge from Knowledge Graphs for image classification. Finally, I will introduce the notion of structured memory as being a crucial part of an intelligent agent's ability to plan and reason in partially observable environments. I will present a modular hierarchical reinforcement learning agent that can learn to store arbitrary information about the environment over long time lags, perform efficient exploration and long-term planning, while generalizing across domains and tasks.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124257655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Fast and Accurate Anomaly Detection in Dynamic Graphs with a Two-Pronged Approach 基于双管齐下方法的动态图快速准确异常检测
Minji Yoon, Bryan Hooi, Kijung Shin, C. Faloutsos
Given a dynamic graph stream, how can we detect the sudden appearance of anomalous patterns, such as link spam, follower boosting, or denial of service attacks? Additionally, can we categorize the types of anomalies that occur in practice, and theoretically analyze the anomalous signs arising from each type? In this work, we propose AnomRank, an online algorithm for anomaly detection in dynamic graphs. AnomRank uses a two-pronged approach defining two novel metrics for anomalousness. Each metric tracks the derivatives of its own version of a 'node score' (or node importance) function. This allows us to detect sudden changes in the importance of any node. We show theoretically and experimentally that the two-pronged approach successfully detects two common types of anomalies: sudden weight changes along an edge, and sudden structural changes to the graph. AnomRank is (a) Fast and Accurate: up to 49.5x faster or 35% more accurate than state-of-the-art methods, (b) Scalable: linear in the number of edges in the input graph, processing millions of edges within 2 seconds on a stock laptop/desktop, and (c) Theoretically Sound: providing theoretical guarantees of the two-pronged approach.
给定一个动态图形流,我们如何检测突然出现的异常模式,如链接垃圾邮件、关注者提升或拒绝服务攻击?另外,我们能否对实践中出现的异常类型进行分类,并从理论上分析每种类型产生的异常迹象?在这项工作中,我们提出了一种用于动态图异常检测的在线算法AnomRank。AnomRank使用双管齐下的方法定义两个新的异常度量。每个指标都追踪自己版本的“节点得分”(或节点重要性)函数的衍生物。这使我们能够检测到任何节点重要性的突然变化。我们从理论上和实验上证明,双管齐下的方法成功地检测了两种常见的异常类型:沿边缘的突然权重变化和图的突然结构变化。AnomRank是(a)快速准确:比最先进的方法快49.5倍或准确35%,(b)可扩展:输入图中的边缘数量呈线性,在库存笔记本电脑/台式机上在2秒内处理数百万条边缘,(c)理论上合理:为双管齐下的方法提供理论保证。
{"title":"Fast and Accurate Anomaly Detection in Dynamic Graphs with a Two-Pronged Approach","authors":"Minji Yoon, Bryan Hooi, Kijung Shin, C. Faloutsos","doi":"10.1145/3292500.3330946","DOIUrl":"https://doi.org/10.1145/3292500.3330946","url":null,"abstract":"Given a dynamic graph stream, how can we detect the sudden appearance of anomalous patterns, such as link spam, follower boosting, or denial of service attacks? Additionally, can we categorize the types of anomalies that occur in practice, and theoretically analyze the anomalous signs arising from each type? In this work, we propose AnomRank, an online algorithm for anomaly detection in dynamic graphs. AnomRank uses a two-pronged approach defining two novel metrics for anomalousness. Each metric tracks the derivatives of its own version of a 'node score' (or node importance) function. This allows us to detect sudden changes in the importance of any node. We show theoretically and experimentally that the two-pronged approach successfully detects two common types of anomalies: sudden weight changes along an edge, and sudden structural changes to the graph. AnomRank is (a) Fast and Accurate: up to 49.5x faster or 35% more accurate than state-of-the-art methods, (b) Scalable: linear in the number of edges in the input graph, processing millions of edges within 2 seconds on a stock laptop/desktop, and (c) Theoretically Sound: providing theoretical guarantees of the two-pronged approach.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116660609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 54
Riker 瑞克
Jie Zhao, Ziyu Guan, Huan Sun
This work studies product question answering (PQA) which aims to answer product-related questions based on customer reviews. Most recent PQA approaches adopt end2end semantic matching methodologies, which map questions and answers to a latent vector space to measure their relevance. Such methods often achieve superior performance but it tends to be difficult to interpret why. On the other hand, simple keyword-based search methods exhibit natural interpretability through matched keywords, but often suffer from the lexical gap problem. In this work, we develop a new PQA framework (named Riker) that enjoys the benefits of both interpretability and effectiveness. Riker mines rich keyword representations of a question with two major components, internal word re-weighting and external word association, which predict the importance of each question word and associate the question with outside relevant keywords respectively, and can be jointly trained under weak supervision with large-scale QA pairs. The keyword representations from Riker can be directly used as input to a keyword-based search module, enabling the whole process to be effective while preserving good interpretability. We conduct extensive experiments using Amazon QA and review datasets from 5 different departments, and our results show that Riker substantially outperforms previous state-of-the-art methods in both synthetic settings and real user evaluations. In addition, we compare keyword representations from Riker and those from attention mechanisms popularly used for deep neural networks through case studies, showing that the former are more effective and interpretable.
{"title":"Riker","authors":"Jie Zhao, Ziyu Guan, Huan Sun","doi":"10.1145/3292500.3330985","DOIUrl":"https://doi.org/10.1145/3292500.3330985","url":null,"abstract":"This work studies product question answering (PQA) which aims to answer product-related questions based on customer reviews. Most recent PQA approaches adopt end2end semantic matching methodologies, which map questions and answers to a latent vector space to measure their relevance. Such methods often achieve superior performance but it tends to be difficult to interpret why. On the other hand, simple keyword-based search methods exhibit natural interpretability through matched keywords, but often suffer from the lexical gap problem. In this work, we develop a new PQA framework (named Riker) that enjoys the benefits of both interpretability and effectiveness. Riker mines rich keyword representations of a question with two major components, internal word re-weighting and external word association, which predict the importance of each question word and associate the question with outside relevant keywords respectively, and can be jointly trained under weak supervision with large-scale QA pairs. The keyword representations from Riker can be directly used as input to a keyword-based search module, enabling the whole process to be effective while preserving good interpretability. We conduct extensive experiments using Amazon QA and review datasets from 5 different departments, and our results show that Riker substantially outperforms previous state-of-the-art methods in both synthetic settings and real user evaluations. In addition, we compare keyword representations from Riker and those from attention mechanisms popularly used for deep neural networks through case studies, showing that the former are more effective and interpretable.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"3 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120986416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Naranjo Question Answering using End-to-End Multi-task Learning Model 使用端到端多任务学习模型的纳兰霍问答
Bhanu Pratap Singh Rawat, Fei Li, Hong Yu
In the clinical domain, it is important to understand whether an adverse drug reaction (ADR) is caused by a particular medication. Clinical judgement studies help judge the causal relation between a medication and its ADRs. In this study, we present the first attempt to automatically infer the causality between a drug and an ADR from electronic health records (EHRs) by answering the Naranjo questionnaire, the validated clinical question answering set used by domain experts for ADR causality assessment. Using physicians' annotation as the gold standard, our proposed joint model, which uses multi-task learning to predict the answers of a subset of the Naranjo questionnaire, significantly outperforms the baseline pipeline model with a good margin, achieving a macro-weighted f-score between 0.3652-0.5271 and micro-weighted f-score between 0.9523-0.9918.
在临床领域,了解药物不良反应(ADR)是否由特定药物引起是很重要的。临床判断研究有助于判断药物与其不良反应之间的因果关系。在这项研究中,我们首次尝试通过回答Naranjo问卷,从电子健康记录(EHRs)中自动推断药物与ADR之间的因果关系,Naranjo问卷是领域专家用于ADR因果关系评估的经过验证的临床问题回答集。以医生注释为金标准,我们提出的联合模型使用多任务学习来预测Naranjo问卷子集的答案,显著优于基线管道模型,并且有很好的裕度,宏观加权f得分在0.3652-0.5271之间,微观加权f得分在0.9523-0.9918之间。
{"title":"Naranjo Question Answering using End-to-End Multi-task Learning Model","authors":"Bhanu Pratap Singh Rawat, Fei Li, Hong Yu","doi":"10.1145/3292500.3330770","DOIUrl":"https://doi.org/10.1145/3292500.3330770","url":null,"abstract":"In the clinical domain, it is important to understand whether an adverse drug reaction (ADR) is caused by a particular medication. Clinical judgement studies help judge the causal relation between a medication and its ADRs. In this study, we present the first attempt to automatically infer the causality between a drug and an ADR from electronic health records (EHRs) by answering the Naranjo questionnaire, the validated clinical question answering set used by domain experts for ADR causality assessment. Using physicians' annotation as the gold standard, our proposed joint model, which uses multi-task learning to predict the answers of a subset of the Naranjo questionnaire, significantly outperforms the baseline pipeline model with a good margin, achieving a macro-weighted f-score between 0.3652-0.5271 and micro-weighted f-score between 0.9523-0.9918.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127148409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Investigate Transitions into Drug Addiction through Text Mining of Reddit Data 通过对Reddit数据的文本挖掘来研究药物成瘾的转变
John Lu, S. Sridhar, Ritika Pandey, M. Hasan, G. Mohler
Increasing rates of opioid drug abuse and heightened prevalence of online support communities underscore the necessity of employing data mining techniques to better understand drug addiction using these rapidly developing online resources. In this work, we obtained data from Reddit, an online collection of forums, to gather insight into drug use/misuse using text snippets from users narratives. Specifically, using users' posts, we trained a binary classifier which predicts a user's transitions from casual drug discussion forums to drug recovery forums. We also proposed a Cox regression model that outputs likelihoods of such transitions. In doing so, we found that utterances of select drugs and certain linguistic features contained in one's posts can help predict these transitions. Using unfiltered drug-related posts, our research delineates drugs that are associated with higher rates of transitions from recreational drug discussion to support/recovery discussion, offers insight into modern drug culture, and provides tools with potential applications in combating the opioid crisis.
阿片类药物滥用率不断上升,在线支持社区日益普及,这突出表明有必要利用这些迅速发展的在线资源,采用数据挖掘技术更好地了解吸毒情况。在这项工作中,我们从Reddit(一个在线论坛集合)获取数据,通过用户叙述的文本片段来收集对药物使用/滥用的见解。具体来说,使用用户的帖子,我们训练了一个二元分类器,该分类器可以预测用户从随意的药物讨论论坛到药物恢复论坛的过渡。我们还提出了一个Cox回归模型,输出这种转变的可能性。在这样做的过程中,我们发现选择药物的话语和某些语言特征包含在一个人的帖子中可以帮助预测这些转变。通过使用未经过滤的药物相关帖子,我们的研究描绘了与从娱乐性药物讨论到支持/恢复讨论的较高比率相关的药物,提供了对现代毒品文化的洞察,并提供了在对抗阿片类药物危机方面具有潜在应用的工具。
{"title":"Investigate Transitions into Drug Addiction through Text Mining of Reddit Data","authors":"John Lu, S. Sridhar, Ritika Pandey, M. Hasan, G. Mohler","doi":"10.1145/3292500.3330737","DOIUrl":"https://doi.org/10.1145/3292500.3330737","url":null,"abstract":"Increasing rates of opioid drug abuse and heightened prevalence of online support communities underscore the necessity of employing data mining techniques to better understand drug addiction using these rapidly developing online resources. In this work, we obtained data from Reddit, an online collection of forums, to gather insight into drug use/misuse using text snippets from users narratives. Specifically, using users' posts, we trained a binary classifier which predicts a user's transitions from casual drug discussion forums to drug recovery forums. We also proposed a Cox regression model that outputs likelihoods of such transitions. In doing so, we found that utterances of select drugs and certain linguistic features contained in one's posts can help predict these transitions. Using unfiltered drug-related posts, our research delineates drugs that are associated with higher rates of transitions from recreational drug discussion to support/recovery discussion, offers insight into modern drug culture, and provides tools with potential applications in combating the opioid crisis.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125890449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 第25届ACM SIGKDD知识发现与数据挖掘国际会议论文集
{"title":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","authors":"","doi":"10.1145/3292500","DOIUrl":"https://doi.org/10.1145/3292500","url":null,"abstract":"","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126635590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 69
Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning 零阶优化及其在数据挖掘和机器学习中的对抗鲁棒性应用研究进展
Pin-Yu Chen, Sijia Liu
Zeroth-order (ZO) optimization is increasingly embraced for solving big data and machine learning problems when explicit expressions of the gradients are difficult or infeasible to obtain. It achieves gradient-free optimization by approximating the full gradient via efficient gradient estimators. Some recent important applications include: a) generation of prediction-evasive, black-box adversarial attacks on deep neural networks, b) online network management with limited computation capacity, c) parameter inference of black-box/complex systems, and d) bandit optimization in which a player receives partial feedback in terms of loss function values revealed by her adversary. This tutorial aims to provide a comprehensive introduction to recent advances in ZO optimization methods in both theory and applications. On the theory side, we will cover convergence rate and iteration complexity analysis of ZO algorithms and make comparisons to their first-order counterparts. On the application side, we will highlight one appealing application of ZO optimization to studying the robustness of deep neural networks - practical and efficient adversarial attacks that generate adversarial examples from a black-box machine learning model. We will also summarize potential research directions regarding ZO optimization, big data challenges and some open-ended data mining and machine learning problems.
当梯度的显式表达式难以或不可获得时,零阶(ZO)优化越来越多地用于解决大数据和机器学习问题。它通过高效梯度估计器逼近全梯度来实现无梯度优化。最近一些重要的应用包括:a)在深度神经网络上产生预测规避,黑盒对抗攻击,b)有限计算能力的在线网络管理,c)黑盒/复杂系统的参数推理,d)强盗优化,其中玩家接收到对手透露的损失函数值的部分反馈。本教程旨在全面介绍ZO优化方法在理论和应用方面的最新进展。在理论方面,我们将介绍ZO算法的收敛速度和迭代复杂度分析,并与一阶算法进行比较。在应用方面,我们将重点介绍ZO优化在研究深度神经网络鲁棒性方面的一个吸引人的应用——从黑箱机器学习模型生成对抗性示例的实用而有效的对抗性攻击。我们还将总结关于ZO优化、大数据挑战以及一些开放式数据挖掘和机器学习问题的潜在研究方向。
{"title":"Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning","authors":"Pin-Yu Chen, Sijia Liu","doi":"10.1145/3292500.3332288","DOIUrl":"https://doi.org/10.1145/3292500.3332288","url":null,"abstract":"Zeroth-order (ZO) optimization is increasingly embraced for solving big data and machine learning problems when explicit expressions of the gradients are difficult or infeasible to obtain. It achieves gradient-free optimization by approximating the full gradient via efficient gradient estimators. Some recent important applications include: a) generation of prediction-evasive, black-box adversarial attacks on deep neural networks, b) online network management with limited computation capacity, c) parameter inference of black-box/complex systems, and d) bandit optimization in which a player receives partial feedback in terms of loss function values revealed by her adversary. This tutorial aims to provide a comprehensive introduction to recent advances in ZO optimization methods in both theory and applications. On the theory side, we will cover convergence rate and iteration complexity analysis of ZO algorithms and make comparisons to their first-order counterparts. On the application side, we will highlight one appealing application of ZO optimization to studying the robustness of deep neural networks - practical and efficient adversarial attacks that generate adversarial examples from a black-box machine learning model. We will also summarize potential research directions regarding ZO optimization, big data challenges and some open-ended data mining and machine learning problems.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116163774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Applications of AI/ML in Established and New Industries AI/ML在现有和新兴行业中的应用
H. Sawaf
The advent of advanced modeling for general machine learning, and in particular computer vision, speech recognition and natural language processing, the applications of AI is enabling classical businesses to reinvent themselves, and new business fields to arise which were even not imaginable a few years back. Hassan will present some of these use cases, and dive into some in more detail, showing where current and future AI/ML technology is accelerating innovation.
通用机器学习的高级建模的出现,特别是计算机视觉、语音识别和自然语言处理,人工智能的应用使传统企业能够重塑自我,几年前甚至无法想象的新业务领域应运而生。Hassan将介绍其中一些用例,并更详细地探讨一些用例,展示当前和未来的AI/ML技术正在加速创新的地方。
{"title":"Applications of AI/ML in Established and New Industries","authors":"H. Sawaf","doi":"10.1145/3292500.3340403","DOIUrl":"https://doi.org/10.1145/3292500.3340403","url":null,"abstract":"The advent of advanced modeling for general machine learning, and in particular computer vision, speech recognition and natural language processing, the applications of AI is enabling classical businesses to reinvent themselves, and new business fields to arise which were even not imaginable a few years back. Hassan will present some of these use cases, and dive into some in more detail, showing where current and future AI/ML technology is accelerating innovation.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122337706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tackle Balancing Constraint for Incremental Semi-Supervised Support Vector Learning 解决增量半监督支持向量学习的平衡约束问题
Shuyang Yu, Bin Gu, Kunpeng Ning, Haiyan Chen, J. Pei, Heng Huang
Semi-Supervised Support Vector Machine (S3VM) is one of the most popular methods for semi-supervised learning. To avoid the trivial solution of classifying all the unlabeled examples to a same class, balancing constraint is often used with S3VM (denoted as BCS3VM). Recently, a novel incremental learning algorithm (IL-S3VM) based on the path following technique was proposed to significantly scale up S3VM. However, the dynamic relationship of balancing constraint with previous labeled and unlabeled samples impede their incremental method for handling BCS3VM. To fill this gap, in this paper, we propose a new incremental S3VM algorithm (IL-BCS3VM) based on IL-S3VM which can effectively handle the balancing constraint and directly update the solution of BCS3VM. Specifically, to handle the dynamic relationship of balancing constraint with previous labeled and unlabeled samples, we design two unique procedures which can respectively eliminate and add the balancing constraint into S3VM. More importantly, we provide the finite convergence analysis for our IL-BCS3VM algorithm. Experimental results on a variety of benchmark datasets not only confirm the finite convergence of IL-BCS3VM, but also show a huge reduction of computational time compared with existing batch and incremental learning algorithms, while retaining the similar generalization performance.
半监督支持向量机(S3VM)是目前最流行的半监督学习方法之一。为了避免将所有未标记的示例分类到同一类的琐碎解决方案,平衡约束通常与S3VM一起使用(表示为BCS3VM)。最近,一种新的基于路径跟踪技术的增量学习算法(IL-S3VM)被提出,以显着扩展S3VM。然而,平衡约束与先前标记和未标记样本的动态关系阻碍了他们处理BCS3VM的增量方法。为了填补这一空白,本文提出了一种新的基于IL-S3VM的增量式S3VM算法(IL-BCS3VM),该算法可以有效地处理平衡约束并直接更新BCS3VM的解。具体来说,为了处理平衡约束与之前标记和未标记样本之间的动态关系,我们设计了两个独特的程序,分别可以消除和添加平衡约束到S3VM中。更重要的是,我们对我们的IL-BCS3VM算法进行了有限收敛分析。在各种基准数据集上的实验结果不仅证实了IL-BCS3VM的有限收敛性,而且与现有的批处理和增量学习算法相比,IL-BCS3VM的计算时间大大减少,同时保持了相似的泛化性能。
{"title":"Tackle Balancing Constraint for Incremental Semi-Supervised Support Vector Learning","authors":"Shuyang Yu, Bin Gu, Kunpeng Ning, Haiyan Chen, J. Pei, Heng Huang","doi":"10.1145/3292500.3330962","DOIUrl":"https://doi.org/10.1145/3292500.3330962","url":null,"abstract":"Semi-Supervised Support Vector Machine (S3VM) is one of the most popular methods for semi-supervised learning. To avoid the trivial solution of classifying all the unlabeled examples to a same class, balancing constraint is often used with S3VM (denoted as BCS3VM). Recently, a novel incremental learning algorithm (IL-S3VM) based on the path following technique was proposed to significantly scale up S3VM. However, the dynamic relationship of balancing constraint with previous labeled and unlabeled samples impede their incremental method for handling BCS3VM. To fill this gap, in this paper, we propose a new incremental S3VM algorithm (IL-BCS3VM) based on IL-S3VM which can effectively handle the balancing constraint and directly update the solution of BCS3VM. Specifically, to handle the dynamic relationship of balancing constraint with previous labeled and unlabeled samples, we design two unique procedures which can respectively eliminate and add the balancing constraint into S3VM. More importantly, we provide the finite convergence analysis for our IL-BCS3VM algorithm. Experimental results on a variety of benchmark datasets not only confirm the finite convergence of IL-BCS3VM, but also show a huge reduction of computational time compared with existing batch and incremental learning algorithms, while retaining the similar generalization performance.","PeriodicalId":186134,"journal":{"name":"Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"26 13","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114032800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1