2022 IEEE International Conference on Data Mining Workshops (ICDMW)最新文献

英文中文

SGKD: A Scalable and Effective Knowledge Distillation Framework for Graph Representation Learning SGKD:用于图表示学习的可扩展和有效的知识蒸馏框架

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00091

Yufei He, Yao Ma

As Graph Neural Networks (GNNs) are widely used in various fields, there is a growing demand for improving their efficiency and scalablity. Knowledge Distillation (KD), a classical methods for model compression and acceleration, has been gradually introduced into the field of graph learning. More recently, it has been shown that, through knowledge distillation, the predictive capability of a well-trained GNN model can be transferred to lightweight and easy-to-deploy MLP models. Such distilled MLPs are able to achieve comparable performance as their corresponding G NN teachers while being significantly more efficient in terms of both space and time. However, the research of KD for graph learning is still in its early stage and there exist several limitations in the existing KD framework. The major issues lie in distilled MLPs lack useful information about the graph structure and logits of teacher are not always reliable. In this paper, we propose a Scalable and effective graph neural network Knowledge Distillation framework (SGKD) to address these issues. Specifically, to include the graph, we use feature propagation as preprocessing to provide MLPs with graph structure-aware features in the original feature space; to address unreliable logits of teacher, we introduce simple yet effective training strategies such as masking and temperature. With these innovations, our framework is able to be more effective while remaining scalable and efficient in training and inference. We conducted comprehensive experiments on eight datasets of different sizes - up to 100 million nodes - under various settings. The results demonstrated that SG KD is able to significantly outperform existing KD methods and even achieve comparable performance with their state-of-the-art GNN teachers.

随着图神经网络在各个领域的广泛应用，人们对其效率和可扩展性的要求越来越高。知识蒸馏(Knowledge Distillation, KD)是一种经典的模型压缩和加速方法，已逐渐被引入图学习领域。最近，研究表明，通过知识蒸馏，训练有素的GNN模型的预测能力可以转移到轻量级且易于部署的MLP模型中。这种经过提炼的mlp能够达到与相应的gnn教师相当的性能，同时在空间和时间方面都显着提高效率。然而，KD用于图学习的研究还处于起步阶段，现有的KD框架存在一些局限性。主要问题在于提取的mlp缺乏关于图结构的有用信息，并且教师的逻辑并不总是可靠的。在本文中，我们提出了一个可扩展和有效的图神经网络知识蒸馏框架(SGKD)来解决这些问题。具体来说，为了包含图，我们使用特征传播作为预处理，在原始特征空间中为mlp提供图结构感知特征;为了解决教师逻辑不可靠的问题，我们引入了简单而有效的训练策略，如遮蔽和温度。通过这些创新，我们的框架能够更有效，同时在训练和推理中保持可扩展性和效率。我们在8个不同规模的数据集上进行了全面的实验，在不同的设置下，多达1亿个节点。结果表明，SG KD能够显著优于现有的KD方法，甚至达到与他们最先进的GNN教师相当的性能。

{"title":"SGKD: A Scalable and Effective Knowledge Distillation Framework for Graph Representation Learning","authors":"Yufei He, Yao Ma","doi":"10.1109/ICDMW58026.2022.00091","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00091","url":null,"abstract":"As Graph Neural Networks (GNNs) are widely used in various fields, there is a growing demand for improving their efficiency and scalablity. Knowledge Distillation (KD), a classical methods for model compression and acceleration, has been gradually introduced into the field of graph learning. More recently, it has been shown that, through knowledge distillation, the predictive capability of a well-trained GNN model can be transferred to lightweight and easy-to-deploy MLP models. Such distilled MLPs are able to achieve comparable performance as their corresponding G NN teachers while being significantly more efficient in terms of both space and time. However, the research of KD for graph learning is still in its early stage and there exist several limitations in the existing KD framework. The major issues lie in distilled MLPs lack useful information about the graph structure and logits of teacher are not always reliable. In this paper, we propose a Scalable and effective graph neural network Knowledge Distillation framework (SGKD) to address these issues. Specifically, to include the graph, we use feature propagation as preprocessing to provide MLPs with graph structure-aware features in the original feature space; to address unreliable logits of teacher, we introduce simple yet effective training strategies such as masking and temperature. With these innovations, our framework is able to be more effective while remaining scalable and efficient in training and inference. We conducted comprehensive experiments on eight datasets of different sizes - up to 100 million nodes - under various settings. The results demonstrated that SG KD is able to significantly outperform existing KD methods and even achieve comparable performance with their state-of-the-art GNN teachers.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125519595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Data-Driven Usage Profiling and Anomaly Detection in Support of Sustainable Machining Processes 支持可持续加工过程的数据驱动使用分析和异常检测

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00026

Fabian Fingerhut, Chaitra Harsha, Amirmohammad Eghbalian, Tom Jacobs, Mahdi Tabassian, R. Verbeke, E. Tsiporkova

There is a lot of room for improvement towards more sustainability in manufacturing companies. During the machining operations, replacement of the cutting tools is not done in an optimal way, resulting in sub-optimal usage of resources and inefficiencies during the production process. Using data-driven approaches to extend the usage of tools can greatly improve on this shortcoming by optimizing the replacement process of these tools. This study is therefore sought to investigate the value of several data-driven approaches, applied to an industrial dataset, to achieve this goal. Although the examined data-driven methods were applied to a dataset which has been generated under a wide variety of machining conditions and lacks reliable ground truth, the obtained experimental results confirm that these methods are indeed capable of extracting informative profiles from the tool usages and can identify anomalous patterns and signs in the time-series datasets collected during different machining processes.

制造业企业在可持续发展方面还有很大的改进空间。在加工过程中，刀具的更换没有以最优的方式进行，导致生产过程中的资源利用不优化，效率低下。使用数据驱动的方法来扩展工具的使用，可以通过优化这些工具的替换过程来极大地改善这一缺点。因此，本研究旨在探讨应用于工业数据集的几种数据驱动方法的价值，以实现这一目标。虽然所研究的数据驱动方法应用于在各种加工条件下生成的数据集，并且缺乏可靠的地面真理，但所获得的实验结果证实，这些方法确实能够从刀具使用中提取信息轮廓，并且可以识别在不同加工过程中收集的时间序列数据集中的异常模式和标志。

引用次数: 0

Stress Identification in Online Social Networks 在线社交网络中的压力识别

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00063

Ashok Kumar, T. Trueman, E. Cambria

Online social networks have become one of the primary ways of communication to individuals. It rapidly gen-erates a large volume of textual and non-textual data such as images, audio, and videos. In particular, textual data plays a vital role in detecting mental health-related problems such as stress, depression, anxiety, and emotional and behavioral disorders. In this paper, we identify the mental stress of online users in social networks using a transformers-based RoBERTa model and an autoregressive model, also called XLNet. We implement this model in both a constrained system and an unconstrained system. The constrained system maintains the gold standard datasets such as training, validation, and testing. On the other hand, the unconstrained system divides the given dataset into user-specific training, validation, and test sets. Our results indicate that the proposed transformers-based RoBERTa model achieves a better result in both constrained and unconstrained systems than the state-of-the-art models.

在线社交网络已经成为个人交流的主要方式之一。它可以快速生成大量的文本和非文本数据，如图像、音频和视频。特别是，文本数据在检测精神健康相关问题(如压力、抑郁、焦虑、情绪和行为障碍)方面起着至关重要的作用。在本文中，我们使用基于转换器的RoBERTa模型和自回归模型(也称为XLNet)来识别社交网络中在线用户的心理压力。我们在有约束系统和无约束系统中都实现了这个模型。约束系统维护黄金标准数据集，如训练、验证和测试。另一方面，无约束系统将给定的数据集划分为特定于用户的训练集、验证集和测试集。我们的结果表明，所提出的基于变压器的RoBERTa模型在有约束和无约束系统中都比最先进的模型取得了更好的结果。

引用次数: 0

Compression Methods for Transformers in Multidomain Sentiment Analysis 多域情感分析中变压器的压缩方法

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00062

Wojciech Korczynski, Jan Kocoń

Transformer models like BERT have significantly improved performance on many NLP tasks, e.g., sentiment analysis. However, their large number of parameters makes real-world applications difficult because of computational costs and latency. Many compression methods have been proposed to solve this problem using quantization, weight pruning, and knowledge distillation. In this work, we explore some of these task-specific and task-agnostic methods by comparing their effectiveness and quality on the MultiEmo sentiment analysis dataset. Additionally, we analyze their ability to generalize and capture sentiment features by conducting domain-sentiment experiments. The results show that the compression methods reduce the model size by 8.6 times and the inference time by 6.9 times compared to the original model while maintaining unimpaired quality. Smaller models perform better on tasks with fewer data and retain more remarkable generalization ability after fine-tuning because they are less prone to overfitting. The best trade-off is obtained using the task-agnostic XtremeDistil model.

像BERT这样的变形模型在许多NLP任务上显著提高了性能，例如情绪分析。然而，由于计算成本和延迟，它们的大量参数使实际应用变得困难。为了解决这一问题，人们提出了许多压缩方法，如量化、权值修剪和知识蒸馏。在这项工作中，我们通过比较它们在MultiEmo情感分析数据集上的有效性和质量，探索了其中一些任务特定和任务不可知的方法。此外，我们通过进行领域情感实验来分析它们概括和捕获情感特征的能力。结果表明，与原始模型相比，压缩方法在保持原始模型质量不变的情况下，将模型大小减少了8.6倍，推理时间减少了6.9倍。较小的模型在数据较少的任务上表现更好，并且在微调后保留了更显著的泛化能力，因为它们不容易出现过拟合。使用任务不可知的xtremeditil模型获得了最佳权衡。

引用次数: 3

Disentangling the Information Flood on OSNs: Finding Notable Posts and Topics 解开osn上的信息洪流:发现值得关注的帖子和话题

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00152

P. Caso, Martino Trevisan, L. Vassio

Online Social Networks (OSN s) are an integral part of modern life for sharing thoughts, stories, and news. An ecosystem of influencers generates a flood of content in the form of posts, some of which have an unusually high level of engagement with the influencer's fan base. These posts relate to blossoming topics of discussion that generate particular interest among users: The COVID-19 pandemic is a prominent example. Studying these phenomena provides an understanding of the OSN landscape and requires appropriate methods. This paper presents a methodology to discover notable posts and group them according to their related topic. By combining anomaly detection, graph modelling and community detection techniques, we pinpoint salient events automatically, with the ability to tune the amount of them. We showcase our approach using a large Instagram dataset and extract some notable weekly topics that gained momentum from 1.4 million posts. We then illustrate some use cases ranging from the COVID-19 outbreak to sporting events.

在线社交网络(OSN)是现代生活中不可或缺的一部分，用于分享思想、故事和新闻。网红生态系统会以帖子的形式产生大量内容，其中一些内容与网红的粉丝群有着不同寻常的高参与度。这些帖子与引起用户特别兴趣的热门讨论话题有关:COVID-19大流行就是一个突出的例子。研究这些现象有助于了解OSN的情况，并需要适当的方法。本文提出了一种方法来发现值得注意的帖子，并根据他们的相关主题进行分组。通过结合异常检测、图形建模和社区检测技术，我们可以自动定位显著事件，并能够调整它们的数量。我们使用一个大型Instagram数据集来展示我们的方法，并从140万篇帖子中提取出一些值得注意的每周话题。然后，我们举例说明了从COVID-19爆发到体育赛事的一些用例。

引用次数: 0

FastFlow: AI for Fast Urban Wind Velocity Prediction FastFlow:用于快速城市风速预测的人工智能

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00028

Shi Jer Low, V. Raghavan, H. Gopalan, Jian Cheng Wong, J. Yeoh, C. Ooi

Data-driven approaches, including deep learning, have shown great promise as surrogate models across many domains, including computer vision and natural language pro-cessing. These extend to various areas in sustainability, including for satellite image analysis to obtain information such as land usage and extent of development. An interesting direction for which data-driven methods have not been applied much yet is in the quick quantitative evaluation of urban layouts for planning and design. In particular, urban designs typically involve complex trade-offs between multiple objectives, including limits on urban build-up and/or consideration of urban heat island effect. Hence, it can be beneficial to urban planners to have a fast surrogate model to predict urban characteristics of a hypothetical layout, e.g. pedestrian-level wind velocity, without having to run compu-tationally expensive and time-consuming high-fidelity numerical simulations each time. This fast surrogate can then be potentially integrated into other design optimization frameworks, including generative models or other gradient-based methods. Here we present an investigation into the use of convolutional neural networks as a surrogate for urban layout characterization that is typically done via high-fidelity numerical simulation. We then further apply this model towards a first demonstration of its utility for data-driven pedestrian-level wind velocity prediction. The data set in this work comprises results from high-fidelity numerical simulations of wind velocities for a diverse set of realistic urban layouts, based on randomized samples from a real-world, highly built-up urban city. We then provide prediction results obtained from the neural network trained on this data-set, demonstrating test errors of under 0.1 m/s for previously unseen novel urban layouts. We further illustrate how this can be useful for purposes such as rapid evaluation of pedestrian wind velocity for a potential new layout. In addition, it is hoped that this data set will further inspire, facilitate and accelerate research in data-driven urban AI, even as our baseline model facilitates quantitative comparison to future, more innovative methods.

数据驱动的方法，包括深度学习，已经在许多领域显示出巨大的前景，包括计算机视觉和自然语言处理。这些扩展到可持续性的各个领域，包括卫星图像分析，以获得诸如土地使用和发展程度等信息。数据驱动的方法尚未应用的一个有趣的方向是用于规划和设计的城市布局的快速定量评估。特别是，城市设计通常涉及多个目标之间的复杂权衡，包括限制城市建设和/或考虑城市热岛效应。因此，对于城市规划者来说，有一个快速的替代模型来预测假设布局的城市特征(例如行人水平的风速)是有益的，而不必每次都运行计算上昂贵且耗时的高保真数值模拟。然后，这个快速代理可以潜在地集成到其他设计优化框架中，包括生成模型或其他基于梯度的方法。在这里，我们提出了一项关于使用卷积神经网络作为城市布局表征的替代品的研究，该表征通常通过高保真数值模拟完成。然后，我们进一步将该模型应用于数据驱动的行人级风速预测的首次演示。这项工作中的数据集包括对多种现实城市布局的高保真风速数值模拟的结果，这些模拟基于来自现实世界中高度建设的城市的随机样本。然后，我们提供了从该数据集上训练的神经网络获得的预测结果，证明对于以前未见过的新城市布局，测试误差低于0.1 m/s。我们进一步说明了这在快速评估潜在新布局的行人风速等方面是如何有用的。此外，希望该数据集将进一步启发、促进和加速数据驱动的城市人工智能研究，即使我们的基线模型有助于与未来更创新的方法进行定量比较。

{"title":"FastFlow: AI for Fast Urban Wind Velocity Prediction","authors":"Shi Jer Low, V. Raghavan, H. Gopalan, Jian Cheng Wong, J. Yeoh, C. Ooi","doi":"10.1109/ICDMW58026.2022.00028","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00028","url":null,"abstract":"Data-driven approaches, including deep learning, have shown great promise as surrogate models across many domains, including computer vision and natural language pro-cessing. These extend to various areas in sustainability, including for satellite image analysis to obtain information such as land usage and extent of development. An interesting direction for which data-driven methods have not been applied much yet is in the quick quantitative evaluation of urban layouts for planning and design. In particular, urban designs typically involve complex trade-offs between multiple objectives, including limits on urban build-up and/or consideration of urban heat island effect. Hence, it can be beneficial to urban planners to have a fast surrogate model to predict urban characteristics of a hypothetical layout, e.g. pedestrian-level wind velocity, without having to run compu-tationally expensive and time-consuming high-fidelity numerical simulations each time. This fast surrogate can then be potentially integrated into other design optimization frameworks, including generative models or other gradient-based methods. Here we present an investigation into the use of convolutional neural networks as a surrogate for urban layout characterization that is typically done via high-fidelity numerical simulation. We then further apply this model towards a first demonstration of its utility for data-driven pedestrian-level wind velocity prediction. The data set in this work comprises results from high-fidelity numerical simulations of wind velocities for a diverse set of realistic urban layouts, based on randomized samples from a real-world, highly built-up urban city. We then provide prediction results obtained from the neural network trained on this data-set, demonstrating test errors of under 0.1 m/s for previously unseen novel urban layouts. We further illustrate how this can be useful for purposes such as rapid evaluation of pedestrian wind velocity for a potential new layout. In addition, it is hoped that this data set will further inspire, facilitate and accelerate research in data-driven urban AI, even as our baseline model facilitates quantitative comparison to future, more innovative methods.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115061443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Students Temporal Profiling and e-Learning Resources Recommendation Based on NLP's Terms Extraction 基于NLP术语提取的学生时间特征分析与网络学习资源推荐

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00044

André Picado, A. Finamore, Ana Moura Santos, C. Antunes

Online education has gained significant relevance over the last few years, and the pandemic situation has brought evidence that it plays a fundamental role nowadays. However, even with the increasing number of students enrolled in online courses, these still do not allow for enough personalization, often leading students to become demotivated and dropping out. The goal of better adapting online courses to students aims to support them in an inclusive and equitable way, since the learners are often students from quite diverse backgrounds. The continuous demand for online learning, and the need to customize it according to the students' profile has led to a succession of attempts at recommendation systems. Nevertheless, many of them were entirely based on collaborative filtering, almost ignoring profiling requirements. In this paper, we propose a recommendation system to be integrated into MOOCs (Massive Open Online Courses), following a hybrid architecture. In our proposal, learning resources are described by a set of terms, extracted directly from the supporting texts in the MOOC. From these terms, those which are included in the exercises will be used to specify the important skills learners must acquire, and the results achieved by each learner in them are used to characterize the particular student's state, at a given moment. Those states are then used to make the recommendation collaboratively, allowing for different recommendations for each particular student over time. The system is validated across several MOOCs.

在线教育在过去几年中具有重要意义，大流行的形势证明，在线教育在当今发挥着重要作用。然而，尽管越来越多的学生参加了在线课程，但这些课程仍然没有提供足够的个性化，这往往导致学生失去动力并辍学。更好地适应学生的在线课程的目标是以包容和公平的方式支持他们，因为学习者通常是来自不同背景的学生。对在线学习的持续需求，以及根据学生的个人资料对其进行定制的需求，导致了对推荐系统的一系列尝试。然而，它们中的许多完全基于协同过滤，几乎忽略了分析需求。在本文中，我们提出了一个基于混合架构的推荐系统，并将其集成到mooc (Massive Open Online Courses)中。在我们的建议中，学习资源由一组术语来描述，这些术语直接从MOOC的支持文本中提取。从这些术语中，那些包含在练习中的术语将被用来指定学习者必须获得的重要技能，并且每个学习者在其中获得的结果被用来表征特定学生在给定时刻的状态。然后使用这些状态进行协作推荐，允许对每个特定学生进行不同的推荐。该系统在多个mooc上进行了验证。

{"title":"Students Temporal Profiling and e-Learning Resources Recommendation Based on NLP's Terms Extraction","authors":"André Picado, A. Finamore, Ana Moura Santos, C. Antunes","doi":"10.1109/ICDMW58026.2022.00044","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00044","url":null,"abstract":"Online education has gained significant relevance over the last few years, and the pandemic situation has brought evidence that it plays a fundamental role nowadays. However, even with the increasing number of students enrolled in online courses, these still do not allow for enough personalization, often leading students to become demotivated and dropping out. The goal of better adapting online courses to students aims to support them in an inclusive and equitable way, since the learners are often students from quite diverse backgrounds. The continuous demand for online learning, and the need to customize it according to the students' profile has led to a succession of attempts at recommendation systems. Nevertheless, many of them were entirely based on collaborative filtering, almost ignoring profiling requirements. In this paper, we propose a recommendation system to be integrated into MOOCs (Massive Open Online Courses), following a hybrid architecture. In our proposal, learning resources are described by a set of terms, extracted directly from the supporting texts in the MOOC. From these terms, those which are included in the exercises will be used to specify the important skills learners must acquire, and the results achieved by each learner in them are used to characterize the particular student's state, at a given moment. Those states are then used to make the recommendation collaboratively, allowing for different recommendations for each particular student over time. The system is validated across several MOOCs.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123211161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning 基于持续强化学习的流交通流预测

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00011

Yanan Xiao, Minyu Liu, Zichen Zhang, Lu Jiang, Minghao Yin, Jianan Wang

Traffic flow prediction is an important part of smart transportation. The goal is to predict future traffic conditions based on historical data recorded by sensors and the traffic net-work. As the city continues to build, parts of the transportation network will be added or modified. How to accurately predict expanding and evolving long-term streaming networks is of great significance. To this end, we propose a new simulation-based criterion that considers teaching autonomous agents to mimic sensor patterns, planning their next visit based on the sensor's profile (e.g., traffic, speed, occupancy). The data recorded by the sensor is most accurate when the agent can perfectly simulate the sensor's activity pattern. We propose to formulate the problem as a continuous reinforcement learning task, where the agent is the next flow value predictor, the action is the next time-series flow value in the sensor, and the environment state is a dynamically fused representation of the sensor and transportation network. Actions taken by the agent change the environment, which in turn forces the agent's mode to update, while the agent further explores changes in the dynamic traffic network, which helps the agent predict its next visit more accurately. Therefore, we develop a strategy in which sensors and traffic networks update each other and incorporate temporal context to quantify state representations evolving over time. Along these lines, we propose streaming traffic flow prediction based on continuous reinforcement learning model (ST-CRL), a kind of predictive model based on reinforcement learning and continuous learning, and an analytical algorithm based on KL divergence that cleverly incorporates long-term novel patterns into model induction. Second, we introduce a prioritized experience replay strategy to consolidate and aggregate previously learned core knowledge into the model. The proposed model is able to continuously learn and predict as the traffic flow network expands and evolves over time. Extensive experiments show that the algorithm has great potential in predicting long-term streaming media networks, while achieving data privacy protection to a certain extent.

交通流预测是智能交通的重要组成部分。目标是根据传感器和交通网络记录的历史数据预测未来的交通状况。随着城市的继续建设，部分交通网络将被增加或修改。如何准确预测不断扩大和演变的长期流媒体网络具有重要意义。为此，我们提出了一种新的基于模拟的标准，该标准考虑教自主代理模仿传感器模式，根据传感器的配置文件(例如，交通，速度，占用率)计划下一次访问。当agent能够完美地模拟传感器的活动模式时，传感器记录的数据是最准确的。我们建议将该问题表述为一个连续强化学习任务，其中智能体是下一个流量值预测器，动作是传感器中的下一个时间序列流量值，环境状态是传感器和运输网络的动态融合表示。agent所采取的行动改变了环境，这反过来又迫使agent的模式更新，同时agent进一步探索动态交通网络的变化，这有助于agent更准确地预测自己的下一次访问。因此，我们开发了一种策略，其中传感器和交通网络相互更新，并结合时间上下文来量化随时间演变的状态表示。沿着这些思路，我们提出了基于连续强化学习模型(ST-CRL)的流交通流预测，一种基于强化学习和连续学习的预测模型，以及一种基于KL散度的分析算法，该算法巧妙地将长期新模式纳入模型归纳。其次，我们引入了一种优先体验重放策略，将先前学习的核心知识整合和聚合到模型中。该模型能够随着交通流网络的不断扩展和演变而不断学习和预测。大量实验表明，该算法在预测长期流媒体网络方面具有很大的潜力，同时在一定程度上实现了数据隐私保护。

{"title":"Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning","authors":"Yanan Xiao, Minyu Liu, Zichen Zhang, Lu Jiang, Minghao Yin, Jianan Wang","doi":"10.1109/ICDMW58026.2022.00011","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00011","url":null,"abstract":"Traffic flow prediction is an important part of smart transportation. The goal is to predict future traffic conditions based on historical data recorded by sensors and the traffic net-work. As the city continues to build, parts of the transportation network will be added or modified. How to accurately predict expanding and evolving long-term streaming networks is of great significance. To this end, we propose a new simulation-based criterion that considers teaching autonomous agents to mimic sensor patterns, planning their next visit based on the sensor's profile (e.g., traffic, speed, occupancy). The data recorded by the sensor is most accurate when the agent can perfectly simulate the sensor's activity pattern. We propose to formulate the problem as a continuous reinforcement learning task, where the agent is the next flow value predictor, the action is the next time-series flow value in the sensor, and the environment state is a dynamically fused representation of the sensor and transportation network. Actions taken by the agent change the environment, which in turn forces the agent's mode to update, while the agent further explores changes in the dynamic traffic network, which helps the agent predict its next visit more accurately. Therefore, we develop a strategy in which sensors and traffic networks update each other and incorporate temporal context to quantify state representations evolving over time. Along these lines, we propose streaming traffic flow prediction based on continuous reinforcement learning model (ST-CRL), a kind of predictive model based on reinforcement learning and continuous learning, and an analytical algorithm based on KL divergence that cleverly incorporates long-term novel patterns into model induction. Second, we introduce a prioritized experience replay strategy to consolidate and aggregate previously learned core knowledge into the model. The proposed model is able to continuously learn and predict as the traffic flow network expands and evolves over time. Extensive experiments show that the algorithm has great potential in predicting long-term streaming media networks, while achieving data privacy protection to a certain extent.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123908499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Knowledge Distillation-enabled Multi-stage Incremental Learning for Online Process Monitoring in Advanced Manufacturing 基于知识提炼的先进制造过程在线监控多阶段增量学习

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00154

Zhangyue Shi, Yuxuan Li, Chenang Liu

In advanced manufacturing, the incorporation of online sensing technologies has enabled great potentials to achieve effective in-situ process monitoring via machine learning-based approaches. In manufacturing practice, the online sensor data are usually collected in a progressive manner, and the stream data collected at latter stages may also contain informative knowledge for process monitoring. Therefore, it is highly valuable to make the machine learning-based monitoring model learn incrementally in manufacturing. To achieve this goal, this paper develops a multi-stage incremental learning approach enabled by the knowledge distillation, which distills representative information from the machine learning model trained at early/offline stage and then enhances the monitoring performance at the latter stages. To validate its effectiveness, a real-world case study in additive manufacturing, which is an emerging advanced manufacturing technology, is conducted. The experimental results show that the developed knowledge distillation-enabled multi-stage incremental learning is very promising to improve the online monitoring performance in advanced manufacturing.

在先进制造业中，在线传感技术的结合使得通过基于机器学习的方法实现有效的原位过程监测具有很大的潜力。在制造实践中，在线传感器数据通常以渐进的方式收集，后期收集的流数据也可能包含用于过程监控的信息性知识。因此，使基于机器学习的监控模型在制造业中实现渐进式学习具有重要的应用价值。为了实现这一目标，本文开发了一种基于知识蒸馏的多阶段增量学习方法，该方法从早期/离线阶段训练的机器学习模型中提取有代表性的信息，然后增强后期阶段的监控性能。为了验证其有效性，对增材制造这一新兴的先进制造技术进行了实际案例研究。实验结果表明，本文提出的基于知识提炼的多阶段增量学习方法对提高先进制造业在线监测性能有很大的帮助。

{"title":"Knowledge Distillation-enabled Multi-stage Incremental Learning for Online Process Monitoring in Advanced Manufacturing","authors":"Zhangyue Shi, Yuxuan Li, Chenang Liu","doi":"10.1109/ICDMW58026.2022.00154","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00154","url":null,"abstract":"In advanced manufacturing, the incorporation of online sensing technologies has enabled great potentials to achieve effective in-situ process monitoring via machine learning-based approaches. In manufacturing practice, the online sensor data are usually collected in a progressive manner, and the stream data collected at latter stages may also contain informative knowledge for process monitoring. Therefore, it is highly valuable to make the machine learning-based monitoring model learn incrementally in manufacturing. To achieve this goal, this paper develops a multi-stage incremental learning approach enabled by the knowledge distillation, which distills representative information from the machine learning model trained at early/offline stage and then enhances the monitoring performance at the latter stages. To validate its effectiveness, a real-world case study in additive manufacturing, which is an emerging advanced manufacturing technology, is conducted. The experimental results show that the developed knowledge distillation-enabled multi-stage incremental learning is very promising to improve the online monitoring performance in advanced manufacturing.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122634163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Using Genetic Programming to Identify Probability Distribution behind Data: A Preliminary Trial 使用遗传规划识别数据背后的概率分布:初步试验

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00056

Yang Syu, Chien-Min Wang

Before conducting any further applications or performing more advanced processing, analyzing and realizing the probability distribution of data is a crucial task. Traditionally, statistical methods are being developed for this procedure. In recent years, researchers in computer science have proposed and applied different machine learning-based techniques to address the abovementioned problem. However, the existing solutions remain problematic and inconvenient, such as the need for human intervention and the complexity of the resulting models. Thus, in this paper, without causing deficiency and inconvenience, a genetic programming-based approach for the identification of probability functions is proposed, implemented, and tested. Based on our empirical trials, in an immense search space of mathematical expressions, the proposed and developed approach can effectively recognize (retrieve) the probability distribution function behind data.

在进行任何进一步的应用或执行更高级的处理之前，分析和实现数据的概率分布是一项至关重要的任务。传统上，正在为这一程序开发统计方法。近年来，计算机科学研究人员提出并应用了不同的基于机器学习的技术来解决上述问题。然而，现有的解决方案仍然存在问题和不便，例如需要人工干预和产生的模型的复杂性。因此，在本文中，在不造成缺陷和不便的情况下，提出了一种基于遗传规划的概率函数识别方法，并进行了实现和测试。根据我们的经验试验，在巨大的数学表达式搜索空间中，提出和开发的方法可以有效地识别(检索)数据背后的概率分布函数。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀