首页 > 最新文献

2022 IEEE International Conference on Data Mining Workshops (ICDMW)最新文献

英文 中文
Adversarial Removal of Population Bias in Genomics Phenotype Prediction 基因组学表型预测中种群偏差的对抗性去除
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00052
Honggang Zhao, Wenlu Wang
Many factors impact trait prediction from genotype data. One of the major confounding factors comes from the presence of population structure among sampled individuals, namely population stratification. When exists, it will lead to biased quantitative phenotype prediction, therefore hampering the unambiguous conclusions about prediction and limiting the downstream usage like disease evaluation or epidemiology survey. Population stratification is an implicit bias that can not be easily removed by data preprocessing. With the purpose of training a phenotype prediction model, we propose an adversarial training framework that ensures the genomics encoder is agnostic to sample populations. For better generalization, our adversarial training framework is orthogonal to the genomics encoder and phenotype prediction model. We experimentally ascertain our debiasing framework by testing on a real-world yield (phenotype) prediction dataset with soybean genomics. The developed frame-work is designed for general genomic data (e.g., human, livestock, and crops) while the phenotype can be either continuous or categorical variables.
许多因素影响着基因型数据的性状预测。其中一个主要的混杂因素来自于样本个体中存在的种群结构,即种群分层。当存在时,会导致定量表型预测的偏差,从而影响预测的明确结论,限制下游的使用,如疾病评估或流行病学调查。人口分层是一种不容易通过数据预处理消除的隐性偏差。为了训练表型预测模型,我们提出了一种对抗性训练框架,以确保基因组编码器对样本群体不可知。为了更好地泛化,我们的对抗性训练框架与基因组编码器和表型预测模型正交。我们通过大豆基因组学在实际产量(表型)预测数据集上进行测试,实验确定了我们的去偏框架。开发的框架是为一般基因组数据(例如,人类,牲畜和作物)设计的,而表型可以是连续变量或分类变量。
{"title":"Adversarial Removal of Population Bias in Genomics Phenotype Prediction","authors":"Honggang Zhao, Wenlu Wang","doi":"10.1109/ICDMW58026.2022.00052","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00052","url":null,"abstract":"Many factors impact trait prediction from genotype data. One of the major confounding factors comes from the presence of population structure among sampled individuals, namely population stratification. When exists, it will lead to biased quantitative phenotype prediction, therefore hampering the unambiguous conclusions about prediction and limiting the downstream usage like disease evaluation or epidemiology survey. Population stratification is an implicit bias that can not be easily removed by data preprocessing. With the purpose of training a phenotype prediction model, we propose an adversarial training framework that ensures the genomics encoder is agnostic to sample populations. For better generalization, our adversarial training framework is orthogonal to the genomics encoder and phenotype prediction model. We experimentally ascertain our debiasing framework by testing on a real-world yield (phenotype) prediction dataset with soybean genomics. The developed frame-work is designed for general genomic data (e.g., human, livestock, and crops) while the phenotype can be either continuous or categorical variables.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133327903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Online Deep Knowledge Tracing 在线深度知识追踪
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00047
Wenxin Zhang, Yupei Zhang, Shuhui Liu, Xuequn Shang
This study focuses on solving the problem of knowledge tracing in a practical situation, where the responses from students come in a stream. Most current works of deep knowledge tracing are pursuing to integrate of more side information or data structure, but they often fail to make self-update in the dynamic learning situation. Towards this end, we here proposed an online deep knowledge tracing model, dubbed ODKT, by utilizing the online gradient descent algorithm to develop the traditional deep knowledge tracing (DKT) into online learning. Rather than learning a perfect model, the ODKT aims to train DKT in its using process step by step. Experiments were conducted on four public datasets for knowledge tracing. The results demonstrate that the ODKT model is effective and more suitable for practical applications.
本研究的重点是在一个实际情境中解决知识追溯的问题,在这个情境中,学生的反应是源源不断的。目前大多数深度知识跟踪的工作都是追求更多侧信息或数据结构的集成,但往往不能在动态学习的情况下进行自我更新。为此,本文提出了一种在线深度知识跟踪模型ODKT,利用在线梯度下降算法将传统的深度知识跟踪(DKT)发展为在线学习。ODKT的目标不是学习一个完美的模型,而是一步一步地训练DKT的使用过程。在四个公共数据集上进行了知识跟踪实验。结果表明,ODKT模型是有效的,更适合于实际应用。
{"title":"Online Deep Knowledge Tracing","authors":"Wenxin Zhang, Yupei Zhang, Shuhui Liu, Xuequn Shang","doi":"10.1109/ICDMW58026.2022.00047","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00047","url":null,"abstract":"This study focuses on solving the problem of knowledge tracing in a practical situation, where the responses from students come in a stream. Most current works of deep knowledge tracing are pursuing to integrate of more side information or data structure, but they often fail to make self-update in the dynamic learning situation. Towards this end, we here proposed an online deep knowledge tracing model, dubbed ODKT, by utilizing the online gradient descent algorithm to develop the traditional deep knowledge tracing (DKT) into online learning. Rather than learning a perfect model, the ODKT aims to train DKT in its using process step by step. Experiments were conducted on four public datasets for knowledge tracing. The results demonstrate that the ODKT model is effective and more suitable for practical applications.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127975768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Backdoor Poisoning of Encrypted Traffic Classifiers 加密流量分类器的后门中毒
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00080
J. Holodnak, Olivia Brown, J. Matterer, Andrew Lemke
Significant recent research has focused on applying deep neural network models to the problem of network traffic classification. At the same time, much has been written about the vulnerability of deep neural networks to adversarial inputs, both during training and inference. In this work, we consider launching backdoor poisoning attacks against an encrypted network traffic classifier. We consider attacks based on padding network packets, which has the benefit of preserving the functionality of the network traffic. In particular, we consider a handcrafted attack, as well as an optimized attack leveraging universal adversarial perturbations. We find that poisoning attacks can be extremely successful if the adversary has the ability to modify both the labels and the data (dirty label attacks) and somewhat successful, depending on the attack strength and the target class, if the adversary perturbs only the data (clean label attacks).
近年来的重要研究集中在将深度神经网络模型应用于网络流量分类问题上。与此同时,关于深度神经网络在训练和推理过程中对对抗性输入的脆弱性的文章也很多。在这项工作中,我们考虑对加密的网络流量分类器发起后门投毒攻击。我们考虑基于填充网络数据包的攻击,这有利于保留网络流量的功能。特别是,我们考虑了手工攻击,以及利用普遍对抗性扰动的优化攻击。我们发现,如果攻击者有能力修改标签和数据(脏标签攻击),那么中毒攻击就会非常成功;如果攻击者只干扰数据(干净标签攻击),那么中毒攻击就会取得一定程度的成功,这取决于攻击强度和目标类别。
{"title":"Backdoor Poisoning of Encrypted Traffic Classifiers","authors":"J. Holodnak, Olivia Brown, J. Matterer, Andrew Lemke","doi":"10.1109/ICDMW58026.2022.00080","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00080","url":null,"abstract":"Significant recent research has focused on applying deep neural network models to the problem of network traffic classification. At the same time, much has been written about the vulnerability of deep neural networks to adversarial inputs, both during training and inference. In this work, we consider launching backdoor poisoning attacks against an encrypted network traffic classifier. We consider attacks based on padding network packets, which has the benefit of preserving the functionality of the network traffic. In particular, we consider a handcrafted attack, as well as an optimized attack leveraging universal adversarial perturbations. We find that poisoning attacks can be extremely successful if the adversary has the ability to modify both the labels and the data (dirty label attacks) and somewhat successful, depending on the attack strength and the target class, if the adversary perturbs only the data (clean label attacks).","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114328334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Above Ground Biomass Estimation of a Cocoa Plantation using Machine Learning 利用机器学习估算可可种植园地上生物量
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00147
Sabrina Sankar, Marvin B. Lewis, Patrick Hosein
The rapid increase in carbon dioxide in the atmosphere and its associated effects on climate change and global warming has raised the importance of monitoring carbon sequestration levels. Estimating above ground biomass (AGB) is one way of monitoring carbon sequestration in forested areas. Quantifying above ground biomass using direct methods is costly, time-consuming and, in many cases, impractical. However, remote sensing technologies such as LiDAR (Light Detection And Ranging) captures three dimensional information which can be used to perform this estimation. In particular, LiDAR can be used to estimate the diameter of a tree at breast height (DBH) and from this we can estimate its AGB. For this research we used LiDAR data, along with various Machine Learning (ML) algorithms (Multiple Linear Regression, Random Forest, Support Vector Regression and Regression Tree) to estimate DBH of cocoa trees. Various feature selection methods were used to select the most significant features for our model. The best performing algorithm was Random Forest which achieved an R2 value of 0.83 and Root Mean Square Estimate (RMSE) value of 0.062. This algorithm then estimated an AGB value of 28.75 ± 2.34 Mg/ha (Megagram per hectare). We compared this result with that obtained from locally-developed allometric equations for the same cocoa plot. The comparison proved our estimate to be 14.7% lower than the allometric equation. The results demonstrated that using ML with LiDAR measurements for AGB estimation is quite promising.
大气中二氧化碳的迅速增加及其对气候变化和全球变暖的相关影响提高了监测碳固存水平的重要性。估算地上生物量(AGB)是监测森林地区碳固存的一种方法。使用直接方法对地上生物量进行量化既昂贵又耗时,而且在许多情况下不切实际。然而,遥感技术,如激光雷达(光探测和测距)捕获三维信息,可用于执行这种估计。特别是,激光雷达可以用来估计树在胸高(DBH)的直径,从中我们可以估计它的AGB。在这项研究中,我们使用激光雷达数据,以及各种机器学习(ML)算法(多元线性回归、随机森林、支持向量回归和回归树)来估计可可树的胸径。我们使用了各种特征选择方法来为我们的模型选择最重要的特征。表现最好的算法是Random Forest, R2值为0.83,RMSE值为0.062。该算法估计AGB值为28.75±2.34 Mg/ha (Megagram /公顷)。我们将这一结果与同一块可可地本地开发的异速生长方程得到的结果进行了比较。结果表明,我们的估计值比异速生长方程低14.7%。结果表明,使用ML与LiDAR测量进行AGB估计是非常有前途的。
{"title":"Above Ground Biomass Estimation of a Cocoa Plantation using Machine Learning","authors":"Sabrina Sankar, Marvin B. Lewis, Patrick Hosein","doi":"10.1109/ICDMW58026.2022.00147","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00147","url":null,"abstract":"The rapid increase in carbon dioxide in the atmosphere and its associated effects on climate change and global warming has raised the importance of monitoring carbon sequestration levels. Estimating above ground biomass (AGB) is one way of monitoring carbon sequestration in forested areas. Quantifying above ground biomass using direct methods is costly, time-consuming and, in many cases, impractical. However, remote sensing technologies such as LiDAR (Light Detection And Ranging) captures three dimensional information which can be used to perform this estimation. In particular, LiDAR can be used to estimate the diameter of a tree at breast height (DBH) and from this we can estimate its AGB. For this research we used LiDAR data, along with various Machine Learning (ML) algorithms (Multiple Linear Regression, Random Forest, Support Vector Regression and Regression Tree) to estimate DBH of cocoa trees. Various feature selection methods were used to select the most significant features for our model. The best performing algorithm was Random Forest which achieved an R2 value of 0.83 and Root Mean Square Estimate (RMSE) value of 0.062. This algorithm then estimated an AGB value of 28.75 ± 2.34 Mg/ha (Megagram per hectare). We compared this result with that obtained from locally-developed allometric equations for the same cocoa plot. The comparison proved our estimate to be 14.7% lower than the allometric equation. The results demonstrated that using ML with LiDAR measurements for AGB estimation is quite promising.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"253 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114107989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
End-to-End Modeling of Hierarchical Time Series Using Autoregressive Transformer and Conditional Normalizing Flow-based Reconciliation 基于自回归变压器和条件归一化流调节的分层时间序列端到端建模
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00141
Shiyu Wang, Fan Zhou, Yinbo Sun, Lintao Ma, James Zhang, Yang Zheng, Lei Lei, Yun Hu
Multivariate time series forecasting with hierarchi-cal structure is pervasive in real-world applications, demanding not only predicting each level of the hierarchy, but also recon-ciling all forecasts to ensure coherency, i.e., the forecasts should satisfy the hierarchical aggregation constraints. Moreover, the disparities of statistical characteristics between levels can be huge, worsened by non-Gaussian distributions and non-linear correlations. To this extent, we propose a novel end-to-end hierarchical time series forecasting model, based on conditioned normalizing flow-based autoregressive transformer reconciliation, to represent complex data distribution while simultaneously reconciling the forecasts to ensure coherency. Unlike other state-of-the-art methods, we achieve the forecasting and reconciliation simultaneously without requiring any explicit post-processing step. In addition, by harnessing the power of deep model, we do not rely on any assumption such as unbiased estimates or Gaussian distribution. Our evaluation experiments are conducted on four real-world hierarchical datasets from different industrial domains (three public ones and a dataset from the application servers of Alipay11Alipay is the world's leading company in payment technology. https:/len.wikipedia.org/wiki/Alipay) and the preliminary results demonstrate efficacy of our proposed method.
具有层次结构的多变量时间序列预测在实际应用中非常普遍,它不仅需要预测层次结构的每一层,而且需要协调所有预测以确保一致性,即预测应满足层次聚集约束。此外,水平之间的统计特征差异可能是巨大的,非高斯分布和非线性相关性使其恶化。为此,我们提出了一种新的端到端分层时间序列预测模型,该模型基于条件归一化流自回归变压器调节,以表示复杂的数据分布,同时调节预测以确保一致性。与其他最先进的方法不同,我们同时实现预测和调节,而不需要任何明确的后处理步骤。此外,通过利用深度模型的力量,我们不依赖于任何假设,如无偏估计或高斯分布。我们的评估实验是在来自不同行业领域的四个真实世界的分层数据集(三个公共数据集和一个来自支付宝应用服务器的数据集)上进行的。支付宝是世界领先的支付技术公司。https:/len.wikipedia.org/wiki/Alipay),初步结果证明了我们提出的方法的有效性。
{"title":"End-to-End Modeling of Hierarchical Time Series Using Autoregressive Transformer and Conditional Normalizing Flow-based Reconciliation","authors":"Shiyu Wang, Fan Zhou, Yinbo Sun, Lintao Ma, James Zhang, Yang Zheng, Lei Lei, Yun Hu","doi":"10.1109/ICDMW58026.2022.00141","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00141","url":null,"abstract":"Multivariate time series forecasting with hierarchi-cal structure is pervasive in real-world applications, demanding not only predicting each level of the hierarchy, but also recon-ciling all forecasts to ensure coherency, i.e., the forecasts should satisfy the hierarchical aggregation constraints. Moreover, the disparities of statistical characteristics between levels can be huge, worsened by non-Gaussian distributions and non-linear correlations. To this extent, we propose a novel end-to-end hierarchical time series forecasting model, based on conditioned normalizing flow-based autoregressive transformer reconciliation, to represent complex data distribution while simultaneously reconciling the forecasts to ensure coherency. Unlike other state-of-the-art methods, we achieve the forecasting and reconciliation simultaneously without requiring any explicit post-processing step. In addition, by harnessing the power of deep model, we do not rely on any assumption such as unbiased estimates or Gaussian distribution. Our evaluation experiments are conducted on four real-world hierarchical datasets from different industrial domains (three public ones and a dataset from the application servers of Alipay11Alipay is the world's leading company in payment technology. https:/len.wikipedia.org/wiki/Alipay) and the preliminary results demonstrate efficacy of our proposed method.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124347423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Hybrid ConvLSTM Deep Neural Network for Noise Reduction and Data Augmentation for Prediction of Non-linear Dynamics of Streamflow 基于混合ConvLSTM深度神经网络的流场非线性动力学降噪与数据增强预测
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00146
J. Rochac, N. Zhang, T. Deksissa, Jiajun Xu, Lara A. Thompson
Long Short-Term Memory (LSTM) models are at the cutting edge of artificial learning and ecoinformatics in regards to water quantity prediction. However, one driver for more accuracy, efficient, and robust, water pollution perdition methods is climate change, and in particular global sea level rising. Statistical systems are no longer reliable and new prediction models need to be explored due to the increasing nonlinearity of streamflow predictors and extremes sea level changes. Another driver is that, in places with legacy infrastructure, updated water monitoring systems and unreliable forecasting framework, state-of-the-art LSTM -based models suffer due to the presence of noisy data. This paper proposes multiple LSTM-based models with Scharr filtering to improve the streamflow prediction accuracy against noise. A hybrid ConvLSTM approach is realized to overcome the nonlinearity of the main predictors and the noises. The evaluation results demonstrate that the proposed hybrid ConvLSTM model can effectively improve the overall prediction accuracy for both real-world data and the noise-augmented data. The hybrid ConvLSTM model also obtained competitive and even better performance compared with several state-of-the-art methods. In addition, our proposed design achieves comparable performance in terms of prediction time.
长短期记忆(LSTM)模型在水量预测方面处于人工学习和生态信息学的前沿。然而,气候变化,特别是全球海平面上升,是更准确、更有效、更可靠的水污染预测方法的一个驱动因素。由于流量预测器的非线性增加和海平面的极端变化,统计系统不再可靠,需要探索新的预测模式。另一个驱动因素是,在基础设施陈旧、水监测系统更新和预测框架不可靠的地方,最先进的基于LSTM的模型由于噪声数据的存在而受到影响。本文提出了多个基于lstm的沙尔滤波模型,以提高对噪声的流量预测精度。为了克服主要预测量和噪声的非线性,实现了一种混合卷积stm方法。评价结果表明,所提出的混合ConvLSTM模型可以有效提高实际数据和噪声增强数据的整体预测精度。混合ConvLSTM模型也获得了具有竞争力甚至更好的性能。此外,我们提出的设计在预测时间方面达到了相当的性能。
{"title":"A Hybrid ConvLSTM Deep Neural Network for Noise Reduction and Data Augmentation for Prediction of Non-linear Dynamics of Streamflow","authors":"J. Rochac, N. Zhang, T. Deksissa, Jiajun Xu, Lara A. Thompson","doi":"10.1109/ICDMW58026.2022.00146","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00146","url":null,"abstract":"Long Short-Term Memory (LSTM) models are at the cutting edge of artificial learning and ecoinformatics in regards to water quantity prediction. However, one driver for more accuracy, efficient, and robust, water pollution perdition methods is climate change, and in particular global sea level rising. Statistical systems are no longer reliable and new prediction models need to be explored due to the increasing nonlinearity of streamflow predictors and extremes sea level changes. Another driver is that, in places with legacy infrastructure, updated water monitoring systems and unreliable forecasting framework, state-of-the-art LSTM -based models suffer due to the presence of noisy data. This paper proposes multiple LSTM-based models with Scharr filtering to improve the streamflow prediction accuracy against noise. A hybrid ConvLSTM approach is realized to overcome the nonlinearity of the main predictors and the noises. The evaluation results demonstrate that the proposed hybrid ConvLSTM model can effectively improve the overall prediction accuracy for both real-world data and the noise-augmented data. The hybrid ConvLSTM model also obtained competitive and even better performance compared with several state-of-the-art methods. In addition, our proposed design achieves comparable performance in terms of prediction time.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123266297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EnD: Enhanced Dedensification for Graph Compressing and Embedding 结束:图形压缩和嵌入的增强去密度化
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00092
Tanvir Hossain, Esra Akbas, Muhammad Ifte Khairul Islam
Graph representation learning is essential in applying machine learning methods on large-scale networks. Several embedding approaches have shown promising outcomes in recent years. Nonetheless, on massive graphs, it may be time-consuming and space inefficient for direct applications of existing embedding methods. This paper presents a novel graph compression approach based on dedensification called Enhanced Dedensification with degree-based compression (EnD). The principal goal of our system is to assure decent compression of large graphs that eloquently favor their representation learning. For this purpose, we first compress the low-degree nodes and dedensify them to reduce the high-degree nodes' loads. Then, we embed the compressed graph instead of the original graph to decrease the representation learning cost. Our approach is a general meta-strategy that attains time and space efficiency over the original graph by applying the state-of-the-art graph embedding methods: Node2vec, DeepWalk, RiWalk, and xNetMf. Comprehensive ex-periments on large-scale real-world graphs validate the viability of our method, which shows sound performance on single and multi-label node classification tasks without losing accuracy.
图表示学习是将机器学习方法应用于大规模网络的关键。近年来,几种嵌入方法已经显示出有希望的结果。然而,在海量图上,直接应用现有的嵌入方法可能会耗费时间和空间。本文提出了一种新的基于脱密的图形压缩方法,称为基于程度压缩的增强脱密。我们系统的主要目标是确保对大型图进行适当的压缩,从而有力地支持它们的表示学习。为此,我们首先压缩低度节点并对其进行致密化,以减少高度节点的负载。然后,我们嵌入压缩图代替原始图,以减少表示学习成本。我们的方法是一种通用的元策略,通过应用最先进的图嵌入方法(Node2vec、DeepWalk、RiWalk和xNetMf),在原始图上实现时间和空间效率。在大规模真实图上的综合实验验证了我们的方法的可行性,该方法在单标签和多标签节点分类任务上表现出良好的性能,同时又不失准确性。
{"title":"EnD: Enhanced Dedensification for Graph Compressing and Embedding","authors":"Tanvir Hossain, Esra Akbas, Muhammad Ifte Khairul Islam","doi":"10.1109/ICDMW58026.2022.00092","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00092","url":null,"abstract":"Graph representation learning is essential in applying machine learning methods on large-scale networks. Several embedding approaches have shown promising outcomes in recent years. Nonetheless, on massive graphs, it may be time-consuming and space inefficient for direct applications of existing embedding methods. This paper presents a novel graph compression approach based on dedensification called Enhanced Dedensification with degree-based compression (EnD). The principal goal of our system is to assure decent compression of large graphs that eloquently favor their representation learning. For this purpose, we first compress the low-degree nodes and dedensify them to reduce the high-degree nodes' loads. Then, we embed the compressed graph instead of the original graph to decrease the representation learning cost. Our approach is a general meta-strategy that attains time and space efficiency over the original graph by applying the state-of-the-art graph embedding methods: Node2vec, DeepWalk, RiWalk, and xNetMf. Comprehensive ex-periments on large-scale real-world graphs validate the viability of our method, which shows sound performance on single and multi-label node classification tasks without losing accuracy.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127091399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting Unobserved Node States with spatio-temporal Graph Neural Networks 时空图神经网络预测未观测节点状态
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00101
Andreas Roth, T. Liebig
Forecasting future states of sensors is key to solving tasks like weather prediction, route planning, and many others when dealing with networks of sensors. But complete spatial coverage of sensors is generally unavailable and would practically be infeasible due to limitations in budget and other resources during deployment and maintenance. Currently existing approaches using machine learning are limited to the spatial locations where data was observed, causing limitations to downstream tasks. Inspired by the recent surge of Graph Neural Networks for spatio-temporal data processing, we investigate whether these can also forecast the state of locations with no sensors available. For this purpose, we develop a framework, named Forecasting Unobserved Node States (FUNS), that allows forecasting the state at entirely unobserved locations based on spatio-temporal correlations and the graph inductive bias. FUNS serves as a blueprint for optimizing models only on observed data and demonstrates good generalization capabilities for predicting the state at entirely unobserved locations during the testing stage. Our framework can be combined with any spatio-temporal Graph Neural Network, that exploits spatio-temporal correlations with surrounding observed locations by using the network's graph structure. Our employed model builds on a previous model by also allowing us to exploit prior knowledge about locations of interest, e.g. the road type. Our empirical evaluation of both simulated and real-world datasets demonstrates that Graph Neural Networks are well-suited for this task.
在处理传感器网络时,预测传感器的未来状态是解决天气预报、路线规划等任务的关键。但是,传感器的完全空间覆盖通常是不可能的,而且由于预算和其他资源在部署和维护期间的限制,实际上是不可行的。目前使用机器学习的现有方法仅限于观察数据的空间位置,从而限制了下游任务。受最近用于时空数据处理的图神经网络的启发,我们研究了这些网络是否也可以预测没有传感器可用的位置的状态。为此,我们开发了一个名为预测未观测节点状态(FUNS)的框架,该框架允许基于时空相关性和图归纳偏差预测完全未观测位置的状态。FUNS作为仅在观测数据上优化模型的蓝图,并展示了在测试阶段预测完全未观测位置的状态的良好泛化能力。我们的框架可以与任何时空图神经网络相结合,通过使用网络的图结构来利用与周围观测位置的时空相关性。我们所采用的模型建立在之前的模型之上,也允许我们利用关于感兴趣位置的先验知识,例如道路类型。我们对模拟和现实世界数据集的经验评估表明,图神经网络非常适合这项任务。
{"title":"Forecasting Unobserved Node States with spatio-temporal Graph Neural Networks","authors":"Andreas Roth, T. Liebig","doi":"10.1109/ICDMW58026.2022.00101","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00101","url":null,"abstract":"Forecasting future states of sensors is key to solving tasks like weather prediction, route planning, and many others when dealing with networks of sensors. But complete spatial coverage of sensors is generally unavailable and would practically be infeasible due to limitations in budget and other resources during deployment and maintenance. Currently existing approaches using machine learning are limited to the spatial locations where data was observed, causing limitations to downstream tasks. Inspired by the recent surge of Graph Neural Networks for spatio-temporal data processing, we investigate whether these can also forecast the state of locations with no sensors available. For this purpose, we develop a framework, named Forecasting Unobserved Node States (FUNS), that allows forecasting the state at entirely unobserved locations based on spatio-temporal correlations and the graph inductive bias. FUNS serves as a blueprint for optimizing models only on observed data and demonstrates good generalization capabilities for predicting the state at entirely unobserved locations during the testing stage. Our framework can be combined with any spatio-temporal Graph Neural Network, that exploits spatio-temporal correlations with surrounding observed locations by using the network's graph structure. Our employed model builds on a previous model by also allowing us to exploit prior knowledge about locations of interest, e.g. the road type. Our empirical evaluation of both simulated and real-world datasets demonstrates that Graph Neural Networks are well-suited for this task.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130665518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
MultiAspectEmo: Multilingual and Language-Agnostic Aspect-Based Sentiment Analysis MultiAspectEmo:多语言和语言不可知论的基于方面的情感分析
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00065
Joanna Szolomicka, Jan Kocoń
The paper addresses the important problem of multilingual and language-agnostic approaches to the aspect-based sentiment analysis (ABSA) task, using modern approaches based on transformer models. We propose a new dataset based on automatic translation of the Polish AspectEmo dataset together with cross-lingual transfer of tags describing aspect polarity. The result is a MultiAspectEmo dataset translated into five other languages: English, Czech, Spanish, French and Dutch. In this paper, we also present the original Tr Asp (Transformer-based Aspect Extraction and Classification) method, which is significantly better than methods from the literature in the ABSA task. In addition, we present multilingual and language-agnostic variants of this method, evaluated on the MultiAspectEmo and also the SemEval2016 datasets. We also test various language models for the ABSA task, including compressed models that give promising results while significantly reducing inference time and memory usage.
本文利用基于转换模型的现代方法,解决了基于方面的情感分析(ABSA)任务的多语言和语言不可知论方法的重要问题。我们提出了一个基于波兰语AspectEmo数据集的自动翻译和描述方面极性标签的跨语言迁移的新数据集。结果是将MultiAspectEmo数据集翻译成其他五种语言:英语、捷克语、西班牙语、法语和荷兰语。在本文中,我们还提出了原始的Tr Asp(基于transformer的Aspect Extraction and Classification)方法,该方法在ABSA任务中明显优于文献中的方法。此外,我们提出了该方法的多语言和语言不确定变体,并在MultiAspectEmo和SemEval2016数据集上进行了评估。我们还为ABSA任务测试了各种语言模型,包括压缩模型,这些模型提供了有希望的结果,同时显著减少了推理时间和内存使用。
{"title":"MultiAspectEmo: Multilingual and Language-Agnostic Aspect-Based Sentiment Analysis","authors":"Joanna Szolomicka, Jan Kocoń","doi":"10.1109/ICDMW58026.2022.00065","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00065","url":null,"abstract":"The paper addresses the important problem of multilingual and language-agnostic approaches to the aspect-based sentiment analysis (ABSA) task, using modern approaches based on transformer models. We propose a new dataset based on automatic translation of the Polish AspectEmo dataset together with cross-lingual transfer of tags describing aspect polarity. The result is a MultiAspectEmo dataset translated into five other languages: English, Czech, Spanish, French and Dutch. In this paper, we also present the original Tr Asp (Transformer-based Aspect Extraction and Classification) method, which is significantly better than methods from the literature in the ABSA task. In addition, we present multilingual and language-agnostic variants of this method, evaluated on the MultiAspectEmo and also the SemEval2016 datasets. We also test various language models for the ABSA task, including compressed models that give promising results while significantly reducing inference time and memory usage.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133207528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Heterogeneous Graph Neural Networks via Similarity Regularization Loss and Hierarchical Fusion 基于相似性正则化损失和层次融合的深度异构图神经网络
Pub Date : 2022-11-01 DOI: 10.1109/ICDMW58026.2022.00104
Zhilong Xiong, Jia Cai
Recently, Graph Neural Networks (GNNs) have emerged as a promising and powerful method for tackling graph-structured data. However, most real-world graph-structured data contains distinct types of objects (nodes) and links (edges), which is called heterogeneous graph. The heterogeneity and rich semantic information indeed increase the difficulties in handling heterogeneous graph. Most of the current heterogeneous graph neural networks (HeteGNNs) can only build on a very shallow structure. This is caused by a phenomenon called semantic confusion, where the node embeddings become indistinguishable with the growth of model depth, leading to the degradation of the model performance. In this paper, we address this problem by proposing a similarity regularization loss and hierarchical fusion based heterogeneous graph neural networks (SHGNN). The hierarchical fusion strategy is utilized to fuse the features of the node embeddings at each layer, which can improve the expressive power of the model, and then a similarity regularization loss is introduced, by which the problem of indistinguishability among nodes can be alleviated. Our approach is flexible to combine various HeteGNNs effectively. Experimental results on real-world heterogeneous graph-structured data demonstrate the state-of-the-art performance of the proposed approach, which can efficiently mitigate the semantic confusion problem.
最近,图神经网络(gnn)作为处理图结构数据的一种有前途和强大的方法而出现。然而,大多数现实世界的图结构数据包含不同类型的对象(节点)和链接(边),这被称为异构图。异构性和丰富的语义信息确实增加了异构图处理的难度。目前大多数异构图神经网络(hetegnn)只能建立在一个非常浅的结构上。这是由一种称为语义混淆的现象引起的,其中节点嵌入随着模型深度的增长而变得无法区分,导致模型性能下降。本文提出了一种基于相似性正则化损失和层次融合的异构图神经网络(SHGNN)来解决这一问题。采用分层融合策略融合各层节点嵌入的特征,提高模型的表达能力;引入相似度正则化损失,缓解节点间不可分辨的问题。我们的方法是灵活的,可以有效地组合各种hetegnn。在实际异构图结构数据上的实验结果表明,该方法能够有效地缓解语义混淆问题。
{"title":"Deep Heterogeneous Graph Neural Networks via Similarity Regularization Loss and Hierarchical Fusion","authors":"Zhilong Xiong, Jia Cai","doi":"10.1109/ICDMW58026.2022.00104","DOIUrl":"https://doi.org/10.1109/ICDMW58026.2022.00104","url":null,"abstract":"Recently, Graph Neural Networks (GNNs) have emerged as a promising and powerful method for tackling graph-structured data. However, most real-world graph-structured data contains distinct types of objects (nodes) and links (edges), which is called heterogeneous graph. The heterogeneity and rich semantic information indeed increase the difficulties in handling heterogeneous graph. Most of the current heterogeneous graph neural networks (HeteGNNs) can only build on a very shallow structure. This is caused by a phenomenon called semantic confusion, where the node embeddings become indistinguishable with the growth of model depth, leading to the degradation of the model performance. In this paper, we address this problem by proposing a similarity regularization loss and hierarchical fusion based heterogeneous graph neural networks (SHGNN). The hierarchical fusion strategy is utilized to fuse the features of the node embeddings at each layer, which can improve the expressive power of the model, and then a similarity regularization loss is introduced, by which the problem of indistinguishability among nodes can be alleviated. Our approach is flexible to combine various HeteGNNs effectively. Experimental results on real-world heterogeneous graph-structured data demonstrate the state-of-the-art performance of the proposed approach, which can efficiently mitigate the semantic confusion problem.","PeriodicalId":146687,"journal":{"name":"2022 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134336843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Conference on Data Mining Workshops (ICDMW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1