首页 > 最新文献

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)最新文献

英文 中文
Do You Know My Emotion? Emotion-Aware Strategy Recognition towards a Persuasive Dialogue System 你知道我的情绪吗?说服性对话系统的情绪感知策略识别
Wei Peng, Yue Hu, Luxi Xing, Yuqiang Xie, Yajing Sun
Persuasive strategy recognition task requires the system to recognize the adopted strategy of the persuader according to the conversation. However, previous methods mainly focus on the contextual information, little is known about incorporating the psychological feedback, i.e. emotion of the persuadee, to predict the strategy. In this paper, we propose a Cross-channel Feedback memOry Network (CFO-Net) to leverage the emotional feedback to iteratively measure the potential benefits of strategies and incorporate them into the contextual-aware dialogue information. Specifically, CFO-Net designs a feedback memory module, including strategy pool and feedback pool, to obtain emotion-aware strategy representation. The strategy pool aims to store historical strategies and the feedback pool is to obtain updated strategy weight based on feedback emotional information. Furthermore, a cross-channel fusion predictor is developed to make a mutual interaction between the emotion-aware strategy representation and the contextual-aware dialogue information for strategy recognition. Experimental results on textsc{PersuasionForGood} confirm that the proposed model CFO-Net is effective to improve the performance on M-F1 from 61.74 to 65.41.
说服策略识别任务要求系统根据对话来识别说服者所采用的策略。然而,以往的方法主要集中在语境信息上,很少有人知道如何结合心理反馈,即被说服者的情绪来预测策略。在本文中,我们提出了一个跨渠道反馈记忆网络(CFO-Net)来利用情绪反馈迭代测量策略的潜在利益,并将其纳入上下文感知对话信息。具体而言,CFO-Net设计了一个反馈记忆模块,包括策略池和反馈池,以获得情绪感知的策略表示。策略池用于存储历史策略,反馈池用于根据反馈的情绪信息获取更新的策略权重。在此基础上,提出了一种跨通道融合预测器,在情感感知策略表示和上下文感知对话信息之间进行交互,实现策略识别。在textsc{PersuasionForGood}上的实验结果证实了所提出的模型CFO-Net可以有效地将M-F1的性能从61.74提高到65.41。
{"title":"Do You Know My Emotion? Emotion-Aware Strategy Recognition towards a Persuasive Dialogue System","authors":"Wei Peng, Yue Hu, Luxi Xing, Yuqiang Xie, Yajing Sun","doi":"10.48550/arXiv.2206.12101","DOIUrl":"https://doi.org/10.48550/arXiv.2206.12101","url":null,"abstract":"Persuasive strategy recognition task requires the system to recognize the adopted strategy of the persuader according to the conversation. However, previous methods mainly focus on the contextual information, little is known about incorporating the psychological feedback, i.e. emotion of the persuadee, to predict the strategy. In this paper, we propose a Cross-channel Feedback memOry Network (CFO-Net) to leverage the emotional feedback to iteratively measure the potential benefits of strategies and incorporate them into the contextual-aware dialogue information. Specifically, CFO-Net designs a feedback memory module, including strategy pool and feedback pool, to obtain emotion-aware strategy representation. The strategy pool aims to store historical strategies and the feedback pool is to obtain updated strategy weight based on feedback emotional information. Furthermore, a cross-channel fusion predictor is developed to make a mutual interaction between the emotion-aware strategy representation and the contextual-aware dialogue information for strategy recognition. Experimental results on textsc{PersuasionForGood} confirm that the proposed model CFO-Net is effective to improve the performance on M-F1 from 61.74 to 65.41.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84034788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Inferring Tie Strength in Temporal Networks 推断时间网络中的纽带强度
Lutz Oettershagen, A. Konstantinidis, G. Italiano
Inferring tie strengths in social networks is an essential task in social network analysis. Common approaches classify the ties as weak and strong ties based on the strong triadic closure (STC). The STC states that if for three nodes, $A$, $B$, and $C$, there are strong ties between $A$ and $B$, as well as $A$ and $C$, there has to be a (weak or strong) tie between $B$ and $C$. So far, most works discuss the STC in static networks. However, modern large-scale social networks are usually highly dynamic, providing user contacts and communications as streams of edge updates. Temporal networks capture these dynamics. To apply the STC to temporal networks, we first generalize the STC and introduce a weighted version such that empirical a priori knowledge given in the form of edge weights is respected by the STC. The weighted STC is hard to compute, and our main contribution is an efficient 2-approximative streaming algorithm for the weighted STC in temporal networks. As a technical contribution, we introduce a fully dynamic 2-approximation for the minimum weight vertex cover problem, which is a crucial component of our streaming algorithm. Our evaluation shows that the weighted STC leads to solutions that capture the a priori knowledge given by the edge weights better than the non-weighted STC. Moreover, we show that our streaming algorithm efficiently approximates the weighted STC in large-scale social networks.
推断社会网络中的联系强度是社会网络分析中的一项重要任务。常用的方法是根据强三元闭包(STC)将连接分为弱连接和强连接。STC指出,如果对于$A$, $B$和$C$三个节点,$A$和$B$之间以及$A$和$C$之间存在强联系,则$B$和$C$之间必须存在(弱或强)联系。到目前为止,大多数研究都是讨论静态网络中的STC。然而,现代大型社交网络通常是高度动态的,以边缘更新流的形式提供用户联系和通信。时间网络捕捉到了这些动态。为了将STC应用于时间网络,我们首先对STC进行了推广,并引入了一个加权版本,使得STC尊重以边缘权重形式给出的经验先验知识。加权STC很难计算,我们的主要贡献是为时间网络中的加权STC提供了一种高效的2逼近流算法。作为技术上的贡献,我们为最小权重顶点覆盖问题引入了一个完全动态的2逼近,这是我们的流算法的关键组成部分。我们的评估表明,加权STC导致的解决方案比非加权STC更好地捕获由边缘权重给出的先验知识。此外,我们证明了我们的流算法有效地近似于大规模社交网络中的加权STC。
{"title":"Inferring Tie Strength in Temporal Networks","authors":"Lutz Oettershagen, A. Konstantinidis, G. Italiano","doi":"10.48550/arXiv.2206.11705","DOIUrl":"https://doi.org/10.48550/arXiv.2206.11705","url":null,"abstract":"Inferring tie strengths in social networks is an essential task in social network analysis. Common approaches classify the ties as weak and strong ties based on the strong triadic closure (STC). The STC states that if for three nodes, $A$, $B$, and $C$, there are strong ties between $A$ and $B$, as well as $A$ and $C$, there has to be a (weak or strong) tie between $B$ and $C$. So far, most works discuss the STC in static networks. However, modern large-scale social networks are usually highly dynamic, providing user contacts and communications as streams of edge updates. Temporal networks capture these dynamics. To apply the STC to temporal networks, we first generalize the STC and introduce a weighted version such that empirical a priori knowledge given in the form of edge weights is respected by the STC. The weighted STC is hard to compute, and our main contribution is an efficient 2-approximative streaming algorithm for the weighted STC in temporal networks. As a technical contribution, we introduce a fully dynamic 2-approximation for the minimum weight vertex cover problem, which is a crucial component of our streaming algorithm. Our evaluation shows that the weighted STC leads to solutions that capture the a priori knowledge given by the edge weights better than the non-weighted STC. Moreover, we show that our streaming algorithm efficiently approximates the weighted STC in large-scale social networks.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89457404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Waypoint Generation in Row-based Crops with Deep Learning and Contrastive Clustering 基于深度学习和对比聚类的行基作物路径点生成
Francesco Salvetti, Simone Angarano, Mauro Martini, Simone Cerrato, M. Chiaberge
{"title":"Waypoint Generation in Row-based Crops with Deep Learning and Contrastive Clustering","authors":"Francesco Salvetti, Simone Angarano, Mauro Martini, Simone Cerrato, M. Chiaberge","doi":"10.1007/978-3-031-26422-1_13","DOIUrl":"https://doi.org/10.1007/978-3-031-26422-1_13","url":null,"abstract":"","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75210822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Automated Cancer Subtyping via Vector Quantization Mutual Information Maximization 基于矢量量化互信息最大化的自动化癌症亚型分型
Zheng Chen, Lingwei Zhu, Ziwei Yang, Takashi Matsubara
Cancer subtyping is crucial for understanding the nature of tumors and providing suitable therapy. However, existing labelling methods are medically controversial, and have driven the process of subtyping away from teaching signals. Moreover, cancer genetic expression profiles are high-dimensional, scarce, and have complicated dependence, thereby posing a serious challenge to existing subtyping models for outputting sensible clustering. In this study, we propose a novel clustering method for exploiting genetic expression profiles and distinguishing subtypes in an unsupervised manner. The proposed method adaptively learns categorical correspondence from latent representations of expression profiles to the subtypes output by the model. By maximizing the problem -- agnostic mutual information between input expression profiles and output subtypes, our method can automatically decide a suitable number of subtypes. Through experiments, we demonstrate that our proposed method can refine existing controversial labels, and, by further medical analysis, this refinement is proven to have a high correlation with cancer survival rates.
癌症亚型对于了解肿瘤的性质和提供合适的治疗至关重要。然而,现有的标记方法在医学上是有争议的,并且已经使分型过程远离了教学信号。此外,癌症基因表达谱具有高维、稀缺和复杂的依赖性,这对现有的亚型模型输出合理聚类提出了严峻的挑战。在这项研究中,我们提出了一种新的聚类方法,以无监督的方式利用基因表达谱和区分亚型。该方法自适应地学习从表达谱的潜在表示到模型输出的子类型的分类对应关系。通过最大化输入表达式配置文件和输出子类型之间与问题无关的互信息,我们的方法可以自动确定适当数量的子类型。通过实验,我们证明了我们提出的方法可以改进现有的有争议的标签,并且通过进一步的医学分析,这种改进被证明与癌症存活率有很高的相关性。
{"title":"Automated Cancer Subtyping via Vector Quantization Mutual Information Maximization","authors":"Zheng Chen, Lingwei Zhu, Ziwei Yang, Takashi Matsubara","doi":"10.48550/arXiv.2206.10801","DOIUrl":"https://doi.org/10.48550/arXiv.2206.10801","url":null,"abstract":"Cancer subtyping is crucial for understanding the nature of tumors and providing suitable therapy. However, existing labelling methods are medically controversial, and have driven the process of subtyping away from teaching signals. Moreover, cancer genetic expression profiles are high-dimensional, scarce, and have complicated dependence, thereby posing a serious challenge to existing subtyping models for outputting sensible clustering. In this study, we propose a novel clustering method for exploiting genetic expression profiles and distinguishing subtypes in an unsupervised manner. The proposed method adaptively learns categorical correspondence from latent representations of expression profiles to the subtypes output by the model. By maximizing the problem -- agnostic mutual information between input expression profiles and output subtypes, our method can automatically decide a suitable number of subtypes. Through experiments, we demonstrate that our proposed method can refine existing controversial labels, and, by further medical analysis, this refinement is proven to have a high correlation with cancer survival rates.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90519808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Route to Time and Time to Route: Travel Time Estimation from Sparse Trajectories 路线到时间和时间到路线:从稀疏轨迹估计旅行时间
Zhiwen Zhang, Hongjun Wang, Z. Fan, Jiyuan Chen, Xuan Song, R. Shibasaki
Due to the rapid development of Internet of Things (IoT) technologies, many online web apps (e.g., Google Map and Uber) estimate the travel time of trajectory data collected by mobile devices. However, in reality, complex factors, such as network communication and energy constraints, make multiple trajectories collected at a low sampling rate. In this case, this paper aims to resolve the problem of travel time estimation (TTE) and route recovery in sparse scenarios, which often leads to the uncertain label of travel time and route between continuously sampled GPS points. We formulate this problem as an inexact supervision problem in which the training data has coarsely grained labels and jointly solve the tasks of TTE and route recovery. And we argue that both two tasks are complementary to each other in the model-learning procedure and hold such a relation: more precise travel time can lead to better inference for routes, in turn, resulting in a more accurate time estimation). Based on this assumption, we propose an EM algorithm to alternatively estimate the travel time of inferred route through weak supervision in E step and retrieve the route based on estimated travel time in M step for sparse trajectories. We conducted experiments on three real-world trajectory datasets and demonstrated the effectiveness of the proposed method.
由于物联网(IoT)技术的快速发展,许多在线web应用程序(如谷歌地图和优步)估计移动设备收集的轨迹数据的旅行时间。但在现实中,由于网络通信、能量约束等复杂因素,使得采集到的多轨迹采样率较低。在这种情况下,本文旨在解决稀疏场景下旅行时间估计(TTE)和路线恢复问题,该问题经常导致连续采样GPS点之间的旅行时间和路线标记不确定。我们将此问题表述为训练数据具有粗粒度标签的非精确监督问题,共同解决TTE和路由恢复的任务。我们认为这两个任务在模型学习过程中是相互补充的,并且保持这样的关系:更精确的旅行时间可以导致更好的路线推断,反过来,导致更准确的时间估计)。基于这一假设,我们提出了一种EM算法,交替地在E步中通过弱监督估计推断路线的行程时间,并在M步中根据估计的行程时间检索稀疏轨迹。我们在三个真实的轨迹数据集上进行了实验,并证明了所提出方法的有效性。
{"title":"Route to Time and Time to Route: Travel Time Estimation from Sparse Trajectories","authors":"Zhiwen Zhang, Hongjun Wang, Z. Fan, Jiyuan Chen, Xuan Song, R. Shibasaki","doi":"10.48550/arXiv.2206.10418","DOIUrl":"https://doi.org/10.48550/arXiv.2206.10418","url":null,"abstract":"Due to the rapid development of Internet of Things (IoT) technologies, many online web apps (e.g., Google Map and Uber) estimate the travel time of trajectory data collected by mobile devices. However, in reality, complex factors, such as network communication and energy constraints, make multiple trajectories collected at a low sampling rate. In this case, this paper aims to resolve the problem of travel time estimation (TTE) and route recovery in sparse scenarios, which often leads to the uncertain label of travel time and route between continuously sampled GPS points. We formulate this problem as an inexact supervision problem in which the training data has coarsely grained labels and jointly solve the tasks of TTE and route recovery. And we argue that both two tasks are complementary to each other in the model-learning procedure and hold such a relation: more precise travel time can lead to better inference for routes, in turn, resulting in a more accurate time estimation). Based on this assumption, we propose an EM algorithm to alternatively estimate the travel time of inferred route through weak supervision in E step and retrieve the route based on estimated travel time in M step for sparse trajectories. We conducted experiments on three real-world trajectory datasets and demonstrated the effectiveness of the proposed method.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89298187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
R2-AD2: Detecting Anomalies by Analysing the Raw Gradient R2-AD2:通过分析原始梯度来检测异常
Jan-Philipp Schulze, Philip Sperl, Ana Ruaductoiu, Carla Sagebiel, Konstantin Bottinger
{"title":"R2-AD2: Detecting Anomalies by Analysing the Raw Gradient","authors":"Jan-Philipp Schulze, Philip Sperl, Ana Ruaductoiu, Carla Sagebiel, Konstantin Bottinger","doi":"10.1007/978-3-031-26387-3_13","DOIUrl":"https://doi.org/10.1007/978-3-031-26387-3_13","url":null,"abstract":"","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81118275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TrafficFlowGAN: Physics-informed Flow based Generative Adversarial Network for Uncertainty Quantification 基于物理信息流的不确定性量化生成对抗网络
Zhaobin Mo, Yongjie Fu, Daran Xu, Xuan Di
This paper proposes the TrafficFlowGAN, a physics-informed flow based generative adversarial network (GAN), for uncertainty quantification (UQ) of dynamical systems. TrafficFlowGAN adopts a normalizing flow model as the generator to explicitly estimate the data likelihood. This flow model is trained to maximize the data likelihood and to generate synthetic data that can fool a convolutional discriminator. We further regularize this training process using prior physics information, so-called physics-informed deep learning (PIDL). To the best of our knowledge, we are the first to propose an integration of flow, GAN and PIDL for the UQ problems. We take the traffic state estimation (TSE), which aims to estimate the traffic variables (e.g. traffic density and velocity) using partially observed data, as an example to demonstrate the performance of our proposed model. We conduct numerical experiments where the proposed model is applied to learn the solutions of stochastic differential equations. The results demonstrate the robustness and accuracy of the proposed model, together with the ability to learn a machine learning surrogate model. We also test it on a real-world dataset, the Next Generation SIMulation (NGSIM), to show that the proposed TrafficFlowGAN can outperform the baselines, including the pure flow model, the physics-informed flow model, and the flow based GAN model.
本文提出了一种基于物理信息流的生成对抗网络(GAN),用于动态系统的不确定性量化(UQ)。TrafficFlowGAN采用归一化流模型作为生成器来显式估计数据的似然。该流模型被训练为最大化数据的似然性,并生成可以欺骗卷积鉴别器的合成数据。我们使用先前的物理信息进一步规范这个训练过程,即所谓的物理信息深度学习(PIDL)。据我们所知,我们是第一个提出流,GAN和PIDL集成UQ问题的人。我们以交通状态估计(TSE)为例来证明我们提出的模型的性能,该模型旨在使用部分观测数据来估计交通变量(如交通密度和速度)。我们进行了数值实验,将所提出的模型应用于学习随机微分方程的解。结果证明了所提出模型的鲁棒性和准确性,以及学习机器学习代理模型的能力。我们还在真实世界的数据集下一代模拟(NGSIM)上对其进行了测试,以表明所提出的TrafficFlowGAN可以优于基线,包括纯流模型、物理信息流模型和基于流的GAN模型。
{"title":"TrafficFlowGAN: Physics-informed Flow based Generative Adversarial Network for Uncertainty Quantification","authors":"Zhaobin Mo, Yongjie Fu, Daran Xu, Xuan Di","doi":"10.48550/arXiv.2206.09319","DOIUrl":"https://doi.org/10.48550/arXiv.2206.09319","url":null,"abstract":"This paper proposes the TrafficFlowGAN, a physics-informed flow based generative adversarial network (GAN), for uncertainty quantification (UQ) of dynamical systems. TrafficFlowGAN adopts a normalizing flow model as the generator to explicitly estimate the data likelihood. This flow model is trained to maximize the data likelihood and to generate synthetic data that can fool a convolutional discriminator. We further regularize this training process using prior physics information, so-called physics-informed deep learning (PIDL). To the best of our knowledge, we are the first to propose an integration of flow, GAN and PIDL for the UQ problems. We take the traffic state estimation (TSE), which aims to estimate the traffic variables (e.g. traffic density and velocity) using partially observed data, as an example to demonstrate the performance of our proposed model. We conduct numerical experiments where the proposed model is applied to learn the solutions of stochastic differential equations. The results demonstrate the robustness and accuracy of the proposed model, together with the ability to learn a machine learning surrogate model. We also test it on a real-world dataset, the Next Generation SIMulation (NGSIM), to show that the proposed TrafficFlowGAN can outperform the baselines, including the pure flow model, the physics-informed flow model, and the flow based GAN model.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74639085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
SAViR-T: Spatially Attentive Visual Reasoning with Transformers SAViR-T:变形金刚的空间专注视觉推理
Pritish Sahu, Kalliopi Basioti, V. Pavlovic
We present a novel computational model,"SAViR-T", for the family of visual reasoning problems embodied in the Raven's Progressive Matrices (RPM). Our model considers explicit spatial semantics of visual elements within each image in the puzzle, encoded as spatio-visual tokens, and learns the intra-image as well as the inter-image token dependencies, highly relevant for the visual reasoning task. Token-wise relationship, modeled through a transformer-based SAViR-T architecture, extract group (row or column) driven representations by leveraging the group-rule coherence and use this as the inductive bias to extract the underlying rule representations in the top two row (or column) per token in the RPM. We use this relation representations to locate the correct choice image that completes the last row or column for the RPM. Extensive experiments across both synthetic RPM benchmarks, including RAVEN, I-RAVEN, RAVEN-FAIR, and PGM, and the natural image-based"V-PROM"demonstrate that SAViR-T sets a new state-of-the-art for visual reasoning, exceeding prior models' performance by a considerable margin.
我们提出了一种新的计算模型,“SAViR-T”,用于体现在Raven渐进矩阵(RPM)中的视觉推理问题家族。我们的模型考虑了拼图中每个图像中视觉元素的显式空间语义,编码为空间视觉标记,并学习图像内部和图像间的标记依赖关系,这与视觉推理任务高度相关。标记智能关系,通过基于转换器的SAViR-T体系结构建模,通过利用组规则一致性提取组(行或列)驱动的表示,并将其用作归纳偏差,以提取RPM中每个标记的前两行(或列)中的底层规则表示。我们使用这种关系表示来定位完成RPM的最后一行或最后一列的正确选择映像。在包括RAVEN, I-RAVEN, RAVEN- fair和PGM在内的两种合成RPM基准以及基于自然图像的“V-PROM”上进行的广泛实验表明,SAViR-T为视觉推理设置了新的最先进的技术,大大超过了先前模型的性能。
{"title":"SAViR-T: Spatially Attentive Visual Reasoning with Transformers","authors":"Pritish Sahu, Kalliopi Basioti, V. Pavlovic","doi":"10.48550/arXiv.2206.09265","DOIUrl":"https://doi.org/10.48550/arXiv.2206.09265","url":null,"abstract":"We present a novel computational model,\"SAViR-T\", for the family of visual reasoning problems embodied in the Raven's Progressive Matrices (RPM). Our model considers explicit spatial semantics of visual elements within each image in the puzzle, encoded as spatio-visual tokens, and learns the intra-image as well as the inter-image token dependencies, highly relevant for the visual reasoning task. Token-wise relationship, modeled through a transformer-based SAViR-T architecture, extract group (row or column) driven representations by leveraging the group-rule coherence and use this as the inductive bias to extract the underlying rule representations in the top two row (or column) per token in the RPM. We use this relation representations to locate the correct choice image that completes the last row or column for the RPM. Extensive experiments across both synthetic RPM benchmarks, including RAVEN, I-RAVEN, RAVEN-FAIR, and PGM, and the natural image-based\"V-PROM\"demonstrate that SAViR-T sets a new state-of-the-art for visual reasoning, exceeding prior models' performance by a considerable margin.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81683000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Learning to Teach Fairness-aware Deep Multi-task Learning 学习教授公平意识的深度多任务学习
Arjun Roy, Eirini Ntoutsi
Fairness-aware learning mainly focuses on single task learning (STL). The fairness implications of multi-task learning (MTL) have only recently been considered and a seminal approach has been proposed that considers the fairness-accuracy trade-off for each task and the performance trade-off among different tasks. Instead of a rigid fairness-accuracy trade-off formulation, we propose a flexible approach that learns how to be fair in a MTL setting by selecting which objective (accuracy or fairness) to optimize at each step. We introduce the L2T-FMT algorithm that is a teacher-student network trained collaboratively; the student learns to solve the fair MTL problem while the teacher instructs the student to learn from either accuracy or fairness, depending on what is harder to learn for each task. Moreover, this dynamic selection of which objective to use at each step for each task reduces the number of trade-off weights from 2T to T, where T is the number of tasks. Our experiments on three real datasets show that L2T-FMT improves on both fairness (12-19%) and accuracy (up to 2%) over state-of-the-art approaches.
公平感知学习主要集中在单任务学习(STL)。多任务学习(MTL)的公平性影响直到最近才被考虑,并提出了一种开创性的方法,该方法考虑了每个任务的公平性-准确性权衡以及不同任务之间的性能权衡。我们提出了一种灵活的方法,通过选择在每个步骤优化哪个目标(准确性或公平性)来学习如何在MTL设置中保持公平,而不是严格的公平性-准确性权衡公式。我们介绍了L2T-FMT算法,它是一个师生协作训练的网络;学生学习解决公平的MTL问题,而老师指导学生学习准确性或公平性,这取决于每个任务更难学的东西。此外,在每个任务的每个步骤中使用哪个目标的动态选择将权衡权重的数量从2T减少到T,其中T是任务的数量。我们在三个真实数据集上的实验表明,与最先进的方法相比,L2T-FMT在公平性(12-19%)和准确性(高达2%)方面都有所提高。
{"title":"Learning to Teach Fairness-aware Deep Multi-task Learning","authors":"Arjun Roy, Eirini Ntoutsi","doi":"10.48550/arXiv.2206.08403","DOIUrl":"https://doi.org/10.48550/arXiv.2206.08403","url":null,"abstract":"Fairness-aware learning mainly focuses on single task learning (STL). The fairness implications of multi-task learning (MTL) have only recently been considered and a seminal approach has been proposed that considers the fairness-accuracy trade-off for each task and the performance trade-off among different tasks. Instead of a rigid fairness-accuracy trade-off formulation, we propose a flexible approach that learns how to be fair in a MTL setting by selecting which objective (accuracy or fairness) to optimize at each step. We introduce the L2T-FMT algorithm that is a teacher-student network trained collaboratively; the student learns to solve the fair MTL problem while the teacher instructs the student to learn from either accuracy or fairness, depending on what is harder to learn for each task. Moreover, this dynamic selection of which objective to use at each step for each task reduces the number of trade-off weights from 2T to T, where T is the number of tasks. Our experiments on three real datasets show that L2T-FMT improves on both fairness (12-19%) and accuracy (up to 2%) over state-of-the-art approaches.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88623908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
'John ate 5 apples' != 'John ate some apples': Self-Supervised Paraphrase Quality Detection for Algebraic Word Problems “约翰吃了5个苹果”=“约翰吃了一些苹果”:代数词问题的自我监督释义质量检测
Rishabh Gupta, V. Venktesh, M. Mohania, Vikram Goyal
This paper introduces the novel task of scoring paraphrases for Algebraic Word Problems (AWP) and presents a self-supervised method for doing so. In the current online pedagogical setting, paraphrasing these problems is helpful for academicians to generate multiple syntactically diverse questions for assessments. It also helps induce variation to ensure that the student has understood the problem instead of just memorizing it or using unfair means to solve it. The current state-of-the-art paraphrase generation models often cannot effectively paraphrase word problems, losing a critical piece of information (such as numbers or units) which renders the question unsolvable. There is a need for paraphrase scoring methods in the context of AWP to enable the training of good paraphrasers. Thus, we propose ParaQD, a self-supervised paraphrase quality detection method using novel data augmentations that can learn latent representations to separate a high-quality paraphrase of an algebraic question from a poor one by a wide margin. Through extensive experimentation, we demonstrate that our method outperforms existing state-of-the-art self-supervised methods by up to 32% while also demonstrating impressive zero-shot performance.
本文介绍了代数词问题(AWP)释义评分的新任务,并提出了一种自监督的评分方法。在当前的在线教学环境中,对这些问题进行解释有助于学者产生多个语法不同的问题进行评估。它还有助于诱导变化,以确保学生已经理解了问题,而不是仅仅记住它或使用不公平的方法来解决它。当前最先进的释义生成模型通常不能有效地释义单词问题,丢失关键信息(如数字或单位),从而使问题无法解决。在AWP的背景下,有必要使用释义评分方法来训练优秀的释义者。因此,我们提出了ParaQD,一种使用新颖数据增强的自监督释义质量检测方法,该方法可以学习潜在表征,从而将代数问题的高质量释义从差的释义中分离出来。通过大量的实验,我们证明了我们的方法比现有的最先进的自监督方法高出32%,同时也展示了令人印象深刻的零射击性能。
{"title":"'John ate 5 apples' != 'John ate some apples': Self-Supervised Paraphrase Quality Detection for Algebraic Word Problems","authors":"Rishabh Gupta, V. Venktesh, M. Mohania, Vikram Goyal","doi":"10.48550/arXiv.2206.08263","DOIUrl":"https://doi.org/10.48550/arXiv.2206.08263","url":null,"abstract":"This paper introduces the novel task of scoring paraphrases for Algebraic Word Problems (AWP) and presents a self-supervised method for doing so. In the current online pedagogical setting, paraphrasing these problems is helpful for academicians to generate multiple syntactically diverse questions for assessments. It also helps induce variation to ensure that the student has understood the problem instead of just memorizing it or using unfair means to solve it. The current state-of-the-art paraphrase generation models often cannot effectively paraphrase word problems, losing a critical piece of information (such as numbers or units) which renders the question unsolvable. There is a need for paraphrase scoring methods in the context of AWP to enable the training of good paraphrasers. Thus, we propose ParaQD, a self-supervised paraphrase quality detection method using novel data augmentations that can learn latent representations to separate a high-quality paraphrase of an algebraic question from a poor one by a wide margin. Through extensive experimentation, we demonstrate that our method outperforms existing state-of-the-art self-supervised methods by up to 32% while also demonstrating impressive zero-shot performance.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89243493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1