首页 > 最新文献

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence最新文献

英文 中文
Fast and Accurate Binary Neural Networks Based on Depth-Width Reshaping 基于深度-宽度重构的快速精确二值神经网络
Ping Xue, Yang Lu, Jingfei Chang, Xing Wei, Zhenchun Wei
Network binarization (i.e., binary neural networks, BNNs) can efficiently compress deep neural networks and accelerate model inference but cause severe accuracy degradation. Existing BNNs are mainly implemented based on the commonly used full-precision network backbones, and then the accuracy is improved with various techniques. However, there is a question of whether the full-precision network backbone is well adapted to BNNs. We start from the factors of the performance degradation of BNNs and analyze the problems of directly using full-precision network backbones for BNNs: for a given computational budget, the backbone of a BNN may need to be shallower and wider compared to the backbone of a full-precision network. With this in mind, Depth-Width Reshaping (DWR) is proposed to reshape the depth and width of existing full-precision network backbones and further optimize them by incorporating pruning techniques to better fit the BNNs. Extensive experiments demonstrate the analytical result and the effectiveness of the proposed method. Compared with the original backbones, the DWR backbones constructed by the proposed method result in close to O(√s) decrease in activations, while achieving an absolute accuracy increase by up to 1.7% with comparable computational cost. Besides, by using the DWR backbones, existing methods can achieve new state-of-the-art (SOTA) accuracy (e.g., 67.2% on ImageNet with ResNet-18 as the original backbone). We hope this work provides a novel insight into the backbone design of BNNs. The code is available at https://github.com/pingxue-hfut/DWR.
网络二值化(即二元神经网络,bnn)可以有效地压缩深度神经网络并加速模型推理,但会导致严重的精度下降。现有的bnn主要基于常用的全精度网络骨干网实现,然后通过各种技术提高精度。然而,全精度网络骨干网是否能很好地适应bnn是一个问题。我们从影响BNN性能下降的因素入手,分析了直接使用全精度网络骨干网的问题:对于给定的计算预算,与全精度网络骨干网相比,BNN的骨干网可能需要更浅、更宽。考虑到这一点,提出了深度-宽度重塑(DWR)来重塑现有的全精度网络骨干网的深度和宽度,并通过结合修剪技术进一步优化它们,以更好地适应bnn。大量的实验证明了分析结果和所提方法的有效性。与原始主干网相比,采用该方法构建的DWR主干网的激活次数减少了近0(√s),在计算成本相当的情况下,绝对精度提高了1.7%。此外,通过使用DWR主干,现有方法可以达到新的最先进的(SOTA)精度(例如,在使用ResNet-18作为原始主干的ImageNet上,精度为67.2%)。我们希望这项工作能为bnn的主干设计提供新的见解。代码可在https://github.com/pingxue-hfut/DWR上获得。
{"title":"Fast and Accurate Binary Neural Networks Based on Depth-Width Reshaping","authors":"Ping Xue, Yang Lu, Jingfei Chang, Xing Wei, Zhenchun Wei","doi":"10.1609/aaai.v37i9.26268","DOIUrl":"https://doi.org/10.1609/aaai.v37i9.26268","url":null,"abstract":"Network binarization (i.e., binary neural networks, BNNs) can efficiently compress deep neural networks and accelerate model inference but cause severe accuracy degradation. Existing BNNs are mainly implemented based on the commonly used full-precision network backbones, and then the accuracy is improved with various techniques. However, there is a question of whether the full-precision network backbone is well adapted to BNNs. We start from the factors of the performance degradation of BNNs and analyze the problems of directly using full-precision network backbones for BNNs: for a given computational budget, the backbone of a BNN may need to be shallower and wider compared to the backbone of a full-precision network. With this in mind, Depth-Width Reshaping (DWR) is proposed to reshape the depth and width of existing full-precision network backbones and further optimize them by incorporating pruning techniques to better fit the BNNs. Extensive experiments demonstrate the analytical result and the effectiveness of the proposed method. Compared with the original backbones, the DWR backbones constructed by the proposed method result in close to O(√s) decrease in activations, while achieving an absolute accuracy increase by up to 1.7% with comparable computational cost. Besides, by using the DWR backbones, existing methods can achieve new state-of-the-art (SOTA) accuracy (e.g., 67.2% on ImageNet with ResNet-18 as the original backbone). We hope this work provides a novel insight into the backbone design of BNNs. The code is available at https://github.com/pingxue-hfut/DWR.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"150 1","pages":"10684-10692"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77477351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Level Wavelet Mapping Correlation for Statistical Dependence Measurement: Methodology and Performance 统计相关性测量的多级小波映射关联:方法与性能
Yixin Ren, Hao Zhang, Yewei Xia, J. Guan, Shuigeng Zhou
We propose a new criterion for measuring dependence between two real variables, namely, Multi-level Wavelet Mapping Correlation (MWMC). MWMC can capture the nonlinear dependencies between variables by measuring their correlation under different levels of wavelet mappings. We show that the empirical estimate of MWMC converges exponentially to its population quantity. To support independence test better with MWMC, we further design a permutation test based on MWMC and prove that our test can not only control the type I error rate (the rate of false positives) well but also ensure that the type II error rate (the rate of false negatives) is upper bounded by O(1/n) (n is the sample size) with finite permutations. By extensive experiments on (conditional) independence tests and causal discovery, we show that our method outperforms existing independence test methods.
我们提出了一种新的度量两个实变量之间相关性的准则,即多级小波映射相关(MWMC)。MWMC可以通过测量变量在不同小波映射层次下的相关性来捕捉变量之间的非线性依赖关系。我们证明了MWMC的经验估计是指数收敛于其种群数量的。为了更好地支持MWMC的独立性检验,我们进一步设计了基于MWMC的排列检验,并证明了我们的检验不仅可以很好地控制I类错误率(假阳性率),而且可以保证在有限排列情况下II类错误率(假阴性率)的上限为O(1/n) (n为样本量)。通过对(条件)独立性测试和因果发现的大量实验,我们表明我们的方法优于现有的独立性测试方法。
{"title":"Multi-Level Wavelet Mapping Correlation for Statistical Dependence Measurement: Methodology and Performance","authors":"Yixin Ren, Hao Zhang, Yewei Xia, J. Guan, Shuigeng Zhou","doi":"10.1609/aaai.v37i5.25799","DOIUrl":"https://doi.org/10.1609/aaai.v37i5.25799","url":null,"abstract":"We propose a new criterion for measuring dependence between two real variables, namely, Multi-level Wavelet Mapping Correlation (MWMC). MWMC can capture the nonlinear dependencies between variables by measuring their correlation under different levels of wavelet mappings. We show that the empirical estimate of MWMC converges exponentially to its population quantity. To support independence test better with MWMC, we further design a permutation test based on MWMC and prove that our test can not only control the type I error rate (the rate of false positives) well but also ensure that the type II error rate (the rate of false negatives) is upper bounded by O(1/n) (n is the sample size) with finite permutations. By extensive experiments on (conditional) independence tests and causal discovery, we show that our method outperforms existing independence test methods.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"53 1","pages":"6499-6506"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77489627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Debiasing Intrinsic Bias and Application Bias Jointly via Invariant Risk Minimization (Student Abstract) 基于不变风险最小化的联合消除内在偏差和应用偏差(学生摘要)
Yuzhou Mao, Liu Yu, Yi Yang, Fan Zhou, Ting Zhong
Demographic biases and social stereotypes are common in pretrained language models (PLMs), while the fine-tuning in downstream applications can also produce new biases or amplify the impact of the original biases. Existing works separate the debiasing from the fine-tuning procedure, which results in a gap between intrinsic bias and application bias. In this work, we propose a debiasing framework CauDebias to eliminate both biases, which directly combines debiasing with fine-tuning and can be applied for any PLMs in downstream tasks. We distinguish the bias-relevant (non-causal factors) and label-relevant (causal factors) parts in sentences from a causal invariant perspective. Specifically, we perform intervention on non-causal factors in different demographic groups, and then devise an invariant risk minimization loss to trade-off performance between bias mitigation and task accuracy. Experimental results on three downstream tasks show that our CauDebias can remarkably reduce biases in PLMs while minimizing the impact on downstream tasks.
人口统计偏差和社会刻板印象在预训练语言模型(PLMs)中很常见,而下游应用程序的微调也可能产生新的偏差或放大原始偏差的影响。现有的工作将去除偏置与微调过程分开,这导致了内在偏置与应用偏置之间的差距。在这项工作中,我们提出了一个去除偏置的框架CauDebias来消除这两种偏置,它直接结合了去除偏置和微调,可以应用于下游任务中的任何plm。我们从因果不变的角度区分句子中的偏差相关部分(非因果因素)和标签相关部分(因果因素)。具体而言,我们对不同人口统计群体的非因果因素进行干预,然后设计一个不变的风险最小化损失,以权衡偏差缓解和任务准确性之间的性能。三个下游任务的实验结果表明,我们的CauDebias可以显著减少plm中的偏差,同时最大限度地减少对下游任务的影响。
{"title":"Debiasing Intrinsic Bias and Application Bias Jointly via Invariant Risk Minimization (Student Abstract)","authors":"Yuzhou Mao, Liu Yu, Yi Yang, Fan Zhou, Ting Zhong","doi":"10.1609/aaai.v37i13.27000","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.27000","url":null,"abstract":"Demographic biases and social stereotypes are common in pretrained language models (PLMs), while the fine-tuning in downstream applications can also produce new biases or amplify the impact of the original biases. Existing works separate the debiasing from the fine-tuning procedure, which results in a gap between intrinsic bias and application bias. In this work, we propose a debiasing framework CauDebias to eliminate both biases, which directly combines debiasing with fine-tuning and can be applied for any PLMs in downstream tasks. We distinguish the bias-relevant (non-causal factors) and label-relevant (causal factors) parts in sentences from a causal invariant perspective. Specifically, we perform intervention on non-causal factors in different demographic groups, and then devise an invariant risk minimization loss to trade-off performance between bias mitigation and task accuracy. Experimental results on three downstream tasks show that our CauDebias can remarkably reduce biases in PLMs while minimizing the impact on downstream tasks.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"197 1","pages":"16280-16281"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79768635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Isometric Manifold Learning Using Hierarchical Flow 等量流形学习使用分层流
Ziqi Pan, Jianfu Zhang, Li Niu, Liqing Zhang
We propose the Hierarchical Flow (HF) model constrained by isometric regularizations for manifold learning that combines manifold learning goals such as dimensionality reduction, inference, sampling, projection and density estimation into one unified framework. Our proposed HF model is regularized to not only produce embeddings preserving the geometric structure of the manifold, but also project samples onto the manifold in a manner conforming to the rigorous definition of projection. Theoretical guarantees are provided for our HF model to satisfy the two desired properties. In order to detect the real dimensionality of the manifold, we also propose a two-stage dimensionality reduction algorithm, which is a time-efficient algorithm thanks to the hierarchical architecture design of our HF model. Experimental results justify our theoretical analysis, demonstrate the superiority of our dimensionality reduction algorithm in cost of training time, and verify the effect of the aforementioned properties in improving performances on downstream tasks such as anomaly detection.
本文提出了基于等距正则化约束的流形学习层次流模型,该模型将降维、推理、采样、投影和密度估计等流形学习目标整合到一个统一的框架中。我们提出的高频模型经过正则化,不仅产生了保留流形几何结构的嵌入,而且还以符合投影严格定义的方式将样本投影到流形上。理论保证了高频模型能够满足这两个期望的性质。为了检测流形的真实维数,我们还提出了一种两阶段降维算法,由于我们的高频模型的分层结构设计,这是一种省时的算法。实验结果验证了我们的理论分析,证明了我们的降维算法在训练时间成本上的优势,并验证了上述特性在提高下游任务(如异常检测)性能方面的效果。
{"title":"Isometric Manifold Learning Using Hierarchical Flow","authors":"Ziqi Pan, Jianfu Zhang, Li Niu, Liqing Zhang","doi":"10.1609/aaai.v37i8.26124","DOIUrl":"https://doi.org/10.1609/aaai.v37i8.26124","url":null,"abstract":"We propose the Hierarchical Flow (HF) model constrained by isometric regularizations for manifold learning that combines manifold learning goals such as dimensionality reduction, inference, sampling, projection and density estimation into one unified framework. Our proposed HF model is regularized to not only produce embeddings preserving the geometric structure of the manifold, but also project samples onto the manifold in a manner conforming to the rigorous definition of projection. Theoretical guarantees are provided for our HF model to satisfy the two desired properties. In order to detect the real dimensionality of the manifold, we also propose a two-stage dimensionality reduction algorithm, which is a time-efficient algorithm thanks to the hierarchical architecture design of our HF model. Experimental results justify our theoretical analysis, demonstrate the superiority of our dimensionality reduction algorithm in cost of training time, and verify the effect of the aforementioned properties in improving performances on downstream tasks such as anomaly detection.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"72 1","pages":"9381-9388"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80126068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Exclusive Language during Pair Programming 在结对编程中检测排他语言
S. Ubani, Rodney D. Nielsen, Helen Li
Inclusive team participation is one of the most important factors that aids effective collaboration and pair programming. In this paper, we investigated the ability of linguistic features and a transformer-based language model to detect exclusive and inclusive language. The task of detecting exclusive language was approached as a text classification problem. We created a research community resource consisting of a dataset of 40,490 labeled utterances obtained from three programming assignments involving 34 students pair programming in a remote environment. This research involves the first successful automated detection of exclusive language during pair programming. Additionally, this is the first work to perform a computational linguistic analysis on the verbal interaction common in the context of inclusive and exclusive language during pair programming.
包容性的团队参与是帮助有效协作和结对编程的最重要因素之一。在本文中,我们研究了语言特征和基于转换器的语言模型来检测排他性和包容性语言的能力。将排他语言的检测作为文本分类问题进行研究。我们创建了一个研究社区资源,该资源由来自34名学生在远程环境中结对编程的三个编程作业的40,490个标记话语的数据集组成。本研究首次成功实现了对偶编程过程中排他性语言的自动检测。此外,这是第一个对结对编程中包容性和排他性语言背景下常见的口头交互进行计算语言学分析的工作。
{"title":"Detecting Exclusive Language during Pair Programming","authors":"S. Ubani, Rodney D. Nielsen, Helen Li","doi":"10.1609/aaai.v37i13.26895","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.26895","url":null,"abstract":"Inclusive team participation is one of the most important factors that aids effective collaboration and pair programming. In this paper, we investigated the ability of linguistic features and a transformer-based language model to detect exclusive and inclusive language. The task of detecting exclusive language was approached as a text classification problem. We created a research community resource consisting of a dataset of 40,490 labeled utterances obtained from three programming assignments involving 34 students pair programming in a remote environment. This research involves the first successful automated detection of exclusive language during pair programming. Additionally, this is the first work to perform a computational linguistic analysis on the verbal interaction common in the context of inclusive and exclusive language during pair programming.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"103 1","pages":"15964-15971"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80378434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Supervised Joint Dynamic Scene Reconstruction and Optical Flow Estimation for Spiking Camera 自监督联合动态场景重建和光流估计
Shiyan Chen, Zhaofei Yu, Tiejun Huang
Spiking camera, a novel retina-inspired vision sensor, has shown its great potential for capturing high-speed dynamic scenes with a sampling rate of 40,000 Hz. The spiking camera abandons the concept of exposure window, with each of its photosensitive units continuously capturing photons and firing spikes asynchronously. However, the special sampling mechanism prevents the frame-based algorithm from being used to spiking camera. It remains to be a challenge to reconstruct dynamic scenes and perform common computer vision tasks for spiking camera. In this paper, we propose a self-supervised joint learning framework for optical flow estimation and reconstruction of spiking camera. The framework reconstructs clean frame-based spiking representations in a self-supervised manner, and then uses them to train the optical flow networks. We also propose an optical flow based inverse rendering process to achieve self-supervision by minimizing the difference with respect to the original spiking temporal aggregation image. The experimental results demonstrate that our method bridges the gap between synthetic and real-world scenes and achieves desired results in real-world scenarios. To the best of our knowledge, this is the first attempt to jointly reconstruct dynamic scenes and estimate optical flow for spiking camera from a self-supervised learning perspective.
spike camera是一种新型的视网膜视觉传感器,它以40000 Hz的采样率显示出捕捉高速动态场景的巨大潜力。脉冲相机放弃了曝光窗口的概念,它的每个感光单元都连续捕获光子并异步发射脉冲。然而,由于其特殊的采样机制,使得基于帧的算法无法应用于尖峰摄像机。动态场景的重建和常见的计算机视觉任务对于脉冲摄像机来说仍然是一个挑战。本文提出了一种自监督联合学习框架,用于脉冲相机的光流估计和重建。该框架以自监督的方式重建干净的基于帧的尖峰表示,然后使用它们来训练光流网络。我们还提出了一种基于光流的反向渲染过程,通过最小化与原始尖峰时间聚合图像的差异来实现自我监督。实验结果表明,我们的方法弥补了合成场景和真实场景之间的差距,并在真实场景中达到了预期的效果。据我们所知,这是第一次尝试从自监督学习的角度来共同重建动态场景和光流估计。
{"title":"Self-Supervised Joint Dynamic Scene Reconstruction and Optical Flow Estimation for Spiking Camera","authors":"Shiyan Chen, Zhaofei Yu, Tiejun Huang","doi":"10.1609/aaai.v37i1.25108","DOIUrl":"https://doi.org/10.1609/aaai.v37i1.25108","url":null,"abstract":"Spiking camera, a novel retina-inspired vision sensor, has shown its great potential for capturing high-speed dynamic scenes with a sampling rate of 40,000 Hz. The spiking camera abandons the concept of exposure window, with each of its photosensitive units continuously capturing photons and firing spikes asynchronously. However, the special sampling mechanism prevents the frame-based algorithm from being used to spiking camera. It remains to be a challenge to reconstruct dynamic scenes and perform common computer vision tasks for spiking camera. In this paper, we propose a self-supervised joint learning framework for optical flow estimation and reconstruction of spiking camera. The framework reconstructs clean frame-based spiking representations in a self-supervised manner, and then uses them to train the optical flow networks. We also propose an optical flow based inverse rendering process to achieve self-supervision by minimizing the difference with respect to the original spiking temporal aggregation image. The experimental results demonstrate that our method bridges the gap between synthetic and real-world scenes and achieves desired results in real-world scenarios. To the best of our knowledge, this is the first attempt to jointly reconstruct dynamic scenes and estimate optical flow for spiking camera from a self-supervised learning perspective.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"125 1","pages":"350-358"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79254909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved Approximation Algorithm for Wage Determination and Online Task Allocation in Crowd-Sourcing 众包中工资确定与在线任务分配的改进逼近算法
Yuya Hikima, Yasunori Akagi, Hideaki Kim, Taichi Asami
Crowd-sourcing has attracted much attention due to its growing importance to society, and numerous studies have been conducted on task allocation and wage determination. Recent works have focused on optimizing task allocation and workers' wages, simultaneously. However, existing methods do not provide good solutions for real-world crowd-sourcing platforms due to the low approximation ratio or myopic problem settings. We tackle an optimization problem for wage determination and online task allocation in crowd-sourcing and propose a fast 1-1/(k+3)^(1/2)-approximation algorithm, where k is the minimum of tasks' budgets (numbers of possible assignments). This approximation ratio is greater than or equal to the existing method. The proposed method reduces the tackled problem to a non-convex multi-period continuous optimization problem by approximating the objective function. Then, the method transforms the reduced problem into a minimum convex cost flow problem, which is a well-known combinatorial optimization problem, and solves it by the capacity scaling algorithm. Synthetic experiments and simulation experiments using real crowd-sourcing data show that the proposed method solves the problem faster and outputs higher objective values than existing methods.
众包因其在社会中的重要性而备受关注,在任务分配和工资确定方面进行了大量研究。最近的研究主要集中在同时优化任务分配和工人工资。然而,现有的方法由于近似比低或问题设置短视,并不能很好地解决现实世界的众包平台。我们解决了众包中工资确定和在线任务分配的优化问题,并提出了一个快速的1-1/(k+3)^(1/2)-近似算法,其中k是任务预算的最小值(可能分配的数量)。该近似比大于或等于现有方法。该方法通过逼近目标函数,将所处理的问题简化为非凸多周期连续优化问题。然后,该方法将约简后的问题转化为一个著名的组合优化问题——最小凸代价流问题,并采用容量缩放算法求解。基于真实众包数据的综合实验和仿真实验表明,该方法比现有方法解决问题的速度更快,输出的目标值更高。
{"title":"An Improved Approximation Algorithm for Wage Determination and Online Task Allocation in Crowd-Sourcing","authors":"Yuya Hikima, Yasunori Akagi, Hideaki Kim, Taichi Asami","doi":"10.1609/aaai.v37i4.25512","DOIUrl":"https://doi.org/10.1609/aaai.v37i4.25512","url":null,"abstract":"Crowd-sourcing has attracted much attention due to its growing importance to society, and numerous studies have been conducted on task allocation and wage determination. Recent works have focused on optimizing task allocation and workers' wages, simultaneously. However, existing methods do not provide good solutions for real-world crowd-sourcing platforms due to the low approximation ratio or myopic problem settings. We tackle an optimization problem for wage determination and online task allocation in crowd-sourcing and propose a fast 1-1/(k+3)^(1/2)-approximation algorithm, where k is the minimum of tasks' budgets (numbers of possible assignments). This approximation ratio is greater than or equal to the existing method. The proposed method reduces the tackled problem to a non-convex multi-period continuous optimization problem by approximating the objective function. Then, the method transforms the reduced problem into a minimum convex cost flow problem, which is a well-known combinatorial optimization problem, and solves it by the capacity scaling algorithm. Synthetic experiments and simulation experiments using real crowd-sourcing data show that the proposed method solves the problem faster and outputs higher objective values than existing methods.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"11 1","pages":"3977-3986"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81501815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trustworthy Residual Vehicle Value Prediction for Auto Finance 汽车金融的可信赖剩余价值预测
Mi-hyung Kim, Jimyung Choi, Jaehyun Kim, Wooyoung Kim, Yeonung Baek, Gisuk Bang, Kwangwoon Son, Yeonman Ryou, Kee-Eung Kim
The residual value (RV) of a vehicle refers to its estimated worth at some point in the future. It is a core component in every auto financial product, used to determine the credit lines and the leasing rates. As such, an accurate prediction of RV is critical for the auto finance industry, since it can pose a risk of revenue loss by over-prediction or make the financial product incompetent by under-prediction. Although there are a number of prior studies on training machine learning models on a large amount of used car sales data, we had to cope with real-world operational requirements such as compliance with regulations (i.e. monotonicity of output with respect to a subset of features) and generalization to unseen input (i.e. new and rare car models). In this paper, we describe how we coped with these practical challenges and created value for our business at Hyundai Capital Services, the top auto financial service provider in Korea.
车辆的残值(RV)是指其在未来某一时刻的估计价值。它是每个汽车金融产品的核心组成部分,用于确定信贷额度和租赁利率。因此,准确的RV预测对于汽车金融行业至关重要,因为它可能因过度预测而造成收入损失或因预测不足而使金融产品无法胜任。尽管之前有很多关于在大量二手车销售数据上训练机器学习模型的研究,但我们必须应对现实世界的操作需求,例如遵守规则(即相对于特征子集的输出单调性)和对未见输入(即新车和稀有车型)的泛化。在本文中,我们描述了我们如何应对这些实际挑战,并为韩国顶级汽车金融服务提供商现代资本服务公司的业务创造价值。
{"title":"Trustworthy Residual Vehicle Value Prediction for Auto Finance","authors":"Mi-hyung Kim, Jimyung Choi, Jaehyun Kim, Wooyoung Kim, Yeonung Baek, Gisuk Bang, Kwangwoon Son, Yeonman Ryou, Kee-Eung Kim","doi":"10.1609/aaai.v37i13.26842","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.26842","url":null,"abstract":"The residual value (RV) of a vehicle refers to its estimated worth at some point in the future. It is a core component in every auto financial product, used to determine the credit lines and the leasing rates. As such, an accurate prediction of RV is critical for the auto finance industry, since it can pose a risk of revenue loss by over-prediction or make the financial product incompetent by under-prediction. Although there are a number of prior studies on training machine learning models on a large amount of used car sales data, we had to cope with real-world operational requirements such as compliance with regulations (i.e. monotonicity of output with respect to a subset of features) and generalization to unseen input (i.e. new and rare car models). In this paper, we describe how we coped with these practical challenges and created value for our business at Hyundai Capital Services, the top auto financial service provider in Korea.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"19 1","pages":"15537-15544"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81703206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
C-NTPP: Learning Cluster-Aware Neural Temporal Point Process C-NTPP:学习簇感知神经时间点过程
Fangyu Ding, Junchi Yan, Haiyang Wang
Event sequences in continuous time space are ubiquitous across applications and have been intensively studied with both classic temporal point process (TPP) and its recent deep network variants. This work is motivated by an observation that many of event data exhibit inherent clustering patterns in terms of the sparse correlation among events, while such characteristics are seldom explicitly considered in existing neural TPP models whereby the history encoders are often embodied by RNNs or Transformers. In this work, we propose a c-NTPP (Cluster-Aware Neural Temporal Point Process) model, which leverages a sequential variational autoencoder framework to infer the latent cluster each event belongs to in the sequence. Specially, a novel event-clustered attention mechanism is devised to learn each cluster and then aggregate them together to obtain the final representation for each event. Extensive experiments show that c-NTPP achieves superior performance on both real-world and synthetic datasets, and it can also uncover the underlying clustering correlations.
连续时间空间中的事件序列在各种应用中普遍存在,并已被经典的时间点过程(TPP)及其最近的深度网络变体所深入研究。这项工作的动机是观察到许多事件数据在事件之间的稀疏相关性方面表现出固有的聚类模式,而这些特征在现有的神经TPP模型中很少被明确考虑,其中历史编码器通常由rnn或transformer体现。在这项工作中,我们提出了一个c-NTPP(聚类感知神经时间点过程)模型,该模型利用顺序变分自编码器框架来推断序列中每个事件所属的潜在聚类。特别地,设计了一种新的事件聚类注意机制来学习每个聚类,然后将它们聚集在一起以获得每个事件的最终表示。大量的实验表明,c-NTPP在真实世界和合成数据集上都取得了卓越的性能,并且它还可以揭示潜在的聚类相关性。
{"title":"C-NTPP: Learning Cluster-Aware Neural Temporal Point Process","authors":"Fangyu Ding, Junchi Yan, Haiyang Wang","doi":"10.1609/aaai.v37i6.25897","DOIUrl":"https://doi.org/10.1609/aaai.v37i6.25897","url":null,"abstract":"Event sequences in continuous time space are ubiquitous across applications and have been intensively studied with both classic temporal point process (TPP) and its recent deep network variants. This work is motivated by an observation that many of event data exhibit inherent clustering patterns in terms of the sparse correlation among events, while such characteristics are seldom explicitly considered in existing neural TPP models whereby the history encoders are often embodied by RNNs or Transformers. In this work, we propose a c-NTPP (Cluster-Aware Neural Temporal Point Process) model, which leverages a sequential variational autoencoder framework to infer the latent cluster each event belongs to in the sequence. Specially, a novel event-clustered attention mechanism is devised to learn each cluster and then aggregate them together to obtain the final representation for each event. Extensive experiments show that c-NTPP achieves superior performance on both real-world and synthetic datasets, and it can also uncover the underlying clustering correlations.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"1 1","pages":"7369-7377"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81897102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PEN: Prediction-Explanation Network to Forecast Stock Price Movement with Better Explainability PEN:预测-解释网络以更好的可解释性预测股价走势
Shuqi Li, Weiheng Liao, Yuhan Chen, Rui Yan
Nowadays explainability in stock price movement prediction is attracting increasing attention in banks, hedge funds and asset managers, primarily due to audit or regulatory reasons. Text data such as financial news and social media posts can be part of the reasons for stock price movement. To this end, we propose a novel framework of Prediction-Explanation Network (PEN) jointly modeling text streams and price streams with alignment. The key component of the PEN model is an shared representation learning module that learns which texts are possibly associated with the stock price movement by modeling the interaction between the text data and stock price data with a salient vector characterizing their correlation. In this way, the PEN model is able to predict the stock price movement by identifying and utilizing abundant messages while on the other hand, the selected text messages also explain the stock price movement. Experiments on real-world datasets demonstrate that we are able to kill two birds with one stone: in terms of accuracy, the proposed PEN model outperforms the state-of-art baseline; on explainability, the PEN model are demonstrated to be far superior to attention mechanism, capable of picking out the crucial texts with a very high confidence.
如今,由于审计或监管方面的原因,股价走势预测的可解释性越来越受到银行、对冲基金和资产管理公司的关注。金融新闻和社交媒体帖子等文本数据可能是股价波动的部分原因。为此,我们提出了一种新的预测-解释网络(PEN)框架,该框架联合对文本流和价格流进行对齐建模。PEN模型的关键组件是一个共享表示学习模块,该模块通过对文本数据和股票价格数据之间的交互建模,并使用表征它们相关性的显著向量来学习哪些文本可能与股票价格运动相关。这样,PEN模型可以通过识别和利用大量的短信来预测股价走势,而另一方面,所选择的短信也可以解释股价走势。在真实世界数据集上的实验表明,我们能够一石二鸟:就准确性而言,所提出的PEN模型优于最先进的基线;在可解释性上,PEN模型被证明远远优于注意机制,能够以非常高的置信度挑选出关键文本。
{"title":"PEN: Prediction-Explanation Network to Forecast Stock Price Movement with Better Explainability","authors":"Shuqi Li, Weiheng Liao, Yuhan Chen, Rui Yan","doi":"10.1609/aaai.v37i4.25648","DOIUrl":"https://doi.org/10.1609/aaai.v37i4.25648","url":null,"abstract":"Nowadays explainability in stock price movement prediction is attracting increasing attention in banks, hedge funds and asset managers, primarily due to audit or regulatory reasons. Text data such as financial news and social media posts can be part of the reasons for stock price movement. To this end, we propose a novel framework of Prediction-Explanation Network (PEN) jointly modeling text streams and price streams with alignment. The key component of the PEN model is an shared representation learning module that learns which texts are possibly associated with the stock price movement by modeling the interaction between the text data and stock price data with a salient vector characterizing their correlation. In this way, the PEN model is able to predict the stock price movement by identifying and utilizing abundant messages while on the other hand, the selected text messages also explain the stock price movement. Experiments on real-world datasets demonstrate that we are able to kill two birds with one stone: in terms of accuracy, the proposed PEN model outperforms the state-of-art baseline; on explainability, the PEN model are demonstrated to be far superior to attention mechanism, capable of picking out the crucial texts with a very high confidence.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"43 5 1","pages":"5187-5194"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84198529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1