首页 > 最新文献

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence最新文献

英文 中文
Self-Supervised Logic Induction for Explainable Fuzzy Temporal Commonsense Reasoning 可解释模糊时间常识推理的自监督逻辑归纳
Bibo Cai, Xiao Ding, Zhouhao Sun, Bing Qin, Ting Liu, Baojun Wang, Lifeng Shang
Understanding temporal commonsense concepts, such as times of occurrence and durations is crucial for event-centric language understanding. Reasoning about such temporal concepts in a complex context requires reasoning over both the stated context and the world knowledge that underlines it. A recent study shows massive pre-trained LM still struggle with such temporal reasoning under complex contexts (e.g., dialog) because they only implicitly encode the relevant contexts and fail to explicitly uncover the underlying logical compositions for complex inference, thus may not be robust enough. In this work, we propose to augment LMs with the temporal logic induction ability, which frames the temporal reasoning by defining three modular components: temporal dependency inducer and temporal concept defuzzifier and logic validator. The former two components disentangle the explicit/implicit dependency between temporal concepts across context (before, after, ...) and the specific meaning of fuzzy temporal concepts, respectively, while the validator combines the intermediate reasoning clues for robust contextual reasoning about the temporal concepts. Extensive experimental results on TIMEDIAL, a challenging dataset for temporal reasoning over dialog, show that our method, Logic Induction Enhanced Contextualized TEmporal Reasoning (LECTER), can yield great improvements over the traditional language model for temporal reasoning.
理解时间常识性概念,如发生时间和持续时间,对于以事件为中心的语言理解至关重要。在一个复杂的背景下对这种时间概念进行推理,需要对所陈述的背景和强调它的世界知识进行推理。最近的一项研究表明,大量预训练的LM仍然在复杂上下文(例如对话)下进行这种时间推理,因为它们只是隐式地对相关上下文进行编码,而不能显式地揭示复杂推理的底层逻辑组合,因此可能不够健壮。在这项工作中,我们提出用时间逻辑归纳能力来增强LMs,该能力通过定义三个模块组件来构建时间推理:时间依赖诱导器、时间概念解模糊器和逻辑验证器。前两个组件分别解开跨上下文(before, after,…)的时间概念之间的显式/隐式依赖关系和模糊时间概念的特定含义,而验证器则结合中间推理线索,对时间概念进行鲁棒上下文推理。大量的实验结果表明,我们的方法,逻辑归纳增强上下文时态推理(LECTER),可以比传统的语言模型在时间推理方面产生很大的改进。
{"title":"Self-Supervised Logic Induction for Explainable Fuzzy Temporal Commonsense Reasoning","authors":"Bibo Cai, Xiao Ding, Zhouhao Sun, Bing Qin, Ting Liu, Baojun Wang, Lifeng Shang","doi":"10.1609/aaai.v37i11.26481","DOIUrl":"https://doi.org/10.1609/aaai.v37i11.26481","url":null,"abstract":"Understanding temporal commonsense concepts, such as times of occurrence and durations is crucial for event-centric language understanding. Reasoning about such temporal concepts in a complex context requires reasoning over both the stated context and the world knowledge that underlines it. A recent study shows massive pre-trained LM still struggle with such temporal reasoning under complex contexts (e.g., dialog) because they only implicitly encode the relevant contexts and fail to explicitly uncover the underlying logical compositions for complex inference, thus may not be robust enough. In this work, we propose to augment LMs with the temporal logic induction ability, which frames the temporal reasoning by defining three modular components: temporal dependency inducer and temporal concept defuzzifier and logic validator. The former two components disentangle the explicit/implicit dependency between temporal concepts across context (before, after, ...) and the specific meaning of fuzzy temporal concepts, respectively, while the validator combines the intermediate reasoning clues for robust contextual reasoning about the temporal concepts. Extensive experimental results on TIMEDIAL, a challenging dataset for temporal reasoning over dialog, show that our method, Logic Induction Enhanced Contextualized TEmporal Reasoning (LECTER), can yield great improvements over the traditional language model for temporal reasoning.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"1 1","pages":"12580-12588"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77174303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Fair Incentive Scheme for Community Health Workers 社区卫生工作者公平奖励计划
Avinandan Bose, Tracey Li, Arunesh Sinha, Tien Mai
Community health workers (CHWs) play a crucial role inthe last mile delivery of essential health services to underservedpopulations in low-income countries. Many nongovernmentalorganizations (NGOs) provide training andsupport to enable CHWs to deliver health services to theircommunities, with no charge to the recipients of the services.This includes monetary compensation for the work thatCHWs perform, which is broken down into a series of well definedtasks. In this work, we partner with a NGO D-TreeInternational to design a fair monetary compensation schemefor tasks performed by CHWs in the semi-autonomous regionof Zanzibar in Tanzania, Africa. In consultation withstakeholders, we interpret fairness as the equal opportunityto earn, which means that each CHW has the opportunity toearn roughly the same total payment over a given T monthperiod, if the CHW reacts to the incentive scheme almost rationally.We model this problem as a reward design problemfor a Markov Decision Process (MDP) formulation for theCHWs’ earning. There is a need for the mechanism to besimple so that it is understood by the CHWs, thus, we explorelinear and piecewise linear rewards in the CHWs’ measuredunits of work. We solve this design problem via a novelpolicy-reward gradient result. Our experiments using two realworld parameters from the ground provide evidence of reasonableincentive output by our scheme.
社区卫生工作者在向低收入国家服务不足人群提供最后一英里基本卫生服务方面发挥着关键作用。许多非政府组织(ngo)提供培训和支持,使保健员能够向他们的社区提供保健服务,而不向接受服务的人收费。这包括对chw所做工作的金钱补偿,这些工作被分解为一系列明确定义的任务。在这项工作中,我们与非政府组织D-TreeInternational合作,为非洲坦桑尼亚桑给巴尔半自治地区的卫生工作者设计了一个公平的货币补偿方案。在与利益相关者协商后,我们将公平解释为平等的赚钱机会,这意味着每个CHW都有机会在给定的T个月内获得大致相同的总报酬,如果CHW对激励计划的反应几乎是理性的。我们将这一问题建模为一个针对医疗保健人员学习的马尔可夫决策过程(MDP)公式的奖励设计问题。有必要简化机制,以便chw能够理解,因此,我们在chw的测量工作单位中探索线性和分段线性奖励。我们通过一个新的策略-奖励梯度结果解决了这个设计问题。我们的实验使用了两个来自地面的真实世界参数,证明了我们的方案具有合理的激励输出。
{"title":"A Fair Incentive Scheme for Community Health Workers","authors":"Avinandan Bose, Tracey Li, Arunesh Sinha, Tien Mai","doi":"10.1609/aaai.v37i12.26653","DOIUrl":"https://doi.org/10.1609/aaai.v37i12.26653","url":null,"abstract":"Community health workers (CHWs) play a crucial role in\u0000the last mile delivery of essential health services to underserved\u0000populations in low-income countries. Many nongovernmental\u0000organizations (NGOs) provide training and\u0000support to enable CHWs to deliver health services to their\u0000communities, with no charge to the recipients of the services.\u0000This includes monetary compensation for the work that\u0000CHWs perform, which is broken down into a series of well defined\u0000tasks. In this work, we partner with a NGO D-Tree\u0000International to design a fair monetary compensation scheme\u0000for tasks performed by CHWs in the semi-autonomous region\u0000of Zanzibar in Tanzania, Africa. In consultation with\u0000stakeholders, we interpret fairness as the equal opportunity\u0000to earn, which means that each CHW has the opportunity to\u0000earn roughly the same total payment over a given T month\u0000period, if the CHW reacts to the incentive scheme almost rationally.\u0000We model this problem as a reward design problem\u0000for a Markov Decision Process (MDP) formulation for the\u0000CHWs’ earning. There is a need for the mechanism to be\u0000simple so that it is understood by the CHWs, thus, we explore\u0000linear and piecewise linear rewards in the CHWs’ measured\u0000units of work. We solve this design problem via a novel\u0000policy-reward gradient result. Our experiments using two real\u0000world parameters from the ground provide evidence of reasonable\u0000incentive output by our scheme.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"46 1","pages":"14127-14135"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77176073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Iteratively Enhanced Semidefinite Relaxations for Efficient Neural Network Verification 高效神经网络验证的迭代增强半定松弛
Jianglin Lan, Yang Zheng, A. Lomuscio
We propose an enhanced semidefinite program (SDP) relaxation to enable the tight and efficient verification of neural networks (NNs). The tightness improvement is achieved by introducing a nonlinear constraint to existing SDP relaxations previously proposed for NN verification. The efficiency of the proposal stems from the iterative nature of the proposed algorithm in that it solves the resulting non-convex SDP by recursively solving auxiliary convex layer-based SDP problems. We show formally that the solution generated by our algorithm is tighter than state-of-the-art SDP-based solutions for the problem. We also show that the solution sequence converges to the optimal solution of the non-convex enhanced SDP relaxation. The experimental results on standard benchmarks in the area show that our algorithm achieves the state-of-the-art performance whilst maintaining an acceptable computational cost.
我们提出了一种增强的半定程序(SDP)松弛,以实现神经网络(nn)的严密和有效验证。通过对先前提出的用于神经网络验证的现有SDP松弛引入非线性约束来提高紧密性。该建议的效率源于所提出算法的迭代性质,即通过递归地求解辅助的基于凸层的SDP问题来解决所得到的非凸SDP。我们正式表明,我们的算法生成的解决方案比最先进的基于sdp的解决方案更严格。我们还证明了解序列收敛于非凸增强SDP松弛的最优解。在该领域的标准基准上的实验结果表明,我们的算法在保持可接受的计算成本的同时达到了最先进的性能。
{"title":"Iteratively Enhanced Semidefinite Relaxations for Efficient Neural Network Verification","authors":"Jianglin Lan, Yang Zheng, A. Lomuscio","doi":"10.1609/aaai.v37i12.26744","DOIUrl":"https://doi.org/10.1609/aaai.v37i12.26744","url":null,"abstract":"We propose an enhanced semidefinite program (SDP) relaxation to enable the tight and efficient verification of neural networks (NNs). The tightness improvement is achieved by introducing a nonlinear constraint to existing SDP relaxations previously proposed for NN verification. The efficiency of the proposal stems from the iterative nature of the proposed algorithm in that it solves the resulting non-convex SDP by recursively solving auxiliary convex layer-based SDP problems. We show formally that the solution generated by our algorithm is tighter than state-of-the-art SDP-based solutions for the problem. We also show that the solution sequence converges to the optimal solution of the non-convex enhanced SDP relaxation. The experimental results on standard benchmarks in the area show that our algorithm achieves the state-of-the-art performance whilst maintaining an acceptable computational cost.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"29 1","pages":"14937-14945"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82169547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring Hypergraph of Earnings Call for Risk Prediction (Student Abstract) 探讨盈余呼叫超图的风险预测(学生摘要)
Yi He, Wenxin Tai, Fan Zhou, Yi Yang
In financial economics, studies have shown that the textual content in the earnings conference call transcript has predictive power for a firm's future risk. However, the conference call transcript is very long and contains diverse non-relevant content, which poses challenges for the text-based risk forecast. This study investigates the structural dependency within a conference call transcript by explicitly modeling the dialogue between managers and analysts. Specifically, we utilize TextRank to extract information and exploit the semantic correlation within a discussion using hypergraph learning. This novel design can improve the transcript representation performance and reduce the risk of forecast errors. Experimental results on a large-scale dataset show that our approach can significantly improve prediction performance compared to state-of-the-art text-based models.
在金融经济学中,研究表明,盈利电话会议记录中的文本内容对公司未来风险具有预测能力。然而,电话会议记录内容较长,不相关内容较多,这对基于文本的风险预测提出了挑战。本研究通过明确建模经理和分析师之间的对话来调查电话会议记录中的结构依赖性。具体来说,我们利用TextRank来提取信息,并利用超图学习来挖掘讨论中的语义相关性。这种新颖的设计可以提高文本表示性能,降低预测错误的风险。在大规模数据集上的实验结果表明,与最先进的基于文本的模型相比,我们的方法可以显著提高预测性能。
{"title":"Exploring Hypergraph of Earnings Call for Risk Prediction (Student Abstract)","authors":"Yi He, Wenxin Tai, Fan Zhou, Yi Yang","doi":"10.1609/aaai.v37i13.26973","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.26973","url":null,"abstract":"In financial economics, studies have shown that the textual content in the earnings conference call transcript has predictive power for a firm's future risk. However, the conference call transcript is very long and contains diverse non-relevant content, which poses challenges for the text-based risk forecast. This study investigates the structural dependency within a conference call transcript by explicitly modeling the dialogue between managers and analysts. Specifically, we utilize TextRank to extract information and exploit the semantic correlation within a discussion using hypergraph learning. This novel design can improve the transcript representation performance and reduce the risk of forecast errors. Experimental results on a large-scale dataset show that our approach can significantly improve prediction performance compared to state-of-the-art text-based models.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"47 1","pages":"16226-16227"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82381685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning 基于注意的关系推理的层次卷积视觉类比推理
Wentao He, Jialu Zhang, Jianfeng Ren, Ruibin Bai, Xudong Jiang
Raven’s Progressive Matrices (RPMs) have been widely used to evaluate the visual reasoning ability of humans. To tackle the challenges of visual perception and logic reasoning on RPMs, we propose a Hierarchical ConViT with Attention-based Relational Reasoner (HCV-ARR). Traditional solution methods often apply relatively shallow convolution networks to visually perceive shape patterns in RPM images, which may not fully model the long-range dependencies of complex pattern combinations in RPMs. The proposed ConViT consists of a convolutional block to capture the low-level attributes of visual patterns, and a transformer block to capture the high-level image semantics such as pattern formations. Furthermore, the proposed hierarchical ConViT captures visual features from multiple receptive fields, where the shallow layers focus on the image fine details while the deeper layers focus on the image semantics. To better model the underlying reasoning rules embedded in RPM images, an Attention-based Relational Reasoner (ARR) is proposed to establish the underlying relations among images. The proposed ARR well exploits the hidden relations among question images through the developed element-wise attentive reasoner. Experimental results on three RPM datasets demonstrate that the proposed HCV-ARR achieves a significant performance gain compared with the state-of-the-art models. The source code is available at: https://github.com/wentaoheunnc/HCV-ARR.
Raven 's Progressive Matrices (rpm)被广泛用于评估人类的视觉推理能力。为了解决rpm的视觉感知和逻辑推理问题,我们提出了一种基于关注的关系推理器(HCV-ARR)的分层卷积神经网络。传统的解决方法通常使用相对浅的卷积网络来视觉感知RPM图像中的形状模式,这可能无法完全模拟RPM中复杂模式组合的长期依赖关系。所提出的ConViT由卷积块和转换块组成,卷积块用于捕获视觉模式的低级属性,转换块用于捕获高级图像语义,如模式形成。此外,提出的分层ConViT从多个接受域捕获视觉特征,其中浅层专注于图像的精细细节,而深层专注于图像的语义。为了更好地建模RPM图像中嵌入的底层推理规则,提出了一种基于注意力的关系推理器(ARR)来建立图像之间的底层关系。本文提出的ARR通过开发的元素关注推理器,很好地利用了问题图像之间的隐藏关系。在三个RPM数据集上的实验结果表明,与目前最先进的模型相比,所提出的HCV-ARR模型取得了显着的性能提升。源代码可从https://github.com/wentaoheunnc/HCV-ARR获得。
{"title":"Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning","authors":"Wentao He, Jialu Zhang, Jianfeng Ren, Ruibin Bai, Xudong Jiang","doi":"10.1609/aaai.v37i1.25072","DOIUrl":"https://doi.org/10.1609/aaai.v37i1.25072","url":null,"abstract":"Raven’s Progressive Matrices (RPMs) have been widely used to evaluate the visual reasoning ability of humans. To tackle the challenges of visual perception and logic reasoning on RPMs, we propose a Hierarchical ConViT with Attention-based Relational Reasoner (HCV-ARR). Traditional solution methods often apply relatively shallow convolution networks to visually perceive shape patterns in RPM images, which may not fully model the long-range dependencies of complex pattern combinations in RPMs. The proposed ConViT consists of a convolutional block to capture the low-level attributes of visual patterns, and a transformer block to capture the high-level image semantics such as pattern formations. Furthermore, the proposed hierarchical ConViT captures visual features from multiple receptive fields, where the shallow layers focus on the image fine details while the deeper layers focus on the image semantics. To better model the underlying reasoning rules embedded in RPM images, an Attention-based Relational Reasoner (ARR) is proposed to establish the underlying relations among images. The proposed ARR well exploits the hidden relations among question images through the developed element-wise attentive reasoner. Experimental results on three RPM datasets demonstrate that the proposed HCV-ARR achieves a significant performance gain compared with the state-of-the-art models. The source code is available at: https://github.com/wentaoheunnc/HCV-ARR.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"15 1","pages":"22-30"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82427960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Securing Lifelines: Safe Delivery of Critical Services in Areas with Volatile Security Situation via a Stackelberg Game Approach 确保生命线:通过Stackelberg博弈方法在安全形势不稳定的地区安全提供关键服务
Tien Mai, Arunesh Sinha
Vaccine delivery in under-resourced locations with security risks is not just challenging but also life threatening. The COVID pandemic and the need to vaccinate added even more urgency to this issue. Motivated by this problem, we propose a general framework to set-up limited temporary (vaccination) centers that balance physical security and desired (vaccine) service coverage with limited resources. We set-up the problem as a Stackelberg game between the centers operator (defender) and an adversary, where the set of centers is not fixed a priori but is part of the decision output. This results in a mixed combinatorial and continuous optimization problem. As part of our scalable approximation solution, we provide a fundamental contribution by identifying general duality conditions of switching max and min when both discrete and continuous variables are involved. Via detailed experiments, we show that the solution proposed is scalable in practice.
在有安全风险的资源不足地区提供疫苗不仅具有挑战性,而且威胁生命。COVID大流行和接种疫苗的需求使这一问题更加紧迫。在这个问题的激励下,我们提出了一个建立有限临时(疫苗接种)中心的一般框架,以有限的资源平衡物理安全和期望的(疫苗)服务覆盖。我们将问题设置为中心运营者(防守者)和对手之间的Stackelberg博弈,其中中心的集合不是先验固定的,而是决策输出的一部分。这导致了一个混合组合和连续优化问题。作为可扩展近似解决方案的一部分,我们通过确定在涉及离散变量和连续变量时切换最大值和最小值的一般对偶条件提供了基本贡献。通过详细的实验,我们证明了所提出的解决方案在实践中具有可扩展性。
{"title":"Securing Lifelines: Safe Delivery of Critical Services in Areas with Volatile Security Situation via a Stackelberg Game Approach","authors":"Tien Mai, Arunesh Sinha","doi":"10.1609/aaai.v37i5.25720","DOIUrl":"https://doi.org/10.1609/aaai.v37i5.25720","url":null,"abstract":"Vaccine delivery in under-resourced locations with security risks is not just challenging but also life threatening. The COVID pandemic and the need to vaccinate added even more urgency to this issue. Motivated by this problem, we propose a general framework to set-up limited temporary (vaccination) centers that balance physical security and desired (vaccine) service coverage with limited resources. We set-up the problem as a Stackelberg game between the centers operator (defender) and an adversary, where the set of centers is not fixed a priori but is part of the decision output. This results in a mixed combinatorial and continuous optimization problem. As part of our scalable approximation solution, we provide a fundamental contribution by identifying general duality conditions of switching max and min when both discrete and continuous variables are involved. Via detailed experiments, we show that the solution proposed is scalable in practice.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"13 1","pages":"5805-5813"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81864405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Solving Math Word Problems concerning Systems of Equations with GPT-3 用GPT-3求解方程组数学应用题
M. Zong, Bhaskar Krishnamachari
Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One challenging set of tasks for school students is learning to solve math word problems. We explore how recent advances in natural language processing, specifically the rise of powerful transformer based models, can be applied to help math learners with such problems. Concretely, we evaluate the use of GPT-3, a 1.75B parameter transformer model recently released by OpenAI, for three related challenges pertaining to math word problems corresponding to systems of two linear equations. The three challenges are classifying word problems, extracting equations from word problems, and generating word problems. For the first challenge, we define a set of problem classes and find that GPT-3 has generally very high accuracy in classifying word problems (80%-100%), for all but one of these classes. For the second challenge, we find the accuracy for extracting equations improves with number of examples provided to the model, ranging from an accuracy of 31% for zero-shot learning to about 69% using 3-shot learning, which is further improved to a high value of 80% with fine-tuning. For the third challenge, we find that GPT-3 is able to generate problems with accuracy ranging from 33% to 93%, depending on the problem type.
研究人员一直对开发人工智能工具来帮助学生学习各种数学科目感兴趣。对于学生来说,一组具有挑战性的任务是学习解决数学单词问题。我们探讨了自然语言处理的最新进展,特别是基于强大的变压器模型的兴起,如何应用于帮助数学学习者解决这些问题。具体而言,我们评估了GPT-3 (OpenAI最近发布的1.75B参数转换器模型)在三个相关挑战中的使用,这些挑战与两个线性方程组对应的数学单词问题有关。这三个挑战是对单词问题进行分类,从单词问题中提取方程,以及生成单词问题。对于第一个挑战,我们定义了一组问题类别,并发现GPT-3在分类单词问题方面通常具有非常高的准确率(80%-100%),除了这些类别中的一个。对于第二个挑战,我们发现提取方程的准确性随着提供给模型的样本数量的增加而提高,从零次学习的31%的准确率到使用3次学习的约69%的准确率,通过微调进一步提高到80%的高值。对于第三个挑战,我们发现GPT-3能够生成准确率在33%到93%之间的问题,具体取决于问题类型。
{"title":"Solving Math Word Problems concerning Systems of Equations with GPT-3","authors":"M. Zong, Bhaskar Krishnamachari","doi":"10.1609/aaai.v37i13.26896","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.26896","url":null,"abstract":"Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One challenging set of tasks for school students is learning to solve math word problems. We explore how recent advances in natural language processing, specifically the rise of powerful transformer based models, can be applied to help math learners with such problems. Concretely, we evaluate the use of GPT-3, a 1.75B parameter transformer model recently released by OpenAI, for three related challenges pertaining to math word problems corresponding to systems of two linear equations. The three challenges are classifying word problems, extracting equations from word problems, and generating word problems. For the first challenge, we define a set of problem classes and find that GPT-3 has generally very high accuracy in classifying word problems (80%-100%), for all but one of these classes. For the second challenge, we find the accuracy for extracting equations improves with number of examples provided to the model, ranging from an accuracy of 31% for zero-shot learning to about 69% using 3-shot learning, which is further improved to a high value of 80% with fine-tuning. For the third challenge, we find that GPT-3 is able to generate problems with accuracy ranging from 33% to 93%, depending on the problem type.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"2 1","pages":"15972-15979"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78654967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Separate but Equal: Equality in Belief Propagation for Single Cycle Graphs 分离但相等:单循环图的信念传播中的相等性
Erel Cohen, Omer Lev, R. Zivan
Belief propagation is a widely used incomplete optimization algorithm, whose main theoretical properties hold only under the assumptions that beliefs are not equal. Nevertheless, there is much evidence that equality between beliefs does occur. A method to overcome belief equality by using unary function-nodes is assumed to resolve the problem.We focus on Min-sum, the belief propagation version for solving constraint optimization problems. We prove that on a single cycle graph, belief equality can be avoided only when the algorithm converges to the optimal solution. In any other case, the unary function methods will not prevent equality, rendering some existing results in need of reassessment. We differentiate between belief equality, which includes equal beliefs in a single message, and assignment equality, that prevents a coherent selection of assignments to variables. We show the necessary and satisfying conditions for both.
信念传播是一种应用广泛的不完全优化算法,其主要理论性质仅在信念不相等的假设下成立。然而,有很多证据表明信仰之间的平等确实存在。提出了一种利用一元函数节点克服信念等式的方法来解决这一问题。我们专注于最小和,即解决约束优化问题的信念传播版本。证明了在单循环图上,只有当算法收敛到最优解时才能避免置信等式。在任何其他情况下,一元函数方法都不会阻止相等性,从而导致一些现有结果需要重新评估。我们区分了信念相等和赋值相等,前者包括单个消息中的相等信念,后者阻止了对变量赋值的连贯选择。我们给出了两者的必要条件和满足条件。
{"title":"Separate but Equal: Equality in Belief Propagation for Single Cycle Graphs","authors":"Erel Cohen, Omer Lev, R. Zivan","doi":"10.1609/aaai.v37i4.25506","DOIUrl":"https://doi.org/10.1609/aaai.v37i4.25506","url":null,"abstract":"Belief propagation is a widely used incomplete optimization algorithm, whose main theoretical properties hold only under the assumptions that beliefs are not equal. Nevertheless, there is much evidence that equality between beliefs does occur. A method to overcome belief equality by using unary function-nodes is assumed to resolve the problem.\u0000\u0000We focus on Min-sum, the belief propagation version for solving constraint optimization problems. We prove that on a single cycle graph, belief equality can be avoided only when the algorithm converges to the optimal solution. In any other case, the unary function methods will not prevent equality, rendering some existing results in need of reassessment. We differentiate between belief equality, which includes equal beliefs in a single message, and assignment equality, that prevents a coherent selection of assignments to variables. We show the necessary and satisfying conditions for both.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"1 1","pages":"3924-3931"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78871085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Learning Better Representations Using Auxiliary Knowledge 使用辅助知识学习更好的表征
Saed Rezayi
Representation Learning is the core of Machine Learning and Artificial Intelligence as it summarizes input data points into low dimensional vectors. This low dimensional vectors should be accurate portrayals of the input data, thus it is crucial to find the most effective and robust representation possible for given input as the performance of the ML task is dependent on the resulting representations. In this summary, we discuss an approach to augment representation learning which relies on external knowledge. We briefly describe the shortcoming of the existing techniques and describe how an auxiliary knowledge source could result in obtaining improved representations.
表征学习是机器学习和人工智能的核心,因为它将输入数据点总结为低维向量。这种低维向量应该是输入数据的准确描述,因此找到给定输入的最有效和最健壮的表示是至关重要的,因为ML任务的性能依赖于结果表示。在本总结中,我们讨论了一种依赖于外部知识的增强表征学习方法。我们简要地描述了现有技术的缺点,并描述了辅助知识来源如何能够获得改进的表示。
{"title":"Learning Better Representations Using Auxiliary Knowledge","authors":"Saed Rezayi","doi":"10.1609/aaai.v37i13.26927","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.26927","url":null,"abstract":"Representation Learning is the core of Machine Learning and Artificial Intelligence as it summarizes input data points into low dimensional vectors. This low dimensional vectors should be accurate portrayals of the input data, thus it is crucial to find the most effective and robust representation possible for given input as the performance of the ML task is dependent on the resulting representations. In this summary, we discuss an approach to augment representation learning which relies on external knowledge. We briefly describe the shortcoming of the existing techniques and describe how an auxiliary knowledge source could result in obtaining improved representations.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"39 1","pages":"16133-16134"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76100014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Temporal-Ordered Representation for Spike Streams Based on Discrete Wavelet Transforms 基于离散小波变换的尖峰流时间有序表示学习
Jiyuan Zhang, Shanshan Jia, Zhaofei Yu, Tiejun Huang
Spike camera, a new type of neuromorphic visual sensor that imitates the sampling mechanism of the primate fovea, can capture photons and output 40000 Hz binary spike streams. Benefiting from the asynchronous sampling mechanism, the spike camera can record fast-moving objects and clear images can be recovered from the spike stream at any specified timestamps without motion blurring. Despite these, due to the dense time sequence information of the discrete spike stream, it is not easy to directly apply the existing algorithms of traditional cameras to the spike camera. Therefore, it is necessary and interesting to explore a universally effective representation of dense spike streams to better fit various network architectures. In this paper, we propose to mine temporal-robust features of spikes in time-frequency space with wavelet transforms. We present a novel Wavelet-Guided Spike Enhancing (WGSE) paradigm consisting of three consecutive steps: multi-level wavelet transform, CNN-based learnable module, and inverse wavelet transform. With the assistance of WGSE, the new streaming representation of spikes can be learned. We demonstrate the effectiveness of WGSE on two downstream tasks, achieving state-of-the-art performance on the image reconstruction task and getting considerable performance on semantic segmentation. Furthermore, We build a new spike-based synthesized dataset for semantic segmentation. Code and Datasets are available at https://github.com/Leozhangjiyuan/WGSE-SpikeCamera.
Spike camera是一种模仿灵长类动物中央凹采样机制的新型神经形态视觉传感器,可以捕获光子并输出40000 Hz的二进制Spike流。得益于异步采样机制,该相机可以记录快速运动的物体,并且可以在任何指定的时间戳从spike流中恢复清晰的图像,而不会产生运动模糊。尽管如此,由于离散尖峰流的时间序列信息密集,传统相机的现有算法不容易直接应用于尖峰相机。因此,探索密集尖峰流的普遍有效表示以更好地适应各种网络架构是必要和有趣的。本文提出用小波变换在时频空间中挖掘尖峰信号的时间鲁棒性特征。我们提出了一种新的小波引导尖峰增强(WGSE)范式,该范式由三个连续的步骤组成:多级小波变换、基于cnn的可学习模块和逆小波变换。在WGSE的帮助下,可以学习新的峰值流表示。我们展示了WGSE在两个下游任务上的有效性,在图像重建任务上取得了最先进的性能,在语义分割任务上取得了可观的性能。此外,我们建立了一个新的基于峰值的合成数据集用于语义分割。代码和数据集可在https://github.com/Leozhangjiyuan/WGSE-SpikeCamera上获得。
{"title":"Learning Temporal-Ordered Representation for Spike Streams Based on Discrete Wavelet Transforms","authors":"Jiyuan Zhang, Shanshan Jia, Zhaofei Yu, Tiejun Huang","doi":"10.1609/aaai.v37i1.25085","DOIUrl":"https://doi.org/10.1609/aaai.v37i1.25085","url":null,"abstract":"Spike camera, a new type of neuromorphic visual sensor that imitates the sampling mechanism of the primate fovea, can capture photons and output 40000 Hz binary spike streams. Benefiting from the asynchronous sampling mechanism, the spike camera can record fast-moving objects and clear images can be recovered from the spike stream at any specified timestamps without motion blurring. Despite these, due to the dense time sequence information of the discrete spike stream, it is not easy to directly apply the existing algorithms of traditional cameras to the spike camera. Therefore, it is necessary and interesting to explore a universally effective representation of dense spike streams to better fit various network architectures. In this paper, we propose to mine temporal-robust features of spikes in time-frequency space with wavelet transforms. We present a novel Wavelet-Guided Spike Enhancing (WGSE) paradigm consisting of three consecutive steps: multi-level wavelet transform, CNN-based learnable module, and inverse wavelet transform. With the assistance of WGSE, the new streaming representation of spikes can be learned. We demonstrate the effectiveness of WGSE on two downstream tasks, achieving state-of-the-art performance on the image reconstruction task and getting considerable performance on semantic segmentation. Furthermore, We build a new spike-based synthesized dataset for semantic segmentation. Code and Datasets are available at https://github.com/Leozhangjiyuan/WGSE-SpikeCamera.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"3 1","pages":"137-147"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76182204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1