首页 > 最新文献

arXiv - CS - Artificial Intelligence最新文献

英文 中文
Abductive explanations of classifiers under constraints: Complexity and properties 约束条件下分类器的归纳解释:复杂性与特性
Pub Date : 2024-09-18 DOI: arxiv-2409.12154
Martin Cooper, Leila Amgoud
Abductive explanations (AXp's) are widely used for understanding decisions ofclassifiers. Existing definitions are suitable when features are independent.However, we show that ignoring constraints when they exist between features maylead to an explosion in the number of redundant or superfluous AXp's. Wepropose three new types of explanations that take into account constraints andthat can be generated from the whole feature space or from a sample (such as adataset). They are based on a key notion of coverage of an explanation, the setof instances it explains. We show that coverage is powerful enough to discardredundant and superfluous AXp's. For each type, we analyse the complexity offinding an explanation and investigate its formal properties. The final resultis a catalogue of different forms of AXp's with different complexities anddifferent formal guarantees.
归纳式解释(AXp's)被广泛用于理解分类器的决策。然而,我们的研究表明,如果忽略了特征之间存在的约束条件,可能会导致冗余或多余的 AXp 数量激增。我们提出了三种考虑到约束条件的新型解释,它们可以从整个特征空间或样本(如数据集)中生成。它们基于解释的覆盖范围这一关键概念,即解释的实例集。我们证明,覆盖率足以摒弃多余和多余的 AXp。对于每种类型,我们都分析了寻找解释的复杂性,并研究了其形式属性。最终的结果是一个具有不同复杂性和不同形式保证的不同形式的 AXp 目录。
{"title":"Abductive explanations of classifiers under constraints: Complexity and properties","authors":"Martin Cooper, Leila Amgoud","doi":"arxiv-2409.12154","DOIUrl":"https://doi.org/arxiv-2409.12154","url":null,"abstract":"Abductive explanations (AXp's) are widely used for understanding decisions of\u0000classifiers. Existing definitions are suitable when features are independent.\u0000However, we show that ignoring constraints when they exist between features may\u0000lead to an explosion in the number of redundant or superfluous AXp's. We\u0000propose three new types of explanations that take into account constraints and\u0000that can be generated from the whole feature space or from a sample (such as a\u0000dataset). They are based on a key notion of coverage of an explanation, the set\u0000of instances it explains. We show that coverage is powerful enough to discard\u0000redundant and superfluous AXp's. For each type, we analyse the complexity of\u0000finding an explanation and investigate its formal properties. The final result\u0000is a catalogue of different forms of AXp's with different complexities and\u0000different formal guarantees.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach 利用证据权重 (WoE) 实现可解释的目标识别:以人为本的方法
Pub Date : 2024-09-18 DOI: arxiv-2409.11675
Abeer Alshehri, Amal Abdulrahman, Hajar Alamri, Tim Miller, Mor Vered
Goal recognition (GR) involves inferring an agent's unobserved goal from asequence of observations. This is a critical problem in AI with diverseapplications. Traditionally, GR has been addressed using 'inference to the bestexplanation' or abduction, where hypotheses about the agent's goals aregenerated as the most plausible explanations for observed behavior.Alternatively, some approaches enhance interpretability by ensuring that anagent's behavior aligns with an observer's expectations or by making thereasoning behind decisions more transparent. In this work, we tackle adifferent challenge: explaining the GR process in a way that is comprehensibleto humans. We introduce and evaluate an explainable model for goal recognition(GR) agents, grounded in the theoretical framework and cognitive processesunderlying human behavior explanation. Drawing on insights from two human-agentstudies, we propose a conceptual framework for human-centered explanations ofGR. Using this framework, we develop the eXplainable Goal Recognition (XGR)model, which generates explanations for both why and why not questions. Weevaluate the model computationally across eight GR benchmarks and through threeuser studies. The first study assesses the efficiency of generating human-likeexplanations within the Sokoban game domain, the second examines perceivedexplainability in the same domain, and the third evaluates the model'seffectiveness in aiding decision-making in illegal fishing detection. Resultsdemonstrate that the XGR model significantly enhances user understanding,trust, and decision-making compared to baseline models, underscoring itspotential to improve human-agent collaboration.
目标识别(GR)涉及从一系列观察结果中推断出一个代理的未观察到的目标。这是人工智能中的一个关键问题,有着多种多样的应用。传统上,人们使用 "最佳解释推理 "或归纳法来解决目标识别问题,在这种方法中,关于代理目标的假设被生成为对观察到的行为最合理的解释。在这项工作中,我们要解决一个不同的挑战:以人类能够理解的方式解释 GR 过程。我们以人类行为解释的理论框架和认知过程为基础,介绍并评估了目标识别(GR)代理的可解释模型。借鉴两项人类代理研究的见解,我们提出了一个以人为中心解释目标识别代理的概念框架。利用这个框架,我们开发了可解释目标识别(XGR)模型,该模型可以对为什么和为什么不的问题进行解释。我们通过八项 GR 基准和三项用户研究对该模型进行了计算评估。第一项研究评估了在推箱子游戏领域生成类人解释的效率,第二项研究考察了同一领域的感知可解释性,第三项研究评估了该模型在非法捕鱼检测中辅助决策的有效性。结果表明,与基线模型相比,XGR 模型大大增强了用户的理解力、信任度和决策能力,突出了该模型在改善人机协作方面的潜力。
{"title":"Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach","authors":"Abeer Alshehri, Amal Abdulrahman, Hajar Alamri, Tim Miller, Mor Vered","doi":"arxiv-2409.11675","DOIUrl":"https://doi.org/arxiv-2409.11675","url":null,"abstract":"Goal recognition (GR) involves inferring an agent's unobserved goal from a\u0000sequence of observations. This is a critical problem in AI with diverse\u0000applications. Traditionally, GR has been addressed using 'inference to the best\u0000explanation' or abduction, where hypotheses about the agent's goals are\u0000generated as the most plausible explanations for observed behavior.\u0000Alternatively, some approaches enhance interpretability by ensuring that an\u0000agent's behavior aligns with an observer's expectations or by making the\u0000reasoning behind decisions more transparent. In this work, we tackle a\u0000different challenge: explaining the GR process in a way that is comprehensible\u0000to humans. We introduce and evaluate an explainable model for goal recognition\u0000(GR) agents, grounded in the theoretical framework and cognitive processes\u0000underlying human behavior explanation. Drawing on insights from two human-agent\u0000studies, we propose a conceptual framework for human-centered explanations of\u0000GR. Using this framework, we develop the eXplainable Goal Recognition (XGR)\u0000model, which generates explanations for both why and why not questions. We\u0000evaluate the model computationally across eight GR benchmarks and through three\u0000user studies. The first study assesses the efficiency of generating human-like\u0000explanations within the Sokoban game domain, the second examines perceived\u0000explainability in the same domain, and the third evaluates the model's\u0000effectiveness in aiding decision-making in illegal fishing detection. Results\u0000demonstrate that the XGR model significantly enhances user understanding,\u0000trust, and decision-making compared to baseline models, underscoring its\u0000potential to improve human-agent collaboration.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Metric Hybrid Planning Approach to Solving Pandemic Planning Problems with Simple SIR Models 用简单 SIR 模型解决大流行规划问题的度量混合规划方法
Pub Date : 2024-09-18 DOI: arxiv-2409.11631
Ari Gestetner, Buser Say
A pandemic is the spread of a disease across large regions, and can havedevastating costs to the society in terms of health, economic and social. Assuch, the study of effective pandemic mitigation strategies can yieldsignificant positive impact on the society. A pandemic can be mathematicallydescribed using a compartmental model, such as the Susceptible Infected Removed(SIR) model. In this paper, we extend the solution equations of the SIR modelto a state transition model with lockdowns. We formalize a metric hybridplanning problem based on this state transition model, and solve it using ametric hybrid planner. We improve the runtime effectiveness of the metrichybrid planner with the addition of valid inequalities, and demonstrate thesuccess of our approach both theoretically and experimentally under variouschallenging settings.
大流行病是指一种疾病在大范围地区的传播,会给社会带来健康、经济和社会方面的巨大损失。因此,研究有效的大流行缓解策略会对社会产生重大的积极影响。大流行可以用分区模型来进行数学描述,如 "易感感染清除(SIR)"模型。在本文中,我们将 SIR 模型的求解方程扩展到带有锁定的状态转换模型。我们基于该状态转换模型形式化了一个度量混合规划问题,并使用度量混合规划器求解了该问题。通过添加有效不等式,我们提高了度量混合规划器的运行效率,并在各种挑战性设置下从理论和实验两方面证明了我们的方法是成功的。
{"title":"A Metric Hybrid Planning Approach to Solving Pandemic Planning Problems with Simple SIR Models","authors":"Ari Gestetner, Buser Say","doi":"arxiv-2409.11631","DOIUrl":"https://doi.org/arxiv-2409.11631","url":null,"abstract":"A pandemic is the spread of a disease across large regions, and can have\u0000devastating costs to the society in terms of health, economic and social. As\u0000such, the study of effective pandemic mitigation strategies can yield\u0000significant positive impact on the society. A pandemic can be mathematically\u0000described using a compartmental model, such as the Susceptible Infected Removed\u0000(SIR) model. In this paper, we extend the solution equations of the SIR model\u0000to a state transition model with lockdowns. We formalize a metric hybrid\u0000planning problem based on this state transition model, and solve it using a\u0000metric hybrid planner. We improve the runtime effectiveness of the metric\u0000hybrid planner with the addition of valid inequalities, and demonstrate the\u0000success of our approach both theoretically and experimentally under various\u0000challenging settings.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic 用论证理论与 Deontic Logic 解释非单调规范推理
Pub Date : 2024-09-18 DOI: arxiv-2409.11780
Zhe Yu, Yiwei Lu
In our previous research, we provided a reasoning system (called LeSAC) basedon argumentation theory to provide legal support to designers during the designprocess. Building on this, this paper explores how to provide designers witheffective explanations for their legally relevant design decisions. We extendthe previous system for providing explanations by specifying norms and the keylegal or ethical principles for justifying actions in normative contexts.Considering that first-order logic has strong expressive power, in the currentpaper we adopt a first-order deontic logic system with deontic operators andpreferences. We illustrate the advantages and necessity of introducing deonticlogic and designing explanations under LeSAC by modelling two cases in thecontext of autonomous driving. In particular, this paper also discusses therequirements of the updated LeSAC to guarantee rationality, and proves that awell-defined LeSAC can satisfy the rationality postulate for rule-basedargumentation frameworks. This ensures the system's ability to providecoherent, legally valid explanations for complex design decisions.
在我们之前的研究中,我们提供了一个基于论证理论的推理系统(称为 LeSAC),在设计过程中为设计师提供法律支持。在此基础上,本文探讨了如何为设计者提供与法律相关的设计决策的有效解释。考虑到一阶逻辑具有很强的表达能力,本文采用了带有deontic操作符和偏好的一阶deontic逻辑系统。考虑到一阶逻辑具有很强的表达能力,我们在本文中采用了带有deontic算子和偏好的一阶deontic逻辑系统。我们通过模拟自动驾驶背景下的两个案例,说明了在LeSAC下引入deontic逻辑和设计解释的优势和必要性。本文还特别讨论了更新后的 LeSAC 在保证合理性方面的要求,并证明了定义明确的 LeSAC 可以满足基于规则的论证框架的合理性假设。这确保了该系统能够为复杂的设计决策提供连贯、合法的解释。
{"title":"Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic","authors":"Zhe Yu, Yiwei Lu","doi":"arxiv-2409.11780","DOIUrl":"https://doi.org/arxiv-2409.11780","url":null,"abstract":"In our previous research, we provided a reasoning system (called LeSAC) based\u0000on argumentation theory to provide legal support to designers during the design\u0000process. Building on this, this paper explores how to provide designers with\u0000effective explanations for their legally relevant design decisions. We extend\u0000the previous system for providing explanations by specifying norms and the key\u0000legal or ethical principles for justifying actions in normative contexts.\u0000Considering that first-order logic has strong expressive power, in the current\u0000paper we adopt a first-order deontic logic system with deontic operators and\u0000preferences. We illustrate the advantages and necessity of introducing deontic\u0000logic and designing explanations under LeSAC by modelling two cases in the\u0000context of autonomous driving. In particular, this paper also discusses the\u0000requirements of the updated LeSAC to guarantee rationality, and proves that a\u0000well-defined LeSAC can satisfy the rationality postulate for rule-based\u0000argumentation frameworks. This ensures the system's ability to provide\u0000coherent, legally valid explanations for complex design decisions.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning and Theory Ladenness -- A Phenomenological Account 机器学习与理论娴熟性 -- 一种现象学解释
Pub Date : 2024-09-17 DOI: arxiv-2409.11277
Alberto Termine, Emanuele Ratti, Alessandro Facchini
In recent years, the dissemination of machine learning (ML) methodologies inscientific research has prompted discussions on theory ladenness. Morespecifically, the issue of theory ladenness has remerged as questions aboutwhether and how ML models (MLMs) and ML modelling strategies are impacted bythe domain theory of the scientific field in which ML is used and implemented(e.g., physics, chemistry, biology, etc). On the one hand, some have arguedthat there is no difference between traditional (pre ML) and ML assistedscience. In both cases, theory plays an essential and unavoidable role in theanalysis of phenomena and the construction and use of models. Others haveargued instead that ML methodologies and models are theory independent and, insome cases, even theory free. In this article, we argue that both positions areoverly simplistic and do not advance our understanding of the interplay betweenML methods and domain theories. Specifically, we provide an analysis of theoryladenness in ML assisted science. Our analysis reveals that, while theconstruction of MLMs can be relatively independent of domain theory, thepractical implementation and interpretation of these models within a givenspecific domain still relies on fundamental theoretical assumptions andbackground knowledge.
近年来,机器学习(ML)方法论在科学研究领域的传播引发了关于理论阶梯性的讨论。具体地说,理论阶梯性问题是指机器学习模型(MLMs)和机器学习建模策略是否以及如何受到使用和实施机器学习的科学领域(如物理、化学、生物等)的领域理论的影响。一方面,有人认为传统科学(ML 之前)和 ML 辅助科学之间没有区别。在这两种情况下,理论在分析现象、构建和使用模型方面都起着不可或缺的作用。另一些人则认为,ML 方法论和模型与理论无关,在某些情况下甚至不需要理论。在这篇文章中,我们认为这两种立场都过于简单化,并没有促进我们对 ML 方法与领域理论之间相互作用的理解。具体来说,我们分析了 ML 辅助科学中的理论独立性。我们的分析揭示出,虽然构建 MLM 可以相对独立于领域理论,但这些模型在特定领域内的实际应用和解释仍然依赖于基本理论假设和背景知识。
{"title":"Machine Learning and Theory Ladenness -- A Phenomenological Account","authors":"Alberto Termine, Emanuele Ratti, Alessandro Facchini","doi":"arxiv-2409.11277","DOIUrl":"https://doi.org/arxiv-2409.11277","url":null,"abstract":"In recent years, the dissemination of machine learning (ML) methodologies in\u0000scientific research has prompted discussions on theory ladenness. More\u0000specifically, the issue of theory ladenness has remerged as questions about\u0000whether and how ML models (MLMs) and ML modelling strategies are impacted by\u0000the domain theory of the scientific field in which ML is used and implemented\u0000(e.g., physics, chemistry, biology, etc). On the one hand, some have argued\u0000that there is no difference between traditional (pre ML) and ML assisted\u0000science. In both cases, theory plays an essential and unavoidable role in the\u0000analysis of phenomena and the construction and use of models. Others have\u0000argued instead that ML methodologies and models are theory independent and, in\u0000some cases, even theory free. In this article, we argue that both positions are\u0000overly simplistic and do not advance our understanding of the interplay between\u0000ML methods and domain theories. Specifically, we provide an analysis of theory\u0000ladenness in ML assisted science. Our analysis reveals that, while the\u0000construction of MLMs can be relatively independent of domain theory, the\u0000practical implementation and interpretation of these models within a given\u0000specific domain still relies on fundamental theoretical assumptions and\u0000background knowledge.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent 利用多代理思想树验证代理改进 LLM 推理
Pub Date : 2024-09-17 DOI: arxiv-2409.11527
Fatemeh Haji, Mazal Bethany, Maryam Tabar, Jason Chiang, Anthony Rios, Peyman Najafirad
Multi-agent strategies have emerged as a promising approach to enhance thereasoning abilities of Large Language Models (LLMs) by assigning specializedroles in the problem-solving process. Concurrently, Tree of Thoughts (ToT)methods have shown potential in improving reasoning for complexquestion-answering tasks by exploring diverse reasoning paths. A criticallimitation in multi-agent reasoning is the 'Reasoner' agent's shallowexploration of reasoning paths. While ToT strategies could help mitigate thisproblem, they may generate flawed reasoning branches, which could harm thetrustworthiness of the final answer. To leverage the strengths of bothmulti-agent reasoning and ToT strategies, we introduce a novel approachcombining ToT-based Reasoner agents with a Thought Validator agent. MultipleReasoner agents operate in parallel, employing ToT to explore diverse reasoningpaths. The Thought Validator then scrutinizes these paths, considering aReasoner's conclusion only if its reasoning is valid. This method enables amore robust voting strategy by discarding faulty reasoning paths, enhancing thesystem's ability to tackle tasks requiring systematic and trustworthyreasoning. Our method demonstrates superior performance compared to existingtechniques when evaluated on the GSM8K dataset, outperforming the standard ToTstrategy by an average 5.6% across four LLMs.
多代理策略是通过在问题解决过程中分配专门角色来提高大型语言模型(LLM)推理能力的一种有前途的方法。与此同时,思维树(ToT)方法通过探索不同的推理路径,在提高复杂问题解答任务的推理能力方面显示出了潜力。推理者"(Reasoner)代理对推理路径的浅层探索是多代理推理中一个令人诟病的限制因素。虽然 ToT 策略有助于缓解这一问题,但它们可能会产生有缺陷的推理分支,从而损害最终答案的可信度。为了充分利用多代理推理和 ToT 策略的优势,我们引入了一种将基于 ToT 的推理代理与思想验证代理相结合的新方法。多个推理代理并行运行,利用 ToT 探索不同的推理路径。然后,"思想验证器 "会仔细检查这些路径,只有在推理有效的情况下,才会考虑推理者的结论。这种方法通过摒弃错误的推理路径,实现了更稳健的投票策略,增强了系统处理需要系统化和可信推理的任务的能力。在 GSM8K 数据集上进行评估时,与现有技术相比,我们的方法表现出更优越的性能,在四个 LLM 中平均比标准 ToT 策略高出 5.6%。
{"title":"Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent","authors":"Fatemeh Haji, Mazal Bethany, Maryam Tabar, Jason Chiang, Anthony Rios, Peyman Najafirad","doi":"arxiv-2409.11527","DOIUrl":"https://doi.org/arxiv-2409.11527","url":null,"abstract":"Multi-agent strategies have emerged as a promising approach to enhance the\u0000reasoning abilities of Large Language Models (LLMs) by assigning specialized\u0000roles in the problem-solving process. Concurrently, Tree of Thoughts (ToT)\u0000methods have shown potential in improving reasoning for complex\u0000question-answering tasks by exploring diverse reasoning paths. A critical\u0000limitation in multi-agent reasoning is the 'Reasoner' agent's shallow\u0000exploration of reasoning paths. While ToT strategies could help mitigate this\u0000problem, they may generate flawed reasoning branches, which could harm the\u0000trustworthiness of the final answer. To leverage the strengths of both\u0000multi-agent reasoning and ToT strategies, we introduce a novel approach\u0000combining ToT-based Reasoner agents with a Thought Validator agent. Multiple\u0000Reasoner agents operate in parallel, employing ToT to explore diverse reasoning\u0000paths. The Thought Validator then scrutinizes these paths, considering a\u0000Reasoner's conclusion only if its reasoning is valid. This method enables a\u0000more robust voting strategy by discarding faulty reasoning paths, enhancing the\u0000system's ability to tackle tasks requiring systematic and trustworthy\u0000reasoning. Our method demonstrates superior performance compared to existing\u0000techniques when evaluated on the GSM8K dataset, outperforming the standard ToT\u0000strategy by an average 5.6% across four LLMs.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating Process Mining: A Case study using pm4py 流程挖掘导航:使用 pm4py 的案例研究
Pub Date : 2024-09-17 DOI: arxiv-2409.11294
Ali Jlidi, László Kovács
Process-mining techniques have emerged as powerful tools for analyzing eventdata to gain insights into business processes. In this paper, we present acomprehensive analysis of road traffic fine management processes using thepm4py library in Python. We start by importing an event log dataset and exploreits characteristics, including the distribution of activities and processvariants. Through filtering and statistical analysis, we uncover key patternsand variations in the process executions. Subsequently, we apply variousprocess-mining algorithms, including the Alpha Miner, Inductive Miner, andHeuristic Miner, to discover process models from the event log data. Wevisualize the discovered models to understand the workflow structures anddependencies within the process. Additionally, we discuss the strengths andlimitations of each mining approach in capturing the underlying processdynamics. Our findings shed light on the efficiency and effectiveness of roadtraffic fine management processes, providing valuable insights for processoptimization and decision-making. This study demonstrates the utility of pm4pyin facilitating process mining tasks and its potential for analyzing real-worldbusiness processes.
流程挖掘技术已成为分析事件数据以深入了解业务流程的强大工具。在本文中,我们使用 Python 中的pm4py 库对道路交通罚款管理流程进行了全面分析。我们从导入事件日志数据集开始,探索其特征,包括活动和流程变量的分布。通过过滤和统计分析,我们发现了流程执行中的关键模式和变化。随后,我们应用各种流程挖掘算法,包括阿尔法挖掘器、归纳挖掘器和逻辑挖掘器,从事件日志数据中发现流程模型。我们将发现的模型可视化,以了解流程内的工作流结构和依赖关系。此外,我们还讨论了每种挖掘方法在捕捉底层流程动力学方面的优势和局限性。我们的研究结果揭示了道路交通精细化管理流程的效率和有效性,为流程优化和决策提供了有价值的见解。这项研究证明了 pm4py 在促进流程挖掘任务方面的实用性及其在分析现实世界业务流程方面的潜力。
{"title":"Navigating Process Mining: A Case study using pm4py","authors":"Ali Jlidi, László Kovács","doi":"arxiv-2409.11294","DOIUrl":"https://doi.org/arxiv-2409.11294","url":null,"abstract":"Process-mining techniques have emerged as powerful tools for analyzing event\u0000data to gain insights into business processes. In this paper, we present a\u0000comprehensive analysis of road traffic fine management processes using the\u0000pm4py library in Python. We start by importing an event log dataset and explore\u0000its characteristics, including the distribution of activities and process\u0000variants. Through filtering and statistical analysis, we uncover key patterns\u0000and variations in the process executions. Subsequently, we apply various\u0000process-mining algorithms, including the Alpha Miner, Inductive Miner, and\u0000Heuristic Miner, to discover process models from the event log data. We\u0000visualize the discovered models to understand the workflow structures and\u0000dependencies within the process. Additionally, we discuss the strengths and\u0000limitations of each mining approach in capturing the underlying process\u0000dynamics. Our findings shed light on the efficiency and effectiveness of road\u0000traffic fine management processes, providing valuable insights for process\u0000optimization and decision-making. This study demonstrates the utility of pm4py\u0000in facilitating process mining tasks and its potential for analyzing real-world\u0000business processes.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Networks for Vehicle Routing Problem 用于车辆路由问题的神经网络
Pub Date : 2024-09-17 DOI: arxiv-2409.11290
László Kovács, Ali Jlidi
The Vehicle Routing Problem is about optimizing the routes of vehicles tomeet the needs of customers at specific locations. The route graph consists ofdepots on several levels and customer positions. Several optimization methodshave been developed over the years, most of which are based on some type ofclassic heuristic: genetic algorithm, simulated annealing, tabu search, antcolony optimization, firefly algorithm. Recent developments in machine learningprovide a new toolset, the rich family of neural networks, for tackling complexproblems. The main area of application of neural networks is the area ofclassification and regression. Route optimization can be viewed as a newchallenge for neural networks. The article first presents an analysis of theapplicability of neural network tools, then a novel graphical neural networkmodel is presented in detail. The efficiency analysis based on test experimentsshows the applicability of the proposed NN architecture.
车辆路线问题是关于优化车辆路线以满足特定地点客户需求的问题。路线图由多个层次的配送点和客户位置组成。多年来,人们开发了多种优化方法,其中大多数都基于某种经典的启发式算法:遗传算法、模拟退火、塔布搜索、蚁群优化、萤火虫算法。机器学习的最新发展为解决复杂问题提供了一个新的工具集,即丰富的神经网络家族。神经网络的主要应用领域是分类和回归。路线优化可视为神经网络面临的新挑战。文章首先分析了神经网络工具的适用性,然后详细介绍了一种新型图形神经网络模型。基于测试实验的效率分析表明了所提出的神经网络架构的适用性。
{"title":"Neural Networks for Vehicle Routing Problem","authors":"László Kovács, Ali Jlidi","doi":"arxiv-2409.11290","DOIUrl":"https://doi.org/arxiv-2409.11290","url":null,"abstract":"The Vehicle Routing Problem is about optimizing the routes of vehicles to\u0000meet the needs of customers at specific locations. The route graph consists of\u0000depots on several levels and customer positions. Several optimization methods\u0000have been developed over the years, most of which are based on some type of\u0000classic heuristic: genetic algorithm, simulated annealing, tabu search, ant\u0000colony optimization, firefly algorithm. Recent developments in machine learning\u0000provide a new toolset, the rich family of neural networks, for tackling complex\u0000problems. The main area of application of neural networks is the area of\u0000classification and regression. Route optimization can be viewed as a new\u0000challenge for neural networks. The article first presents an analysis of the\u0000applicability of neural network tools, then a novel graphical neural network\u0000model is presented in detail. The efficiency analysis based on test experiments\u0000shows the applicability of the proposed NN architecture.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework ReflectDiffu:通过 RL-Diffusion 框架在情感意向传染和模仿之间进行反射,以生成富有同情心的反应
Pub Date : 2024-09-16 DOI: arxiv-2409.10289
Jiahao Yuan, Zixiang Di, Zhiqing Cui, Guisong Yang, Usman Naseem
Empathetic response generation necessitates the integration of emotional andintentional dynamics to foster meaningful interactions. Existing researcheither neglects the intricate interplay between emotion and intent, leading tosuboptimal controllability of empathy, or resorts to large language models(LLMs), which incur significant computational overhead. In this paper, weintroduce ReflectDiffu, a lightweight and comprehensive framework forempathetic response generation. This framework incorporates emotion contagionto augment emotional expressiveness and employs an emotion-reasoning mask topinpoint critical emotional elements. Additionally, it integrates intentmimicry within reinforcement learning for refinement during diffusion. Byharnessing an intent twice reflect the mechanism ofExploring-Sampling-Correcting, ReflectDiffu adeptly translates emotionaldecision-making into precise intent actions, thereby addressing empatheticresponse misalignments stemming from emotional misrecognition. Throughreflection, the framework maps emotional states to intents, markedly enhancingboth response empathy and flexibility. Comprehensive experiments reveal thatReflectDiffu outperforms existing models regarding relevance, controllability,and informativeness, achieving state-of-the-art results in both automatic andhuman evaluations.
移情反应的生成需要整合情感和意图动态,以促进有意义的互动。现有的研究要么忽视了情感和意图之间错综复杂的相互作用,导致同理心的可控性不够理想,要么依赖于大型语言模型(LLM),从而产生了巨大的计算开销。在本文中,我们介绍了一个轻量级的综合性移情反应生成框架 ReflectDiffu。该框架结合了情感传染来增强情感表达能力,并采用情感推理掩码来指出关键的情感元素。此外,它还在强化学习中整合了意图模仿,以便在扩散过程中进行改进。通过利用意向两次反映探索-取样-纠正机制,ReflectDiffu 巧妙地将情感决策转化为精确的意向行动,从而解决了因情感识别错误而导致的移情反应失调问题。通过反思,该框架将情绪状态映射到意图上,显著增强了反应的移情性和灵活性。综合实验表明,ReflectDiffu 在相关性、可控性和信息性方面都优于现有模型,在自动评估和人工评估中都取得了最先进的结果。
{"title":"ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework","authors":"Jiahao Yuan, Zixiang Di, Zhiqing Cui, Guisong Yang, Usman Naseem","doi":"arxiv-2409.10289","DOIUrl":"https://doi.org/arxiv-2409.10289","url":null,"abstract":"Empathetic response generation necessitates the integration of emotional and\u0000intentional dynamics to foster meaningful interactions. Existing research\u0000either neglects the intricate interplay between emotion and intent, leading to\u0000suboptimal controllability of empathy, or resorts to large language models\u0000(LLMs), which incur significant computational overhead. In this paper, we\u0000introduce ReflectDiffu, a lightweight and comprehensive framework for\u0000empathetic response generation. This framework incorporates emotion contagion\u0000to augment emotional expressiveness and employs an emotion-reasoning mask to\u0000pinpoint critical emotional elements. Additionally, it integrates intent\u0000mimicry within reinforcement learning for refinement during diffusion. By\u0000harnessing an intent twice reflect the mechanism of\u0000Exploring-Sampling-Correcting, ReflectDiffu adeptly translates emotional\u0000decision-making into precise intent actions, thereby addressing empathetic\u0000response misalignments stemming from emotional misrecognition. Through\u0000reflection, the framework maps emotional states to intents, markedly enhancing\u0000both response empathy and flexibility. Comprehensive experiments reveal that\u0000ReflectDiffu outperforms existing models regarding relevance, controllability,\u0000and informativeness, achieving state-of-the-art results in both automatic and\u0000human evaluations.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Logic Synthesis Optimization with Predictive Self-Supervision via Causal Transformers 通过因果变换器进行预测性自我监督的逻辑合成优化
Pub Date : 2024-09-16 DOI: arxiv-2409.10653
Raika Karimi, Faezeh Faez, Yingxue Zhang, Xing Li, Lei Chen, Mingxuan Yuan, Mahdi Biparva
Contemporary hardware design benefits from the abstraction provided byhigh-level logic gates, streamlining the implementation of logic circuits.Logic Synthesis Optimization (LSO) operates at one level of abstraction withinthe Electronic Design Automation (EDA) workflow, targeting improvements inlogic circuits with respect to performance metrics such as size and speed inthe final layout. Recent trends in the field show a growing interest inleveraging Machine Learning (ML) for EDA, notably through ML-guided logicsynthesis utilizing policy-based Reinforcement Learning (RL) methods.Despitethese advancements, existing models face challenges such as overfitting andlimited generalization, attributed to constrained public circuits and theexpressiveness limitations of graph encoders. To address these hurdles, andtackle data scarcity issues, we introduce LSOformer, a novel approachharnessing Autoregressive transformer models and predictive SSL to predict thetrajectory of Quality of Results (QoR). LSOformer integrates cross-attentionmodules to merge insights from circuit graphs and optimization sequences,thereby enhancing prediction accuracy for QoR metrics. Experimental studiesvalidate the effectiveness of LSOformer, showcasing its superior performanceover baseline architectures in QoR prediction tasks, where it achievesimprovements of 5.74%, 4.35%, and 17.06% on the EPFL, OABCD, and proprietarycircuits datasets, respectively, in inductive setup.
逻辑合成优化(LSO)在电子设计自动化(EDA)工作流程中的一个抽象层次上运行,目标是在最终布局的尺寸和速度等性能指标方面改进逻辑电路。该领域的最新趋势表明,人们对将机器学习(ML)应用于 EDA 的兴趣与日俱增,特别是通过利用基于策略的强化学习(RL)方法进行 ML 引导的逻辑合成。尽管取得了这些进步,但现有模型仍面临过度拟合和泛化受限等挑战,这归因于受限的公共电路和图编码器的可执行性限制。为了克服这些障碍并解决数据稀缺问题,我们引入了 LSOformer,这是一种利用自回归变压器模型和预测性 SSL 来预测结果质量(QoR)轨迹的新方法。LSOformer 集成了交叉注意模块,将电路图和优化序列中的见解融合在一起,从而提高了 QoR 指标的预测精度。实验研究验证了 LSOformer 的有效性,展示了它在 QoR 预测任务中优于基线架构的性能,在归纳设置中,LSOformer 在 EPFL、OABCD 和 proprietarycircuits 数据集上的性能分别提高了 5.74%、4.35% 和 17.06%。
{"title":"Logic Synthesis Optimization with Predictive Self-Supervision via Causal Transformers","authors":"Raika Karimi, Faezeh Faez, Yingxue Zhang, Xing Li, Lei Chen, Mingxuan Yuan, Mahdi Biparva","doi":"arxiv-2409.10653","DOIUrl":"https://doi.org/arxiv-2409.10653","url":null,"abstract":"Contemporary hardware design benefits from the abstraction provided by\u0000high-level logic gates, streamlining the implementation of logic circuits.\u0000Logic Synthesis Optimization (LSO) operates at one level of abstraction within\u0000the Electronic Design Automation (EDA) workflow, targeting improvements in\u0000logic circuits with respect to performance metrics such as size and speed in\u0000the final layout. Recent trends in the field show a growing interest in\u0000leveraging Machine Learning (ML) for EDA, notably through ML-guided logic\u0000synthesis utilizing policy-based Reinforcement Learning (RL) methods.Despite\u0000these advancements, existing models face challenges such as overfitting and\u0000limited generalization, attributed to constrained public circuits and the\u0000expressiveness limitations of graph encoders. To address these hurdles, and\u0000tackle data scarcity issues, we introduce LSOformer, a novel approach\u0000harnessing Autoregressive transformer models and predictive SSL to predict the\u0000trajectory of Quality of Results (QoR). LSOformer integrates cross-attention\u0000modules to merge insights from circuit graphs and optimization sequences,\u0000thereby enhancing prediction accuracy for QoR metrics. Experimental studies\u0000validate the effectiveness of LSOformer, showcasing its superior performance\u0000over baseline architectures in QoR prediction tasks, where it achieves\u0000improvements of 5.74%, 4.35%, and 17.06% on the EPFL, OABCD, and proprietary\u0000circuits datasets, respectively, in inductive setup.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1