arXiv - CS - Artificial Intelligence最新文献

英文中文

CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning CauseJudger：用 LLMs 找出原因，进行归纳逻辑推理

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-09 DOI: arxiv-2409.05559

Jinwei He, Feng Lu

Large language models (LLMs) have been utilized in solving diverse reasoningtasks, encompassing common sense, arithmetic and deduction tasks. However, withdifficulties of reversing thinking patterns and irrelevant premises, how todetermine the authenticity of the cause in abductive logical reasoning remainsunderexplored. Inspired by hypothesis and verification method andidentification of irrelevant information in human thinking process, we proposea new framework for LLMs abductive logical reasoning called CauseJudger (CJ),which identifies the authenticity of possible cause by transforming thinkingfrom reverse to forward and removing irrelevant information. In addition, weconstruct an abductive logical reasoning dataset for decision task calledCauseLogics, which contains 200,000 tasks of varying reasoning lengths. Ourexperiments show the efficiency of CJ with overall experiments and ablationexperiments as well as case studies on our dataset and reconstructed publicdataset. Notably, CJ's implementation is efficient, requiring only two calls toLLM. Its impact is profound: when using gpt-3.5, CJ achieves a maximumcorrectness improvement of 41% compared to Zero-Shot-CoT. Moreover, with gpt-4,CJ attains an accuracy exceeding 90% across all datasets.

大语言模型（LLM）已被用于解决各种推理任务，包括常识、算术和演绎任务。然而，在归纳逻辑推理中，由于存在思维模式逆转和前提不相关等困难，如何确定原因的真伪仍未得到探索。受假设和验证方法以及人类思维过程中无关信息识别方法的启发，我们提出了一种新的LLMs归纳逻辑推理框架，称为 "原因评判器（CJ）"，它通过将思维从逆向转化为正向并去除无关信息来识别可能原因的真实性。此外，我们还构建了一个名为 "原因逻辑"（CauseLogics）的决策任务归纳逻辑推理数据集，其中包含 200,000 个推理长度各不相同的任务。我们通过整体实验、消融实验以及在我们的数据集和重建的公共数据集上进行的案例研究，展示了 CJ 的效率。值得注意的是，CJ 的实现非常高效，只需要调用两次LLM。它的影响是深远的：在使用 gpt-3.5 时，CJ 与 Zero-Shot-CoT 相比，最大正确率提高了 41%。此外，在使用 gpt-4 时，CJ 在所有数据集上的正确率都超过了 90%。

{"title":"CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning","authors":"Jinwei He, Feng Lu","doi":"arxiv-2409.05559","DOIUrl":"https://doi.org/arxiv-2409.05559","url":null,"abstract":"Large language models (LLMs) have been utilized in solving diverse reasoning\u0000tasks, encompassing common sense, arithmetic and deduction tasks. However, with\u0000difficulties of reversing thinking patterns and irrelevant premises, how to\u0000determine the authenticity of the cause in abductive logical reasoning remains\u0000underexplored. Inspired by hypothesis and verification method and\u0000identification of irrelevant information in human thinking process, we propose\u0000a new framework for LLMs abductive logical reasoning called CauseJudger (CJ),\u0000which identifies the authenticity of possible cause by transforming thinking\u0000from reverse to forward and removing irrelevant information. In addition, we\u0000construct an abductive logical reasoning dataset for decision task called\u0000CauseLogics, which contains 200,000 tasks of varying reasoning lengths. Our\u0000experiments show the efficiency of CJ with overall experiments and ablation\u0000experiments as well as case studies on our dataset and reconstructed public\u0000dataset. Notably, CJ's implementation is efficient, requiring only two calls to\u0000LLM. Its impact is profound: when using gpt-3.5, CJ achieves a maximum\u0000correctness improvement of 41% compared to Zero-Shot-CoT. Moreover, with gpt-4,\u0000CJ attains an accuracy exceeding 90% across all datasets.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"62 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic Demand Management for Parcel Lockers 包裹柜的动态需求管理

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-08 DOI: arxiv-2409.05061

Daniela Sailer, Robert Klein, Claudius Steinhardt

In pursuit of a more sustainable and cost-efficient last mile, parcel lockershave gained a firm foothold in the parcel delivery landscape. To fully exploittheir potential and simultaneously ensure customer satisfaction, successfulmanagement of the locker's limited capacity is crucial. This is challenging asfuture delivery requests and pickup times are stochastic from the provider'sperspective. In response, we propose to dynamically control whether the lockeris presented as an available delivery option to each incoming customer with thegoal of maximizing the number of served requests weighted by their priority.Additionally, we take different compartment sizes into account, which entails asecond type of decision as parcels scheduled for delivery must be allocated. Weformalize the problem as an infinite-horizon sequential decision problem andfind that exact methods are intractable due to the curses of dimensionality. Inlight of this, we develop a solution framework that orchestrates multiplealgorithmic techniques rooted in Sequential Decision Analytics andReinforcement Learning, namely cost function approximation and an offlinetrained parametric value function approximation together with a truncatedonline rollout. Our innovative approach to combine these techniques enables usto address the strong interrelations between the two decision types. As ageneral methodological contribution, we enhance the training of our valuefunction approximation with a modified version of experience replay thatenforces structure in the value function. Our computational study shows thatour method outperforms a myopic benchmark by 13.7% and an industry-inspiredpolicy by 12.6%.

为了实现更具可持续性和成本效益的 "最后一公里"，包裹柜在包裹递送领域站稳了脚跟。为了充分挖掘其潜力，同时确保客户满意度，成功管理储物柜的有限容量至关重要。从提供商的角度来看，未来的投递请求和取件时间都是随机的，因此这具有挑战性。为此，我们建议动态控制储物柜是否向每位顾客提供可用的递送选项，目标是根据优先级加权最大化已服务请求的数量。此外，我们还考虑到了不同的储物箱大小，这就需要做出第二类决策，因为必须对预定递送的包裹进行分配。我们将该问题形式化为一个无限视距的顺序决策问题，并发现由于维度的限制，精确方法难以解决。有鉴于此，我们开发了一个解决方案框架，该框架协调了源于序列决策分析和强化学习的多种算法技术，即成本函数近似、离线训练的参数值函数近似和截断在线推出。我们结合这些技术的创新方法使我们能够解决这两种决策类型之间的密切联系。作为方法论上的一般贡献，我们通过改进版的经验重放加强了价值函数近似的训练，从而强化了价值函数的结构。我们的计算研究表明，我们的方法比近视基准优胜 13.7%，比行业启发政策优胜 12.6%。

{"title":"Dynamic Demand Management for Parcel Lockers","authors":"Daniela Sailer, Robert Klein, Claudius Steinhardt","doi":"arxiv-2409.05061","DOIUrl":"https://doi.org/arxiv-2409.05061","url":null,"abstract":"In pursuit of a more sustainable and cost-efficient last mile, parcel lockers\u0000have gained a firm foothold in the parcel delivery landscape. To fully exploit\u0000their potential and simultaneously ensure customer satisfaction, successful\u0000management of the locker's limited capacity is crucial. This is challenging as\u0000future delivery requests and pickup times are stochastic from the provider's\u0000perspective. In response, we propose to dynamically control whether the locker\u0000is presented as an available delivery option to each incoming customer with the\u0000goal of maximizing the number of served requests weighted by their priority.\u0000Additionally, we take different compartment sizes into account, which entails a\u0000second type of decision as parcels scheduled for delivery must be allocated. We\u0000formalize the problem as an infinite-horizon sequential decision problem and\u0000find that exact methods are intractable due to the curses of dimensionality. In\u0000light of this, we develop a solution framework that orchestrates multiple\u0000algorithmic techniques rooted in Sequential Decision Analytics and\u0000Reinforcement Learning, namely cost function approximation and an offline\u0000trained parametric value function approximation together with a truncated\u0000online rollout. Our innovative approach to combine these techniques enables us\u0000to address the strong interrelations between the two decision types. As a\u0000general methodological contribution, we enhance the training of our value\u0000function approximation with a modified version of experience replay that\u0000enforces structure in the value function. Our computational study shows that\u0000our method outperforms a myopic benchmark by 13.7% and an industry-inspired\u0000policy by 12.6%.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality MuAP：缺失模态视觉语言模型的多步自适应提示学习

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-07 DOI: arxiv-2409.04693

Ruiting Dai, Yuqiao Tan, Lisi Mo, Tao He, Ke Qin, Shuang Liang

Recently, prompt learning has garnered considerable attention for its successin various Vision-Language (VL) tasks. However, existing prompt-based modelsare primarily focused on studying prompt generation and prompt strategies withcomplete modality settings, which does not accurately reflect real-worldscenarios where partial modality information may be missing. In this paper, wepresent the first comprehensive investigation into prompt learning behaviorwhen modalities are incomplete, revealing the high sensitivity of prompt-basedmodels to missing modalities. To this end, we propose a novel Multi-stepAdaptive Prompt Learning (MuAP) framework, aiming to generate multimodalprompts and perform multi-step prompt tuning, which adaptively learns knowledgeby iteratively aligning modalities. Specifically, we generate multimodalprompts for each modality and devise prompt strategies to integrate them intothe Transformer model. Subsequently, we sequentially perform prompt tuning fromsingle-stage and alignment-stage, allowing each modality-prompt to beautonomously and adaptively learned, thereby mitigating the imbalance issuecaused by only textual prompts that are learnable in previous works. Extensiveexperiments demonstrate the effectiveness of our MuAP and this model achievessignificant improvements compared to the state-of-the-art on all benchmarkdatasets

最近，提示学习因其在各种视觉语言（VL）任务中的成功应用而备受关注。然而，现有的基于提示的模型主要侧重于研究完整模态设置下的提示生成和提示策略，这并不能准确反映部分模态信息可能缺失的真实世界场景。在本文中，我们首次对模态不完整时的提示学习行为进行了全面研究，揭示了基于提示的模型对模态缺失的高度敏感性。为此，我们提出了一个新颖的多步自适应提示学习（MuAP）框架，旨在生成多模态提示并执行多步提示调整，通过迭代调整模态来自适应地学习知识。具体来说，我们为每种模态生成多模态提示，并设计提示策略将其整合到变形器模型中。随后，我们依次从单个阶段和对齐阶段进行提示调整，使每种模态提示都能美化并自适应地学习，从而缓解了以往工作中只有文本提示可以学习所造成的不平衡问题。广泛的实验证明了我们的 MuAP 的有效性，在所有基准数据集上，该模型都比最先进的模型取得了显著的改进

{"title":"MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality","authors":"Ruiting Dai, Yuqiao Tan, Lisi Mo, Tao He, Ke Qin, Shuang Liang","doi":"arxiv-2409.04693","DOIUrl":"https://doi.org/arxiv-2409.04693","url":null,"abstract":"Recently, prompt learning has garnered considerable attention for its success\u0000in various Vision-Language (VL) tasks. However, existing prompt-based models\u0000are primarily focused on studying prompt generation and prompt strategies with\u0000complete modality settings, which does not accurately reflect real-world\u0000scenarios where partial modality information may be missing. In this paper, we\u0000present the first comprehensive investigation into prompt learning behavior\u0000when modalities are incomplete, revealing the high sensitivity of prompt-based\u0000models to missing modalities. To this end, we propose a novel Multi-step\u0000Adaptive Prompt Learning (MuAP) framework, aiming to generate multimodal\u0000prompts and perform multi-step prompt tuning, which adaptively learns knowledge\u0000by iteratively aligning modalities. Specifically, we generate multimodal\u0000prompts for each modality and devise prompt strategies to integrate them into\u0000the Transformer model. Subsequently, we sequentially perform prompt tuning from\u0000single-stage and alignment-stage, allowing each modality-prompt to be\u0000autonomously and adaptively learned, thereby mitigating the imbalance issue\u0000caused by only textual prompts that are learnable in previous works. Extensive\u0000experiments demonstrate the effectiveness of our MuAP and this model achieves\u0000significant improvements compared to the state-of-the-art on all benchmark\u0000datasets","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Algorithmic Scenario Generation as Quality Diversity Optimization 作为质量多样性优化的算法场景生成

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-07 DOI: arxiv-2409.04711

Stefanos Nikolaidis

The increasing complexity of robots and autonomous agents that interact withpeople highlights the critical need for approaches that systematically testthem before deployment. This review paper presents a general framework forsolving this problem, describes the insights that we have gained from workingon each component of the framework, and shows how integrating these componentsleads to the discovery of a diverse range of realistic and challengingscenarios that reveal previously unknown failures in deployed robotic systemsinteracting with people.

与人交互的机器人和自主代理的复杂性日益增加，这凸显了在部署前对它们进行系统测试的方法的迫切需求。这篇综述论文介绍了解决这一问题的总体框架，描述了我们从该框架的各个组成部分中获得的启示，并展示了如何通过整合这些组成部分来发现各种现实和具有挑战性的情景，从而揭示已部署的与人交互的机器人系统中以前未知的故障。

引用次数: 0

Action is the primary key: a categorical framework for episode description and logical reasoning 行动是首要关键：情节描述和逻辑推理的分类框架

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-07 DOI: arxiv-2409.04793

Yoshiki Fukada

This research presents a computational framework for describing andrecognizing episodes and for logical reasoning. This framework, namedcognitive-logs, consists of a set of relational and graph databases.Cognitive-logs record knowledge, particularly in episodes that consist of"actions" represented by verbs in natural languages and "participants" whoperform the actions. These objects are connected by arrows (morphisms) thatlink each action to its participant and link cause to effect. Operations basedon category theory enable comparisons between episodes and deductiveinferences, including abstractions of stories. One of the goals of this studyis to develop a database-driven artificial intelligence. This artificialintelligence thinks like a human but possesses the accuracy and rigour of amachine. The vast capacities of databases (up to petabyte scales in currenttechnologies) enable the artificial intelligence to store a greater volume ofknowledge than neural-network based artificial intelligences. Cognitive-logsserve as a model of human cognition and designed with references to cognitivelinguistics. Cognitive-logs also have the potential to model various human mindactivities.

这项研究提出了一个用于描述和识别情节以及逻辑推理的计算框架。认知日志记录知识，尤其是由自然语言中动词代表的 "行动 "和执行行动的 "参与者 "组成的事件。这些对象通过箭头（变形）连接起来，箭头将每个动作与其参与者联系起来，并将因果联系起来。基于范畴理论的运算可以进行情节之间的比较和演绎推理，包括故事的抽象。本研究的目标之一是开发一种数据库驱动的人工智能。这种人工智能的思维方式与人类相似，但具有机器的准确性和严谨性。与基于神经网络的人工智能相比，数据库的巨大容量（在目前的技术中可达到 PB 级）使人工智能能够存储更多的知识。认知日志是人类认知的模型，其设计参考了认知语言学。认知日志还有可能模拟人类的各种思维活动。

{"title":"Action is the primary key: a categorical framework for episode description and logical reasoning","authors":"Yoshiki Fukada","doi":"arxiv-2409.04793","DOIUrl":"https://doi.org/arxiv-2409.04793","url":null,"abstract":"This research presents a computational framework for describing and\u0000recognizing episodes and for logical reasoning. This framework, named\u0000cognitive-logs, consists of a set of relational and graph databases.\u0000Cognitive-logs record knowledge, particularly in episodes that consist of\u0000\"actions\" represented by verbs in natural languages and \"participants\" who\u0000perform the actions. These objects are connected by arrows (morphisms) that\u0000link each action to its participant and link cause to effect. Operations based\u0000on category theory enable comparisons between episodes and deductive\u0000inferences, including abstractions of stories. One of the goals of this study\u0000is to develop a database-driven artificial intelligence. This artificial\u0000intelligence thinks like a human but possesses the accuracy and rigour of a\u0000machine. The vast capacities of databases (up to petabyte scales in current\u0000technologies) enable the artificial intelligence to store a greater volume of\u0000knowledge than neural-network based artificial intelligences. Cognitive-logs\u0000serve as a model of human cognition and designed with references to cognitive\u0000linguistics. Cognitive-logs also have the potential to model various human mind\u0000activities.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

HULLMI: Human vs LLM identification with explainability HULLMI：人类与 LLM 对可解释性的识别

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-07 DOI: arxiv-2409.04808

Prathamesh Dinesh Joshi, Sahil Pocker, Raj Abhijit Dandekar, Rajat Dandekar, Sreedath Panat

As LLMs become increasingly proficient at producing human-like responses,there has been a rise of academic and industrial pursuits dedicated to flagginga given piece of text as "human" or "AI". Most of these pursuits involve modernNLP detectors like T5-Sentinel and RoBERTa-Sentinel, without paying too muchattention to issues of interpretability and explainability of these models. Inour study, we provide a comprehensive analysis that shows that traditional MLmodels (Naive-Bayes,MLP, Random Forests, XGBoost) perform as well as modern NLPdetectors, in human vs AI text detection. We achieve this by implementing arobust testing procedure on diverse datasets, including curated corpora andreal-world samples. Subsequently, by employing the explainable AI techniqueLIME, we uncover parts of the input that contribute most to the prediction ofeach model, providing insights into the detection process. Our studycontributes to the growing need for developing production-level LLM detectiontools, which can leverage a wide range of traditional as well as modern NLPdetectors we propose. Finally, the LIME techniques we demonstrate also have thepotential to equip these detection tools with interpretability analysisfeatures, making them more reliable and trustworthy in various domains likeeducation, healthcare, and media.

随着 LLM 越来越熟练地做出类似人类的反应，学术界和工业界开始致力于将特定文本标记为 "人类 "或 "人工智能"。这些研究大多涉及 T5-Sentinel 和 RoBERTa-Sentinel 等现代 NLP 检测器，却没有过多关注这些模型的可解释性和可解释性问题。在我们的研究中，我们进行了全面的分析，结果表明传统的 ML 模型（Naive-Bayes、MLP、Random Forests、XGBoost）在人类与人工智能文本检测中的表现与现代 NLP 检测器不相上下。为此，我们在各种数据集（包括策划的语料库和真实世界样本）上实施了一套可靠的测试程序。随后，通过使用可解释的人工智能技术LIME，我们发现了输入中对每个模型的预测贡献最大的部分，为检测过程提供了洞察力。我们的研究有助于满足开发生产级 LLM 检测工具的日益增长的需求，这些工具可以利用我们提出的各种传统和现代 NLP 检测器。最后，我们展示的 LIME 技术还有潜力为这些检测工具配备可解释性分析功能，使它们在教育、医疗保健和媒体等各个领域更加可靠和可信。

{"title":"HULLMI: Human vs LLM identification with explainability","authors":"Prathamesh Dinesh Joshi, Sahil Pocker, Raj Abhijit Dandekar, Rajat Dandekar, Sreedath Panat","doi":"arxiv-2409.04808","DOIUrl":"https://doi.org/arxiv-2409.04808","url":null,"abstract":"As LLMs become increasingly proficient at producing human-like responses,\u0000there has been a rise of academic and industrial pursuits dedicated to flagging\u0000a given piece of text as \"human\" or \"AI\". Most of these pursuits involve modern\u0000NLP detectors like T5-Sentinel and RoBERTa-Sentinel, without paying too much\u0000attention to issues of interpretability and explainability of these models. In\u0000our study, we provide a comprehensive analysis that shows that traditional ML\u0000models (Naive-Bayes,MLP, Random Forests, XGBoost) perform as well as modern NLP\u0000detectors, in human vs AI text detection. We achieve this by implementing a\u0000robust testing procedure on diverse datasets, including curated corpora and\u0000real-world samples. Subsequently, by employing the explainable AI technique\u0000LIME, we uncover parts of the input that contribute most to the prediction of\u0000each model, providing insights into the detection process. Our study\u0000contributes to the growing need for developing production-level LLM detection\u0000tools, which can leverage a wide range of traditional as well as modern NLP\u0000detectors we propose. Finally, the LIME techniques we demonstrate also have the\u0000potential to equip these detection tools with interpretability analysis\u0000features, making them more reliable and trustworthy in various domains like\u0000education, healthcare, and media.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework 推进多器官疾病护理：分层多代理强化学习框架

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-06 DOI: arxiv-2409.04224

Daniel J. Tan, Qianyi Xu, Kay Choong See, Dilruk Perera, Mengling Feng

Multi-organ diseases present significant challenges due to their simultaneousimpact on multiple organ systems, necessitating complex and adaptive treatmentstrategies. Despite recent advancements in AI-powered healthcare decisionsupport systems, existing solutions are limited to individual organ systems.They often ignore the intricate dependencies between organ system and therebyfails to provide holistic treatment recommendations that are useful inpractice. We propose a novel hierarchical multi-agent reinforcement learning(HMARL) framework to address these challenges. This framework uses dedicatedagents for each organ system, and model dynamic through explicit inter-agentcommunication channels, enabling coordinated treatment strategies acrossorgans. Furthermore, we introduce a dual-layer state representation techniqueto contextualize patient conditions at various hierarchical levels, enhancingthe treatment accuracy and relevance. Through extensive qualitative andquantitative evaluations in managing sepsis (a complex multi-organ disease),our approach demonstrates its ability to learn effective treatment policiesthat significantly improve patient survival rates. This framework marks asubstantial advancement in clinical decision support systems, pioneering acomprehensive approach for multi-organ treatment recommendations.

多器官疾病对多个器官系统同时产生影响，需要复杂的适应性治疗策略，因此带来了巨大的挑战。尽管最近人工智能医疗决策支持系统取得了进步，但现有的解决方案仅限于单个器官系统，它们往往忽略了器官系统之间错综复杂的依赖关系，因此无法提供在实践中有用的整体治疗建议。我们提出了一种新颖的分层多代理强化学习（HMARL）框架来应对这些挑战。该框架为每个器官系统使用专用的代理，并通过明确的代理间通信渠道建立动态模型，从而实现跨器官的协调治疗策略。此外，我们还引入了一种双层状态表示技术，在不同层次上将病人的情况上下文化，从而提高了治疗的准确性和相关性。通过在脓毒症（一种复杂的多器官疾病）治疗中进行广泛的定性和定量评估，我们的方法证明了其学习有效治疗策略的能力，从而显著提高了患者的存活率。这一框架标志着临床决策支持系统取得了实质性进展，开创了多器官治疗建议的综合方法。

{"title":"Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework","authors":"Daniel J. Tan, Qianyi Xu, Kay Choong See, Dilruk Perera, Mengling Feng","doi":"arxiv-2409.04224","DOIUrl":"https://doi.org/arxiv-2409.04224","url":null,"abstract":"Multi-organ diseases present significant challenges due to their simultaneous\u0000impact on multiple organ systems, necessitating complex and adaptive treatment\u0000strategies. Despite recent advancements in AI-powered healthcare decision\u0000support systems, existing solutions are limited to individual organ systems.\u0000They often ignore the intricate dependencies between organ system and thereby\u0000fails to provide holistic treatment recommendations that are useful in\u0000practice. We propose a novel hierarchical multi-agent reinforcement learning\u0000(HMARL) framework to address these challenges. This framework uses dedicated\u0000agents for each organ system, and model dynamic through explicit inter-agent\u0000communication channels, enabling coordinated treatment strategies across\u0000organs. Furthermore, we introduce a dual-layer state representation technique\u0000to contextualize patient conditions at various hierarchical levels, enhancing\u0000the treatment accuracy and relevance. Through extensive qualitative and\u0000quantitative evaluations in managing sepsis (a complex multi-organ disease),\u0000our approach demonstrates its ability to learn effective treatment policies\u0000that significantly improve patient survival rates. This framework marks a\u0000substantial advancement in clinical decision support systems, pioneering a\u0000comprehensive approach for multi-organ treatment recommendations.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improved Parallel Algorithm for Non-Monotone Submodular Maximization under Knapsack Constraint Knapsack约束下非单调次模态最大化的改进并行算法

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-06 DOI: arxiv-2409.04415

Tan D. Tran, Canh V. Pham, Dung T. K. Ha, Phuong N. H. Pham

This work proposes an efficient parallel algorithm for non-monotonesubmodular maximization under a knapsack constraint problem over the ground setof size $n$. Our algorithm improves the best approximation factor of theexisting parallel one from $8+epsilon$ to $7+epsilon$ with $O(log n)$adaptive complexity. The key idea of our approach is to create a new alternate thresholdalgorithmic framework. This strategy alternately constructs two disjointcandidate solutions within a constant number of sequence rounds. Then, thealgorithm boosts solution quality without sacrificing the adaptive complexity.Extensive experimental studies on three applications, Revenue Maximization,Image Summarization, and Maximum Weighted Cut, show that our algorithm not onlysignificantly increases solution quality but also requires comparativeadaptivity to state-of-the-art algorithms.

本研究提出了一种高效的并行算法，用于在大小为 $n$ 的地面集合上的 knapsack 约束问题下的非单调次模态最大化。我们的算法将现有并行算法的最佳近似系数从 $8+epsilon$ 提高到 $7+epsilon$ ，自适应复杂度为 $O(log n)$。我们的方法的关键思路是创建一个新的交替阈值算法框架。该策略在一定数量的序列轮次内交替构建两个不相交的候选解。对收入最大化、图像汇总和最大加权剪切这三个应用的广泛实验研究表明，我们的算法不仅显著提高了解的质量，而且与最先进的算法相比具有可比性。

引用次数: 0

Intelligent tutoring systems by Bayesian networks with noisy gates 带有噪声门的贝叶斯网络智能辅导系统

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-06 DOI: arxiv-2409.04102

Alessandro Antonucci, Francesca Mangili, Claudio Bonesana, Giorgia Adorni

Directed graphical models such as Bayesian nets are often used to implementintelligent tutoring systems able to interact in real-time with learners in apurely automatic way. When coping with such models, keeping a bound on thenumber of parameters might be important for multiple reasons. First, as thesemodels are typically based on expert knowledge, a huge number of parameters toelicit might discourage practitioners from adopting them. Moreover, the numberof model parameters affects the complexity of the inferences, while a fastcomputation of the queries is needed for real-time feedback. We advocatelogical gates with uncertainty for a compact parametrization of the conditionalprobability tables in the underlying Bayesian net used by tutoring systems. Wediscuss the semantics of the model parameters to elicit and the assumptionsrequired to apply such approach in this domain. We also derive a dedicatedinference scheme to speed up computations.

有向图模型（如贝叶斯网）通常用于实现智能辅导系统，该系统能够以纯粹自动的方式与学习者进行实时交互。在处理这类模型时，出于多种原因，对参数数量保持一定的限制可能很重要。首先，由于这些模型通常以专家知识为基础，如果要获取大量参数，可能会让从业者望而却步。此外，模型参数的数量会影响推论的复杂性，而实时反馈需要快速计算查询。我们提倡在辅导系统使用的底层贝叶斯网中，用不确定性逻辑门对条件概率表进行紧凑的参数化。我们讨论了模型参数的语义，以及在该领域应用这种方法所需的假设。我们还推导出一种专用推理方案，以加快计算速度。

引用次数: 0

Neurosymbolic Methods for Dynamic Knowledge Graphs 动态知识图谱的神经符号方法

arXiv - CS - Artificial Intelligence

Pub Date : 2024-09-06 DOI: arxiv-2409.04572

Mehwish Alam, Genet Asefa Gesese, Pierre-Henri Paris

Knowledge graphs (KGs) have recently been used for many tools andapplications, making them rich resources in structured format. However, in thereal world, KGs grow due to the additions of new knowledge in the form ofentities and relations, making these KGs dynamic. This chapter formally definesseveral types of dynamic KGs and summarizes how these KGs can be represented.Additionally, many neurosymbolic methods have been proposed for learningrepresentations over static KGs for several tasks such as KG completion andentity alignment. This chapter further focuses on neurosymbolic methods fordynamic KGs with or without temporal information. More specifically, itprovides an insight into neurosymbolic methods for dynamic (temporal ornon-temporal) KG completion and entity alignment tasks. It further discussesthe challenges of current approaches and provides some future directions.

知识图谱（KGs）近来已被用于许多工具和应用中，成为结构化格式的丰富资源。然而，在现实世界中，由于实体和关系形式的新知识的加入，知识图谱会不断增长，从而使这些知识图谱成为动态图谱。本章正式定义了几种类型的动态 KG，并总结了这些 KG 的表示方法。此外，许多神经符号方法已被提出，用于学习静态 KG 的表示方法，以完成 KG 补充和实体对齐等任务。本章将进一步关注用于有时间信息或无时间信息动态 KG 的神经符号方法。更具体地说，本章深入探讨了用于动态（时态或非时态）KG补全和实体配准任务的神经符号方法。它进一步讨论了当前方法所面临的挑战，并提供了一些未来发展方向。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

arXiv - CS - Artificial Intelligence

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀