ACM Transactions on Intelligent Systems and Technology最新文献_第6页

CACTUS: a Comprehensive Abstraction and Classification Tool for Uncovering Structures CACTUS：用于揭示结构的综合抽象和分类工具

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-27 DOI: 10.1145/3649459

Luca Gherardini, Varun Ravi Varma, Karol Capała, Roger Woods, Jose Sousa

The availability of large data sets is providing the impetus for driving many current artificial intelligent developments. However, specific challenges arise in developing solutions that exploit small data sets, mainly due to practical and cost-effective deployment issues, as well as the opacity of deep learning models. To address this, the Comprehensive Abstraction and Classification Tool for Uncovering Structures (CACTUS) is presented as a means of improving secure analytics by effectively employing explainable artificial intelligence. CACTUS achieves this by providing additional support for categorical attributes, preserving their original meaning, optimising memory usage, and speeding up the computation through parallelisation. It exposes to the user the frequency of the attributes in each class and ranks them by their discriminative power. Performance is assessed by applying it to various domains, including Wisconsin Diagnostic Breast Cancer, Thyroid0387, Mushroom, Cleveland Heart Disease, and Adult Income data sets.

大型数据集的可用性为推动当前许多人工智能的发展提供了动力。然而，在开发利用小型数据集的解决方案时也遇到了一些具体挑战，主要是由于实际部署和成本效益问题，以及深度学习模型的不透明性。为了解决这个问题，我们提出了用于揭示结构的综合抽象和分类工具（CACTUS），作为通过有效利用可解释人工智能来改进安全分析的一种手段。CACTUS 通过为分类属性提供额外支持、保留其原始含义、优化内存使用以及通过并行化加快计算速度来实现这一目标。它向用户展示了每个类别中属性的频率，并根据其判别能力对它们进行排序。通过将其应用于各种领域，包括威斯康星诊断乳腺癌、甲状腺 0387、蘑菇、克利夫兰心脏病和成人收入数据集，对其性能进行了评估。

引用次数: 0

Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation 通过相对绝对幅度层向相关性传播和多成分评估提高基于归因的神经网络可解释性

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-26 DOI: 10.1145/3649458

Davor Vukadin, Petar Afrić, Marin Šilić, Goran Delač

Recent advancement in deep-neural network performance led to the development of new state-of-the-art approaches in numerous areas. However, the black-box nature of neural networks often prohibits their use in areas where model explainability and model transparency are crucial. Over the years, researchers proposed many algorithms to aid neural network understanding and provide additional information to the human expert. One of the most popular methods being Layer-Wise Relevance Propagation (LRP). This method assigns local relevance based on the pixel-wise decomposition of nonlinear classifiers. With the rise of attribution method research, there has emerged a pressing need to assess and evaluate their performance. Numerous metrics have been proposed, each assessing an individual property of attribution methods such as faithfulness, robustness or localization. Unfortunately, no single metric is deemed optimal for every case, and researchers often use several metrics to test the quality of the attribution maps. In this work, we address the shortcomings of the current LRP formulations and introduce a novel method for determining the relevance of input neurons through layer-wise relevance propagation. Furthermore, we apply this approach to the recently developed Vision Transformer architecture and evaluate its performance against existing methods on two image classification datasets, namely ImageNet and PascalVOC. Our results clearly demonstrate the advantage of our proposed method. Furthermore, we discuss the insufficiencies of current evaluation metrics for attribution-based explainability and propose a new evaluation metric that combines the notions of faithfulness, robustness and contrastiveness. We utilize this new metric to evaluate the performance of various attribution-based methods. Our code is available at: https://github.com/davor10105/relative-absolute-magnitude-propagation

近来，深度神经网络性能的提升促使人们在许多领域开发出了最先进的新方法。然而，神经网络的黑箱性质往往使其无法用于模型可解释性和模型透明度至关重要的领域。多年来，研究人员提出了许多算法来帮助理解神经网络，并为人类专家提供更多信息。最流行的方法之一是层相关性传播（LRP）。这种方法基于非线性分类器的像素分解来分配局部相关性。随着归因方法研究的兴起，人们迫切需要对其性能进行评估和评价。目前已提出了许多衡量标准，每种标准都对归因方法的某一特性进行评估，如忠实性、稳健性或定位性。遗憾的是，没有一种指标被认为是适用于所有情况的最佳指标，研究人员通常使用多种指标来测试归因图的质量。在这项工作中，我们解决了当前 LRP 方案的不足之处，并引入了一种通过层相关性传播来确定输入神经元相关性的新方法。此外，我们将这种方法应用于最近开发的 Vision Transformer 架构，并在两个图像分类数据集（即 ImageNet 和 PascalVOC）上对其性能与现有方法进行了评估。我们的结果清楚地证明了我们提出的方法的优势。此外，我们还讨论了当前基于归因的可解释性评估指标的不足之处，并提出了一种新的评估指标，该指标结合了忠实性、鲁棒性和对比性等概念。我们利用这一新指标来评估各种基于归因的方法的性能。我们的代码可在以下网址获取： https://github.com/davor10105/relative-absolute-magnitude-propagation

{"title":"Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation","authors":"Davor Vukadin, Petar Afrić, Marin Šilić, Goran Delač","doi":"10.1145/3649458","DOIUrl":"https://doi.org/10.1145/3649458","url":null,"abstract":"Recent advancement in deep-neural network performance led to the development of new state-of-the-art approaches in numerous areas. However, the black-box nature of neural networks often prohibits their use in areas where model explainability and model transparency are crucial. Over the years, researchers proposed many algorithms to aid neural network understanding and provide additional information to the human expert. One of the most popular methods being Layer-Wise Relevance Propagation (LRP). This method assigns local relevance based on the pixel-wise decomposition of nonlinear classifiers. With the rise of attribution method research, there has emerged a pressing need to assess and evaluate their performance. Numerous metrics have been proposed, each assessing an individual property of attribution methods such as faithfulness, robustness or localization. Unfortunately, no single metric is deemed optimal for every case, and researchers often use several metrics to test the quality of the attribution maps. In this work, we address the shortcomings of the current LRP formulations and introduce a novel method for determining the relevance of input neurons through layer-wise relevance propagation. Furthermore, we apply this approach to the recently developed Vision Transformer architecture and evaluate its performance against existing methods on two image classification datasets, namely ImageNet and PascalVOC. Our results clearly demonstrate the advantage of our proposed method. Furthermore, we discuss the insufficiencies of current evaluation metrics for attribution-based explainability and propose a new evaluation metric that combines the notions of faithfulness, robustness and contrastiveness. We utilize this new metric to evaluate the performance of various attribution-based methods. Our code is available at: https://github.com/davor10105/relative-absolute-magnitude-propagation\u0000","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"126 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139969692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Unit Detection MGRR-Net：用于面部动作单元检测的多层次图关系推理网络

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-09 DOI: 10.1145/3643863

Xuri Ge, Joemon M. Jose, Songpei Xu, Xiao Liu, Hu Han

The Facial Action Coding System (FACS) encodes the action units (AUs) in facial images, which has attracted extensive research attention due to its wide use in facial expression analysis. Many methods that perform well on automatic facial action unit (AU) detection primarily focus on modelling various AU relations between corresponding local muscle areas or mining global attention-aware facial features; however, they neglect the dynamic interactions among local-global features. We argue that encoding AU features just from one perspective may not capture the rich contextual information between regional and global face features, as well as the detailed variability across AUs, because of the diversity in expression and individual characteristics. In this paper, we propose a novel Multi-level Graph Relational Reasoning Network (termed MGRR-Net) for facial AU detection. Each layer of MGRR-Net performs a multi-level (i.e., region-level, pixel-wise and channel-wise level) feature learning. On the one hand, the region-level feature learning from the local face patch features via graph neural network can encode the correlation across different AUs. On the other hand, pixel-wise and channel-wise feature learning via graph attention networks (GAT) enhance the discrimination ability of AU features by adaptively recalibrating feature responses of pixels and channels from global face features. The hierarchical fusion strategy combines features from the three levels with gated fusion cells to improve AU discriminative ability. Extensive experiments on DISFA and BP4D AU datasets show that the proposed approach achieves superior performance than the state-of-the-art methods.

面部动作编码系统（FACS）对面部图像中的动作单元（AUs）进行编码，因其在面部表情分析中的广泛应用而引起了广泛的研究关注。许多在面部动作单元（AU）自动检测方面表现出色的方法主要侧重于模拟相应局部肌肉区域之间的各种 AU 关系，或挖掘全局注意力感知面部特征；然而，它们忽略了局部-全局特征之间的动态交互。我们认为，由于表情和个体特征的多样性，仅从一个角度对 AU 特征进行编码可能无法捕捉到区域和全局面部特征之间丰富的上下文信息，也无法捕捉到 AU 之间的细节变化。在本文中，我们提出了一种用于面部 AU 检测的新型多层图关系推理网络（MGRR-Net）。MGRR-Net 的每一层都执行多级（即区域级、像素级和通道级）特征学习。一方面，区域级特征学习通过图神经网络从局部人脸补丁特征中学习，可以编码不同 AU 之间的相关性。另一方面，通过图注意网络（GAT）进行的像素级和通道级特征学习，可以从全局人脸特征中自适应性地重新校准像素和通道的特征响应，从而提高区域特征的辨别能力。分层融合策略将三个层次的特征与门控融合单元相结合，以提高 AU 识别能力。在 DISFA 和 BP4D AU 数据集上进行的大量实验表明，所提出的方法比最先进的方法性能更优。

{"title":"MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Unit Detection","authors":"Xuri Ge, Joemon M. Jose, Songpei Xu, Xiao Liu, Hu Han","doi":"10.1145/3643863","DOIUrl":"https://doi.org/10.1145/3643863","url":null,"abstract":"The Facial Action Coding System (FACS) encodes the action units (AUs) in facial images, which has attracted extensive research attention due to its wide use in facial expression analysis. Many methods that perform well on automatic facial action unit (AU) detection primarily focus on modelling various AU relations between corresponding local muscle areas or mining global attention-aware facial features; however, they neglect the dynamic interactions among local-global features. We argue that encoding AU features just from one perspective may not capture the rich contextual information between regional and global face features, as well as the detailed variability across AUs, because of the diversity in expression and individual characteristics. In this paper, we propose a novel Multi-level Graph Relational Reasoning Network (termed MGRR-Net) for facial AU detection. Each layer of MGRR-Net performs a multi-level (i.e., region-level, pixel-wise and channel-wise level) feature learning. On the one hand, the region-level feature learning from the local face patch features via graph neural network can encode the correlation across different AUs. On the other hand, pixel-wise and channel-wise feature learning via graph attention networks (GAT) enhance the discrimination ability of AU features by adaptively recalibrating feature responses of pixels and channels from global face features. The hierarchical fusion strategy combines features from the three levels with gated fusion cells to improve AU discriminative ability. Extensive experiments on DISFA and BP4D AU datasets show that the proposed approach achieves superior performance than the state-of-the-art methods.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"24 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139762909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Boosting Healthiness Exposure in Category-constrained Meal Recommendation Using Nutritional Standards 利用营养标准提高受类别限制的膳食推荐中的健康指数

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-05 DOI: 10.1145/3643859

Ming Li, Lin Li, Xiaohui Tao, Zhongwei Xie, Qing Xie, Jingling Yuan

Food computing, as a newly emerging topic, is closely linked to human life through computational methodologies. Meal recommendation, a food-related study about human health, aims to provide users a meal with courses constrained from specific categories (e.g., appetizers, main dishes) that can be enjoyed as a service. Historical interaction data, as important user information, is often used by existing models to learn user preferences. However, if user’s preferences favor less healthy meals, the model will follow that preference and make similar recommendations, potentially negatively impacting the user’s long-term health. This emphasizes the necessity for health-oriented and responsible meal recommendation systems. In this paper, we propose a healthiness-aware and category-wise meal recommendation model called CateRec, which boosts healthiness exposure by using nutritional standards as knowledge to guide the model training. Two fundamental questions are raised and answered: 1) How to evaluate the healthiness of meals? Two well-known nutritional standards from the World Health Organisation and the United Kingdom Food Standards Agency are used to calculate the healthiness score of the meal. 2) How to health-orientedly guide the model training? We construct category-wise personalization partial rankings and category-wise healthiness partial rankings, and theoretically analyze that they meet the necessary properties and assumptions required to be trained by the maximum posterior estimator under Bayesian probability. The data analysis confirms the existence of user preferences leaning towards less healthy meals in two public datasets. A comprehensive experiment demonstrates that our CateRec effectively boosts healthiness exposure in terms of mean healthiness score and ranking exposure, while being comparable to the state-of-the-art model in terms of recommendation accuracy.

食品计算作为一个新兴课题，通过计算方法与人类生活密切相关。膳食推荐是一项与人类健康相关的食品研究，旨在为用户提供可享受特定类别（如开胃菜、主菜）菜肴的膳食服务。历史交互数据作为重要的用户信息，经常被现有模型用来学习用户偏好。但是，如果用户的偏好偏向于不太健康的饮食，模型就会遵循这种偏好并做出类似的推荐，从而可能对用户的长期健康产生负面影响。这就强调了以健康为导向、负责任的膳食推荐系统的必要性。在本文中，我们提出了一种具有健康意识和分类意识的膳食推荐模型--CateRec，该模型通过将营养标准作为知识来指导模型训练，从而提高健康度。本文提出并回答了两个基本问题：1）如何评估膳食的健康程度？采用世界卫生组织和英国食品标准局的两个著名营养标准来计算膳食的健康度得分。2) 如何以健康为导向指导模型训练？我们构建了分类的个性化部分排名和分类的健康度部分排名，并从理论上分析了它们符合贝叶斯概率下最大后验估计器训练所需的必要属性和假设。数据分析证实，在两个公共数据集中，用户的偏好倾向于不太健康的膳食。综合实验表明，我们的 CateRec 在平均健康度得分和排名曝光率方面有效地提高了健康度曝光率，同时在推荐准确率方面与最先进的模型不相上下。

{"title":"Boosting Healthiness Exposure in Category-constrained Meal Recommendation Using Nutritional Standards","authors":"Ming Li, Lin Li, Xiaohui Tao, Zhongwei Xie, Qing Xie, Jingling Yuan","doi":"10.1145/3643859","DOIUrl":"https://doi.org/10.1145/3643859","url":null,"abstract":"Food computing, as a newly emerging topic, is closely linked to human life through computational methodologies. Meal recommendation, a food-related study about human health, aims to provide users a meal with courses constrained from specific categories (e.g., appetizers, main dishes) that can be enjoyed as a service. Historical interaction data, as important user information, is often used by existing models to learn user preferences. However, if user’s preferences favor less healthy meals, the model will follow that preference and make similar recommendations, potentially negatively impacting the user’s long-term health. This emphasizes the necessity for health-oriented and responsible meal recommendation systems. In this paper, we propose a healthiness-aware and category-wise meal recommendation model called CateRec, which boosts healthiness exposure by using nutritional standards as knowledge to guide the model training. Two fundamental questions are raised and answered: 1) How to evaluate the healthiness of meals? Two well-known nutritional standards from the World Health Organisation and the United Kingdom Food Standards Agency are used to calculate the healthiness score of the meal. 2) How to health-orientedly guide the model training? We construct category-wise personalization partial rankings and category-wise healthiness partial rankings, and theoretically analyze that they meet the necessary properties and assumptions required to be trained by the maximum posterior estimator under Bayesian probability. The data analysis confirms the existence of user preferences leaning towards less healthy meals in two public datasets. A comprehensive experiment demonstrates that our CateRec effectively boosts healthiness exposure in terms of mean healthiness score and ranking exposure, while being comparable to the state-of-the-art model in terms of recommendation accuracy.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"27 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139688816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FEIR: Quantifying and Reducing Envy and Inferiority for Fair Recommendation of Limited Resources FEIR：量化和减少羡慕嫉妒恨，公平推荐有限资源

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-03 DOI: 10.1145/3643891

Nan Li, Bo Kang, Jefrey Lijffijt, Tijl De Bie

Recommendation in settings such as e-recruitment and online dating involves distributing limited opportunities, which differs from recommending practically unlimited goods such as in e-commerce or music recommendation. This setting calls for novel approaches to quantify and enforce fairness. Indeed, typical recommender systems recommend each user their top relevant items, such that desirable items may be recommended simultaneously to more and to less qualified individuals. This is arguably unfair to the latter. Indeed, when they pursue such a desirable recommendation (e.g. by applying for a job), they are unlikely to be successful.

To quantify fairness in such settings, we introduce inferiority: a novel (un)fairness measure that quantifies the competitive disadvantage of a user for their recommended items. Inferiority is complementary to envy: a previously-proposed fairness notion that quantifies the extent to which a user prefers other users’ recommendations over their own. We propose to use both inferiority and envy in combination with an accuracy-related measure called utility: the aggregated relevancy scores of the recommended items. Unfortunately, none of these three measures are differentiable, making it hard to optimize them, and restricting their immediate use to evaluation only. To remedy this, we reformulate them in the context of a probabilistic interpretation of recommender systems, resulting in differentiable versions. We show how these loss functions can be combined in a multi-objective optimization problem that we call FEIR (Fairness through Envy and Inferiority Reduction), used as a post-processing of the scores from any standard recommender system.

Experiments on synthetic and real-world data show that the proposed approach effectively improves the trade-offs between inferiority, envy and utility, compared to the naive recommendation and the state of the art method for the related problem of congestion alleviation in job recommendation. We discuss and enhance the practical impact of our findings on a wide range of real-world recommendation scenarios, and we offer implementations of visualization tools to render the envy and inferiority metrics more accessible.

电子招聘和在线约会等环境下的推荐涉及分配有限的机会，这与电子商务或音乐推荐等实际上无限的商品推荐不同。在这种情况下，需要采用新颖的方法来量化和执行公平性。事实上，典型的推荐系统会向每个用户推荐其最相关的项目，这样，理想的项目可能会同时推荐给更多和更少的合格个人。可以说，这对后者是不公平的。事实上，当他们寻求这种理想的推荐时（例如申请工作），他们不太可能成功。为了量化这种情况下的公平性，我们引入了 "劣势"：一种新颖的（非）公平性衡量标准，可以量化用户在其推荐项目中的竞争劣势。劣势与妒忌是互补的：妒忌是以前提出的一种公平概念，它量化了用户在多大程度上更喜欢其他用户的推荐而不是自己的推荐。我们建议将 "自卑 "和 "嫉妒 "与一种名为 "效用 "的与准确性相关的测量方法结合起来使用："效用 "是指推荐项目的相关性总分。遗憾的是，这三种衡量标准都不是可微分的，因此很难对它们进行优化，也限制了它们在评估中的直接使用。为了解决这个问题，我们根据推荐系统的概率解释对它们进行了重新表述，从而得到了可微分的版本。我们展示了如何将这些损失函数结合到一个多目标优化问题中，我们称之为 FEIR（通过减少嫉妒和自卑实现公平），用作任何标准推荐系统得分的后处理。在合成数据和真实世界数据上的实验表明，与天真的推荐方法以及在工作推荐中缓解拥堵这一相关问题的最先进方法相比，所提出的方法有效地改善了自卑、嫉妒和效用之间的权衡。我们讨论并增强了我们的研究结果对现实世界中各种推荐场景的实际影响，我们还提供了可视化工具的实现方法，使羡慕度和自卑度指标更易于理解。

{"title":"FEIR: Quantifying and Reducing Envy and Inferiority for Fair Recommendation of Limited Resources","authors":"Nan Li, Bo Kang, Jefrey Lijffijt, Tijl De Bie","doi":"10.1145/3643891","DOIUrl":"https://doi.org/10.1145/3643891","url":null,"abstract":"Recommendation in settings such as e-recruitment and online dating involves distributing limited opportunities, which differs from recommending practically unlimited goods such as in e-commerce or music recommendation. This setting calls for novel approaches to quantify and enforce fairness. Indeed, typical recommender systems recommend each user their top relevant items, such that desirable items may be recommended simultaneously to more and to less qualified individuals. This is arguably unfair to the latter. Indeed, when they pursue such a desirable recommendation (e.g. by applying for a job), they are unlikely to be successful. To quantify fairness in such settings, we introduce inferiority: a novel (un)fairness measure that quantifies the competitive disadvantage of a user for their recommended items. Inferiority is complementary to envy: a previously-proposed fairness notion that quantifies the extent to which a user prefers other users’ recommendations over their own. We propose to use both inferiority and envy in combination with an accuracy-related measure called utility: the aggregated relevancy scores of the recommended items. Unfortunately, none of these three measures are differentiable, making it hard to optimize them, and restricting their immediate use to evaluation only. To remedy this, we reformulate them in the context of a probabilistic interpretation of recommender systems, resulting in differentiable versions. We show how these loss functions can be combined in a multi-objective optimization problem that we call FEIR (Fairness through Envy and Inferiority Reduction), used as a post-processing of the scores from any standard recommender system. Experiments on synthetic and real-world data show that the proposed approach effectively improves the trade-offs between inferiority, envy and utility, compared to the naive recommendation and the state of the art method for the related problem of congestion alleviation in job recommendation. We discuss and enhance the practical impact of our findings on a wide range of real-world recommendation scenarios, and we offer implementations of visualization tools to render the envy and inferiority metrics more accessible.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"61 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139668856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Internal Rehearsals for a Reconfigurable Robot to Improve Area Coverage Performance 为可重构机器人进行内部演练以提高区域覆盖性能

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-02 DOI: 10.1145/3643854

S. M. Bhagya P. Samarakoon, M. A. Viraj J. Muthugala, Mohan Rajesh Elara

Reconfigurable robots are deployed for applications demanding area coverage, such as cleaning and inspections. Reconfiguration per context, considering beyond a small set of predefined shapes, is crucial for area coverage performance. However, the existing area coverage methods of reconfigurable robots are not always effective and require improvements for ascertaining the intended goal. Therefore, this paper proposes a novel coverage strategy based on internal rehearsals to improve the area coverage performance of a reconfigurable robot. In this regard, a reconfigurable robot is embodied with the cognitive ability to predict the outcomes of its actions before executing them. A genetic algorithm uses the results of the internal rehearsals to determine a set of the robot’s coverage parameters, including positioning, heading, and reconfiguration, to maximize coverage in an obstacle cluster encountered by the robot. The experimental results confirm that the proposed method can significantly improve the area coverage performance of a reconfigurable robot.

可重构机器人被部署在清洁和检查等要求区域覆盖的应用中。根据具体情况进行重新配置，不局限于一小部分预定义的形状，这对区域覆盖性能至关重要。然而，现有的可重构机器人区域覆盖方法并不总是有效的，需要加以改进才能确定预期目标。因此，本文提出了一种基于内部演练的新型覆盖策略，以提高可重构机器人的区域覆盖性能。在这方面，可重构机器人具有认知能力，能在执行行动前预测行动结果。遗传算法利用内部演练的结果来确定机器人的一组覆盖参数，包括定位、航向和重新配置，以最大限度地覆盖机器人遇到的障碍物集群。实验结果证实，所提出的方法能显著提高可重构机器人的区域覆盖性能。

引用次数: 0

Guidelines for the Regularization of Gammas in Batch Normalization for Deep Residual Networks 深度残差网络批量归一化中的伽马正则化指南

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-01 DOI: 10.1145/3643860

Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

L₂ regularization for weights in neural networks is widely used as a standard training trick. In addition to weights, the use of batch normalization involves an additional trainable parameter γ, which acts as a scaling factor. However, L₂ regularization for γ remains an undiscussed mystery and is applied in different ways depending on the library and practitioner. In this paper, we study whether L₂ regularization for γ is valid. To explore this issue, we consider two approaches: 1) variance control to make the residual network behave like an identity mapping and 2) stable optimization through the improvement of effective learning rate. Through two analyses, we specify the desirable and undesirable γ to apply L₂ regularization and propose four guidelines for managing them. In several experiments, we observed that applying L₂ regularization to applicable γ increased 1%–4% classification accuracy, whereas applying L₂ regularization to inapplicable γ decreased 1%–3% classification accuracy, which is consistent with our four guidelines. Our proposed guidelines were further validated through various tasks and architectures, including variants of residual networks and transformers.

神经网络中权重的 L2 正则化作为一种标准训练技巧被广泛使用。除了权重之外，批量正则化的使用还涉及到一个额外的可训练参数γ，它是一个缩放因子。然而，γ 的 L2 正则化仍然是一个未被讨论的谜，并且根据库和从业者的不同而有不同的应用方式。在本文中，我们将研究对 γ 进行 L2 正则化是否有效。为了探讨这个问题，我们考虑了两种方法：1) 方差控制，使残差网络表现得像一个身份映射；2) 通过提高有效学习率进行稳定优化。通过这两项分析，我们明确了应用 L2 正则化的理想γ 和不理想γ，并提出了管理它们的四项准则。在多次实验中，我们观察到对适用的γ应用 L2 正则化会提高 1%-4%的分类准确率，而对不适用的γ应用 L2 正则化会降低 1%-3%的分类准确率，这与我们的四条准则是一致的。我们提出的准则通过各种任务和架构（包括残差网络和变压器的变体）得到了进一步验证。

{"title":"Guidelines for the Regularization of Gammas in Batch Normalization for Deep Residual Networks","authors":"Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim","doi":"10.1145/3643860","DOIUrl":"https://doi.org/10.1145/3643860","url":null,"abstract":"L2 regularization for weights in neural networks is widely used as a standard training trick. In addition to weights, the use of batch normalization involves an additional trainable parameter γ, which acts as a scaling factor. However, L2 regularization for γ remains an undiscussed mystery and is applied in different ways depending on the library and practitioner. In this paper, we study whether L2 regularization for γ is valid. To explore this issue, we consider two approaches: 1) variance control to make the residual network behave like an identity mapping and 2) stable optimization through the improvement of effective learning rate. Through two analyses, we specify the desirable and undesirable γ to apply L2 regularization and propose four guidelines for managing them. In several experiments, we observed that applying L2 regularization to applicable γ increased 1%–4% classification accuracy, whereas applying L2 regularization to inapplicable γ decreased 1%–3% classification accuracy, which is consistent with our four guidelines. Our proposed guidelines were further validated through various tasks and architectures, including variants of residual networks and transformers.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"20 5","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Overcoming Diverse Undesired Effects in Recommender Systems: A Deontological Approach 克服推荐系统中的各种意外效应：义务论方法

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-01 DOI: 10.1145/3643857

Paula G. Duran, Pere Gilabert, Santi Seguí, Jordi Vitrià

In today’s digital landscape, recommender systems have gained ubiquity as a means of directing users towards personalized products, services, and content. However, despite their widespread adoption and a long track of research, these systems are not immune to shortcomings. A significant challenge faced by recommender systems is the presence of biases, which produces various undesirable effects, prominently the popularity bias. This bias hampers the diversity of recommended items, thus restricting users’ exposure to less popular or niche content. Furthermore, this issue is compounded when multiple stakeholders are considered, requiring the balance of multiple, potentially conflicting objectives.

In this paper, we present a new approach to address a wide range of undesired consequences in recommender systems that involve various stakeholders. Instead of adopting a consequentialist perspective that aims to mitigate the repercussions of a recommendation policy, we propose a deontological approach centered around a minimal set of ethical principles. More precisely, we introduce two distinct principles aimed at avoiding overconfidence in predictions and accurately modeling the genuine interests of users. The proposed approach circumvents the need for defining a multi-objective system, which has been identified as one of the main limitations when developing complex recommenders. Through extensive experimentation, we show the efficacy of our approach in mitigating the adverse impact of the recommender from both user and item perspectives, ultimately enhancing various beyond accuracy metrics. This study underscores the significance of responsible and equitable recommendations and proposes a strategy that can be easily deployed in real-world scenarios.

在当今的数字时代，推荐系统作为一种引导用户使用个性化产品、服务和内容的手段，已经变得无处不在。然而，尽管推荐系统得到了广泛应用，研究成果也源远流长，但这些系统也难免存在缺陷。推荐系统面临的一个重大挑战是存在偏差，这会产生各种不良影响，其中最突出的是人气偏差。这种偏差妨碍了推荐项目的多样性，从而限制了用户接触不那么受欢迎或小众的内容。此外，如果考虑到多个利益相关者，这个问题就会变得更加复杂，需要平衡多个可能相互冲突的目标。在本文中，我们提出了一种新的方法来解决涉及不同利益相关者的推荐系统中的各种不期望后果。我们没有采用旨在减轻推荐政策影响的结果论观点，而是提出了一种以一套最基本的道德原则为中心的义务论方法。更确切地说，我们提出了两个不同的原则，旨在避免对预测过于自信，并准确地模拟用户的真正利益。所提出的方法避免了定义多目标系统的需要，而多目标系统是开发复杂推荐器的主要限制之一。通过广泛的实验，我们展示了我们的方法在从用户和项目两个角度减轻推荐器的不利影响方面的功效，最终提高了各种超越准确性的指标。这项研究强调了负责任的公平推荐的重要性，并提出了一种可以在现实世界中轻松部署的策略。

{"title":"Overcoming Diverse Undesired Effects in Recommender Systems: A Deontological Approach","authors":"Paula G. Duran, Pere Gilabert, Santi Seguí, Jordi Vitrià","doi":"10.1145/3643857","DOIUrl":"https://doi.org/10.1145/3643857","url":null,"abstract":"In today’s digital landscape, recommender systems have gained ubiquity as a means of directing users towards personalized products, services, and content. However, despite their widespread adoption and a long track of research, these systems are not immune to shortcomings. A significant challenge faced by recommender systems is the presence of biases, which produces various undesirable effects, prominently the popularity bias. This bias hampers the diversity of recommended items, thus restricting users’ exposure to less popular or niche content. Furthermore, this issue is compounded when multiple stakeholders are considered, requiring the balance of multiple, potentially conflicting objectives. In this paper, we present a new approach to address a wide range of undesired consequences in recommender systems that involve various stakeholders. Instead of adopting a consequentialist perspective that aims to mitigate the repercussions of a recommendation policy, we propose a deontological approach centered around a minimal set of ethical principles. More precisely, we introduce two distinct principles aimed at avoiding overconfidence in predictions and accurately modeling the genuine interests of users. The proposed approach circumvents the need for defining a multi-objective system, which has been identified as one of the main limitations when developing complex recommenders. Through extensive experimentation, we show the efficacy of our approach in mitigating the adverse impact of the recommender from both user and item perspectives, ultimately enhancing various beyond accuracy metrics. This study underscores the significance of responsible and equitable recommendations and proposes a strategy that can be easily deployed in real-world scenarios.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"291 2 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal Treatment Strategies for Critical Patients with Deep Reinforcement Learning 利用深度强化学习优化危重病人的治疗策略

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-01 DOI: 10.1145/3643856

Simi Job, Xiaohui Tao, Lin Li, Haoran Xie, Taotao Cai, Jianming Yong, Qing Li

Personalized clinical decision support systems are increasingly being adopted due to the emergence of data-driven technologies, with this approach now gaining recognition in critical care. The task of incorporating diverse patient conditions and treatment procedures into critical care decision-making can be challenging due to the heterogeneous nature of medical data. Advances in Artificial Intelligence (AI), particularly Reinforcement Learning (RL) techniques, enables the development of personalized treatment strategies for severe illnesses by using a learning agent to recommend optimal policies. In this study, we propose a Deep Reinforcement Learning (DRL) model with a tailored reward function and an LSTM-GRU-derived state representation to formulate optimal treatment policies for vasopressor administration in stabilizing patient physiological states in critical care settings. Using an ICU dataset and the Medical Information Mart for Intensive Care (MIMIC-III) dataset, we focus on patients with Acute Respiratory Distress Syndrome (ARDS) that has led to Sepsis, to derive optimal policies that can prioritize patient recovery over patient survival. Both the DDQN (RepDRL-DDQN) and Dueling DDQN (RepDRL-DDDQN) versions of the DRL model surpass the baseline performance, with the proposed model’s learning agent achieving an optimal learning process across our performance measuring schemes. The robust state representation served as the foundation for enhancing the model’s performance, ultimately providing an optimal treatment policy focused on rapid patient recovery.

由于数据驱动技术的出现，个性化临床决策支持系统正被越来越多地采用，这种方法现在在重症监护领域也得到了认可。由于医疗数据的异质性，将不同的患者病情和治疗程序纳入重症监护决策是一项具有挑战性的任务。人工智能（AI）的进步，尤其是强化学习（RL）技术的进步，使得通过使用学习代理推荐最佳策略来制定重症个性化治疗策略成为可能。在本研究中，我们提出了一种具有定制奖励函数和 LSTM-GRU 衍生状态表示的深度强化学习（DRL）模型，用于制定最佳治疗策略，在重症监护环境中稳定患者的生理状态。利用重症监护室数据集和重症监护医疗信息市场（MIMIC-III）数据集，我们重点研究了导致败血症的急性呼吸窘迫综合征（ARDS）患者，得出了优先考虑患者康复而不是患者生存的最佳策略。DRL模型的DDQN（RepDRL-DDQN）和Dueling DDQN（RepDRL-DDDQN）版本都超过了基线性能，所提出模型的学习代理在我们的性能测量方案中实现了最佳学习过程。稳健的状态表示是提高模型性能的基础，最终提供了以患者快速康复为重点的最佳治疗策略。

{"title":"Optimal Treatment Strategies for Critical Patients with Deep Reinforcement Learning","authors":"Simi Job, Xiaohui Tao, Lin Li, Haoran Xie, Taotao Cai, Jianming Yong, Qing Li","doi":"10.1145/3643856","DOIUrl":"https://doi.org/10.1145/3643856","url":null,"abstract":"Personalized clinical decision support systems are increasingly being adopted due to the emergence of data-driven technologies, with this approach now gaining recognition in critical care. The task of incorporating diverse patient conditions and treatment procedures into critical care decision-making can be challenging due to the heterogeneous nature of medical data. Advances in Artificial Intelligence (AI), particularly Reinforcement Learning (RL) techniques, enables the development of personalized treatment strategies for severe illnesses by using a learning agent to recommend optimal policies. In this study, we propose a Deep Reinforcement Learning (DRL) model with a tailored reward function and an LSTM-GRU-derived state representation to formulate optimal treatment policies for vasopressor administration in stabilizing patient physiological states in critical care settings. Using an ICU dataset and the Medical Information Mart for Intensive Care (MIMIC-III) dataset, we focus on patients with Acute Respiratory Distress Syndrome (ARDS) that has led to Sepsis, to derive optimal policies that can prioritize patient recovery over patient survival. Both the DDQN (RepDRL-DDQN) and Dueling DDQN (RepDRL-DDDQN) versions of the DRL model surpass the baseline performance, with the proposed model’s learning agent achieving an optimal learning process across our performance measuring schemes. The robust state representation served as the foundation for enhancing the model’s performance, ultimately providing an optimal treatment policy focused on rapid patient recovery.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"24 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SiG: A Siamese-based Graph Convolutional Network to Align Knowledge in Autonomous Transportation Systems SiG：在自主运输系统中对齐知识的基于连通器的图卷积网络

IF 5 4区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Intelligent Systems and Technology

Pub Date : 2024-02-01 DOI: 10.1145/3643861

Mai Hao, Ming Cai, Minghui Fang, Linlin You

Domain knowledge is gradually renovating its attributes to exhibit distinct features in autonomy, propelled by the shift of modern transportation systems (TS) towards autonomous TS (ATS) comprising three progressive generations. Knowledge graph (KG) and its corresponding versions can help depict the evolving TS. Given that KG versions exhibit asymmetry primarily due to variations in evolved knowledge, it is imperative to harmonize the evolved knowledge embodied by the entity across disparate KG versions. Hence, this paper proposes a siamese-based graph convolutional network (GCN) model, namely SiG, to address unresolved issues of low accuracy, efficiency, and effectiveness in aligning asymmetric KGs. SiG can optimize entity alignment in ATS and support the analysis of future-stage ATS development. Such a goal is attained through: a) generating unified KGs to enhance data quality, b) defining graph split to facilitate entire-graph computation, c) enhancing GCN to extract intrinsic features, and d) designing siamese network to train asymmetric KGs. The evaluation results suggest that SiG surpasses other commonly employed models, resulting in average improvements of 23.90% and 37.89% in accuracy and efficiency, respectively. These findings have significant implications for TS evolution analysis and offer a novel perspective for research on complex systems limited by continuously updated knowledge.

在现代交通系统（TS）向由三代渐进式组成的自主交通系统（ATS）转变的推动下，领域知识正在逐步更新其属性，以展现自主交通系统的鲜明特征。知识图谱（KG）及其相应版本有助于描述不断发展的 TS。鉴于 KG 版本主要因进化知识的差异而表现出不对称性，当务之急是协调不同 KG 版本的实体所体现的进化知识。因此，本文提出了一种基于连体图卷积网络（GCN）的模型，即 SiG，以解决在对齐非对称 KG 时尚未解决的低准确率、低效率和低有效性问题。SiG 可以优化 ATS 中的实体配准，并支持对未来阶段 ATS 发展的分析。实现这一目标的途径包括：a) 生成统一的 KGs 以提高数据质量；b) 定义图拆分以促进全图计算；c) 增强 GCN 以提取内在特征；d) 设计连体网络以训练非对称 KGs。评估结果表明，SiG 超越了其他常用模型，其准确率和效率分别平均提高了 23.90% 和 37.89%。这些发现对 TS 演化分析具有重要意义，并为研究受限于不断更新的知识的复杂系统提供了新的视角。

{"title":"SiG: A Siamese-based Graph Convolutional Network to Align Knowledge in Autonomous Transportation Systems","authors":"Mai Hao, Ming Cai, Minghui Fang, Linlin You","doi":"10.1145/3643861","DOIUrl":"https://doi.org/10.1145/3643861","url":null,"abstract":"Domain knowledge is gradually renovating its attributes to exhibit distinct features in autonomy, propelled by the shift of modern transportation systems (TS) towards autonomous TS (ATS) comprising three progressive generations. Knowledge graph (KG) and its corresponding versions can help depict the evolving TS. Given that KG versions exhibit asymmetry primarily due to variations in evolved knowledge, it is imperative to harmonize the evolved knowledge embodied by the entity across disparate KG versions. Hence, this paper proposes a siamese-based graph convolutional network (GCN) model, namely SiG, to address unresolved issues of low accuracy, efficiency, and effectiveness in aligning asymmetric KGs. SiG can optimize entity alignment in ATS and support the analysis of future-stage ATS development. Such a goal is attained through: a) generating unified KGs to enhance data quality, b) defining graph split to facilitate entire-graph computation, c) enhancing GCN to extract intrinsic features, and d) designing siamese network to train asymmetric KGs. The evaluation results suggest that SiG surpasses other commonly employed models, resulting in average improvements of 23.90% and 37.89% in accuracy and efficiency, respectively. These findings have significant implications for TS evolution analysis and offer a novel perspective for research on complex systems limited by continuously updated knowledge.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":"51 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0