首页 > 最新文献

AI Communications最新文献

英文 中文
Multi-feature fusion dehazing based on CycleGAN 基于 CycleGAN 的多特征融合去毛刺技术
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-29 DOI: 10.3233/aic-230227
Jingpin Wang, Yuan Ge, Jie Zhao, Chao Han
Under the foggy environment, lane line images are obscured by haze, which leads to lower detection accuracy, higher false detection of lane lines. To address the above problems, a multi-layer feature fusion dehazing network based on CycleGAN architecture is proposed. Firstly, the foggy image is enhanced to remove the fog in the image, and then the lane line detection network is used for detection. For the dehazing network, a multi-layer feature fusion module is used in the generator to fuse the features of different coding layers of U-Net to enhance the network’s recovery of information such as details and edges, and a frequency domain channel attention mechanism is added at the key nodes of the network to enhance the network’s attention to different fog concentrations. At the same time, to improve the discriminant effect of the discriminator, the discriminator is extended to a global and local discriminator. The experimental results show that the dehaze effect on Reside and other test data sets is better than the comparison method. The peak signal-to-noise ratio is improved by 2.26 dB compared to the highest GCA-Net algorithm. According to the lane detection of fog images, it is found that the proposed network improves the accuracy of lane detection on foggy days.
在多雾环境下,车道线图像会被雾霾遮挡,导致检测精度降低,车道线误检率升高。针对上述问题,提出了一种基于 CycleGAN 架构的多层特征融合去毛刺网络。首先,对雾气图像进行增强以去除图像中的雾气,然后使用车道线检测网络进行检测。对于去雾网络,在生成器中使用了多层特征融合模块,将 U-Net 不同编码层的特征进行融合,以增强网络对细节和边缘等信息的恢复能力,并在网络的关键节点加入了频域信道关注机制,以增强网络对不同雾气浓度的关注。同时,为了提高判别器的判别效果,将判别器扩展为全局判别器和局部判别器。实验结果表明,在 Reside 和其他测试数据集上的去雾效果优于对比方法。与最高的 GCA-Net 算法相比,峰值信噪比提高了 2.26 dB。根据雾图像的车道检测,发现所提出的网络提高了雾天车道检测的准确性。
{"title":"Multi-feature fusion dehazing based on CycleGAN","authors":"Jingpin Wang, Yuan Ge, Jie Zhao, Chao Han","doi":"10.3233/aic-230227","DOIUrl":"https://doi.org/10.3233/aic-230227","url":null,"abstract":"Under the foggy environment, lane line images are obscured by haze, which leads to lower detection accuracy, higher false detection of lane lines. To address the above problems, a multi-layer feature fusion dehazing network based on CycleGAN architecture is proposed. Firstly, the foggy image is enhanced to remove the fog in the image, and then the lane line detection network is used for detection. For the dehazing network, a multi-layer feature fusion module is used in the generator to fuse the features of different coding layers of U-Net to enhance the network’s recovery of information such as details and edges, and a frequency domain channel attention mechanism is added at the key nodes of the network to enhance the network’s attention to different fog concentrations. At the same time, to improve the discriminant effect of the discriminator, the discriminator is extended to a global and local discriminator. The experimental results show that the dehaze effect on Reside and other test data sets is better than the comparison method. The peak signal-to-noise ratio is improved by 2.26 dB compared to the highest GCA-Net algorithm. According to the lane detection of fog images, it is found that the proposed network improves the accuracy of lane detection on foggy days.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"73 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-temporal deep learning framework for pedestrian intention prediction in urban traffic scenes 用于预测城市交通场景中行人意图的时空深度学习框架
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-28 DOI: 10.3233/aic-230053
Monika , Pardeep Singh, Satish Chand
Pedestrian intent prediction is an essential task for ensuring the safety of pedestrians and vehicles on the road. This task involves predicting whether a pedestrian intends to cross a road or not based on their behavior and surrounding environment. Previous studies have explored feature-based machine learning and vision-based deep learning models for this task but these methods have limitations in capturing the global spatio-temporal context and fusing different features of data effectively. To address these issues, we propose a novel hybrid framework HSTGCN for pedestrian intent prediction that combines spatio-temporal graph convolutional neural networks (STGCN) and long short-term memory (LSTM) networks. The proposed framework utilizes the strengths of both models by fusing multiple features, including skeleton pose, trajectory, height, orientation, and ego-vehicle speed, to predict their intentions accurately. The framework’s performance have been evaluated on the JAAD benchmark dataset and the results show that it outperforms the state-of-the-art methods. The proposed framework has potential applications in developing intelligent transportation systems, autonomous vehicles, and pedestrian safety technologies. The utilization of multiple features can significantly improve the performance of the pedestrian intent prediction task.
行人意图预测是确保道路上行人和车辆安全的一项重要任务。这项任务包括根据行人的行为和周围环境预测行人是否打算过马路。以往的研究探索了基于特征的机器学习和基于视觉的深度学习模型来完成这项任务,但这些方法在捕捉全局时空背景和有效融合数据的不同特征方面存在局限性。为了解决这些问题,我们提出了一种用于行人意图预测的新型混合框架 HSTGCN,它结合了时空图卷积神经网络(STGCN)和长短期记忆(LSTM)网络。所提出的框架通过融合骨架姿势、轨迹、高度、方向和自我车辆速度等多种特征,利用了这两种模型的优势,从而准确预测行人的意图。在 JAAD 基准数据集上对该框架的性能进行了评估,结果表明它优于最先进的方法。所提出的框架在开发智能交通系统、自动驾驶汽车和行人安全技术方面具有潜在的应用价值。利用多种特征可以显著提高行人意图预测任务的性能。
{"title":"Spatio-temporal deep learning framework for pedestrian intention prediction in urban traffic scenes","authors":"Monika , Pardeep Singh, Satish Chand","doi":"10.3233/aic-230053","DOIUrl":"https://doi.org/10.3233/aic-230053","url":null,"abstract":"Pedestrian intent prediction is an essential task for ensuring the safety of pedestrians and vehicles on the road. This task involves predicting whether a pedestrian intends to cross a road or not based on their behavior and surrounding environment. Previous studies have explored feature-based machine learning and vision-based deep learning models for this task but these methods have limitations in capturing the global spatio-temporal context and fusing different features of data effectively. To address these issues, we propose a novel hybrid framework HSTGCN for pedestrian intent prediction that combines spatio-temporal graph convolutional neural networks (STGCN) and long short-term memory (LSTM) networks. The proposed framework utilizes the strengths of both models by fusing multiple features, including skeleton pose, trajectory, height, orientation, and ego-vehicle speed, to predict their intentions accurately. The framework’s performance have been evaluated on the JAAD benchmark dataset and the results show that it outperforms the state-of-the-art methods. The proposed framework has potential applications in developing intelligent transportation systems, autonomous vehicles, and pedestrian safety technologies. The utilization of multiple features can significantly improve the performance of the pedestrian intent prediction task.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"32 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Open-world object detection: A solution based on reselection mechanism and feature disentanglement 开放世界物体检测:基于重新选择机制和特征分解的解决方案
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-25 DOI: 10.3233/aic-230270
Tian Lin, Li Hua, Li Linxuan, Bai Chuanao
Traditional object detection algorithms operate within a closed set, where the training data may not cover all real-world objects. Therefore, the issue of open-world object detection has attracted significant attention. Open-world object detection faces two major challenges: “neglecting unknown objects” and “misclassifying unknown objects as known ones.” In our study, we address these challenges by utilizing the Region Proposal Network (RPN) outputs to identify potential unknown objects with high object scores that do not overlap with ground truth annotations. We introduce the reselection mechanism, which separates unknown objects from the background. Subsequently, we employ the simulated annealing algorithm to disentangle features of unknown and known classes, guiding the detector’s learning process. Our method has improved on multiple evaluation metrics such as U-mAP, U-recall, and UDP, greatly alleviating the challenges faced by open world object detection.
传统的物体检测算法是在一个封闭的集合中运行的,训练数据可能无法涵盖现实世界中的所有物体。因此,开放世界物体检测问题引起了广泛关注。开放世界物体检测面临两大挑战:"忽略未知物体 "和 "将未知物体误分类为已知物体"。在我们的研究中,我们利用区域建议网络(RPN)的输出来识别潜在的未知物体,这些潜在的未知物体具有较高的物体分数,与地面实况注释不重叠,从而解决了这些难题。我们引入了重新选择机制,将未知物体从背景中分离出来。随后,我们采用模拟退火算法来分离未知类和已知类的特征,从而指导检测器的学习过程。我们的方法在 U-mAP、U-recall 和 UDP 等多个评估指标上都有所改进,大大缓解了开放世界物体检测所面临的挑战。
{"title":"Open-world object detection: A solution based on reselection mechanism and feature disentanglement","authors":"Tian Lin, Li Hua, Li Linxuan, Bai Chuanao","doi":"10.3233/aic-230270","DOIUrl":"https://doi.org/10.3233/aic-230270","url":null,"abstract":"Traditional object detection algorithms operate within a closed set, where the training data may not cover all real-world objects. Therefore, the issue of open-world object detection has attracted significant attention. Open-world object detection faces two major challenges: “neglecting unknown objects” and “misclassifying unknown objects as known ones.” In our study, we address these challenges by utilizing the Region Proposal Network (RPN) outputs to identify potential unknown objects with high object scores that do not overlap with ground truth annotations. We introduce the reselection mechanism, which separates unknown objects from the background. Subsequently, we employ the simulated annealing algorithm to disentangle features of unknown and known classes, guiding the detector’s learning process. Our method has improved on multiple evaluation metrics such as U-mAP, U-recall, and UDP, greatly alleviating the challenges faced by open world object detection.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"17 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Token-modification adversarial attacks for natural language processing: A survey 自然语言处理中的标记修改对抗攻击:调查
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-02 DOI: 10.3233/aic-230279
Tom Roth, Yansong Gao, Alsharif Abuadbba, Surya Nepal, Wei Liu
Many adversarial attacks target natural language processing systems, most of which succeed through modifying the individual tokens of a document. Despite the apparent uniqueness of each of these attacks, fundamentally they are simply a distinct configuration of four components: a goal function, allowable transformations, a search method, and constraints. In this survey, we systematically present the different components used throughout the literature, using an attack-independent framework which allows for easy comparison and categorisation of components. Our work aims to serve as a comprehensive guide for newcomers to the field and to spark targeted research into refining the individual attack components.
许多对抗性攻击都以自然语言处理系统为目标,其中大多数攻击都是通过修改文档中的单个标记而得逞的。尽管这些攻击表面上各具特色,但从根本上说,它们不过是由目标函数、允许的转换、搜索方法和限制条件这四个部分组成的独特配置。在本调查报告中,我们系统地介绍了文献中使用的不同组件,并使用了与攻击无关的框架,以便于对组件进行比较和分类。我们的工作旨在为该领域的新手提供全面的指导,并激发有针对性的研究,以完善各个攻击组件。
{"title":"Token-modification adversarial attacks for natural language processing: A survey","authors":"Tom Roth, Yansong Gao, Alsharif Abuadbba, Surya Nepal, Wei Liu","doi":"10.3233/aic-230279","DOIUrl":"https://doi.org/10.3233/aic-230279","url":null,"abstract":"Many adversarial attacks target natural language processing systems, most of which succeed through modifying the individual tokens of a document. Despite the apparent uniqueness of each of these attacks, fundamentally they are simply a distinct configuration of four components: a goal function, allowable transformations, a search method, and constraints. In this survey, we systematically present the different components used throughout the literature, using an attack-independent framework which allows for easy comparison and categorisation of components. Our work aims to serve as a comprehensive guide for newcomers to the field and to spark targeted research into refining the individual attack components.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"17 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MantaRay-ProM: An efficient process model discovery algorithm MantaRay-ProM:一种高效的流程模型发现算法
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-02 DOI: 10.3233/aic-220219
Shikha Gupta, Sonia Deshmukh, Naveen Kumar
Discovering the business process model from an organisation’s records of its operational processes is an active area of research in process mining. The discovered model may be used either during a new system rollout or to improve an existing system. In this paper, we present a process model discovery approach based on the recently proposed bio-inspired Manta Ray Foraging Optimization algorithm (MRFO). Since MRFO is designed to solve real-valued optimization problems, we adapted a binary version of MRFO to suit the domain of process mining. The proposed approach is compared with state-of-the-art process discovery algorithms on several synthetic and real-life event logs. The results show that compared to other algorithms, the proposed approach exhibits faster convergence and yields superior quality process models.
从企业的业务流程记录中发现业务流程模型是流程挖掘的一个活跃研究领域。发现的模型既可用于新系统的推广,也可用于改进现有系统。在本文中,我们介绍了一种基于最近提出的生物启发蝠鲼觅食优化算法(MRFO)的流程模型发现方法。由于 MRFO 是为解决实值优化问题而设计的,我们对二进制版本的 MRFO 进行了调整,以适应流程挖掘领域。我们在多个合成和真实事件日志上将所提出的方法与最先进的流程发现算法进行了比较。结果表明,与其他算法相比,所提出的方法收敛速度更快,所生成的流程模型质量更高。
{"title":"MantaRay-ProM: An efficient process model discovery algorithm","authors":"Shikha Gupta, Sonia Deshmukh, Naveen Kumar","doi":"10.3233/aic-220219","DOIUrl":"https://doi.org/10.3233/aic-220219","url":null,"abstract":"Discovering the business process model from an organisation’s records of its operational processes is an active area of research in process mining. The discovered model may be used either during a new system rollout or to improve an existing system. In this paper, we present a process model discovery approach based on the recently proposed bio-inspired Manta Ray Foraging Optimization algorithm (MRFO). Since MRFO is designed to solve real-valued optimization problems, we adapted a binary version of MRFO to suit the domain of process mining. The proposed approach is compared with state-of-the-art process discovery algorithms on several synthetic and real-life event logs. The results show that compared to other algorithms, the proposed approach exhibits faster convergence and yields superior quality process models.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"41 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140799793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information extraction tool text2alm: From narratives to action language system descriptions and query answering 信息提取工具 text2alm:从叙述到行动语言系统描述和查询回答
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-03-21 DOI: 10.3233/aic-220194
Yuliya Lierler, Gang Ling, Craig Olson
In this work we design an information extraction tool text2alm capable of narrative understanding with a focus on action verbs. This tool uses an action language ALM to perform inferences on complex interactions of events described in narratives. The methodology used to implement the text2alm system was originally outlined by Lierler, Inclezan, and Gelfond (In IWCS 2017 – 12th International Conference on Computational Semantics – Short Papers (2017)) via a manual process of converting a narrative to an ALM model. We refine that theoretical methodology and utilize it in design of the text2alm system. This system relies on a conglomeration of resources and techniques from two distinct fields of artificial intelligence, namely, (i) knowledge representation and reasoning and (ii) natural language processing. The effectiveness of system text2alm is measured by its ability to correctly answer questions from the bAbI tasks published by Facebook Research in 2015. This tool matched or exceeded the performance of state-of-the-art machine learning methods in six of the seven tested tasks. We also illustrate that the text2alm approach generalizes to a broader spectrum of narratives. On the path to creating system text2alm, a semantic role labeler text2drs was designed. Its unique feature is the use of the elements of the fine grained linguistic ontology VerbNet as semantic roles/labels in annotating considered text. This paper provides an accurate account on the details behind the text2alm and text2drs systems.
在这项工作中,我们设计了一种信息提取工具 text2alm,它能够理解叙事,重点是动作动词。该工具使用动作语言 ALM 对叙述中描述的事件的复杂互动进行推理。用于实现 text2alm 系统的方法最初由 Lierler、Inclezan 和 Gelfond(In IWCS 2017 - 12th International Conference on Computational Semantics - Short Papers (2017))通过手动将叙述转换为 ALM 模型的过程概述。我们完善了这一理论方法,并将其用于 text2alm 系统的设计。该系统依赖于人工智能两个不同领域的资源和技术,即(i)知识表示和推理以及(ii)自然语言处理。系统 text2alm 的有效性是通过其正确回答 2015 年 Facebook Research 发布的 bAbI 任务中的问题的能力来衡量的。在七项测试任务中,该工具在六项任务中的表现与最先进的机器学习方法不相上下,甚至更胜一筹。我们还说明,text2alm 方法适用于更广泛的叙述。在创建 text2alm 系统的过程中,我们设计了一个语义角色标签器 text2drs。它的独特之处在于使用细粒度语言本体 VerbNet 的元素作为语义角色/标签来注释所考虑的文本。本文将准确介绍 text2alm 和 text2drs 系统背后的细节。
{"title":"Information extraction tool text2alm: From narratives to action language system descriptions and query answering","authors":"Yuliya Lierler, Gang Ling, Craig Olson","doi":"10.3233/aic-220194","DOIUrl":"https://doi.org/10.3233/aic-220194","url":null,"abstract":"In this work we design an information extraction tool text2alm capable of narrative understanding with a focus on action verbs. This tool uses an action language ALM to perform inferences on complex interactions of events described in narratives. The methodology used to implement the text2alm system was originally outlined by Lierler, Inclezan, and Gelfond (In IWCS 2017 – 12th International Conference on Computational Semantics – Short Papers (2017)) via a manual process of converting a narrative to an ALM model. We refine that theoretical methodology and utilize it in design of the text2alm system. This system relies on a conglomeration of resources and techniques from two distinct fields of artificial intelligence, namely, (i) knowledge representation and reasoning and (ii) natural language processing. The effectiveness of system text2alm is measured by its ability to correctly answer questions from the bAbI tasks published by Facebook Research in 2015. This tool matched or exceeded the performance of state-of-the-art machine learning methods in six of the seven tested tasks. We also illustrate that the text2alm approach generalizes to a broader spectrum of narratives. On the path to creating system text2alm, a semantic role labeler text2drs was designed. Its unique feature is the use of the elements of the fine grained linguistic ontology VerbNet as semantic roles/labels in annotating considered text. This paper provides an accurate account on the details behind the text2alm and text2drs systems.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"23 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140196376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IMATSA – an improved and adaptive intelligent optimization algorithm based on tunicate swarm algorithm IMATSA - 基于调谐群算法的改进型自适应智能优化算法
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-03-21 DOI: 10.3233/aic-220093
Yan Chen, Weizhen Dong, Xiaochun Hu
Swarm intelligence optimization algorithm has been proved to perform well in the field of parameter optimization. In order to further improve the performance of intelligent optimization algorithm, this paper proposes an improved and adaptive tunicate swarm algorithm (IMATSA) based on tunicate swarmalgorithm (TSA). IMATSA improves TSA in the following four aspects: population diversity, local search convergence speed, jumping out of local optimal position, and balancing global and local search. Firstly, IMATSA adopts Tent map and quadratic interpolation to initialize population and enhance the diversity. Secondly, IMATSA uses Golden-Sine algorithm to accelerate the convergence of local search. Thirdly, in the process of global development, IMATSA adopts Levy flight and the improved Gauss disturbance method to adaptively improves and coordinates the ability of global development and local search. Then, this paper verifies the performance of IMATSA based on 14 benchmark functions experiment, ablation experiment, parameter optimization experiments of Support Vector Machine (SVM) and Gradient Boosting Decision Tree (GBDT), Wilcoxon signed rank test and image multi-threshold segmentation experiment with the performance metrics are convergence speed, convergence value, significance level P-value, Peak Signal-to-Noise Ratio (PSNR) and Standard Deviation (STD). Experimental results show that IMATSA performs better in three kinds of benchmark functions; each component of IMATSA has a positive effect on the performance; IMATSA performs better in parameter optimization experiments of SVM experiment and GBDT; there is significant difference between IMATSA and other algorithms by Wilcoxon signed rank test; in image segmentation, the performance is directly proportional to the number of thresholds, and compared with other algorithms, IMATSA has better comprehensive performance.
事实证明,群智能优化算法在参数优化领域具有良好的性能。为了进一步提高智能优化算法的性能,本文在调谐蜂群算法(TSA)的基础上提出了一种改进的自适应调谐蜂群算法(IMATSA)。IMATSA 从种群多样性、局部搜索收敛速度、跳出局部最优位置、平衡全局搜索和局部搜索四个方面对 TSA 进行了改进。首先,IMATSA 采用 Tent map 和二次插值来初始化种群,提高了种群多样性。其次,IMATSA 采用 Golden-Sine 算法加快局部搜索的收敛速度。第三,在全局发展过程中,IMATSA 采用 Levy flight 和改进的高斯干扰法,自适应地提高和协调全局发展和局部搜索的能力。然后,本文基于14个基准函数实验、消融实验、支持向量机(SVM)和梯度提升决策树(GBDT)参数优化实验、Wilcoxon符号秩检验和图像多阈值分割实验验证了IMATSA的性能,性能指标包括收敛速度、收敛值、显著性水平P值、峰值信噪比(PSNR)和标准偏差(STD)。实验结果表明:IMATSA 在三种基准函数中表现较好;IMATSA 的每个组成部分都对性能有积极影响;IMATSA 在 SVM 实验和 GBDT 的参数优化实验中表现较好;通过 Wilcoxon 符号秩检验,IMATSA 与其他算法有显著差异;在图像分割中,性能与阈值数成正比,与其他算法相比,IMATSA 具有更好的综合性能。
{"title":"IMATSA – an improved and adaptive intelligent optimization algorithm based on tunicate swarm algorithm","authors":"Yan Chen, Weizhen Dong, Xiaochun Hu","doi":"10.3233/aic-220093","DOIUrl":"https://doi.org/10.3233/aic-220093","url":null,"abstract":"Swarm intelligence optimization algorithm has been proved to perform well in the field of parameter optimization. In order to further improve the performance of intelligent optimization algorithm, this paper proposes an improved and adaptive tunicate swarm algorithm (IMATSA) based on tunicate swarmalgorithm (TSA). IMATSA improves TSA in the following four aspects: population diversity, local search convergence speed, jumping out of local optimal position, and balancing global and local search. Firstly, IMATSA adopts Tent map and quadratic interpolation to initialize population and enhance the diversity. Secondly, IMATSA uses Golden-Sine algorithm to accelerate the convergence of local search. Thirdly, in the process of global development, IMATSA adopts Levy flight and the improved Gauss disturbance method to adaptively improves and coordinates the ability of global development and local search. Then, this paper verifies the performance of IMATSA based on 14 benchmark functions experiment, ablation experiment, parameter optimization experiments of Support Vector Machine (SVM) and Gradient Boosting Decision Tree (GBDT), Wilcoxon signed rank test and image multi-threshold segmentation experiment with the performance metrics are convergence speed, convergence value, significance level P-value, Peak Signal-to-Noise Ratio (PSNR) and Standard Deviation (STD). Experimental results show that IMATSA performs better in three kinds of benchmark functions; each component of IMATSA has a positive effect on the performance; IMATSA performs better in parameter optimization experiments of SVM experiment and GBDT; there is significant difference between IMATSA and other algorithms by Wilcoxon signed rank test; in image segmentation, the performance is directly proportional to the number of thresholds, and compared with other algorithms, IMATSA has better comprehensive performance.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"131 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140196416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporally-aware node embeddings for evolving networks topologies 针对不断演变的网络拓扑结构的时间感知节点嵌入
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-02 DOI: 10.3233/aic-230028
K. B. Enes, Matheus Nunes, Fabricio Murai, Gisele L. Pappa
Static node embedding algorithms applied to snapshots of real-world applications graphs are unable to capture their evolving process. As a result, the absence of information about the dynamics in these node representations can harm the accuracy and increase processing time of machine learning tasks related to these applications. Aiming at fill the gap regarding the inability of static methods to capture evolving processes on dynamic networks, we propose a biased random walk method named Evolving Node Embedding (EVNE). EVNE leverages the sequential relationship of graph snapshots by incorporating historic information when generating embeddings for the next snapshot. It learns node representations through a neural network, but differs from existing methods as it: (i) incorporates previously run walks at each step; (ii) starts the optimization of the current embedding from the parameters obtained in the previous iteration; and (iii) uses two time-varying parameters to regulate the behavior of the biased random walks over the process of graph exploration. Through a wide set of experiments we show that our approach generates better embeddings, outperforming baselines by up to 20% in a downstream node classification task. EVNE’s embeddings achieve better performance than others, based on experiments with four classifiers and five datasets. In addition, we present seven variations of our model to show the impact of each of EVNE’s mechanisms.
应用于现实世界应用图快照的静态节点嵌入算法无法捕捉其演变过程。因此,这些节点表示法中动态信息的缺失会损害与这些应用相关的机器学习任务的准确性,并增加处理时间。为了填补静态方法无法捕捉动态网络中演化过程的空白,我们提出了一种名为 "演化节点嵌入(EVNE)"的偏向随机游走方法。EVNE 利用图快照的顺序关系,在生成下一个快照的嵌入时纳入历史信息。它通过神经网络学习节点表征,但与现有方法不同,因为它(i)在每一步都纳入以前运行的行走;(ii)根据上一次迭代获得的参数开始优化当前的嵌入;(iii)使用两个时变参数来调节图探索过程中偏向随机行走的行为。通过一系列广泛的实验,我们发现我们的方法能生成更好的嵌入,在下游节点分类任务中,我们的嵌入比基准方法高出 20%。基于四种分类器和五个数据集的实验,EVNE 的嵌入比其他方法取得了更好的性能。此外,我们还介绍了模型的七种变体,以展示 EVNE 每种机制的影响。
{"title":"Temporally-aware node embeddings for evolving networks topologies","authors":"K. B. Enes, Matheus Nunes, Fabricio Murai, Gisele L. Pappa","doi":"10.3233/aic-230028","DOIUrl":"https://doi.org/10.3233/aic-230028","url":null,"abstract":"Static node embedding algorithms applied to snapshots of real-world applications graphs are unable to capture their evolving process. As a result, the absence of information about the dynamics in these node representations can harm the accuracy and increase processing time of machine learning tasks related to these applications. Aiming at fill the gap regarding the inability of static methods to capture evolving processes on dynamic networks, we propose a biased random walk method named Evolving Node Embedding (EVNE). EVNE leverages the sequential relationship of graph snapshots by incorporating historic information when generating embeddings for the next snapshot. It learns node representations through a neural network, but differs from existing methods as it: (i) incorporates previously run walks at each step; (ii) starts the optimization of the current embedding from the parameters obtained in the previous iteration; and (iii) uses two time-varying parameters to regulate the behavior of the biased random walks over the process of graph exploration. Through a wide set of experiments we show that our approach generates better embeddings, outperforming baselines by up to 20% in a downstream node classification task. EVNE’s embeddings achieve better performance than others, based on experiments with four classifiers and five datasets. In addition, we present seven variations of our model to show the impact of each of EVNE’s mechanisms.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"135 15","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139453103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolutionary multi-objective physics-informed neural networks: The MOPINNs approach 进化多目标物理信息神经网络:MOPINNs 方法
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-15 DOI: 10.3233/aic-230073
Hugo Carrillo, Taco de Wolff, Luis Martí, Nayat Sanchez-Pi
Physics-informed neural networks formulation allows the neural network to be trained by both the training data and prior domain knowledge about the physical system that models the data. In particular, it has a loss function for the data and the physics, where the latter is the deviation from a partial differential equation describing the system. Conventionally, both loss functions are combined by a weighted sum, whose weights are usually chosen manually. It is known that balancing between different loss terms can make the training process more efficient. In addition, it is necessary to find the optimal architecture of the neural network in order to find a hypothesis set in which is easier to train the PINN. In our work, we propose a multi-objective optimization approach to find the optimal value for the loss function weighting, as well as the optimal activation function, number of layers, and number of neurons for each layer. We validate our results on the Poisson, Burgers, and advection-diffusion equations and show that we are able to find accurate approximations of the solutions using optimal hyperparameters.
物理信息神经网络的表述方式允许神经网络通过训练数据和模拟数据的物理系统的先验领域知识进行训练。特别是,它有一个数据和物理的损失函数,后者是描述系统的偏微分方程的偏差。传统上,这两个损失函数通过加权和来组合,其权重通常由人工选择。众所周知,平衡不同的损失项可以提高训练过程的效率。此外,有必要找到神经网络的最佳架构,以便找到更容易训练 PINN 的假设集。在我们的工作中,我们提出了一种多目标优化方法,以找到损失函数加权的最优值,以及最优激活函数、层数和每层神经元数。我们在泊松方程、伯格斯方程和平流扩散方程上验证了我们的结果,并表明我们能够利用最优超参数找到精确的近似解。
{"title":"Evolutionary multi-objective physics-informed neural networks: The MOPINNs approach","authors":"Hugo Carrillo, Taco de Wolff, Luis Martí, Nayat Sanchez-Pi","doi":"10.3233/aic-230073","DOIUrl":"https://doi.org/10.3233/aic-230073","url":null,"abstract":"Physics-informed neural networks formulation allows the neural network to be trained by both the training data and prior domain knowledge about the physical system that models the data. In particular, it has a loss function for the data and the physics, where the latter is the deviation from a partial differential equation describing the system. Conventionally, both loss functions are combined by a weighted sum, whose weights are usually chosen manually. It is known that balancing between different loss terms can make the training process more efficient. In addition, it is necessary to find the optimal architecture of the neural network in order to find a hypothesis set in which is easier to train the PINN. In our work, we propose a multi-objective optimization approach to find the optimal value for the loss function weighting, as well as the optimal activation function, number of layers, and number of neurons for each layer. We validate our results on the Poisson, Burgers, and advection-diffusion equations and show that we are able to find accurate approximations of the solutions using optimal hyperparameters.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"5 4","pages":""},"PeriodicalIF":0.8,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138998848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Some thoughts about artificial stupidity and artificial dumbness 关于人工愚蠢和人工笨拙的一些想法
IF 0.8 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-14 DOI: 10.3233/aic-220322
Jean Lieber, Jean-Guy Mailly, Pierre Marquis, Henri Prade, François Rollin
In a recently published book, the French writer and comedian François Rollin has discussed various aspects of the notion of stupidity, including artificial stupidity, the stupid counterpart of artificial intelligence. His claim is that a system of artificial stupidity is a system that provides wrong answers to any task it should solve, leading to absurd solutions in most cases. We believe that this claim is (at least partially) false and that designing artificial stupidity is not as trivial as it seems. In this article, we discuss why and how one could design a system of artificial stupidity. We believe that such a reflection on (artificial) stupidity can bring about some interesting insights about (artificial) intelligence.
在最近出版的一本书中,法国作家兼喜剧演员弗朗索瓦-罗林讨论了 "愚蠢 "概念的各个方面,包括人工智能的 "愚蠢 "对应物--"人工愚蠢"。他声称,人工愚蠢系统是一种能对任何它应该解决的任务提供错误答案的系统,在大多数情况下会导致荒谬的解决方案。我们认为,这种说法(至少部分)是错误的,设计人工愚蠢并不像看起来那么微不足道。在本文中,我们将讨论为什么以及如何设计人工愚蠢系统。我们相信,这种对(人工)愚蠢的反思可以为(人工)智能带来一些有趣的启示。
{"title":"Some thoughts about artificial stupidity and artificial dumbness","authors":"Jean Lieber, Jean-Guy Mailly, Pierre Marquis, Henri Prade, François Rollin","doi":"10.3233/aic-220322","DOIUrl":"https://doi.org/10.3233/aic-220322","url":null,"abstract":"In a recently published book, the French writer and comedian François Rollin has discussed various aspects of the notion of stupidity, including artificial stupidity, the stupid counterpart of artificial intelligence. His claim is that a system of artificial stupidity is a system that provides wrong answers to any task it should solve, leading to absurd solutions in most cases. We believe that this claim is (at least partially) false and that designing artificial stupidity is not as trivial as it seems. In this article, we discuss why and how one could design a system of artificial stupidity. We believe that such a reflection on (artificial) stupidity can bring about some interesting insights about (artificial) intelligence.","PeriodicalId":50835,"journal":{"name":"AI Communications","volume":"6 5","pages":""},"PeriodicalIF":0.8,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138972268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AI Communications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1