首页 > 最新文献

Artificial Intelligence Review最新文献

英文 中文
A systematic literature review of recent advances on context-aware recommender systems 关于情境感知推荐系统最新进展的系统性文献综述
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-16 DOI: 10.1007/s10462-024-10939-4
Pablo Mateos, Alejandro Bellogín

Recommender systems are software mechanisms whose usage is to offer suggestions for different types of entities like products, services, or contacts that could be useful or interesting for a specific user. Other ways have been explored in the field to enhance the power of these systems by integrating the context as an additional attribute. This inclusion tries to extract the user preferences more accurately taking into account multiple components such as temporal, spatial, or social ones. Notwithstanding the magnitude of context-awareness in this area, the research community is in agreement with the lack of framework for context information and how to integrate it into recommender systems. Under this premise, this paper focuses on a comprehensive systematic literature review of the state-of-the-art recommendation techniques and their characteristics to benefit from contextual information. The following survey presents the following contributions as outcomes of our study: (i) determine a framework where multiple aspects are taken into account to have a clear definition of context representation, (ii) the techniques used to incorporate context, and (iii) the evaluation of these methods in terms of reproducibility and effectiveness. Our review also covers some crucial topics about context integration, classification of the contexts, application domains, and evaluation of the used datasets, metrics, and code implementations, where we observed clear shiftings in algorithmic and evaluation trends towards Neural Network approaches and ranking metrics, respectively. Just as importantly, future research opportunities and directions are exposed as final closure, standing out the exploitation of various data sources and the scalability and customization of existing solutions.

推荐系统是一种软件机制,其用途是为不同类型的实体(如产品、服务或联系人)提供对特定用户有用或有趣的建议。该领域还探索了其他方法,通过整合上下文作为附加属性来增强这些系统的功能。这种整合试图在考虑时间、空间或社交等多种因素的情况下,更准确地提取用户偏好。尽管情境感知在这一领域非常重要,但研究界一致认为缺乏情境信息框架以及如何将其整合到推荐系统中。在此前提下,本文重点对最先进的推荐技术及其特点进行了全面系统的文献综述,以便从情境信息中获益。以下调查报告介绍了我们的研究成果:(i) 确定一个框架,其中考虑到多个方面,以便对上下文表示法有一个明确的定义;(ii) 用于整合上下文的技术;(iii) 从可重复性和有效性的角度对这些方法进行评估。我们的综述还涵盖了有关上下文整合、上下文分类、应用领域以及对所用数据集、度量标准和代码实现进行评估的一些关键主题,我们观察到算法和评估趋势明显转向神经网络方法和排名度量标准。同样重要的是,未来的研究机会和方向在最后的总结中被揭示出来,突出了对各种数据源的利用以及现有解决方案的可扩展性和定制化。
{"title":"A systematic literature review of recent advances on context-aware recommender systems","authors":"Pablo Mateos,&nbsp;Alejandro Bellogín","doi":"10.1007/s10462-024-10939-4","DOIUrl":"10.1007/s10462-024-10939-4","url":null,"abstract":"<div><p>Recommender systems are software mechanisms whose usage is to offer suggestions for different types of entities like products, services, or contacts that could be useful or interesting for a specific user. Other ways have been explored in the field to enhance the power of these systems by integrating the context as an additional attribute. This inclusion tries to extract the user preferences more accurately taking into account multiple components such as temporal, spatial, or social ones. Notwithstanding the magnitude of context-awareness in this area, the research community is in agreement with the lack of framework for context information and how to integrate it into recommender systems. Under this premise, this paper focuses on a comprehensive systematic literature review of the state-of-the-art recommendation techniques and their characteristics to benefit from contextual information. The following survey presents the following contributions as outcomes of our study: (i) determine a framework where multiple aspects are taken into account to have a clear definition of context representation, (ii) the techniques used to incorporate context, and (iii) the evaluation of these methods in terms of reproducibility and effectiveness. Our review also covers some crucial topics about context integration, classification of the contexts, application domains, and evaluation of the used datasets, metrics, and code implementations, where we observed clear shiftings in algorithmic and evaluation trends towards Neural Network approaches and ranking metrics, respectively. Just as importantly, future research opportunities and directions are exposed as final closure, standing out the exploitation of various data sources and the scalability and customization of existing solutions.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10939-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated learning design and functional models: survey 联合学习设计和功能模式:调查
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-16 DOI: 10.1007/s10462-024-10969-y
John Ayeelyan, Sapdo Utomo, Adarsh Rouniyar, Hsiu-Chun Hsu, Pao-Ann Hsiung

Federated learning is a multiple device collaboration setup designed to solve machine learning problems under framework for aggregation and knowledge transfer in distributed local data. This distributed model ensures the privacy of data at each local node. Owing to its relevance, there has been extensive research activities and outcomes in federated learning with expanded applicability to different areas by the research community. As such, there is a vast research archive made available by the community with research work and articles related to the various aspects of federated learning such as applications, challenges, privacy, functionalities, and design. With respect to the function and design of federated learning, client selection, aggregation, knowledge transfer, management of distributed data (Non-IID), Incentive of data and communication cost are of paramount importance. Any effective design of federated learning requires these aspects to be well considered. There are numerous survey articles found among the available literature that focus on its application and challenges, opportunities, data privacy and protection, as well as on federated learning on internet of things, federated learning on edge computing, etc. In this paper, a review of the available literature on the various elements of design and functionalities in federated learning has been carried out with an aim to lay emphasis on the important challenges and research opportunities. More specifically, this work has endeavored to understand and summarize the various functional methods available, along with their techniques and goals. Additionally, it has strived to get a bird’s eye view of how various functions and designs of federated learning have been used in applications, and how it has helped uncover challenges and promising research directions for the future.

联盟学习是一种多设备协作设置,旨在解决分布式本地数据聚合和知识转移框架下的机器学习问题。这种分布式模型可确保每个本地节点的数据隐私。由于其相关性,研究界在联合学习方面开展了广泛的研究活动并取得了丰硕的研究成果,并将其应用扩展到不同的领域。因此,研究界提供了大量与联合学习各方面(如应用、挑战、隐私、功能和设计)相关的研究工作和文章。关于联合学习的功能和设计,客户端选择、聚合、知识转移、分布式数据(非 IID)管理、数据激励和通信成本都至关重要。联合学习的任何有效设计都需要充分考虑这些方面。在现有文献中,有许多调查文章关注其应用和挑战、机遇、数据隐私和保护,以及物联网上的联合学习、边缘计算上的联合学习等。本文对现有文献中有关联合学习的各种设计和功能要素进行了综述,旨在强调重要的挑战和研究机遇。更具体地说,这项工作致力于了解和总结现有的各种功能方法及其技术和目标。此外,这项工作还努力了解联合学习的各种功能和设计是如何应用的,以及如何帮助发现未来的挑战和有前途的研究方向。
{"title":"Federated learning design and functional models: survey","authors":"John Ayeelyan,&nbsp;Sapdo Utomo,&nbsp;Adarsh Rouniyar,&nbsp;Hsiu-Chun Hsu,&nbsp;Pao-Ann Hsiung","doi":"10.1007/s10462-024-10969-y","DOIUrl":"10.1007/s10462-024-10969-y","url":null,"abstract":"<div><p>Federated learning is a multiple device collaboration setup designed to solve machine learning problems under framework for aggregation and knowledge transfer in distributed local data. This distributed model ensures the privacy of data at each local node. Owing to its relevance, there has been extensive research activities and outcomes in federated learning with expanded applicability to different areas by the research community. As such, there is a vast research archive made available by the community with research work and articles related to the various aspects of federated learning such as applications, challenges, privacy, functionalities, and design. With respect to the function and design of federated learning, client selection, aggregation, knowledge transfer, management of distributed data (Non-IID), Incentive of data and communication cost are of paramount importance. Any effective design of federated learning requires these aspects to be well considered. There are numerous survey articles found among the available literature that focus on its application and challenges, opportunities, data privacy and protection, as well as on federated learning on internet of things, federated learning on edge computing, etc. In this paper, a review of the available literature on the various elements of design and functionalities in federated learning has been carried out with an aim to lay emphasis on the important challenges and research opportunities. More specifically, this work has endeavored to understand and summarize the various functional methods available, along with their techniques and goals. Additionally, it has strived to get a bird’s eye view of how various functions and designs of federated learning have been used in applications, and how it has helped uncover challenges and promising research directions for the future.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10969-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Escape: an optimization method based on crowd evacuation behaviors 逃生:基于人群疏散行为的优化方法
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-16 DOI: 10.1007/s10462-024-11008-6
Kaichen Ouyang, Shengwei Fu, Yi Chen, Qifeng Cai, Ali Asghar Heidari, Huiling Chen

Meta-heuristic algorithms, particularly those based on swarm intelligence, are highly effective for solving black-box optimization problems. However, maintaining a balance between exploration and exploitation within these algorithms remains a significant challenge. This paper introduces a useful algorithm, called Escape or Escape Algorithm (ESC), inspired by crowd evacuation behavior, to solve real-world cases and benchmark problems. The ESC algorithm simulates the behavior of crowds during the evacuation, where the population is divided into calm, herding, and panic groups during the exploration phase, reflecting different levels of decision-making and emotional states. Calm individuals guide the crowd toward safety, herding individuals imitate others in less secure areas, and panic individuals make volatile decisions in the most dangerous zones. As the algorithm transitions into the exploitation phase, the population converges toward optimal solutions, akin to finding the safest exit. The effectiveness of the ESC algorithm is validated on two adjustable problem size test suites, CEC 2017 and CEC 2022. ESC ranked first in the 10-dimensional, 30-dimensional tests of CEC 2017, and the 10-dimensional and 20-dimensional tests of CEC 2022, and second in the 50-dimensional and 100-dimensional tests of CEC 2017. Additionally, ESC performed exceptionally well, ranking first in the engineering problems of pressure vessel design, tension/compression spring design, and rolling element bearing design, as well as in two 3D UAV path planning problems, demonstrating its efficiency in solving real-world complex problems, particularly complex problems like 3D UAV path planning. Compared with 12 other high-performance, classical, and advanced algorithms, ESC exhibited superior performance in complex optimization problems. The source codes of ESC algorithm will be shared at https://aliasgharheidari.com/ESC.html and other websites.

元启发式算法,尤其是基于蜂群智能的算法,对于解决黑箱优化问题非常有效。然而,在这些算法中保持探索与开发之间的平衡仍然是一个重大挑战。本文受人群疏散行为的启发,介绍了一种有用的算法,名为 "逃离或逃逸算法(ESC)",用于解决现实世界中的案例和基准问题。ESC算法模拟了人群在疏散过程中的行为,在探索阶段,人群被分为冷静组、羊群组和恐慌组,反映了不同的决策水平和情绪状态。冷静的个体会引导人群走向安全地带,群居的个体会在不太安全的区域模仿其他人,而恐慌的个体则会在最危险的区域做出不稳定的决定。当算法过渡到开发阶段时,人群会向最优解靠拢,类似于寻找最安全的出口。ESC算法的有效性在两个可调整问题规模的测试套件(CEC 2017和CEC 2022)上得到了验证。ESC在CEC 2017的10维、30维测试和CEC 2022的10维、20维测试中排名第一,在CEC 2017的50维、100维测试中排名第二。此外,ESC表现优异,在压力容器设计、拉伸/压缩弹簧设计、滚动轴承设计等工程问题以及两个三维无人机路径规划问题中均排名第一,展示了其解决现实世界复杂问题,尤其是三维无人机路径规划等复杂问题的高效性。与其他 12 种高性能、经典和先进算法相比,ESC 算法在复杂优化问题中表现出更优越的性能。ESC 算法的源代码将在 https://aliasgharheidari.com/ESC.html 和其他网站上共享。
{"title":"Escape: an optimization method based on crowd evacuation behaviors","authors":"Kaichen Ouyang,&nbsp;Shengwei Fu,&nbsp;Yi Chen,&nbsp;Qifeng Cai,&nbsp;Ali Asghar Heidari,&nbsp;Huiling Chen","doi":"10.1007/s10462-024-11008-6","DOIUrl":"10.1007/s10462-024-11008-6","url":null,"abstract":"<div><p>Meta-heuristic algorithms, particularly those based on swarm intelligence, are highly effective for solving black-box optimization problems. However, maintaining a balance between exploration and exploitation within these algorithms remains a significant challenge. This paper introduces a useful algorithm, called Escape or Escape Algorithm (ESC), inspired by crowd evacuation behavior, to solve real-world cases and benchmark problems. The ESC algorithm simulates the behavior of crowds during the evacuation, where the population is divided into calm, herding, and panic groups during the exploration phase, reflecting different levels of decision-making and emotional states. Calm individuals guide the crowd toward safety, herding individuals imitate others in less secure areas, and panic individuals make volatile decisions in the most dangerous zones. As the algorithm transitions into the exploitation phase, the population converges toward optimal solutions, akin to finding the safest exit. The effectiveness of the ESC algorithm is validated on two adjustable problem size test suites, CEC 2017 and CEC 2022. ESC ranked first in the 10-dimensional, 30-dimensional tests of CEC 2017, and the 10-dimensional and 20-dimensional tests of CEC 2022, and second in the 50-dimensional and 100-dimensional tests of CEC 2017. Additionally, ESC performed exceptionally well, ranking first in the engineering problems of pressure vessel design, tension/compression spring design, and rolling element bearing design, as well as in two 3D UAV path planning problems, demonstrating its efficiency in solving real-world complex problems, particularly complex problems like 3D UAV path planning. Compared with 12 other high-performance, classical, and advanced algorithms, ESC exhibited superior performance in complex optimization problems. The source codes of ESC algorithm will be shared at https://aliasgharheidari.com/ESC.html and other websites.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11008-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multi-strategy boosted bald eagle search algorithm for global optimization and constrained engineering problems: case study on MLP classification problems 针对全局优化和受限工程问题的多策略秃鹰搜索算法:MLP 分类问题案例研究
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-15 DOI: 10.1007/s10462-024-10957-2
Rong Zheng, Ruikang Li, Abdelazim G. Hussien, Qusay Shihab Hamad, Mohammed Azmi Al-Betar, Yan Che, Hui Wen

The Bald Eagle Search (BES) algorithm is an innovative population-based method inspired by the intelligent hunting behavior of bald eagles. While BES shows promise, it faces challenges such as susceptibility to local optima and imbalances between exploration and exploitation phases. To address these limitations, this paper introduces the Multi-Strategy Boosted Bald Eagle Search (MBBES) algorithm. MBBES enhances the original BES by incorporating an adaptive parameter, two distinct mutation strategies, and replacing the swoop stage with a fall stage. We rigorously evaluate MBBES against classic and improved algorithms using the CEC2014 and CEC2017 test sets. The experimental results demonstrate that MBBES significantly improves the ability to escape local optima and achieves superior convergence accuracy. Moreover, MBBES ranks first according to the Friedman test, outperforming its counterparts in solving five practical engineering problems and three MLP classification problems, underscoring its effectiveness in real-world optimization scenarios. These findings indicate that MBBES not only surpasses BES but also sets a new benchmark in optimization performance.

秃鹰搜索(BES)算法是一种基于种群的创新方法,其灵感来源于秃鹰的智能狩猎行为。虽然 BES 算法前景广阔,但它也面临着一些挑战,例如容易出现局部最优以及探索和利用阶段之间的不平衡。为了解决这些局限性,本文介绍了多策略助推秃鹰搜索(MBBES)算法。MBBES 加入了一个自适应参数、两种不同的突变策略,并用下降阶段取代了俯冲阶段,从而增强了原始 BES 算法。我们使用 CEC2014 和 CEC2017 测试集对 MBBES 与经典算法和改进算法进行了严格评估。实验结果表明,MBBES 显著提高了摆脱局部最优的能力,并实现了更高的收敛精度。此外,根据弗里德曼测试,MBBES 在解决五个实际工程问题和三个 MLP 分类问题方面的表现优于同类算法,排名第一,这凸显了其在实际优化场景中的有效性。这些研究结果表明,MBBES 不仅超越了 BES,而且在优化性能方面树立了新的标杆。
{"title":"A multi-strategy boosted bald eagle search algorithm for global optimization and constrained engineering problems: case study on MLP classification problems","authors":"Rong Zheng,&nbsp;Ruikang Li,&nbsp;Abdelazim G. Hussien,&nbsp;Qusay Shihab Hamad,&nbsp;Mohammed Azmi Al-Betar,&nbsp;Yan Che,&nbsp;Hui Wen","doi":"10.1007/s10462-024-10957-2","DOIUrl":"10.1007/s10462-024-10957-2","url":null,"abstract":"<div><p>The Bald Eagle Search (BES) algorithm is an innovative population-based method inspired by the intelligent hunting behavior of bald eagles. While BES shows promise, it faces challenges such as susceptibility to local optima and imbalances between exploration and exploitation phases. To address these limitations, this paper introduces the Multi-Strategy Boosted Bald Eagle Search (MBBES) algorithm. MBBES enhances the original BES by incorporating an adaptive parameter, two distinct mutation strategies, and replacing the swoop stage with a fall stage. We rigorously evaluate MBBES against classic and improved algorithms using the CEC2014 and CEC2017 test sets. The experimental results demonstrate that MBBES significantly improves the ability to escape local optima and achieves superior convergence accuracy. Moreover, MBBES ranks first according to the Friedman test, outperforming its counterparts in solving five practical engineering problems and three MLP classification problems, underscoring its effectiveness in real-world optimization scenarios. These findings indicate that MBBES not only surpasses BES but also sets a new benchmark in optimization performance.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10957-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Innovative solution suggestions for financing electric vehicle charging infrastructure investments with a novel artificial intelligence-based fuzzy decision-making modelling 利用基于人工智能的新型模糊决策模型为电动汽车充电基础设施投资融资提出创新解决方案建议
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-14 DOI: 10.1007/s10462-024-11012-w
Gang Kou, Serkan Eti, Serhat Yüksel, Hasan Dinçer, Edanur Ergün, Yaşar Gökalp

The right methods for effective financing of electric vehicle charging infrastructure investments should be identified. However, in the literature, there is no consensus on which funding source would be right for these projects. There is a need for a new study to recommend the most appropriate financing strategy for these projects. Accordingly, the purpose of this study is to identify innovative solutions for financing electric vehicle charging infrastructure investments. A novel fuzzy decision-making model is introduced to reach this objective. Firstly, the weights of experts are calculated using dimension reduction. Secondly, Spherical fuzzy decision matrix is obtained. Thirdly, the criteria in charging infrastructure for electric vehicles are weighted using Spherical fuzzy criteria importance through intercriteria correlation (CRITIC). Fourthly, innovative solutions for financing electric vehicles charging infrastructure are ranked via Spherical fuzzy ranking technique by geometric mean of similarity ratio to optimal solution (RATGOS). The main contribution of this study is that effective strategies can be identified for financing electric vehicle charging infrastructure investments by establishing a novel decision-making model. Most of the existing models in the literature could not consider the weights of the experts. This condition is criticized by different scholar because these experts can have different qualifications. To satisfy this problem, in this study, dimension reduction algorithm with machine learning is taken into consideration to compute thee weights of the experts. The findings demonstrate that the most effective criterion in the innovative financial solution for the charging infrastructure of electric vehicles is determined as “potential income”. According to the ranking results, it is also defined that the most sustainable solution among the innovative strategies for financing the charging infrastructure of electric vehicles is “blockchain technology”.

应确定为电动汽车充电基础设施投资提供有效融资的正确方法。然而,在文献中,对于哪种资金来源适合这些项目并没有达成共识。有必要开展一项新的研究,为这些项目推荐最合适的融资策略。因此,本研究的目的是找出电动汽车充电基础设施投资融资的创新解决方案。为实现这一目标,我们引入了一个新颖的模糊决策模型。首先,通过降维计算专家权重。其次,得到球形模糊决策矩阵。第三,利用球形模糊标准重要性和标准间相关性(CRITIC)对电动汽车充电基础设施的标准进行加权。第四,利用球形模糊排序技术,通过与最优方案的相似度几何平均值(RATGOS),对电动汽车充电基础设施融资的创新方案进行排序。本研究的主要贡献在于,通过建立新颖的决策模型,可以确定电动汽车充电基础设施投资融资的有效策略。现有文献中的大多数模型都没有考虑专家的权重。这种情况受到不同学者的批评,因为这些专家可能具有不同的资质。为了解决这一问题,本研究采用了机器学习的降维算法来计算专家权重。研究结果表明,电动汽车充电基础设施创新金融解决方案中最有效的标准是 "潜在收入"。根据排名结果,在电动汽车充电基础设施融资创新战略中,最具可持续性的解决方案是 "区块链技术"。
{"title":"Innovative solution suggestions for financing electric vehicle charging infrastructure investments with a novel artificial intelligence-based fuzzy decision-making modelling","authors":"Gang Kou,&nbsp;Serkan Eti,&nbsp;Serhat Yüksel,&nbsp;Hasan Dinçer,&nbsp;Edanur Ergün,&nbsp;Yaşar Gökalp","doi":"10.1007/s10462-024-11012-w","DOIUrl":"10.1007/s10462-024-11012-w","url":null,"abstract":"<div><p>The right methods for effective financing of electric vehicle charging infrastructure investments should be identified. However, in the literature, there is no consensus on which funding source would be right for these projects. There is a need for a new study to recommend the most appropriate financing strategy for these projects. Accordingly, the purpose of this study is to identify innovative solutions for financing electric vehicle charging infrastructure investments. A novel fuzzy decision-making model is introduced to reach this objective. Firstly, the weights of experts are calculated using dimension reduction. Secondly, Spherical fuzzy decision matrix is obtained. Thirdly, the criteria in charging infrastructure for electric vehicles are weighted using Spherical fuzzy criteria importance through intercriteria correlation (CRITIC). Fourthly, innovative solutions for financing electric vehicles charging infrastructure are ranked via Spherical fuzzy ranking technique by geometric mean of similarity ratio to optimal solution (RATGOS). The main contribution of this study is that effective strategies can be identified for financing electric vehicle charging infrastructure investments by establishing a novel decision-making model. Most of the existing models in the literature could not consider the weights of the experts. This condition is criticized by different scholar because these experts can have different qualifications. To satisfy this problem, in this study, dimension reduction algorithm with machine learning is taken into consideration to compute thee weights of the experts. The findings demonstrate that the most effective criterion in the innovative financial solution for the charging infrastructure of electric vehicles is determined as “potential income”. According to the ranking results, it is also defined that the most sustainable solution among the innovative strategies for financing the charging infrastructure of electric vehicles is “blockchain technology”.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11012-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving ranking-based question answering with weak supervision for low-resource Qur’anic texts 改进基于排序的问题解答,弱化对低资源《古兰经》文本的监督
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-14 DOI: 10.1007/s10462-024-10964-3
Mohammed ElKoumy, Amany Sarhan

This work tackles the challenge of ranking-based machine reading comprehension (MRC), where a question answering (QA) system generates a ranked list of relevant answers for each question instead of simply extracting a single answer. We highlight the limitations of traditional learning methods in this setting, particularly under limited training data. To address these issues, we propose a novel ranking-inspired learning method that focuses on ranking multiple answer spans instead of single answer extraction. This method leverages lexical overlap as weak supervision to guide the ranking process. We evaluate our approach on the Qur’an Reading Comprehension Dataset (QRCD), a low-resource Arabic dataset over the Holy Qur’an. We employ transfer learning with external resources to fine-tune various transformer-based models, mitigating the low-resource challenge. Experimental results demonstrate that our proposed method significantly outperforms standard mechanisms across different models. Furthermore, we show its better alignment with the ranking-based MRC task and the effectiveness of external resources for this low-resource dataset. Our best performing model achieves a state-of-the-art partial Reciprocal Rank (pRR) score of 63.82%, surpassing the previous best-known score of 58.60%. To foster further research, we release code [GitHub repository:github.com/mohammed-elkomy/weakly-supervised-mrc], trained models, and detailed experiments to the community.

这项研究解决了基于排序的机器阅读理解(MRC)的难题,即问题解答(QA)系统为每个问题生成一个相关答案的排序列表,而不是简单地提取一个答案。我们强调了传统学习方法在这种情况下的局限性,尤其是在训练数据有限的情况下。为了解决这些问题,我们提出了一种新颖的受排名启发的学习方法,该方法侧重于对多个答案跨度进行排名,而不是提取单一答案。这种方法利用词汇重叠作为弱监督来指导排序过程。我们在《古兰经》阅读理解数据集(Qur'an Reading Comprehension Dataset,QRCD)上对我们的方法进行了评估,这是一个关于《古兰经》的低资源阿拉伯语数据集。我们利用外部资源的迁移学习来微调各种基于转换器的模型,从而减轻了低资源挑战。实验结果表明,在不同的模型中,我们提出的方法明显优于标准机制。此外,我们还展示了该方法与基于排名的 MRC 任务的更好契合,以及外部资源在低资源数据集上的有效性。我们性能最好的模型达到了最先进的部分互易排名(pRR)得分率 63.82%,超过了之前已知的最佳得分率 58.60%。为了促进进一步的研究,我们向社区发布了代码[GitHub 代码库:github.com/mohammed-elkomy/weakly-supervised-mrc]、训练好的模型和详细的实验。
{"title":"Improving ranking-based question answering with weak supervision for low-resource Qur’anic texts","authors":"Mohammed ElKoumy,&nbsp;Amany Sarhan","doi":"10.1007/s10462-024-10964-3","DOIUrl":"10.1007/s10462-024-10964-3","url":null,"abstract":"<div><p>This work tackles the challenge of ranking-based machine reading comprehension (MRC), where a question answering (QA) system generates a ranked list of relevant answers for each question instead of simply extracting a single answer. We highlight the limitations of traditional learning methods in this setting, particularly under limited training data. To address these issues, we propose a novel ranking-inspired learning method that focuses on ranking multiple answer spans instead of single answer extraction. This method leverages lexical overlap as weak supervision to guide the ranking process. We evaluate our approach on the Qur’an Reading Comprehension Dataset (QRCD), a low-resource Arabic dataset over the Holy Qur’an. We employ transfer learning with external resources to fine-tune various transformer-based models, mitigating the low-resource challenge. Experimental results demonstrate that our proposed method significantly outperforms standard mechanisms across different models. Furthermore, we show its better alignment with the ranking-based MRC task and the effectiveness of external resources for this low-resource dataset. Our best performing model achieves a state-of-the-art partial Reciprocal Rank (pRR) score of 63.82%, surpassing the previous best-known score of 58.60%. To foster further research, we release code [GitHub repository:github.com/mohammed-elkomy/weakly-supervised-mrc], trained models, and detailed experiments to the community.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10964-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning techniques for coffee classification: a comprehensive review of scientific research 用于咖啡分类的机器学习技术:科学研究综述
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-11 DOI: 10.1007/s10462-024-11004-w
Isabela V. C. Motta, Nicolas Vuillerme, Huy-Hieu Pham, Felipe A. P. de Figueiredo

In the realm of agribusiness, transformative shifts are underway, propelled by the growing demands and expanding scales of grain production. This evolution calls for a critical reevaluation of the existing paradigms in coffee production and marketing paradigms, with a specific focus on integrating Artificial Intelligence (AI). This work aims to review, synthesize, and summarize the available data regarding how Machine Learning (ML) has been used to detect and classify characteristics in coffee beans and leaves. For this purpose, a comprehensive literature review of the most significant research contributions describing the application of AI for advanced classification techniques in coffee agriculture has been carried out. Our analysis suggests that implementing AI technologies allows the classification of coffee, encompassing various attributes such as maturity, roast intensity, disease identification, flavor profiles, and overall quality. More largely, this technological advancement holds the potential to revolutionize coffee farming by providing producers and agricultural specialists with sophisticated tools to enhance production efficiency, minimize costs, and improve the accuracy and confidence of their decision-making processes. The motivation for the literature review is to address the increasing global demands and evolving scales of grain production, particularly in coffee farming, by critically reevaluating existing paradigms and integrating AI techniques. This review aims to synthesize and summarize how ML has been utilized to detect and classify various characteristics of coffee beans and leaves, thereby highlighting the potential of AI to revolutionize coffee farming by enhancing production efficiency, minimizing costs, and improving decision-making accuracy. This article presents the latest studies in ML in the coffee area, observes the methodology used, and allows researchers to develop new solutions that cover gaps in the literature, open problems, challenges, and future trends, bringing a real contribution to the scientific field. Finally, this article gathers and presents the databases used in many studies, which may be useful for future ML projects.

在农业综合企业领域,谷物生产的需求不断增长、规模不断扩大,推动了变革性转变。这种演变要求对现有的咖啡生产和营销模式进行批判性的重新评估,并特别关注人工智能(AI)的整合。这项工作旨在回顾、综合和总结有关机器学习(ML)如何用于检测咖啡豆和咖啡叶特征并对其进行分类的现有数据。为此,我们对人工智能在咖啡农业高级分类技术应用方面最重要的研究成果进行了全面的文献综述。我们的分析表明,采用人工智能技术可以对咖啡进行分类,包括成熟度、烘焙强度、病害识别、风味特征和整体质量等各种属性。更重要的是,这一技术进步有可能彻底改变咖啡种植业,为生产者和农业专家提供先进的工具,提高生产效率,最大限度地降低成本,提高决策过程的准确性和可信度。文献综述的动机是通过批判性地重新评估现有范例并整合人工智能技术,来应对全球日益增长的需求和不断变化的谷物生产规模,尤其是咖啡种植业。本综述旨在归纳和总结如何利用人工智能检测咖啡豆和咖啡叶的各种特征并对其进行分类,从而突出人工智能通过提高生产效率、降低成本和提高决策准确性来彻底改变咖啡种植业的潜力。本文介绍了咖啡领域在人工智能方面的最新研究,观察了所使用的方法,并允许研究人员开发新的解决方案,涵盖文献中的空白、开放性问题、挑战和未来趋势,为科学领域带来真正的贡献。最后,本文收集并介绍了许多研究中使用的数据库,这些数据库可能对未来的 ML 项目有用。
{"title":"Machine learning techniques for coffee classification: a comprehensive review of scientific research","authors":"Isabela V. C. Motta,&nbsp;Nicolas Vuillerme,&nbsp;Huy-Hieu Pham,&nbsp;Felipe A. P. de Figueiredo","doi":"10.1007/s10462-024-11004-w","DOIUrl":"10.1007/s10462-024-11004-w","url":null,"abstract":"<div><p>In the realm of agribusiness, transformative shifts are underway, propelled by the growing demands and expanding scales of grain production. This evolution calls for a critical reevaluation of the existing paradigms in coffee production and marketing paradigms, with a specific focus on integrating Artificial Intelligence (AI). This work aims to review, synthesize, and summarize the available data regarding how Machine Learning (ML) has been used to detect and classify characteristics in coffee beans and leaves. For this purpose, a comprehensive literature review of the most significant research contributions describing the application of AI for advanced classification techniques in coffee agriculture has been carried out. Our analysis suggests that implementing AI technologies allows the classification of coffee, encompassing various attributes such as maturity, roast intensity, disease identification, flavor profiles, and overall quality. More largely, this technological advancement holds the potential to revolutionize coffee farming by providing producers and agricultural specialists with sophisticated tools to enhance production efficiency, minimize costs, and improve the accuracy and confidence of their decision-making processes. The motivation for the literature review is to address the increasing global demands and evolving scales of grain production, particularly in coffee farming, by critically reevaluating existing paradigms and integrating AI techniques. This review aims to synthesize and summarize how ML has been utilized to detect and classify various characteristics of coffee beans and leaves, thereby highlighting the potential of AI to revolutionize coffee farming by enhancing production efficiency, minimizing costs, and improving decision-making accuracy. This article presents the latest studies in ML in the coffee area, observes the methodology used, and allows researchers to develop new solutions that cover gaps in the literature, open problems, challenges, and future trends, bringing a real contribution to the scientific field. Finally, this article gathers and presents the databases used in many studies, which may be useful for future ML projects.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11004-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142598956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy task assignment in heterogeneous distributed multi-robot system 异构分布式多机器人系统中的模糊任务分配
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-08 DOI: 10.1007/s10462-024-10977-y
Rechache Khelifa, Teggar Hamza, Boufera Fatma

This study addresses the problem of coordination in cooperative multi-robot systems performing complex tasks. An analysis of cooperative behavior in mobile multi-robot systems in terms of task execution accuracy by heterogeneous robots is carried out. In addition, we evaluate the capacity and compatibility of tasks assigned to robots to optimize task execution without using direct communication with the robots or a central decision-making unit. A model for task selection in heterogeneous distributed multi-robot systems is proposed. It is based on two processes: the first decomposes complex tasks into elementary tasks, and the second assigns elementary tasks to mobile robots for real-time execution. The distribution of elementary tasks is NP-hard, which leads us to recommend approximate solutions. A fuzzy system called Fuzzy Decision Making in Task Selection is proposed, which uses fuzzy logic to solve this problem. This system allows robots to choose to perform any task in the future. An approach is presented that uses two cascading fuzzy systems. The first calculates the utility of the robot and then activates the second fuzzy system to calculate the utility of the task. By using the output of the fuzzy decision system in our model, each robot will be able to decide for itself which tasks to perform. The results of a simulation of mobile robots transporting goods demonstrate the effectiveness of this fuzzy decision-maker.

本研究探讨了执行复杂任务的合作式多机器人系统中的协调问题。从异构机器人执行任务的准确性角度,对移动多机器人系统中的合作行为进行了分析。此外,我们还评估了分配给机器人的任务的能力和兼容性,以便在不与机器人或中央决策单元直接通信的情况下优化任务执行。我们提出了异构分布式多机器人系统中的任务选择模型。该模型基于两个过程:第一个过程将复杂任务分解为基本任务,第二个过程将基本任务分配给移动机器人实时执行。基本任务的分配是 NP 难题,因此我们推荐近似解决方案。我们提出了一种名为 "任务选择中的模糊决策 "的模糊系统,利用模糊逻辑来解决这一问题。该系统允许机器人选择在未来执行任何任务。本文提出了一种使用两个级联模糊系统的方法。第一个系统计算机器人的效用,然后激活第二个模糊系统计算任务的效用。通过在我们的模型中使用模糊决策系统的输出,每个机器人将能够自行决定执行哪些任务。移动机器人运输货物的模拟结果证明了这种模糊决策系统的有效性。
{"title":"Fuzzy task assignment in heterogeneous distributed multi-robot system","authors":"Rechache Khelifa,&nbsp;Teggar Hamza,&nbsp;Boufera Fatma","doi":"10.1007/s10462-024-10977-y","DOIUrl":"10.1007/s10462-024-10977-y","url":null,"abstract":"<div><p>This study addresses the problem of coordination in cooperative multi-robot systems performing complex tasks. An analysis of cooperative behavior in mobile multi-robot systems in terms of task execution accuracy by heterogeneous robots is carried out. In addition, we evaluate the capacity and compatibility of tasks assigned to robots to optimize task execution without using direct communication with the robots or a central decision-making unit. A model for task selection in heterogeneous distributed multi-robot systems is proposed. It is based on two processes: the first decomposes complex tasks into elementary tasks, and the second assigns elementary tasks to mobile robots for real-time execution. The distribution of elementary tasks is NP-hard, which leads us to recommend approximate solutions. A fuzzy system called Fuzzy Decision Making in Task Selection is proposed, which uses fuzzy logic to solve this problem. This system allows robots to choose to perform any task in the future. An approach is presented that uses two cascading fuzzy systems. The first calculates the utility of the robot and then activates the second fuzzy system to calculate the utility of the task. By using the output of the fuzzy decision system in our model, each robot will be able to decide for itself which tasks to perform. The results of a simulation of mobile robots transporting goods demonstrate the effectiveness of this fuzzy decision-maker.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10977-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142595370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robustness in deep learning models for medical diagnostics: security and adversarial challenges towards robust AI applications 用于医疗诊断的深度学习模型的鲁棒性:实现鲁棒人工智能应用的安全性和对抗性挑战
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-08 DOI: 10.1007/s10462-024-11005-9
Haseeb Javed, Shaker El-Sappagh, Tamer Abuhmed

The current study investigates the robustness of deep learning models for accurate medical diagnosis systems with a specific focus on their ability to maintain performance in the presence of adversarial or noisy inputs. We examine factors that may influence model reliability, including model complexity, training data quality, and hyperparameters; we also examine security concerns related to adversarial attacks that aim to deceive models along with privacy attacks that seek to extract sensitive information. Researchers have discussed various defenses to these attacks to enhance model robustness, such as adversarial training and input preprocessing, along with mechanisms like data augmentation and uncertainty estimation. Tools and packages that extend the reliability features of deep learning frameworks such as TensorFlow and PyTorch are also being explored and evaluated. Existing evaluation metrics for robustness are additionally being discussed and evaluated. This paper concludes by discussing limitations in the existing literature and possible future research directions to continue enhancing the status of this research topic, particularly in the medical domain, with the aim of ensuring that AI systems are trustworthy, reliable, and stable.

当前的研究调查了用于精确医疗诊断系统的深度学习模型的鲁棒性,特别关注它们在存在对抗性或噪声输入的情况下保持性能的能力。我们研究了可能影响模型可靠性的因素,包括模型复杂性、训练数据质量和超参数;我们还研究了与旨在欺骗模型的对抗性攻击和试图提取敏感信息的隐私攻击有关的安全问题。研究人员讨论了针对这些攻击的各种防御措施,以增强模型的鲁棒性,如对抗性训练和输入预处理,以及数据增强和不确定性估计等机制。此外,还对扩展 TensorFlow 和 PyTorch 等深度学习框架可靠性功能的工具和软件包进行了探索和评估。此外,还对现有的鲁棒性评估指标进行了讨论和评估。本文最后讨论了现有文献的局限性和未来可能的研究方向,以继续提升这一研究课题的地位,尤其是在医疗领域,从而确保人工智能系统的可信、可靠和稳定。
{"title":"Robustness in deep learning models for medical diagnostics: security and adversarial challenges towards robust AI applications","authors":"Haseeb Javed,&nbsp;Shaker El-Sappagh,&nbsp;Tamer Abuhmed","doi":"10.1007/s10462-024-11005-9","DOIUrl":"10.1007/s10462-024-11005-9","url":null,"abstract":"<div><p>The current study investigates the robustness of deep learning models for accurate medical diagnosis systems with a specific focus on their ability to maintain performance in the presence of adversarial or noisy inputs. We examine factors that may influence model reliability, including model complexity, training data quality, and hyperparameters; we also examine security concerns related to adversarial attacks that aim to deceive models along with privacy attacks that seek to extract sensitive information. Researchers have discussed various defenses to these attacks to enhance model robustness, such as adversarial training and input preprocessing, along with mechanisms like data augmentation and uncertainty estimation. Tools and packages that extend the reliability features of deep learning frameworks such as TensorFlow and PyTorch are also being explored and evaluated. Existing evaluation metrics for robustness are additionally being discussed and evaluated. This paper concludes by discussing limitations in the existing literature and possible future research directions to continue enhancing the status of this research topic, particularly in the medical domain, with the aim of ensuring that AI systems are trustworthy, reliable, and stable.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11005-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142595372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extra dimension algorithm: a breakthrough for optimization and enhancing DNN efficiency 额外维度算法:优化和提高 DNN 效率的突破口
IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-08 DOI: 10.1007/s10462-024-10991-0
Eghbal Hosseini, Abbas M. Al-Ghaili, Dler Hussein Kadir, Norziana Jamil, Muhammet Deveci, Saraswathy Shamini Gunasekaran, Rina Azlin Razali

Proposing an efficient meta-heuristic to improve the inputs of a trainer in deep neural network (DNNs) is significant. According to the Kaluza’s theory, there exists an extra dimension in the universe. This paper proposes a novel algorithm, extra dimension algorithm (EDA), which is simulated based on this theory. The proposed algorithm utilizes the extra dimension to evaluate the current region of solutions and determine the best direction to follow for the next step of the process. Finally, EDA is used to improve inputs of DNN in the process of solving optimization test problems. The same DNN with and without EDA is used to solve extensive optimization problems, including energy-related tasks. The efficiency of EDA in DNN is assessed by solving some test problems in references, the feasibility and efficiency of solutions, within a suitable number of iterations are demonstrated according to the results. The contributions of this paper are as follows: (1) Introduction of the EDA based on Kaluza’s theory. (2) Application of EDA to enhance the performance of DNNs. (3) Demonstration of EDA’s effectiveness in solving complex optimization problems. (4) Comprehensive evaluation of EDA’s impact on energy optimization problems and other test cases. (5) EDA achieved an average improvement of 15% in optimization accuracy and reduced convergence time compared to the best-performing alternatives.

提出一种改进深度神经网络(DNN)训练器输入的高效元启发式意义重大。根据卡鲁扎理论,宇宙中存在一个额外维度。本文提出了一种基于该理论模拟的新算法--额外维度算法(EDA)。所提出的算法利用额外维度来评估当前的解区域,并确定下一步流程的最佳方向。最后,在解决优化测试问题的过程中,EDA 被用来改进 DNN 的输入。使用和不使用 EDA 的 DNN 被用于解决广泛的优化问题,包括与能源相关的任务。通过求解参考文献中的一些测试问题,评估了 EDA 在 DNN 中的效率,并根据结果证明了在适当的迭代次数内求解的可行性和效率。本文的贡献如下:(1) 基于 Kaluza 理论的 EDA 介绍。(2) 应用 EDA 提高 DNN 的性能。(3) 展示 EDA 在解决复杂优化问题中的有效性。(4) 综合评估 EDA 对能源优化问题和其他测试案例的影响。(5) 与性能最佳的替代方案相比,EDA 平均提高了 15%的优化精度,并缩短了收敛时间。
{"title":"Extra dimension algorithm: a breakthrough for optimization and enhancing DNN efficiency","authors":"Eghbal Hosseini,&nbsp;Abbas M. Al-Ghaili,&nbsp;Dler Hussein Kadir,&nbsp;Norziana Jamil,&nbsp;Muhammet Deveci,&nbsp;Saraswathy Shamini Gunasekaran,&nbsp;Rina Azlin Razali","doi":"10.1007/s10462-024-10991-0","DOIUrl":"10.1007/s10462-024-10991-0","url":null,"abstract":"<div><p>Proposing an efficient meta-heuristic to improve the inputs of a trainer in deep neural network (DNNs) is significant. According to the Kaluza’s theory, there exists an extra dimension in the universe. This paper proposes a novel algorithm, extra dimension algorithm (EDA), which is simulated based on this theory. The proposed algorithm utilizes the extra dimension to evaluate the current region of solutions and determine the best direction to follow for the next step of the process. Finally, EDA is used to improve inputs of DNN in the process of solving optimization test problems. The same DNN with and without EDA is used to solve extensive optimization problems, including energy-related tasks. The efficiency of EDA in DNN is assessed by solving some test problems in references, the feasibility and efficiency of solutions, within a suitable number of iterations are demonstrated according to the results. The contributions of this paper are as follows: (1) Introduction of the EDA based on Kaluza’s theory. (2) Application of EDA to enhance the performance of DNNs. (3) Demonstration of EDA’s effectiveness in solving complex optimization problems. (4) Comprehensive evaluation of EDA’s impact on energy optimization problems and other test cases. (5) EDA achieved an average improvement of 15% in optimization accuracy and reduced convergence time compared to the best-performing alternatives.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 1","pages":""},"PeriodicalIF":10.7,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10991-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142595371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1