Intelligent Systems with Applications最新文献_第3页

AI-based model for Prediction of Power consumption in smart grid-smart way towards smart city using blockchain technology 基于人工智能的智能电网用电预测模型--利用区块链技术迈向智慧城市的明智之举

Intelligent Systems with Applications

Pub Date : 2024-09-18 DOI: 10.1016/j.iswa.2024.200440

Emran Aljarrah

A smart grid (SG) is the financial benefit of a complicated and smart power system that can keep up with rising demand. It has to do with saving energy and being environmentally friendly. Growing populations and new technologies have caused a big rise in energy use, causing big problems for the environment and energy security. It is essential and significant to use blockchain technology and artificial intelligence (AI) to solve problems with power control. Data can be collected using a smart city in a power-consumed smart grid data and pre-process using a Z-Score normalization technique. It can extract features using a Spatial-Temporal Correlation (STC) to assess smart grid power usage within the context of a smart city using large-scale, high-dimensional data. Ensuring data integrity, privacy, and trust among grid applicants, transmit the data securely and reliably to a centralized or distributed cloud platform utilizing blockchain technology—a secure transmission and storage using Distributed Authentication and Authorization (DAA) protocol. To achieve precise load forecasting, a short-term recurrent neural network with an improved sparrow search algorithm (LSTM-RNN-ISSA) is incorporated. The smart grid may then record the projected results. Communication can be done on a smart grid with the users; the Blockchain-Based Smart Energy Trading with Adaptive Volt-VAR Optimization (BSET-AVVO) algorithm can be used for effective communication—a quick balancing electrical load and supply via a task-oriented communication mechanism in real-time demand response. Finally, our proposed method performs successfully better than the existing approaches.

智能电网（SG）是指能够满足不断增长的需求的复杂而智能的电力系统所带来的经济效益。它与节约能源和环保有关。不断增长的人口和新技术导致能源使用量大幅上升，给环境和能源安全带来了巨大问题。利用区块链技术和人工智能（AI）来解决电力控制问题，是非常必要且意义重大的。可以利用耗电智能电网数据中的智能城市收集数据，并使用 Z-Score归一化技术进行预处理。它可以使用空间-时间相关性（STC）提取特征，利用大规模、高维数据评估智能城市背景下的智能电网用电情况。为确保数据的完整性、私密性和电网申请者之间的信任，利用区块链技术将数据安全、可靠地传输到集中式或分布式云平台--这是一种使用分布式认证和授权（DAA）协议的安全传输和存储。为实现精确的负荷预测，采用了改进的麻雀搜索算法（LSTM-RNN-ISSA）的短期递归神经网络。然后，智能电网可记录预测结果。智能电网可与用户进行通信；基于区块链的自适应电压-伏特-增值优化智能能源交易（BSET-AVVO）算法可用于有效通信--在实时需求响应中，通过面向任务的通信机制快速平衡电力负载和供应。最后，与现有方法相比，我们提出的方法成功地实现了更好的性能。

{"title":"AI-based model for Prediction of Power consumption in smart grid-smart way towards smart city using blockchain technology","authors":"Emran Aljarrah","doi":"10.1016/j.iswa.2024.200440","DOIUrl":"10.1016/j.iswa.2024.200440","url":null,"abstract":"<div><div>A smart grid (SG) is the financial benefit of a complicated and smart power system that can keep up with rising demand. It has to do with saving energy and being environmentally friendly. Growing populations and new technologies have caused a big rise in energy use, causing big problems for the environment and energy security. It is essential and significant to use blockchain technology and artificial intelligence (AI) to solve problems with power control. Data can be collected using a smart city in a power-consumed smart grid data and pre-process using a Z-Score normalization technique. It can extract features using a Spatial-Temporal Correlation (STC) to assess smart grid power usage within the context of a smart city using large-scale, high-dimensional data. Ensuring data integrity, privacy, and trust among grid applicants, transmit the data securely and reliably to a centralized or distributed cloud platform utilizing blockchain technology—a secure transmission and storage using Distributed Authentication and Authorization (DAA) protocol. To achieve precise load forecasting, a short-term recurrent neural network with an improved sparrow search algorithm (LSTM-RNN-ISSA) is incorporated. The smart grid may then record the projected results. Communication can be done on a smart grid with the users; the Blockchain-Based Smart Energy Trading with Adaptive Volt-VAR Optimization (BSET-AVVO) algorithm can be used for effective communication—a quick balancing electrical load and supply via a task-oriented communication mechanism in real-time demand response. Finally, our proposed method performs successfully better than the existing approaches.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"24 ","pages":"Article 200440"},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001145/pdfft?md5=d544863ef72dcd52a4a7ed799b0b670e&pid=1-s2.0-S2667305324001145-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142315172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal fusion: A study on speech-text emotion recognition with the integration of deep learning 多模态融合：融合深度学习的语音文本情感识别研究

Intelligent Systems with Applications

Pub Date : 2024-09-08 DOI: 10.1016/j.iswa.2024.200436

Yanan Shang, Tianqi Fu

Recognition of various human emotions holds significant value in numerous real-world scenarios. This paper focuses on the multimodal fusion of speech and text for emotion recognition. A 39-dimensional Mel-frequency cepstral coefficient (MFCC) was used as a feature for speech emotion. A 300-dimensional word vector obtained through the Glove algorithm was used as the feature for text emotion. The bidirectional gate recurrent unit (BiGRU) method in deep learning was added for extracting deep features. Subsequently, it was combined with the multi-head self-attention (MHA) mechanism and the improved sparrow search algorithm (ISSA) to obtain the ISSA-BiGRU-MHA method for emotion recognition. It was validated on the IEMOCAP and MELD datasets. It was found that MFCC and Glove word vectors exhibited superior recognition effects as features. Comparisons with the support vector machine and convolutional neural network methods revealed that the ISSA-BiGRU-MHA method demonstrated the highest weighted accuracy and unweighted accuracy. Multimodal fusion achieved weighted accuracies of 76.52 %, 71.84 %, 66.72 %, and 62.12 % on the IEMOCAP, MELD, MOSI, and MOSEI datasets, suggesting better performance than unimodal fusion. These results affirm the reliability of the multimodal fusion recognition method, showing its practical applicability.

在现实世界的众多场景中，识别人类的各种情绪具有重要价值。本文重点研究了语音和文本的多模态融合情感识别。本文使用 39 维的梅尔频率倒频谱系数（MFCC）作为语音情感特征。通过 Glove 算法获得的 300 维词向量被用作文本情感特征。在提取深度特征时，加入了深度学习中的双向门递归单元（BiGRU）方法。随后，它与多头自注意（MHA）机制和改进的麻雀搜索算法（ISSA）相结合，得到了用于情感识别的 ISSA-BiGRU-MHA 方法。该方法在 IEMOCAP 和 MELD 数据集上进行了验证。结果发现，MFCC 和 Glove 词向量作为特征的识别效果更佳。与支持向量机和卷积神经网络方法进行比较后发现，ISSA-BiGRU-MHA 方法的加权准确率和非加权准确率都是最高的。在 IEMOCAP、MELD、MOSI 和 MOSEI 数据集上，多模态融合的加权准确率分别为 76.52%、71.84%、66.72% 和 62.12%，表明其性能优于单模态融合。这些结果肯定了多模态融合识别方法的可靠性，显示了它的实用性。

{"title":"Multimodal fusion: A study on speech-text emotion recognition with the integration of deep learning","authors":"Yanan Shang, Tianqi Fu","doi":"10.1016/j.iswa.2024.200436","DOIUrl":"10.1016/j.iswa.2024.200436","url":null,"abstract":"<div><p>Recognition of various human emotions holds significant value in numerous real-world scenarios. This paper focuses on the multimodal fusion of speech and text for emotion recognition. A 39-dimensional Mel-frequency cepstral coefficient (MFCC) was used as a feature for speech emotion. A 300-dimensional word vector obtained through the Glove algorithm was used as the feature for text emotion. The bidirectional gate recurrent unit (BiGRU) method in deep learning was added for extracting deep features. Subsequently, it was combined with the multi-head self-attention (MHA) mechanism and the improved sparrow search algorithm (ISSA) to obtain the ISSA-BiGRU-MHA method for emotion recognition. It was validated on the IEMOCAP and MELD datasets. It was found that MFCC and Glove word vectors exhibited superior recognition effects as features. Comparisons with the support vector machine and convolutional neural network methods revealed that the ISSA-BiGRU-MHA method demonstrated the highest weighted accuracy and unweighted accuracy. Multimodal fusion achieved weighted accuracies of 76.52 %, 71.84 %, 66.72 %, and 62.12 % on the IEMOCAP, MELD, MOSI, and MOSEI datasets, suggesting better performance than unimodal fusion. These results affirm the reliability of the multimodal fusion recognition method, showing its practical applicability.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"24 ","pages":"Article 200436"},"PeriodicalIF":0.0,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001108/pdfft?md5=f20cf6e918be5af339bd33d538eaa064&pid=1-s2.0-S2667305324001108-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142230312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial intelligence and recommender systems in e-commerce. Trends and research agenda 电子商务中的人工智能和推荐系统。趋势和研究议程

Intelligent Systems with Applications

Pub Date : 2024-09-06 DOI: 10.1016/j.iswa.2024.200435

Alejandro Valencia-Arias , Hernán Uribe-Bedoya , Juan David González-Ruiz , Gustavo Sánchez Santos , Edgard Chapoñan Ramírez , Ezequiel Martínez Rojas

Combining recommendation systems and AI in e-commerce can improve the user experience and decision-making. This study uses a method called bibliometrics to look at how these systems and artificial intelligence are changing. Of the 120 documents, 91 were analyzed. This shows a growth of 97.16% in the topic. The most influential authors were Paraschakis and Nilsson, with three publications and 43 citations. The magazine Electronic Commerce Research has four publications and 60 citations. China is the top country for citations, with 120, followed by India with 25 publications. The results show that research increased in 2021 and 2022. This shows a shift towards sentiment analysis and convolutional neural networks. The identification of new keywords, such as content-based image retrieval and knowledge graph, shows promising areas for future research. This study provides a solid foundation for future research in e-commerce recommender systems.

在电子商务中结合推荐系统和人工智能可以改善用户体验和决策。本研究使用了一种名为文献计量学的方法来研究这些系统和人工智能是如何变化的。在 120 篇文献中，对 91 篇进行了分析。这表明该主题增长了 97.16%。最有影响力的作者是 Paraschakis 和 Nilsson，他们共发表了 3 篇论文，被引用 43 次。电子商务研究》杂志发表了 4 篇论文，被引用 60 次。中国是引文最多的国家，有 120 篇论文，其次是印度，有 25 篇论文。结果显示，2021 年和 2022 年的研究有所增加。这表明了向情感分析和卷积神经网络的转变。新关键词的确定，如基于内容的图像检索和知识图谱，表明未来的研究领域大有可为。这项研究为电子商务推荐系统的未来研究奠定了坚实的基础。

{"title":"Artificial intelligence and recommender systems in e-commerce. Trends and research agenda","authors":"Alejandro Valencia-Arias , Hernán Uribe-Bedoya , Juan David González-Ruiz , Gustavo Sánchez Santos , Edgard Chapoñan Ramírez , Ezequiel Martínez Rojas","doi":"10.1016/j.iswa.2024.200435","DOIUrl":"10.1016/j.iswa.2024.200435","url":null,"abstract":"<div><p>Combining recommendation systems and AI in e-commerce can improve the user experience and decision-making. This study uses a method called bibliometrics to look at how these systems and artificial intelligence are changing. Of the 120 documents, 91 were analyzed. This shows a growth of 97.16% in the topic. The most influential authors were Paraschakis and Nilsson, with three publications and 43 citations. The magazine Electronic Commerce Research has four publications and 60 citations. China is the top country for citations, with 120, followed by India with 25 publications. The results show that research increased in 2021 and 2022. This shows a shift towards sentiment analysis and convolutional neural networks. The identification of new keywords, such as content-based image retrieval and knowledge graph, shows promising areas for future research. This study provides a solid foundation for future research in e-commerce recommender systems.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"24 ","pages":"Article 200435"},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001091/pdfft?md5=7afc73524589c1244a248a6969677221&pid=1-s2.0-S2667305324001091-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142167646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Natural disasters detection using explainable deep learning 利用可解释深度学习检测自然灾害

Intelligent Systems with Applications

Pub Date : 2024-09-01 DOI: 10.1016/j.iswa.2024.200430

Ahmad M. Mustafa, Rand Agha, Lujain Ghazalat, Tariq Sha'ban

Deep learning applications have far-reaching implications in people’s daily lives. Disaster management professionals are becoming increasingly interested in applying deep learning to prepare for and respond to natural disasters. In this paper, we aim to assist natural disaster management professionals in preparing for disasters by developing a framework that can accurately classify natural disasters and interpret the results using a combination of a deep learning model and an XAI method to ensure reliability and ease of interpretation without a technical background. Two main aspects categorize the novelty of our work. The first is utilizing pre-trained Models such as VGGNet19, ResNet50, and ViT for accurate classification of natural disaster images. The second is implementing three explainable AI techniques-Gradient-weighted Class Activation Mapping (Grad-CAM), Grad CAM++, and Local Interpretable Model-agnostic Explanations (LIME) to ensure the interpretability of the model’s predictions, making the decision-making process transparent and reliable. Experiments on the Natural disaster datasets (Niloy et al. 2021) and MEDIC with a ViT-B-32 model achieved a high accuracy of 95.23%. Additionally, explainable artificial intelligence techniques such as LIME, Grad-CAM, and Grad-CAM++ are used to evaluate model performance and visualize decision-making. Our code is available at.¹

深度学习应用对人们的日常生活影响深远。灾害管理专业人员对应用深度学习防备和应对自然灾害越来越感兴趣。在本文中，我们旨在帮助自然灾害管理专业人员做好备灾准备，为此我们开发了一个框架，该框架可以准确地对自然灾害进行分类，并结合深度学习模型和 XAI 方法对结果进行解释，以确保可靠性和解释的简便性，而无需技术背景。我们工作的新颖性主要体现在两个方面。首先是利用 VGGNet19、ResNet50 和 ViT 等预训练模型对自然灾害图像进行准确分类。其次，我们采用了三种可解释人工智能技术--梯度加权类激活映射（Grad-CAM）、梯度 CAM++ 和局部可解释模型解释（LIME），以确保模型预测的可解释性，使决策过程透明可靠。在自然灾害数据集（Niloy 等人，2021 年）和 MEDIC 上使用 ViT-B-32 模型进行的实验取得了 95.23% 的高准确率。此外，LIME、Grad-CAM 和 Grad-CAM++ 等可解释人工智能技术也被用于评估模型性能和可视化决策。我们的代码见

{"title":"Natural disasters detection using explainable deep learning","authors":"Ahmad M. Mustafa, Rand Agha, Lujain Ghazalat, Tariq Sha'ban","doi":"10.1016/j.iswa.2024.200430","DOIUrl":"10.1016/j.iswa.2024.200430","url":null,"abstract":"<div><p>Deep learning applications have far-reaching implications in people’s daily lives. Disaster management professionals are becoming increasingly interested in applying deep learning to prepare for and respond to natural disasters. In this paper, we aim to assist natural disaster management professionals in preparing for disasters by developing a framework that can accurately classify natural disasters and interpret the results using a combination of a deep learning model and an XAI method to ensure reliability and ease of interpretation without a technical background. Two main aspects categorize the novelty of our work. The first is utilizing pre-trained Models such as VGGNet19, ResNet50, and ViT for accurate classification of natural disaster images. The second is implementing three explainable AI techniques-Gradient-weighted Class Activation Mapping (Grad-CAM), Grad CAM++, and Local Interpretable Model-agnostic Explanations (LIME) to ensure the interpretability of the model’s predictions, making the decision-making process transparent and reliable. Experiments on the Natural disaster datasets (Niloy et al. 2021) and MEDIC with a ViT-B-32 model achieved a high accuracy of 95.23%. Additionally, explainable artificial intelligence techniques such as LIME, Grad-CAM, and Grad-CAM++ are used to evaluate model performance and visualize decision-making. Our code is available at.<span><span><sup>1</sup></span></span></p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200430"},"PeriodicalIF":0.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001042/pdfft?md5=289fa5e7afac4b6fe86ff07bc28dfb3a&pid=1-s2.0-S2667305324001042-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A deep learning model for estimating body weight of live pacific white shrimp in a clay pond shrimp aquaculture 用于估算泥塘对虾养殖中活太平洋南美白对虾体重的深度学习模型

Intelligent Systems with Applications

Pub Date : 2024-09-01 DOI: 10.1016/j.iswa.2024.200434

Nitthita Chirdchoo , Suvimol Mukviboonchai , Weerasak Cheunta

This paper presents a novel approach to address the essential challenge of accurately determining the total weight of shrimp within aquaculture ponds. Precise weight estimation is crucial in mitigating issues of overfeeding and underfeeding, thus enhancing efficiency and productivity in shrimp farming. The proposed system leverages image processing techniques to detect individual live shrimp and extract pertinent features for weight estimation within a clay pond environment. Specifically, an automated feed tray captures images of live shrimp, which are then processed using a combination of Detectron2, PyTorch, and CUDA (Compute Unified Device Architecture) for individual shrimp detection. Essential features such as area, perimeter, width, length, and posture are extracted through image analysis, enabling accurate estimation of shrimp weight. An Artificial Neural Network (ANN) model, utilizing these features, accurately predicts shrimp weight with a coefficient of determination (R²) of 94.50% when incorporating all extracted features. Furthermore, our system integrates a user-friendly web application that empowers farmers to monitor shrimp weight trends, facilitating precision feeding strategies and effective farm management. This study contributes a low-cost solution using a deep learning model to estimate the weight of live Pacific white shrimp in clay ponds, enabling daily weight calculations, helping farmers optimize feed quantities, providing shrimp size distribution insights, and reducing the Feed Conversion Ratio (FCR) for greater profitability. The procedure for shrimp feature extraction is also provided, including the calculation of shrimp length and width, as well as shrimp posture classification.

本文提出了一种新方法，以解决准确测定水产养殖池塘中对虾总重量这一基本挑战。精确的重量估算对于减少过量喂食和喂食不足的问题至关重要，从而提高对虾养殖的效率和生产力。拟议的系统利用图像处理技术来检测单个活虾，并提取相关特征，以便在粘土池塘环境中估算重量。具体来说，自动喂食盘捕捉活虾图像，然后使用 Detectron2、PyTorch 和 CUDA（计算统一设备架构）组合进行处理，以检测单个虾。通过图像分析提取面积、周长、宽度、长度和姿态等基本特征，从而准确估算虾的重量。人工神经网络（ANN）模型利用这些特征准确预测了虾的重量，在包含所有提取特征的情况下，判定系数（R2）为 94.50%。此外，我们的系统还集成了一个用户友好型网络应用程序，使养殖户能够监控对虾体重趋势，从而促进精准喂养策略和有效的养殖管理。这项研究提供了一种低成本的解决方案，利用深度学习模型估算粘土池塘中活太平洋南美白对虾的重量，实现每日重量计算，帮助养殖户优化饲料量，提供对虾大小分布的深入了解，并降低饲料转化率（FCR）以获得更大的利润。此外，还提供了对虾特征提取程序，包括对虾长度和宽度的计算，以及对虾姿态分类。

{"title":"A deep learning model for estimating body weight of live pacific white shrimp in a clay pond shrimp aquaculture","authors":"Nitthita Chirdchoo , Suvimol Mukviboonchai , Weerasak Cheunta","doi":"10.1016/j.iswa.2024.200434","DOIUrl":"10.1016/j.iswa.2024.200434","url":null,"abstract":"<div><p>This paper presents a novel approach to address the essential challenge of accurately determining the total weight of shrimp within aquaculture ponds. Precise weight estimation is crucial in mitigating issues of overfeeding and underfeeding, thus enhancing efficiency and productivity in shrimp farming. The proposed system leverages image processing techniques to detect individual live shrimp and extract pertinent features for weight estimation within a clay pond environment. Specifically, an automated feed tray captures images of live shrimp, which are then processed using a combination of Detectron2, PyTorch, and CUDA (Compute Unified Device Architecture) for individual shrimp detection. Essential features such as area, perimeter, width, length, and posture are extracted through image analysis, enabling accurate estimation of shrimp weight. An Artificial Neural Network (ANN) model, utilizing these features, accurately predicts shrimp weight with a coefficient of determination (R<sup>2</sup>) of 94.50% when incorporating all extracted features. Furthermore, our system integrates a user-friendly web application that empowers farmers to monitor shrimp weight trends, facilitating precision feeding strategies and effective farm management. This study contributes a low-cost solution using a deep learning model to estimate the weight of live Pacific white shrimp in clay ponds, enabling daily weight calculations, helping farmers optimize feed quantities, providing shrimp size distribution insights, and reducing the Feed Conversion Ratio (FCR) for greater profitability. The procedure for shrimp feature extraction is also provided, including the calculation of shrimp length and width, as well as shrimp posture classification.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"24 ","pages":"Article 200434"},"PeriodicalIF":0.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266730532400108X/pdfft?md5=11c3ca2d32627550f621452400e08cf3&pid=1-s2.0-S266730532400108X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Explainable artificial intelligence for investigating the effect of lifestyle factors on obesity 用于研究生活方式因素对肥胖症影响的可解释人工智能

Intelligent Systems with Applications

Pub Date : 2024-09-01 DOI: 10.1016/j.iswa.2024.200427

Tarek Khater , Hissam Tawfik , Balbir Singh

Obesity is a critical health issue associated with severe medical conditions. To enhance public health and well-being, early prediction of obesity risk is crucial. This study introduces an innovative approach to predicting obesity levels using explainable artificial intelligence, focusing on lifestyle factors rather than traditional BMI measures. Our best-performing machine learning model, free from BMI parameters, achieved 86.5% accuracy using the Random Forest algorithm. Explainability techniques, including SHAP, PDP and feature importance are employed to gain insights into lifestyle factors’ impact on obesity. Key findings indicate the importance of meal frequency and technology usage. This work demonstrates the significance of lifestyle factors in obesity risk and the power of model-agnostic methods to uncover these relationships.

肥胖是一个严重的健康问题，与严重的医疗状况有关。为了增进公众健康和福祉，及早预测肥胖风险至关重要。本研究介绍了一种利用可解释人工智能预测肥胖程度的创新方法，重点关注生活方式因素而非传统的体重指数衡量标准。我们使用随机森林算法建立了不含 BMI 参数的最佳机器学习模型，准确率达到 86.5%。我们采用了可解释性技术，包括SHAP、PDP和特征重要性，以深入了解生活方式因素对肥胖的影响。主要研究结果表明了进餐频率和技术使用的重要性。这项工作证明了生活方式因素在肥胖风险中的重要性，以及模型识别方法揭示这些关系的能力。

引用次数: 0

Editorial Note 编辑说明

Intelligent Systems with Applications

Pub Date : 2024-09-01 DOI: 10.1016/j.iswa.2024.200418

引用次数: 0

Unmasking large language models by means of OpenAI GPT-4 and Google AI: A deep instruction-based analysis 通过 OpenAI GPT-4 和 Google AI 揭开大型语言模型的神秘面纱：基于指令的深度分析

Intelligent Systems with Applications

Pub Date : 2024-09-01 DOI: 10.1016/j.iswa.2024.200431

Idrees A. Zahid , Shahad Sabbar Joudar , A.S. Albahri , O.S. Albahri , A.H. Alamoodi , Jose Santamaría , Laith Alzubaidi

Large Language Models (LLMs) have become a hot topic in AI due to their ability to mimic human conversation. This study compares the open artificial intelligence generative pretrained transformer-4 (GPT-4) model, based on the (GPT), and Google's artificial intelligence (AI), which is based on the Bidirectional Encoder Representations from Transformers (BERT) framework in terms of the defined capabilities and the built-in architecture. Both LLMs are prominent in AI applications. First, eight different capabilities were identified to evaluate these models, i.e. translation accuracy, text generation, factuality, creativity, intellect, deception avoidance, sentiment classification, and sarcasm detection. Next, each capability was assessed using instructions. Additionally, a categorized LLM evaluation system has been developed by means of using ten research questions per category based on this paper's main contributions from a prompt engineering perspective. It should be highlighted that GPT-4 and Google AI successfully answered 85 % and 68,7 % of the study prompts, respectively. It has been noted that GPT-4 better understands prompts than Google AI, even with verbal flaws, and tolerates grammatical errors. Moreover, the GPT-4 based approach was more precise, accurate, and succinct than Google AI, which was sometimes verbose and less realistic. While GPT-4 beats Google AI in terms of translation accuracy, text generation, factuality, intellectuality, creativity, and deception avoidance, Google AI outperforms the former when considering sarcasm detection. Both sentiment classification models did work properly. More importantly, a human panel of judges was used to assess and evaluate the model comparisons. Statistical analysis of the judges' ratings revealed more robust results based on examining the specific uses, limitations, and expectations of both GPT-4 and Google AI-based approaches. Finally, the two approaches' transformers, parameter sizes, and attention mechanisms have been examined.

大语言模型（LLM）因其模仿人类对话的能力而成为人工智能领域的热门话题。本研究比较了基于 GPT 的开放人工智能生成预训练变换器-4（GPT-4）模型和基于变换器双向编码器表示（BERT）框架的谷歌人工智能（AI）在定义能力和内置架构方面的差异。这两种 LLM 在人工智能应用中都非常突出。首先，确定了八种不同的能力来评估这些模型，即翻译准确性、文本生成、事实性、创造性、智力、避免欺骗、情感分类和讽刺检测。接下来，使用说明对每种能力进行评估。此外，根据本文在提示工程方面的主要贡献，每个类别使用十个研究问题，开发了一个分类 LLM 评估系统。需要强调的是，GPT-4 和谷歌人工智能分别成功回答了 85% 和 68.7% 的研究提示。我们注意到，GPT-4 比谷歌人工智能能更好地理解提示语，即使存在语言缺陷，也能容忍语法错误。此外，与谷歌人工智能相比，基于 GPT-4 的方法更加精确、准确和简洁，而谷歌人工智能有时言辞冗长，不够逼真。虽然 GPT-4 在翻译准确性、文本生成、事实性、知识性、创造性和避免欺骗方面都优于谷歌人工智能，但在讽刺检测方面，谷歌人工智能却胜过前者。两种情感分类模型都能正常工作。更重要的是，人类评委小组对模型比较进行了评估和评价。在对基于 GPT-4 和谷歌人工智能的方法的具体用途、局限性和期望值进行研究的基础上，对评委的评分进行了统计分析，从而得出了更为可靠的结果。最后，对两种方法的转换器、参数大小和关注机制进行了研究。

{"title":"Unmasking large language models by means of OpenAI GPT-4 and Google AI: A deep instruction-based analysis","authors":"Idrees A. Zahid , Shahad Sabbar Joudar , A.S. Albahri , O.S. Albahri , A.H. Alamoodi , Jose Santamaría , Laith Alzubaidi","doi":"10.1016/j.iswa.2024.200431","DOIUrl":"10.1016/j.iswa.2024.200431","url":null,"abstract":"<div><p>Large Language Models (LLMs) have become a hot topic in AI due to their ability to mimic human conversation. This study compares the open artificial intelligence generative pretrained transformer-4 (GPT-4) model, based on the (GPT), and Google's artificial intelligence (AI), which is based on the Bidirectional Encoder Representations from Transformers (BERT) framework in terms of the defined capabilities and the built-in architecture. Both LLMs are prominent in AI applications. First, eight different capabilities were identified to evaluate these models, i.e. translation accuracy, text generation, factuality, creativity, intellect, deception avoidance, sentiment classification, and sarcasm detection. Next, each capability was assessed using instructions. Additionally, a categorized LLM evaluation system has been developed by means of using ten research questions per category based on this paper's main contributions from a prompt engineering perspective. It should be highlighted that GPT-4 and Google AI successfully answered 85 % and 68,7 % of the study prompts, respectively. It has been noted that GPT-4 better understands prompts than Google AI, even with verbal flaws, and tolerates grammatical errors. Moreover, the GPT-4 based approach was more precise, accurate, and succinct than Google AI, which was sometimes verbose and less realistic. While GPT-4 beats Google AI in terms of translation accuracy, text generation, factuality, intellectuality, creativity, and deception avoidance, Google AI outperforms the former when considering sarcasm detection. Both sentiment classification models did work properly. More importantly, a human panel of judges was used to assess and evaluate the model comparisons. Statistical analysis of the judges' ratings revealed more robust results based on examining the specific uses, limitations, and expectations of both GPT-4 and Google AI-based approaches. Finally, the two approaches' transformers, parameter sizes, and attention mechanisms have been examined.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200431"},"PeriodicalIF":0.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001054/pdfft?md5=b6c9fa39bd05b579aebb48986c20b9ec&pid=1-s2.0-S2667305324001054-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142121878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-scale transformer network for super-resolution of visible and thermal air images 用于超分辨率可见光和热空气图像的多尺度变压器网络

Intelligent Systems with Applications

Pub Date : 2024-09-01 DOI: 10.1016/j.iswa.2024.200429

Hèdi Fkih , Abdelaziz Kallel , Zied Chtourou

Reference image-based Super-Resolution (RefSR) is introduced to improve the quality of a Low-resolution (LR) input image by leveraging the additional information provided by a High-Resolution (HR) reference image (Ref). While existing RefSR methods focus on thermal or visible flows separately, they often struggle to enhance the resolution of small objects such as Mini/Micro UAVs (Unmanned Aerial Vehicle) due to the resolution disparities between the input and reference images. To cope with these challenges when dealing with UAV early detection in context of video surveillance, we propose ThermoVisSR, a multiscale texture transformer for enhancing the Super-Resolution (SR) of visible and thermal images of Mini/Micro UAVs. Our approach tries to reconstruct the fine details of these objects while preserving their approximation (the body form and color of the different scene objects) already contained in the LR image. Hence, our model is divided up into two streams dealing separately with approximation and detail reconstruction. In the first one, we introduce a Convolution Neural Network (CNN) fusion backbone to extract the Low-Frequency (LF) approximation from the original LR image pairs. In the second one and to extract the details from the Ref image, our approach involves blending features from both visible and thermal sources to make the most of what each offer. Subsequently, we introduce the High-Frequency Texture Transformer (HFTT) across various resolutions of the merged features to ensure an accurate correspondence matching and significant transfer of High-Frequency (HF) patches from Ref to LR images. Moreover, to adapt the injection to the different bands well, we incorporate the separable software decoder (SSD) into the HFTT allowing to capture channel-specific details during the reconstruction phase. We validated our approach using a newly created dataset of Air images of Mini/Micro UAVs. Experimental results demonstrate that the proposed model consistently outperforms the state-of-the-art approaches on both qualitative and quantitative assessments.

基于参考图像的超分辨率（RefSR）是通过利用高分辨率（HR）参考图像（Ref）提供的附加信息来提高低分辨率（LR）输入图像的质量。虽然现有的 RefSR 方法分别侧重于热流或可见光流，但由于输入图像和参考图像之间的分辨率差异，这些方法往往难以提高小型物体（如微型/微型无人机）的分辨率。为了应对视频监控中无人机早期检测所面临的这些挑战，我们提出了 ThermoVisSR，这是一种多尺度纹理变换器，用于增强迷你/微型无人机可见光和热图像的超分辨率（SR）。我们的方法试图在保留 LR 图像中已包含的近似值（不同场景物体的体形和颜色）的同时，重建这些物体的精细细节。因此，我们的模型分为两个流程，分别处理近似和细节重建。在第一个流程中，我们引入了一个卷积神经网络（CNN）融合骨干，从原始 LR 图像对中提取低频（LF）近似值。其次，为了从反射图像中提取细节，我们的方法涉及融合可见光和热源的特征，以充分利用各自的优势。随后，我们在合并特征的不同分辨率中引入了高频纹理变换器（HFTT），以确保精确的对应匹配和高频（HF）斑块从参考图像到 LR 图像的显著转移。此外，为了使注入能够很好地适应不同的波段，我们将可分离软件解码器（SSD）纳入了高频纹理器，从而在重建阶段捕捉特定信道的细节。我们使用新创建的迷你/微型无人机空气图像数据集验证了我们的方法。实验结果表明，在定性和定量评估方面，所提出的模型始终优于最先进的方法。

{"title":"Multi-scale transformer network for super-resolution of visible and thermal air images","authors":"Hèdi Fkih , Abdelaziz Kallel , Zied Chtourou","doi":"10.1016/j.iswa.2024.200429","DOIUrl":"10.1016/j.iswa.2024.200429","url":null,"abstract":"<div><p>Reference image-based Super-Resolution (RefSR) is introduced to improve the quality of a Low-resolution (LR) input image by leveraging the additional information provided by a High-Resolution (HR) reference image (Ref). While existing RefSR methods focus on thermal or visible flows separately, they often struggle to enhance the resolution of small objects such as Mini/Micro UAVs (Unmanned Aerial Vehicle) due to the resolution disparities between the input and reference images. To cope with these challenges when dealing with UAV early detection in context of video surveillance, we propose ThermoVisSR, a multiscale texture transformer for enhancing the Super-Resolution (SR) of visible and thermal images of Mini/Micro UAVs. Our approach tries to reconstruct the fine details of these objects while preserving their approximation (the body form and color of the different scene objects) already contained in the LR image. Hence, our model is divided up into two streams dealing separately with approximation and detail reconstruction. In the first one, we introduce a Convolution Neural Network (CNN) fusion backbone to extract the Low-Frequency (LF) approximation from the original LR image pairs. In the second one and to extract the details from the Ref image, our approach involves blending features from both visible and thermal sources to make the most of what each offer. Subsequently, we introduce the High-Frequency Texture Transformer (HFTT) across various resolutions of the merged features to ensure an accurate correspondence matching and significant transfer of High-Frequency (HF) patches from Ref to LR images. Moreover, to adapt the injection to the different bands well, we incorporate the separable software decoder (SSD) into the HFTT allowing to capture channel-specific details during the reconstruction phase. We validated our approach using a newly created dataset of Air images of Mini/Micro UAVs. Experimental results demonstrate that the proposed model consistently outperforms the state-of-the-art approaches on both qualitative and quantitative assessments.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200429"},"PeriodicalIF":0.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001030/pdfft?md5=708b3e9003aec9aee059364d6ad6c586&pid=1-s2.0-S2667305324001030-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142121877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing production efficiency in semiconductor enterprises by an improve and optimized biogeographical optimization algorithm based on three-layer coding 基于三层编码的改进和优化生物地理优化算法优化半导体企业的生产效率

Intelligent Systems with Applications

Pub Date : 2024-08-29 DOI: 10.1016/j.iswa.2024.200432

Jiaqi Liu

Aiming at the problem of low production efficiency and inability to fully unleash production capacity in current semiconductor enterprises, a production efficiency optimization model for semiconductor enterprises has been studied and constructed. This model transforms the problem of low production efficiency into a problem of locating and solving the decoupling point of enterprise customer orders, and comprehensively considers the situation of sudden changes in enterprise production orders when locating and solving the decoupling point of customer orders. Propose to use a three-layer coding (TLC) mechanism to improve and optimize the biogeographical optimization algorithm, and use the improved biogeographical optimization (IOBO) algorithm to solve the production efficiency optimization problem of semiconductor enterprises. The results show that the proposed IBBO-TCL algorithm has a fast convergence speed and the minimum root mean square error after convergence. And this method can accurately solve the decoupling point of customer orders for semiconductor enterprises. The method proposed in the study has effectively improved the production efficiency of semiconductor enterprises and has guiding significance for optimizing enterprise structure.

针对当前半导体企业生产效率低、产能无法充分释放的问题，研究构建了半导体企业生产效率优化模型。该模型将生产效率低的问题转化为企业客户订单脱钩点的定位与解决，并在定位与解决客户订单脱钩点时综合考虑了企业生产订单突变的情况。提出利用三层编码（TLC）机制改进和优化生物地理优化算法，利用改进的生物地理优化（IOBO）算法解决半导体企业生产效率优化问题。结果表明，所提出的 IBBO-TCL 算法收敛速度快，收敛后的均方根误差最小。而且该方法能准确解决半导体企业客户订单解耦点问题。该研究提出的方法有效提高了半导体企业的生产效率，对优化企业结构具有指导意义。

{"title":"Optimizing production efficiency in semiconductor enterprises by an improve and optimized biogeographical optimization algorithm based on three-layer coding","authors":"Jiaqi Liu","doi":"10.1016/j.iswa.2024.200432","DOIUrl":"10.1016/j.iswa.2024.200432","url":null,"abstract":"<div><p>Aiming at the problem of low production efficiency and inability to fully unleash production capacity in current semiconductor enterprises, a production efficiency optimization model for semiconductor enterprises has been studied and constructed. This model transforms the problem of low production efficiency into a problem of locating and solving the decoupling point of enterprise customer orders, and comprehensively considers the situation of sudden changes in enterprise production orders when locating and solving the decoupling point of customer orders. Propose to use a three-layer coding (TLC) mechanism to improve and optimize the biogeographical optimization algorithm, and use the improved biogeographical optimization (IOBO) algorithm to solve the production efficiency optimization problem of semiconductor enterprises. The results show that the proposed IBBO-TCL algorithm has a fast convergence speed and the minimum root mean square error after convergence. And this method can accurately solve the decoupling point of customer orders for semiconductor enterprises. The method proposed in the study has effectively improved the production efficiency of semiconductor enterprises and has guiding significance for optimizing enterprise structure.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"24 ","pages":"Article 200432"},"PeriodicalIF":0.0,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001066/pdfft?md5=e9ab62490d3350f471d60b1cd26f905c&pid=1-s2.0-S2667305324001066-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0