Knowledge and Information Systems最新文献_第4页

MoRGH: movie recommender system using GNNs on heterogeneous graphs MoRGH：在异构图上使用 GNN 的电影推荐系统

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-12 DOI: 10.1007/s10115-024-02196-2

Seyed Sina Ziaee, Hossein Rahmani, Mohammad Nazari

Nowadays, with the advent of movies and TV shows and the competition between different movie streamer companies and movie databases to attract more users, movie recommenders have become a major prerequisite for customer satisfaction. Most of the previously introduced methods used collaborative, content-based, and hybrid filtering techniques, where neural network-based approaches and matrix completion are the major approaches of most recent movie recommender systems. The major drawbacks of previous systems are not considering side information, such as plot synopsis and cold start problem, in the context of movie recommendations. In this paper, we propose a novel inductive approach called MoRGH which first constructs a graph of similar movies by considering the information available in movies’ plot synopsis and genres. Second, we construct a heterogeneous graph that includes two types of nodes: movies and users. This graph is built using the MovieLens dataset and the similarity graph generated in the first stage, where each edge between a user and a movie represents the user’s rating for that movie, and each edge between two movies represents the similarity between them. Third, MoRGH mitigates the drawbacks of previous methods by employing a GNN and GAE-based model that combines collaborative and content-based approaches. This hybrid approach allows MoRGH to provide accurate and more personalized recommendations for each user, outperforming previous state-of-the-art models in terms of RMSE scores. The achieved improvement in RMSE scores demonstrates MoRGH’s superior performance and its ability to deliver enhanced recommendations compared to existing models.

如今，随着电影和电视节目的出现，以及不同的电影流媒体公司和电影数据库为吸引更多用户而展开的竞争，电影推荐器已成为满足用户需求的一个重要前提。之前推出的大多数方法都使用了协同过滤、基于内容的过滤和混合过滤技术，其中基于神经网络的方法和矩阵补全是最近大多数电影推荐系统的主要方法。而基于神经网络的方法和矩阵补全是最近大多数电影推荐系统的主要方法。以往系统的主要缺点是在电影推荐中没有考虑剧情简介和冷启动问题等侧面信息。在本文中，我们提出了一种名为 MoRGH 的新颖归纳法，它首先通过考虑电影的剧情梗概和类型信息来构建相似电影图。其次，我们构建了一个异构图，其中包括两类节点：电影和用户。用户和电影之间的每条边代表用户对该电影的评分，两部电影之间的每条边代表它们之间的相似度。第三，MoRGH 采用基于 GNN 和 GAE 的模型，结合了协作式方法和基于内容的方法，从而减轻了以往方法的缺点。这种混合方法使 MoRGH 能够为每个用户提供更准确、更个性化的推荐，在 RMSE 分数方面优于以前的先进模型。RMSE 分数的提高表明，与现有模型相比，MoRGH 性能优越，能够提供更好的推荐。

{"title":"MoRGH: movie recommender system using GNNs on heterogeneous graphs","authors":"Seyed Sina Ziaee, Hossein Rahmani, Mohammad Nazari","doi":"10.1007/s10115-024-02196-2","DOIUrl":"https://doi.org/10.1007/s10115-024-02196-2","url":null,"abstract":"Nowadays, with the advent of movies and TV shows and the competition between different movie streamer companies and movie databases to attract more users, movie recommenders have become a major prerequisite for customer satisfaction. Most of the previously introduced methods used collaborative, content-based, and hybrid filtering techniques, where neural network-based approaches and matrix completion are the major approaches of most recent movie recommender systems. The major drawbacks of previous systems are not considering side information, such as plot synopsis and cold start problem, in the context of movie recommendations. In this paper, we propose a novel inductive approach called MoRGH which first constructs a graph of similar movies by considering the information available in movies’ plot synopsis and genres. Second, we construct a heterogeneous graph that includes two types of nodes: movies and users. This graph is built using the MovieLens dataset and the similarity graph generated in the first stage, where each edge between a user and a movie represents the user’s rating for that movie, and each edge between two movies represents the similarity between them. Third, MoRGH mitigates the drawbacks of previous methods by employing a GNN and GAE-based model that combines collaborative and content-based approaches. This hybrid approach allows MoRGH to provide accurate and more personalized recommendations for each user, outperforming previous state-of-the-art models in terms of RMSE scores. The achieved improvement in RMSE scores demonstrates MoRGH’s superior performance and its ability to deliver enhanced recommendations compared to existing models.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"118 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Disease outbreak prediction using natural language processing: a review 利用自然语言处理预测疾病爆发：综述

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-06 DOI: 10.1007/s10115-024-02192-6

Avneet Singh Gautam, Zahid Raza

Research on disease outbreak prediction has suddenly received an enormous interest owing to the COVID-19 pandemic. Natural language processing using user-generated text data has proven to be quite effective for the same. Disease outbreaks that occur frequently can be easily predicted, but novel disease outbreaks are difficult to predict. This review work attempts to summarize the research concerning disease outbreaks and the use of datasets such as news headlines, tweets, and search engine queries using natural language processing techniques. Existing state-of-the-art systems have been analytically discussed with their contributions and limitations. This work is an insight into the existing research in the domain of disease outbreak prediction. A total of 146 articles were reviewed in this study, and results show that news and Twitter datasets are being used most to predict disease outbreaks. This research underlines the fact that numerous works are available in the literature based on specific outbreak-related Internet-sourced text data, viz. news, tweets, and search engine queries. However, this becomes a limitation for any disease outbreak prediction system as it can predict only specific disease outbreaks and motivates the development of systems capable of disease outbreak prediction without any bias.

由于 COVID-19 大流行，有关疾病爆发预测的研究突然受到了极大的关注。事实证明，利用用户生成的文本数据进行自然语言处理在这方面相当有效。经常发生的疾病爆发很容易预测，但新型疾病爆发却很难预测。本综述试图总结有关疾病爆发的研究，以及利用自然语言处理技术使用新闻标题、推特和搜索引擎查询等数据集的情况。文章分析讨论了现有的先进系统及其贡献和局限性。这项工作深入探讨了疾病爆发预测领域的现有研究。本研究共查阅了 146 篇文章，结果显示，新闻和 Twitter 数据集最常用于预测疾病爆发。这项研究强调了这样一个事实，即基于特定的疫情相关互联网源文本数据（即新闻、推特和搜索引擎查询）的文献中存在大量作品。然而，这对任何疾病爆发预测系统来说都是一种限制，因为它只能预测特定的疾病爆发，这就促使人们开发能够不带任何偏见地预测疾病爆发的系统。

{"title":"Disease outbreak prediction using natural language processing: a review","authors":"Avneet Singh Gautam, Zahid Raza","doi":"10.1007/s10115-024-02192-6","DOIUrl":"https://doi.org/10.1007/s10115-024-02192-6","url":null,"abstract":"Research on disease outbreak prediction has suddenly received an enormous interest owing to the COVID-19 pandemic. Natural language processing using user-generated text data has proven to be quite effective for the same. Disease outbreaks that occur frequently can be easily predicted, but novel disease outbreaks are difficult to predict. This review work attempts to summarize the research concerning disease outbreaks and the use of datasets such as news headlines, tweets, and search engine queries using natural language processing techniques. Existing state-of-the-art systems have been analytically discussed with their contributions and limitations. This work is an insight into the existing research in the domain of disease outbreak prediction. A total of 146 articles were reviewed in this study, and results show that news and Twitter datasets are being used most to predict disease outbreaks. This research underlines the fact that numerous works are available in the literature based on specific outbreak-related Internet-sourced text data, viz. news, tweets, and search engine queries. However, this becomes a limitation for any disease outbreak prediction system as it can predict only specific disease outbreaks and motivates the development of systems capable of disease outbreak prediction without any bias.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"43 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multi-view mask contrastive learning graph convolutional neural network for age estimation 用于年龄估计的多视图掩码对比学习图卷积神经网络

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-06 DOI: 10.1007/s10115-024-02193-5

Yiping Zhang, Yuntao Shou, Tao Meng, Wei Ai, Keqin Li

The age estimation task aims to use facial features to predict the age of people and is widely used in public security, marketing, identification, and other fields. However, the features are mainly concentrated in facial keypoints, and existing CNN and Transformer-based methods have inflexibility and redundancy for modeling complex irregular structures. Therefore, this paper proposes a multi-view mask contrastive learning graph convolutional neural network (MMCL-GCN) for age estimation. Specifically, the overall structure of the MMCL-GCN network contains a feature extraction stage and an age estimation stage. In the feature extraction stage, we introduce a graph structure to construct face images as input and then design a multi-view mask contrastive learning (MMCL) mechanism to learn complex structural and semantic information about face images. The learning mechanism employs an asymmetric Siamese network architecture, which utilizes an online encoder–decoder structure to reconstruct the missing information from the original graph and utilizes the target encoder to learn latent representations for contrastive learning. Furthermore, to promote the two learning mechanisms better compatible and complementary, we adopt two augmentation strategies and optimize the joint losses. In the age estimation stage, we design a multi-layer extreme learning machine (ML-IELM) with identity mapping to fully use the features extracted by the online encoder. Then, a classifier and a regressor were constructed based on ML-IELM, which were used to identify the age grouping interval and accurately estimate the final age. Extensive experiments show that MMCL-GCN can effectively reduce the error of age estimation on benchmark datasets such as Adience, MORPH-II, and LAP-2016.

年龄估计任务旨在利用面部特征预测人的年龄，广泛应用于公共安全、市场营销、身份识别等领域。然而，特征主要集中在面部关键点上，现有的基于 CNN 和 Transformer 的方法在对复杂的不规则结构建模时存在不灵活和冗余的问题。因此，本文提出了一种用于年龄估计的多视图面具对比学习图卷积神经网络（MMCL-GCN）。具体来说，MMCL-GCN 网络的整体结构包括特征提取阶段和年龄估计阶段。在特征提取阶段，我们引入图结构来构建人脸图像作为输入，然后设计一种多视图掩膜对比学习（MMCL）机制来学习人脸图像的复杂结构和语义信息。该学习机制采用非对称连体网络结构，利用在线编码器-解码器结构从原始图中重建缺失信息，并利用目标编码器学习潜在表征进行对比学习。此外，为了使两种学习机制更好地兼容和互补，我们采用了两种增强策略，并对联合损失进行了优化。在年龄估计阶段，我们设计了具有身份映射的多层极端学习机（ML-IELM），以充分利用在线编码器提取的特征。然后，在 ML-IELM 的基础上构建分类器和回归器，用于识别年龄分组区间并准确估计最终年龄。大量实验表明，在 Adience、MORPH-II 和 LAP-2016 等基准数据集上，MMCL-GCN 可以有效降低年龄估计误差。

{"title":"A multi-view mask contrastive learning graph convolutional neural network for age estimation","authors":"Yiping Zhang, Yuntao Shou, Tao Meng, Wei Ai, Keqin Li","doi":"10.1007/s10115-024-02193-5","DOIUrl":"https://doi.org/10.1007/s10115-024-02193-5","url":null,"abstract":"The age estimation task aims to use facial features to predict the age of people and is widely used in public security, marketing, identification, and other fields. However, the features are mainly concentrated in facial keypoints, and existing CNN and Transformer-based methods have inflexibility and redundancy for modeling complex irregular structures. Therefore, this paper proposes a multi-view mask contrastive learning graph convolutional neural network (MMCL-GCN) for age estimation. Specifically, the overall structure of the MMCL-GCN network contains a feature extraction stage and an age estimation stage. In the feature extraction stage, we introduce a graph structure to construct face images as input and then design a multi-view mask contrastive learning (MMCL) mechanism to learn complex structural and semantic information about face images. The learning mechanism employs an asymmetric Siamese network architecture, which utilizes an online encoder–decoder structure to reconstruct the missing information from the original graph and utilizes the target encoder to learn latent representations for contrastive learning. Furthermore, to promote the two learning mechanisms better compatible and complementary, we adopt two augmentation strategies and optimize the joint losses. In the age estimation stage, we design a multi-layer extreme learning machine (ML-IELM) with identity mapping to fully use the features extracted by the online encoder. Then, a classifier and a regressor were constructed based on ML-IELM, which were used to identify the age grouping interval and accurately estimate the final age. Extensive experiments show that MMCL-GCN can effectively reduce the error of age estimation on benchmark datasets such as Adience, MORPH-II, and LAP-2016.\u0000","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"2012 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Crop health assessment through hierarchical fuzzy rule-based status maps 通过分层模糊规则状态图评估作物健康状况

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-05 DOI: 10.1007/s10115-024-02180-w

Danilo Cavaliere, Sabrina Senatore, Vincenzo Loia

Precision agriculture is evolving toward a contemporary approach that involves multiple sensing techniques to monitor and enhance crop quality while minimizing losses and waste of no longer considered inexhaustible resources, such as soil and water supplies. To understand crop status, it is necessary to integrate data from heterogeneous sensors and employ advanced sensing devices that can assess crop and water status. This study presents a smart monitoring approach in agriculture, involving sensors that can be both stationary (such as soil moisture sensors) and mobile (such as sensor-equipped unmanned aerial vehicles). These sensors collect information from visual maps of crop production and water conditions, to comprehensively understand the crop area and spot any potential vegetation problems. A modular fuzzy control scheme has been designed to interpret spectral indices and vegetative parameters and, by applying fuzzy rules, return status maps about vegetation status. The rules are applied incrementally per a hierarchical design to correlate lower-level data (e.g., temperature, vegetation indices) with higher-level data (e.g., vapor pressure deficit) to robustly determine the vegetation status and the main parameters that have led to it. A case study was conducted, involving the collection of satellite images from artichoke crops in Salerno, Italy, to demonstrate the potential of incremental design and information integration in crop health monitoring. Subsequently, tests were conducted on vineyard regions of interest in Teano, Italy, to assess the efficacy of the framework in the assessment of plant status and water stress. Indeed, comparing the outcomes of our maps with those of cutting-edge machine learning (ML) semantic segmentation has indeed revealed a promising level of accuracy. Specifically, classification performance was compared to the output of conventional ML methods, demonstrating that our approach is consistent and achieves an accuracy of over 90% throughout various seasons of the year.

精准农业正在向一种现代方法演变，这种方法涉及多种传感技术，用于监测和提高作物质量，同时最大限度地减少土壤和水供应等不再被视为取之不尽、用之不竭的资源的损失和浪费。为了解作物状况，有必要整合来自不同传感器的数据，并采用可评估作物和水状况的先进传感设备。本研究提出了一种农业智能监测方法，涉及固定式（如土壤水分传感器）和移动式（如配备传感器的无人机）传感器。这些传感器从作物产量和水状况的可视地图中收集信息，以全面了解作物区域并发现任何潜在的植被问题。已设计出一种模块化模糊控制方案，用于解释光谱指数和植被参数，并通过应用模糊规则，返回有关植被状况的状态图。这些规则按层次设计逐步应用，将低层次数据（如温度、植被指数）与高层次数据（如水汽压差）关联起来，从而稳健地确定植被状况和导致植被状况的主要参数。进行了一项案例研究，涉及意大利萨莱诺朝鲜蓟作物的卫星图像收集，以展示增量设计和信息集成在作物健康监测方面的潜力。随后，在意大利蒂亚诺的葡萄园相关区域进行了测试，以评估该框架在评估植物状态和水分胁迫方面的功效。事实上，将我们的地图结果与最先进的机器学习（ML）语义分割结果进行比较后发现，两者的准确度都很高。具体来说，我们将分类性能与传统 ML 方法的输出结果进行了比较，结果表明我们的方法是一致的，在一年的各个季节都能达到 90% 以上的准确率。

{"title":"Crop health assessment through hierarchical fuzzy rule-based status maps","authors":"Danilo Cavaliere, Sabrina Senatore, Vincenzo Loia","doi":"10.1007/s10115-024-02180-w","DOIUrl":"https://doi.org/10.1007/s10115-024-02180-w","url":null,"abstract":"Precision agriculture is evolving toward a contemporary approach that involves multiple sensing techniques to monitor and enhance crop quality while minimizing losses and waste of no longer considered inexhaustible resources, such as soil and water supplies. To understand crop status, it is necessary to integrate data from heterogeneous sensors and employ advanced sensing devices that can assess crop and water status. This study presents a smart monitoring approach in agriculture, involving sensors that can be both stationary (such as soil moisture sensors) and mobile (such as sensor-equipped unmanned aerial vehicles). These sensors collect information from visual maps of crop production and water conditions, to comprehensively understand the crop area and spot any potential vegetation problems. A modular fuzzy control scheme has been designed to interpret spectral indices and vegetative parameters and, by applying fuzzy rules, return status maps about vegetation status. The rules are applied incrementally per a hierarchical design to correlate lower-level data (e.g., temperature, vegetation indices) with higher-level data (e.g., vapor pressure deficit) to robustly determine the vegetation status and the main parameters that have led to it. A case study was conducted, involving the collection of satellite images from artichoke crops in Salerno, Italy, to demonstrate the potential of incremental design and information integration in crop health monitoring. Subsequently, tests were conducted on vineyard regions of interest in Teano, Italy, to assess the efficacy of the framework in the assessment of plant status and water stress. Indeed, comparing the outcomes of our maps with those of cutting-edge machine learning (ML) semantic segmentation has indeed revealed a promising level of accuracy. Specifically, classification performance was compared to the output of conventional ML methods, demonstrating that our approach is consistent and achieves an accuracy of over 90% throughout various seasons of the year.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"22 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An efficient approach for incremental erasable utility pattern mining from non-binary data 从非二进制数据中挖掘增量可擦除实用模式的高效方法

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-04 DOI: 10.1007/s10115-024-02185-5

Yoonji Baek, Hanju Kim, Myungha Cho, Hyeonmo Kim, Chanhee Lee, Taewoong Ryu, Heonho Kim, Bay Vo, Vincent W. Gan, Philippe Fournier-Viger, Jerry Chun-Wei Lin, Witold Pedrycz, Unil Yun

There are many real-life data incrementally generated around the world. One of the recent interesting issues is the efficient processing real-world data that is continuously accumulated. Mining and recognizing removable patterns in such data is a challenging task. Erasable pattern mining confronts this challenge by discovering removable patterns with low gain. In various real-world applications, data are stored in the form of non-binary databases. These databases store item information in a quantity form. Since items in the database can each have different characteristics, such as quantities, considering their relative features makes the mined patterns more meaningful. For these reasons, we propose an erasable utility pattern mining algorithm for incremental non-binary databases. The suggested technique can recognize removable patterns by considering the relative utility of items and the profit of products in an incremental database. The proposed algorithm utilizes a list structure for efficiently extracting erasable utility patterns. Several experiments have been conducted to compare the performance between the suggested algorithm and state-of-the-art techniques using real and synthetic datasets, and the results demonstrate the effectiveness of the proposed method.

全世界有许多现实生活中不断产生的数据。如何有效处理不断积累的现实世界数据，是近期的一个有趣问题。挖掘和识别这些数据中的可删除模式是一项具有挑战性的任务。可擦除模式挖掘通过发现低增益的可擦除模式来应对这一挑战。在现实世界的各种应用中，数据以非二进制数据库的形式存储。这些数据库以数量形式存储项目信息。由于数据库中的每个项目都可能具有不同的特征，例如数量，因此考虑它们的相对特征会使挖掘出的模式更有意义。为此，我们提出了一种针对增量非二进制数据库的可擦除实用模式挖掘算法。建议的技术可以通过考虑增量数据库中物品的相对效用和产品的利润来识别可删除模式。建议的算法利用列表结构来有效提取可擦除效用模式。我们使用真实数据集和合成数据集进行了多次实验，比较了建议算法和最先进技术的性能，结果证明了建议方法的有效性。

{"title":"An efficient approach for incremental erasable utility pattern mining from non-binary data","authors":"Yoonji Baek, Hanju Kim, Myungha Cho, Hyeonmo Kim, Chanhee Lee, Taewoong Ryu, Heonho Kim, Bay Vo, Vincent W. Gan, Philippe Fournier-Viger, Jerry Chun-Wei Lin, Witold Pedrycz, Unil Yun","doi":"10.1007/s10115-024-02185-5","DOIUrl":"https://doi.org/10.1007/s10115-024-02185-5","url":null,"abstract":"There are many real-life data incrementally generated around the world. One of the recent interesting issues is the efficient processing real-world data that is continuously accumulated. Mining and recognizing removable patterns in such data is a challenging task. Erasable pattern mining confronts this challenge by discovering removable patterns with low gain. In various real-world applications, data are stored in the form of non-binary databases. These databases store item information in a quantity form. Since items in the database can each have different characteristics, such as quantities, considering their relative features makes the mined patterns more meaningful. For these reasons, we propose an erasable utility pattern mining algorithm for incremental non-binary databases. The suggested technique can recognize removable patterns by considering the relative utility of items and the profit of products in an incremental database. The proposed algorithm utilizes a list structure for efficiently extracting erasable utility patterns. Several experiments have been conducted to compare the performance between the suggested algorithm and state-of-the-art techniques using real and synthetic datasets, and the results demonstrate the effectiveness of the proposed method.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"1 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An empirical study of a novel multimodal dataset for low-resource machine translation 用于低资源机器翻译的新型多模态数据集实证研究

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-07-29 DOI: 10.1007/s10115-024-02087-6

Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay

Cues from multiple modalities have been successfully applied in several fields of natural language processing including machine translation (MT). However, the application of multimodal cues in low-resource MT (LRMT) is still an open research problem. The main challenge of LRMT is the lack of abundant parallel data which makes it difficult to build MT systems for a reasonable output. Using multimodal cues can provide additional context and information that can help to mitigate this challenge. To address this challenge, we present a multimodal machine translation (MMT) dataset of low-resource languages. The dataset consists of images, audio and corresponding parallel text for a low-resource language pair that is Manipuri–English. The text dataset is collected from the news articles of local daily newspapers and subsequently translated into the target language by translators of the native speakers. The audio version by native speakers for the Manipuri text is recorded for the experiments. The study also investigates whether the correlated audio-visual cues enhance the performance of the machine translation system. Several experiments are conducted for a systematic evaluation of the effectiveness utilizing multiple modalities. With the help of automatic metrics and human evaluation, a detailed analysis of the MT systems trained with text-only and multimodal inputs is carried out. Experimental results attest that the MT systems in low-resource settings could be significantly improved up to +2.7 BLEU score by incorporating correlated modalities. The human evaluation reveals that the type of correlated auxiliary modality affects the adequacy and fluency performance in the MMT systems. Our results emphasize the potential of using cues from auxiliary modalities to enhance machine translation systems, particularly in situations with limited resources.

多模态线索已成功应用于包括机器翻译（MT）在内的多个自然语言处理领域。然而，多模态线索在低资源 MT（LRMT）中的应用仍是一个有待解决的研究问题。低资源 MT 面临的主要挑战是缺乏丰富的并行数据，因此很难建立 MT 系统以获得合理的输出。使用多模态线索可以提供额外的语境和信息，有助于缓解这一难题。为了应对这一挑战，我们提出了一个低资源语言的多模态机器翻译（MMT）数据集。该数据集由图像、音频和相应的平行文本组成，适用于低资源语言对（曼尼普尔语-英语）。文本数据集收集自当地日报的新闻报道，随后由母语译者翻译成目标语言。实验还录制了母语为曼尼普尔语文本的音频版本。本研究还调查了相关视听线索是否能提高机器翻译系统的性能。为了系统地评估利用多种模式的效果，我们进行了多项实验。在自动度量和人工评估的帮助下，对使用纯文本和多模态输入训练的 MT 系统进行了详细分析。实验结果证明，在低资源环境下，通过采用相关模态，MT 系统的 BLEU 得分可显著提高至 +2.7 分。人工评估显示，相关辅助模态的类型会影响 MMT 系统的充分性和流畅性。我们的研究结果强调了使用辅助模态线索来增强机器翻译系统的潜力，尤其是在资源有限的情况下。

{"title":"An empirical study of a novel multimodal dataset for low-resource machine translation","authors":"Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay","doi":"10.1007/s10115-024-02087-6","DOIUrl":"https://doi.org/10.1007/s10115-024-02087-6","url":null,"abstract":"Cues from multiple modalities have been successfully applied in several fields of natural language processing including machine translation (MT). However, the application of multimodal cues in low-resource MT (LRMT) is still an open research problem. The main challenge of LRMT is the lack of abundant parallel data which makes it difficult to build MT systems for a reasonable output. Using multimodal cues can provide additional context and information that can help to mitigate this challenge. To address this challenge, we present a multimodal machine translation (MMT) dataset of low-resource languages. The dataset consists of images, audio and corresponding parallel text for a low-resource language pair that is Manipuri–English. The text dataset is collected from the news articles of local daily newspapers and subsequently translated into the target language by translators of the native speakers. The audio version by native speakers for the Manipuri text is recorded for the experiments. The study also investigates whether the correlated audio-visual cues enhance the performance of the machine translation system. Several experiments are conducted for a systematic evaluation of the effectiveness utilizing multiple modalities. With the help of automatic metrics and human evaluation, a detailed analysis of the MT systems trained with text-only and multimodal inputs is carried out. Experimental results attest that the MT systems in low-resource settings could be significantly improved up to +2.7 BLEU score by incorporating correlated modalities. The human evaluation reveals that the type of correlated auxiliary modality affects the adequacy and fluency performance in the MMT systems. Our results emphasize the potential of using cues from auxiliary modalities to enhance machine translation systems, particularly in situations with limited resources.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"3 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141867047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Wave Hedges distance-based feature fusion and hybrid optimization-enabled deep learning for cyber credit card fraud detection 基于 Wave Hedges 距离的特征融合和混合优化深度学习用于网络信用卡欺诈检测

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-07-24 DOI: 10.1007/s10115-024-02177-5

Venkata Ratnam Ganji, Aparna Chaparala

With the emerging trend in e-commerce, an increasing number of people have adopted cashless payment methods, especially credit cards for buying products online. However, this ever-rising usage of credit cards has also led to an increase in the malicious users attempting to gain financial profits by committing fraudulent activities resulting in huge losses to the card issuer as well as the customer. Credit Card Frauds (CCFs) are pervasive worldwide, and so efficient methods are required to detect CCFs to minimize financial losses. This research presents an efficient CCF Detection (CCFD) approach based on Deep Learning. In this work, CCFD is performed based on the features obtained from the credit card fused based on Wave Hedge distance, and the Wave Hedge coefficient utilized for fusion is estimated using the Deep Neuro-Fuzzy Network. Further, detection is performed using the Zeiler and Fergus Network (ZFNet), whose trainable factors are adjusted using the Dwarf Mongoose–Shuffled Shepherd Political Optimization (DMSSPO) algorithm. Moreover, the DMSSPO_ZFNet is analyzed based on accuracy, sensitivity, and specificity, and the experimental outcomes reveal that the values attained are 0.961, 0.961, and 0.951.

随着电子商务的兴起，越来越多的人采用非现金支付方式，尤其是信用卡在线购买产品。然而，信用卡使用率的不断提高也导致了恶意用户的增加，他们试图通过欺诈活动获取经济利益，给发卡行和客户都造成了巨大损失。信用卡欺诈（CCFs）在全球范围内普遍存在，因此需要有效的方法来检测 CCFs，以尽量减少经济损失。本研究提出了一种基于深度学习的高效 CCF 检测（CCFD）方法。在这项工作中，CCFD 是根据基于波对冲距离的信用卡融合所获得的特征来执行的，而用于融合的波对冲系数是使用深度神经模糊网络来估计的。此外，使用 Zeiler 和 Fergus 网络（ZFNet）进行检测，其可训练因子使用矮獴-松散牧羊人政治优化（DMSSPO）算法进行调整。此外，还根据准确性、灵敏度和特异性对 DMSSPO_ZFNet 进行了分析，实验结果显示其值分别为 0.961、0.961 和 0.951。

{"title":"Wave Hedges distance-based feature fusion and hybrid optimization-enabled deep learning for cyber credit card fraud detection","authors":"Venkata Ratnam Ganji, Aparna Chaparala","doi":"10.1007/s10115-024-02177-5","DOIUrl":"https://doi.org/10.1007/s10115-024-02177-5","url":null,"abstract":"With the emerging trend in e-commerce, an increasing number of people have adopted cashless payment methods, especially credit cards for buying products online. However, this ever-rising usage of credit cards has also led to an increase in the malicious users attempting to gain financial profits by committing fraudulent activities resulting in huge losses to the card issuer as well as the customer. Credit Card Frauds (CCFs) are pervasive worldwide, and so efficient methods are required to detect CCFs to minimize financial losses. This research presents an efficient CCF Detection (CCFD) approach based on Deep Learning. In this work, CCFD is performed based on the features obtained from the credit card fused based on Wave Hedge distance, and the Wave Hedge coefficient utilized for fusion is estimated using the Deep Neuro-Fuzzy Network. Further, detection is performed using the Zeiler and Fergus Network (ZFNet), whose trainable factors are adjusted using the Dwarf Mongoose–Shuffled Shepherd Political Optimization (DMSSPO) algorithm. Moreover, the DMSSPO_ZFNet is analyzed based on accuracy, sensitivity, and specificity, and the experimental outcomes reveal that the values attained are 0.961, 0.961, and 0.951.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"42 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141782922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Caption matters: a new perspective for knowledge-based visual question answering 标题很重要：基于知识的视觉问题解答新视角

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-07-22 DOI: 10.1007/s10115-024-02166-8

Bin Feng, Shulan Ruan, Likang Wu, Huijie Liu, Kai Zhang, Kun Zhang, Qi Liu, Enhong Chen

Knowledge-based visual question answering (KB-VQA) requires to answer questions according to the given image with the assistance of external knowledge. Recently, researchers generally tend to design different multimodal networks to extract visual and text semantic features for KB-VQA. Despite the significant progress, ‘caption’ information, a textual form of image semantics, which can also provide visually non-obvious cues for the reasoning process, is often ignored. In this paper, we introduce a novel framework, the Knowledge Based Caption Enhanced Net (KBCEN), designed to integrate caption information into the KB-VQA process. Specifically, for better knowledge reasoning, we make utilization of caption information comprehensively from both explicit and implicit perspectives. For the former, we explicitly link caption entities to knowledge graph together with object tags and question entities. While for the latter, a pre-trained multimodal BERT with natural implicit knowledge is leveraged to co-represent caption tokens, object regions as well as question tokens. Moreover, we develop a mutual correlation module to discern intricate correlations between explicit and implicit representations, thereby facilitating knowledge integration and final prediction. We conduct extensive experiments on three publicly available datasets (i.e., OK-VQA v1.0, OK-VQA v1.1 and A-OKVQA). Both quantitative and qualitative results demonstrate the superiority and rationality of our proposed KBCEN.

基于知识的视觉问题解答（KB-VQA）需要借助外部知识，根据给定图像回答问题。最近，研究人员普遍倾向于设计不同的多模态网络来提取视觉和文本语义特征，用于知识库-VQA。尽管取得了重大进展，但 "标题 "信息作为图像语义的一种文本形式，也能为推理过程提供视觉上不明显的提示，却往往被忽视。在本文中，我们介绍了一个新颖的框架--基于知识的标题增强网络（KBCEN），旨在将标题信息整合到 KB-VQA 流程中。具体来说，为了更好地进行知识推理，我们从显性和隐性两个角度综合利用字幕信息。对于前者，我们将标题实体与对象标签和问题实体一起显式地链接到知识图谱中。对于后者，我们利用预先训练好的具有自然隐含知识的多模态 BERT 来共同表示字幕标记、对象区域和问题标记。此外，我们还开发了一个相互关联模块，用于识别显性和隐性表征之间错综复杂的关联，从而促进知识整合和最终预测。我们在三个公开可用的数据集（即 OK-VQA v1.0、OK-VQA v1.1 和 A-OKVQA）上进行了广泛的实验。定量和定性结果都证明了我们提出的 KBCEN 的优越性和合理性。

{"title":"Caption matters: a new perspective for knowledge-based visual question answering","authors":"Bin Feng, Shulan Ruan, Likang Wu, Huijie Liu, Kai Zhang, Kun Zhang, Qi Liu, Enhong Chen","doi":"10.1007/s10115-024-02166-8","DOIUrl":"https://doi.org/10.1007/s10115-024-02166-8","url":null,"abstract":"Knowledge-based visual question answering (KB-VQA) requires to answer questions according to the given image with the assistance of external knowledge. Recently, researchers generally tend to design different multimodal networks to extract visual and text semantic features for KB-VQA. Despite the significant progress, ‘caption’ information, a textual form of image semantics, which can also provide visually non-obvious cues for the reasoning process, is often ignored. In this paper, we introduce a novel framework, the Knowledge Based Caption Enhanced Net (KBCEN), designed to integrate caption information into the KB-VQA process. Specifically, for better knowledge reasoning, we make utilization of caption information comprehensively from both explicit and implicit perspectives. For the former, we explicitly link caption entities to knowledge graph together with object tags and question entities. While for the latter, a pre-trained multimodal BERT with natural implicit knowledge is leveraged to co-represent caption tokens, object regions as well as question tokens. Moreover, we develop a mutual correlation module to discern intricate correlations between explicit and implicit representations, thereby facilitating knowledge integration and final prediction. We conduct extensive experiments on three publicly available datasets (i.e., OK-VQA v1.0, OK-VQA v1.1 and A-OKVQA). Both quantitative and qualitative results demonstrate the superiority and rationality of our proposed KBCEN.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"11 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141743155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An adaptive and late multifusion framework in contextual representation based on evidential deep learning and Dempster–Shafer theory 基于证据深度学习和 Dempster-Shafer 理论的上下文表征中的自适应和后期多重融合框架

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-07-22 DOI: 10.1007/s10115-024-02150-2

Doaa Mohey El-Din, Aboul Ella Hassanein, Ehab E. Hassanien

There is a growing interest in multidisciplinary research in multimodal synthesis technology to stimulate diversity of modal interpretation in different application contexts. The real requirement for modality diversity across multiple contextual representation fields is due to the conflicting nature of data in multitarget sensors, which introduces other obstacles including ambiguity, uncertainty, imbalance, and redundancy in multiobject classification. This paper proposes a new adaptive and late multimodal fusion framework using evidence-enhanced deep learning guided by Dempster–Shafer theory and concatenation strategy to interpret multiple modalities and contextual representations that achieves a bigger number of features for interpreting unstructured multimodality types based on late fusion. Furthermore, it is designed based on a multifusion learning solution to solve the modality and context-based fusion that leads to improving decisions. It creates a fully automated selective deep neural network and constructs an adaptive fusion model for all modalities based on the input type. The proposed framework is implemented based on five layers which are a software-defined fusion layer, a preprocessing layer, a dynamic classification layer, an adaptive fusion layer, and an evaluation layer. The framework is formalizing the modality/context-based problem into an adaptive multifusion framework based on a late fusion level. The particle swarm optimization was used in multiple smart context systems to improve the final classification layer with the best optimal parameters that tracing 30 changes in hyperparameters of deep learning training models. This paper applies multiple experimental with multimodalities inputs in multicontext to show the behaviors the proposed multifusion framework. Experimental results on four challenging datasets including military, agricultural, COIVD-19, and food health data provide impressive results compared to other state-of-the-art multiple fusion models. The main strengths of proposed adaptive fusion framework can classify multiobjects with reduced features automatically and solves the fused data ambiguity and inconsistent data. In addition, it can increase the certainty and reduce the redundancy data with improving the unbalancing data. The experimental results of multimodalities experiment in multicontext using the proposed multimodal fusion framework achieve 98.45% of accuracy.

人们对多模态合成技术的多学科研究兴趣与日俱增，以促进不同应用背景下模态解释的多样性。由于多目标传感器中数据的冲突性，在多目标分类中引入了包括模糊性、不确定性、不平衡性和冗余性在内的其他障碍，因此对跨多个上下文表示领域的模态多样性提出了真正的要求。本文提出了一种新的自适应晚期多模态融合框架，利用以 Dempster-Shafer 理论为指导的证据增强型深度学习和串联策略来解释多种模态和上下文表征，从而在晚期融合的基础上实现更多特征来解释非结构化多模态类型。此外，它的设计基于多重融合学习解决方案，以解决基于模态和上下文的融合问题，从而改进决策。它创建了一个全自动选择性深度神经网络，并根据输入类型为所有模态构建了一个自适应融合模型。所提出的框架基于五个层来实现，即软件定义的融合层、预处理层、动态分类层、自适应融合层和评估层。该框架将基于模态/上下文的问题形式化为基于后期融合层的自适应多重融合框架。粒子群优化被用于多个智能语境系统中，以追踪深度学习训练模型超参数的 30 次变化的最佳参数来改进最终分类层。本文在多语境中应用了多种多模态输入实验，以展示所提出的多融合框架的行为。与其他最先进的多重融合模型相比，在军事、农业、COIVD-19 和食品健康数据等四个具有挑战性的数据集上的实验结果令人印象深刻。所提出的自适应融合框架的主要优点是能自动对特征减少的多物体进行分类，并能解决融合数据的模糊性和数据不一致性问题。此外，它还能提高数据的确定性并减少冗余数据，同时改善数据的不平衡性。使用所提出的多模态融合框架在多文本中进行的多模态实验结果表明，其准确率达到了 98.45%。

{"title":"An adaptive and late multifusion framework in contextual representation based on evidential deep learning and Dempster–Shafer theory","authors":"Doaa Mohey El-Din, Aboul Ella Hassanein, Ehab E. Hassanien","doi":"10.1007/s10115-024-02150-2","DOIUrl":"https://doi.org/10.1007/s10115-024-02150-2","url":null,"abstract":"There is a growing interest in multidisciplinary research in multimodal synthesis technology to stimulate diversity of modal interpretation in different application contexts. The real requirement for modality diversity across multiple contextual representation fields is due to the conflicting nature of data in multitarget sensors, which introduces other obstacles including ambiguity, uncertainty, imbalance, and redundancy in multiobject classification. This paper proposes a new adaptive and late multimodal fusion framework using evidence-enhanced deep learning guided by Dempster–Shafer theory and concatenation strategy to interpret multiple modalities and contextual representations that achieves a bigger number of features for interpreting unstructured multimodality types based on late fusion. Furthermore, it is designed based on a multifusion learning solution to solve the modality and context-based fusion that leads to improving decisions. It creates a fully automated selective deep neural network and constructs an adaptive fusion model for all modalities based on the input type. The proposed framework is implemented based on five layers which are a software-defined fusion layer, a preprocessing layer, a dynamic classification layer, an adaptive fusion layer, and an evaluation layer. The framework is formalizing the modality/context-based problem into an adaptive multifusion framework based on a late fusion level. The particle swarm optimization was used in multiple smart context systems to improve the final classification layer with the best optimal parameters that tracing 30 changes in hyperparameters of deep learning training models. This paper applies multiple experimental with multimodalities inputs in multicontext to show the behaviors the proposed multifusion framework. Experimental results on four challenging datasets including military, agricultural, COIVD-19, and food health data provide impressive results compared to other state-of-the-art multiple fusion models. The main strengths of proposed adaptive fusion framework can classify multiobjects with reduced features automatically and solves the fused data ambiguity and inconsistent data. In addition, it can increase the certainty and reduce the redundancy data with improving the unbalancing data. The experimental results of multimodalities experiment in multicontext using the proposed multimodal fusion framework achieve 98.45% of accuracy.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"13 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141743298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal intelligent information retrieval and reliable storage scheme for cloud environment and E-learning big data analytics 云环境和电子学习大数据分析的最佳智能信息检索和可靠存储方案

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-07-22 DOI: 10.1007/s10115-024-02152-0

Chandrasekar Venkatachalam, Shanmugavalli Venkatachalam

Currently, online learning systems in the education sector are widely used and have become a new trend, generating large amounts of educational data based on students’ activities. In order to improve online learning experiences, sophisticated data analysis techniques are required. Adding value to E-learning platforms through the efficient processing of big learning data is possible with Big Data. With time, the E-learning management system’s repository expands and becomes a rich source of learning materials. Subject matter experts may benefit from using E-learning resources to reuse previously created content when creating online content. In addition, it might be beneficial to the students by giving them access to the pertinent documents for achieving their learning objectives effectively. An improved intelligent information retrieval and reliable storage (OIIRS) scheme is proposed for E-learning using hybrid deep learning techniques. Assume that relevant E-learning documents are stored in cloud and dynamically updated according to users’ status. First, we present a highly robust and lightweight crypto, i.e., optimized CLEFIA, for securely storing data in local repositories that improve the reliability of data loading. We develop an improved butterfly optimization algorithm to provide an optimal solution for CLEFIA that selects private keys. In addition, a hybrid deep learning method, i.e., backward diagonal search-based deep recurrent neural network (BD-DRNN) is introduced for optimal intelligent information retrieval based on keywords rather than semantics. Here, feature extraction and key feature matching are performed by the modified Hungarian optimization (MHO) algorithm that improves searching accuracy. Finally, we test our proposed OIIRS scheme with different benchmark datasets and use simulation results to test the performance.

目前，在线学习系统在教育领域得到广泛应用，并已成为一种新趋势，根据学生的活动产生了大量的教育数据。为了改善在线学习体验，需要先进的数据分析技术。通过大数据有效处理学习大数据，为电子学习平台增值成为可能。随着时间的推移，E-learning 管理系统的资料库会不断扩大，成为丰富的学习资料来源。学科专家在创建在线内容时，可以利用电子学习资源重新使用以前创建的内容。此外，学生也可以利用电子学习资源获取相关文件，从而有效实现学习目标。本文利用混合深度学习技术，为电子学习提出了一种改进的智能信息检索和可靠存储（OIIRS）方案。假设相关的电子学习文档存储在云中，并根据用户的状态动态更新。首先，我们提出了一种高度稳健和轻量级的加密技术，即优化的 CLEFIA，用于将数据安全地存储在本地存储库中，从而提高数据加载的可靠性。我们开发了一种改进的蝴蝶优化算法，为选择私钥的 CLEFIA 提供最优解。此外，我们还引入了一种混合深度学习方法，即基于后向对角搜索的深度递归神经网络（BD-DRNN），用于基于关键词而非语义的最优智能信息检索。在这里，特征提取和关键特征匹配是通过改进的匈牙利优化（MHO）算法来完成的，该算法提高了搜索的准确性。最后，我们用不同的基准数据集测试了我们提出的 OIIRS 方案，并使用仿真结果来检验其性能。

{"title":"Optimal intelligent information retrieval and reliable storage scheme for cloud environment and E-learning big data analytics","authors":"Chandrasekar Venkatachalam, Shanmugavalli Venkatachalam","doi":"10.1007/s10115-024-02152-0","DOIUrl":"https://doi.org/10.1007/s10115-024-02152-0","url":null,"abstract":"Currently, online learning systems in the education sector are widely used and have become a new trend, generating large amounts of educational data based on students’ activities. In order to improve online learning experiences, sophisticated data analysis techniques are required. Adding value to E-learning platforms through the efficient processing of big learning data is possible with Big Data. With time, the E-learning management system’s repository expands and becomes a rich source of learning materials. Subject matter experts may benefit from using E-learning resources to reuse previously created content when creating online content. In addition, it might be beneficial to the students by giving them access to the pertinent documents for achieving their learning objectives effectively. An improved intelligent information retrieval and reliable storage (OIIRS) scheme is proposed for E-learning using hybrid deep learning techniques. Assume that relevant E-learning documents are stored in cloud and dynamically updated according to users’ status. First, we present a highly robust and lightweight crypto, i.e., optimized CLEFIA, for securely storing data in local repositories that improve the reliability of data loading. We develop an improved butterfly optimization algorithm to provide an optimal solution for CLEFIA that selects private keys. In addition, a hybrid deep learning method, i.e., backward diagonal search-based deep recurrent neural network (BD-DRNN) is introduced for optimal intelligent information retrieval based on keywords rather than semantics. Here, feature extraction and key feature matching are performed by the modified Hungarian optimization (MHO) algorithm that improves searching accuracy. Finally, we test our proposed OIIRS scheme with different benchmark datasets and use simulation results to test the performance.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"25 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141783057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0