Knowledge and Information Systems最新文献_第2页

Targeted training for numerical reasoning with large language models 利用大型语言模型进行有针对性的数字推理训练

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-09-06 DOI: 10.1007/s10115-024-02216-1

Xiao Li, Sichen Liu, Yin Zhu, Gong Cheng

After recent gains achieved by large language models (LLMs) on numerical reasoning tasks, it has become of interest to have LLMs teach small models to improve on numerical reasoning. Instructing LLMs to generate Chains of Thought to fine-tune small models is an established approach. However, small models are passive in this line of work and may not be able to exploit the provided training data. In this paper, we propose a novel targeted training strategy to match LLM’s assistance with small models’ capacities. The small model will proactively request LLM’s assistance when it sifts out confusing training data. Then, LLM refines such data by successively revising reasoning steps and reducing question complexity before feeding the small model. Experiments show that this targeted training approach remarkably improves the performance of small models on a range of numerical reasoning datasets by 12–25%, making small models even competitive with some LLMs.

最近，大型语言模型（LLMs）在数字推理任务上取得了一些成果，因此，让 LLMs 教小型模型改进数字推理变得很有意义。指导 LLM 生成思维链来微调小型模型是一种成熟的方法。然而，小型模型在这一工作中是被动的，可能无法利用所提供的训练数据。在本文中，我们提出了一种新颖的定向训练策略，使 LLM 的帮助与小型模型的能力相匹配。当小型模型筛选出混乱的训练数据时，它会主动请求 LLM 的帮助。然后，LLM 通过连续修改推理步骤和降低问题复杂度来完善这些数据，然后再反馈给小型模型。实验表明，这种有针对性的训练方法显著提高了小型模型在一系列数字推理数据集上的性能，提高幅度达 12-25%，使小型模型甚至可以与某些 LLM 相媲美。

引用次数: 0

Generative user-experience research for developing domain-specific natural language processing applications 开发特定领域自然语言处理应用程序的用户体验生成研究

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-09-04 DOI: 10.1007/s10115-024-02212-5

Anastasia Zhukova, Lukas von Sperl, Christian E. Matt, Bela Gipp

User experience (UX) is a part of human–computer interaction research and focuses on increasing intuitiveness, transparency, simplicity, and trust for the system users. Most UX research for machine learning or natural language processing (NLP) focuses on a data-driven methodology. It engages domain users mainly for usability evaluation. Moreover, more typical UX methods tailor the systems toward user usability, unlike learning about the user needs first. This paper proposes a new methodology for integrating generative UX research into developing domain NLP applications. Generative UX research employs domain users at the initial stages of prototype development, i.e., ideation and concept evaluation, and the last stage for evaluating system usefulness and user utility. The methodology emerged from and is evaluated on a case study about the full-cycle prototype development of a domain-specific semantic search for daily operations in the process industry. A key finding of our case study is that involving domain experts increases their interest and trust in the final NLP application. The combined UX+NLP research of the proposed method efficiently considers data- and user-driven opportunities and constraints, which can be crucial for developing NLP applications.

用户体验（UX）是人机交互研究的一部分，重点在于提高系统用户的直观性、透明度、简单性和信任度。大多数针对机器学习或自然语言处理（NLP）的用户体验研究都侧重于数据驱动方法。它主要让领域用户参与可用性评估。此外，更典型的用户体验方法是根据用户可用性来定制系统，而不是先了解用户需求。本文提出了一种新方法，将生成性用户体验研究整合到领域 NLP 应用程序的开发中。生成式用户体验研究在原型开发的最初阶段，即构思和概念评估阶段，以及评估系统实用性和用户效用的最后阶段，采用领域用户。该方法源于一项案例研究，该案例研究涉及针对流程工业日常操作的特定领域语义搜索的全周期原型开发，并对该方法进行了评估。我们案例研究的一个重要发现是，让领域专家参与进来，可以提高他们对最终 NLP 应用程序的兴趣和信任度。所提出方法的用户体验和 NLP 研究相结合，有效地考虑了数据和用户驱动的机会和限制，这对于开发 NLP 应用程序至关重要。

{"title":"Generative user-experience research for developing domain-specific natural language processing applications","authors":"Anastasia Zhukova, Lukas von Sperl, Christian E. Matt, Bela Gipp","doi":"10.1007/s10115-024-02212-5","DOIUrl":"https://doi.org/10.1007/s10115-024-02212-5","url":null,"abstract":"User experience (UX) is a part of human–computer interaction research and focuses on increasing intuitiveness, transparency, simplicity, and trust for the system users. Most UX research for machine learning or natural language processing (NLP) focuses on a data-driven methodology. It engages domain users mainly for usability evaluation. Moreover, more typical UX methods tailor the systems toward user usability, unlike learning about the user needs first. This paper proposes a new methodology for integrating generative UX research into developing domain NLP applications. Generative UX research employs domain users at the initial stages of prototype development, i.e., ideation and concept evaluation, and the last stage for evaluating system usefulness and user utility. The methodology emerged from and is evaluated on a case study about the full-cycle prototype development of a domain-specific semantic search for daily operations in the process industry. A key finding of our case study is that involving domain experts increases their interest and trust in the final NLP application. The combined UX+NLP research of the proposed method efficiently considers data- and user-driven opportunities and constraints, which can be crucial for developing NLP applications.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"38 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep ensembled multi-criteria recommendation system for enhancing and personalizing the user experience on e-commerce platforms 用于增强和个性化电子商务平台用户体验的深度集合多标准推荐系统

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-09-04 DOI: 10.1007/s10115-024-02187-3

Rahul Shrivastava, Dilip Singh Sisodia, Naresh Kumar Nagwani

The commercially applicable Recommendation system (RS) exploits multi-criteria rating-based user-item interaction to learn and personalize user preferences using the Multi-criteria recommendation system (MCRS). The existing MCRS techniques have exploited similarity or aggregation function-based modeling to improve prediction accuracy. However, these MCRS methods do not investigate item aspects-based latent user preferences and criteria-based user-item implicit relationships. Also, the prediction reliability is uncertain due to highly sparse user-item interactions and ignoring auxiliary information support. Hence, this study proposes an ensembled approach that jointly develops the Similarity and aggregation function-based MCRS model (SimAgg-MCRS) and aggregates their user-item predicted preferences into a cumulative preference matrix to generate the final recommendation. First, the proposed model develops the deep neural network (DNN)-based model to aggregate the criteria-based similarity and predicts the overall rating using the aggregated similarity by merging user and item-based predictions. Second, the preference relation-based aggregation function approach develops deep autoencoder-based modeling to exploit the latent relationship among criteria to obtain users’ overall preference over an item by aggregating criteria-wise preference. Finally, the third phase develops the DNN-based ensemble model to integrate the preference matrix of similarity and aggregation function approach to obtain the overall aggregated matrix for the recommendation. The proposed SimAgg-MCRS integrates user and item side information to learn user preferences better. Experimental and prediction accuracy-based comparative evaluation results across Yahoo! Movies and Trip Advisor multi-criteria datasets validate the proposed models’ performance over the baseline MCRS methods.

商业应用的推荐系统（RS）利用基于多标准评级的用户-项目互动，通过多标准推荐系统（MCRS）学习和个性化用户偏好。现有的 MCRS 技术利用基于相似性或聚合函数的建模来提高预测准确性。然而，这些 MCRS 方法并没有研究基于项目方面的潜在用户偏好和基于标准的用户-项目隐含关系。此外，由于用户与项目之间的交互非常稀疏，且忽略了辅助信息支持，因此预测可靠性并不确定。因此，本研究提出了一种集合方法，即联合开发基于相似性和聚合函数的 MCRS 模型（SimAgg-MCRS），并将其用户-物品预测偏好聚合到累积偏好矩阵中，生成最终推荐。首先，建议的模型开发了基于深度神经网络（DNN）的模型，以聚合基于标准的相似度，并通过合并基于用户和项目的预测，利用聚合的相似度预测总体评分。其次，基于偏好关系的聚合函数方法开发了基于深度自动编码器的建模，以利用标准之间的潜在关系，通过聚合标准偏好来获得用户对项目的总体偏好。最后，第三阶段开发了基于 DNN 的集合模型，以整合相似性偏好矩阵和聚合函数方法，从而获得用于推荐的整体聚合矩阵。所提出的 SimAgg-MCRS 整合了用户和物品方面的信息，能更好地学习用户偏好。雅虎电影和 Trip Advisor 多标准数据集的实验结果和基于预测准确率的比较评估结果验证了所提出的模型优于基准 MCRS 方法的性能。

{"title":"Deep ensembled multi-criteria recommendation system for enhancing and personalizing the user experience on e-commerce platforms","authors":"Rahul Shrivastava, Dilip Singh Sisodia, Naresh Kumar Nagwani","doi":"10.1007/s10115-024-02187-3","DOIUrl":"https://doi.org/10.1007/s10115-024-02187-3","url":null,"abstract":"The commercially applicable Recommendation system (RS) exploits multi-criteria rating-based user-item interaction to learn and personalize user preferences using the Multi-criteria recommendation system (MCRS). The existing MCRS techniques have exploited similarity or aggregation function-based modeling to improve prediction accuracy. However, these MCRS methods do not investigate item aspects-based latent user preferences and criteria-based user-item implicit relationships. Also, the prediction reliability is uncertain due to highly sparse user-item interactions and ignoring auxiliary information support. Hence, this study proposes an ensembled approach that jointly develops the Similarity and aggregation function-based MCRS model (SimAgg-MCRS) and aggregates their user-item predicted preferences into a cumulative preference matrix to generate the final recommendation. First, the proposed model develops the deep neural network (DNN)-based model to aggregate the criteria-based similarity and predicts the overall rating using the aggregated similarity by merging user and item-based predictions. Second, the preference relation-based aggregation function approach develops deep autoencoder-based modeling to exploit the latent relationship among criteria to obtain users’ overall preference over an item by aggregating criteria-wise preference. Finally, the third phase develops the DNN-based ensemble model to integrate the preference matrix of similarity and aggregation function approach to obtain the overall aggregated matrix for the recommendation. The proposed SimAgg-MCRS integrates user and item side information to learn user preferences better. Experimental and prediction accuracy-based comparative evaluation results across Yahoo! Movies and Trip Advisor multi-criteria datasets validate the proposed models’ performance over the baseline MCRS methods.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"62 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-perspective patient representation learning for disease prediction on electronic health records 通过多视角患者表征学习在电子健康记录上进行疾病预测

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-09-04 DOI: 10.1007/s10115-024-02188-2

Ziyue Yu, Jiayi Wang, Wuman Luo, Rita Tse, Giovanni Pau

Patient representation learning based on electronic health records (EHR) is a critical task for disease prediction. This task aims to effectively extract useful information on dynamic features. Although various existing works have achieved remarkable progress, the model performance can be further improved by fully extracting the trends, variations, and the correlation between the trends and variations in dynamic features. In addition, sparse visit records limit the performance of deep learning models. To address these issues, we propose the multi-perspective patient representation Extractor (MPRE) for disease prediction. Specifically, we propose frequency transformation module (FTM) to extract the trend and variation information of dynamic features in the time–frequency domain, which can enhance the feature representation. In the 2D multi-extraction network (2D MEN), we form the 2D temporal tensor based on trend and variation. Then, the correlations between trend and variation are captured by the proposed dilated operation. Moreover, we propose the first-order difference attention mechanism (FODAM) to calculate the contributions of differences in adjacent variations to the disease diagnosis adaptively. To evaluate the performance of MPRE and baseline methods, we conduct extensive experiments on two real-world public datasets. The experiment results show that MPRE outperforms state-of-the-art baseline methods in terms of AUROC and AUPRC.

基于电子健康记录（EHR）的患者表征学习是疾病预测的一项关键任务。这项任务旨在有效提取动态特征的有用信息。虽然现有的各种研究已取得了显著进展，但如果能充分提取动态特征的趋势、变化以及趋势与变化之间的相关性，模型的性能还能进一步提高。此外，稀疏的访问记录也限制了深度学习模型的性能。为了解决这些问题，我们提出了用于疾病预测的多视角患者表征提取器（MPRE）。具体来说，我们提出了频率变换模块（FTM），以提取动态特征在时频域的趋势和变化信息，从而增强特征表示。在二维多重提取网络（2D MEN）中，我们根据趋势和变化形成二维时间张量。然后，通过提议的扩张操作捕捉趋势和变化之间的相关性。此外，我们还提出了一阶差分注意机制（FODAM），以自适应性地计算相邻变化的差异对疾病诊断的贡献。为了评估 MPRE 和基线方法的性能，我们在两个真实世界的公共数据集上进行了广泛的实验。实验结果表明，MPRE 在 AUROC 和 AUPRC 方面优于最先进的基线方法。

{"title":"Multi-perspective patient representation learning for disease prediction on electronic health records","authors":"Ziyue Yu, Jiayi Wang, Wuman Luo, Rita Tse, Giovanni Pau","doi":"10.1007/s10115-024-02188-2","DOIUrl":"https://doi.org/10.1007/s10115-024-02188-2","url":null,"abstract":"Patient representation learning based on electronic health records (EHR) is a critical task for disease prediction. This task aims to effectively extract useful information on dynamic features. Although various existing works have achieved remarkable progress, the model performance can be further improved by fully extracting the trends, variations, and the correlation between the trends and variations in dynamic features. In addition, sparse visit records limit the performance of deep learning models. To address these issues, we propose the multi-perspective patient representation Extractor (MPRE) for disease prediction. Specifically, we propose frequency transformation module (FTM) to extract the trend and variation information of dynamic features in the time–frequency domain, which can enhance the feature representation. In the 2D multi-extraction network (2D MEN), we form the 2D temporal tensor based on trend and variation. Then, the correlations between trend and variation are captured by the proposed dilated operation. Moreover, we propose the first-order difference attention mechanism (FODAM) to calculate the contributions of differences in adjacent variations to the disease diagnosis adaptively. To evaluate the performance of MPRE and baseline methods, we conduct extensive experiments on two real-world public datasets. The experiment results show that MPRE outperforms state-of-the-art baseline methods in terms of AUROC and AUPRC.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"7 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Lazy learning and sparsity handling in recommendation systems 推荐系统中的懒惰学习和稀疏性处理

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-09-02 DOI: 10.1007/s10115-024-02218-z

Suryanshi Mishra, Tinku Singh, Manish Kumar, Satakshi

Recommendation systems are ubiquitous in various domains, facilitating users in finding relevant items according to their preferences. Identifying pertinent items that meet their preferences enables users to target the right items. To predict ratings for more accurate forecasts, recommender systems often use collaborative filtering (CF) approaches to sparse user-rated item matrices. Due to a lack of knowledge regarding newly formed entities, the data sparsity of the user-rated item matrix has an enormous effect on collaborative filtering algorithms, which frequently face lazy learning issues. Real-world datasets with exponentially increasing users and reviews make this situation worse. Matrix factorization (MF) stands out as a key strategy in recommender systems, especially for CF tasks. This paper presents a neural network matrix factorization (NNMF) model through machine learning to overcome data sparsity challenges. This approach aims to enhance recommendation quality while mitigating the impact of data sparsity, a common issue in CF algorithms. A thorough comparative analysis was conducted on the well-known MovieLens dataset, spanning from 1.6 to 9.6 M records. The outcomes consistently favored the NNMF algorithm, showcasing superior performance compared to the state-of-the-art method in this domain in terms of precision, recall, ({mathcal {F}}1_{textrm{score}}), MAE, and RMSE.

推荐系统在各个领域无处不在，可帮助用户根据自己的偏好查找相关物品。通过识别符合用户偏好的相关项目，用户可以锁定正确的项目。为了预测评分以获得更准确的预测，推荐系统通常使用协同过滤（CF）方法来处理稀疏的用户评分项目矩阵。由于对新形成的实体缺乏了解，用户评分项目矩阵的数据稀疏性对协同过滤算法有巨大影响，而协同过滤算法经常面临懒惰学习问题。在现实世界中，用户和评论呈指数级增长的数据集使得这种情况更加严重。矩阵因式分解（MF）是推荐系统中的一种关键策略，尤其适用于协同过滤任务。本文提出了一种通过机器学习克服数据稀疏性挑战的神经网络矩阵因式分解（NNMF）模型。这种方法旨在提高推荐质量，同时减轻数据稀疏性的影响，而数据稀疏性是 CF 算法中的一个常见问题。我们在著名的 MovieLens 数据集上进行了全面的比较分析，该数据集涵盖 160 万到 960 万条记录。结果一致看好 NNMF 算法，在精确度、召回率、({mathcal {F}}1_{textrm{score}}) 、MAE 和 RMSE 方面，与该领域最先进的方法相比，NNMF 算法表现出更优越的性能。

{"title":"Lazy learning and sparsity handling in recommendation systems","authors":"Suryanshi Mishra, Tinku Singh, Manish Kumar, Satakshi","doi":"10.1007/s10115-024-02218-z","DOIUrl":"https://doi.org/10.1007/s10115-024-02218-z","url":null,"abstract":"Recommendation systems are ubiquitous in various domains, facilitating users in finding relevant items according to their preferences. Identifying pertinent items that meet their preferences enables users to target the right items. To predict ratings for more accurate forecasts, recommender systems often use collaborative filtering (CF) approaches to sparse user-rated item matrices. Due to a lack of knowledge regarding newly formed entities, the data sparsity of the user-rated item matrix has an enormous effect on collaborative filtering algorithms, which frequently face lazy learning issues. Real-world datasets with exponentially increasing users and reviews make this situation worse. Matrix factorization (MF) stands out as a key strategy in recommender systems, especially for CF tasks. This paper presents a neural network matrix factorization (NNMF) model through machine learning to overcome data sparsity challenges. This approach aims to enhance recommendation quality while mitigating the impact of data sparsity, a common issue in CF algorithms. A thorough comparative analysis was conducted on the well-known MovieLens dataset, spanning from 1.6 to 9.6 M records. The outcomes consistently favored the NNMF algorithm, showcasing superior performance compared to the state-of-the-art method in this domain in terms of precision, recall, ({mathcal {F}}1_{textrm{score}}), MAE, and RMSE.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"1 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Truss community search in uncertain graphs 不确定图中的桁架群搜索

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-09-02 DOI: 10.1007/s10115-024-02215-2

Bo Xing, Yuting Tan, Junfeng Zhou, Ming Du

Given an uncertain graph, community search is used to return dense subgraphs that contain the query vertex and satisfy the probability constraint. With the proliferation of uncertain graphs in practical applications, community search has become increasingly important in practical applications to help users make decisions in advertising recommendations, conference organization, etc. However, existing approaches for community search still suffer from two problems. First, they may return subgraphs that cannot meet users’ expectations on structural cohesiveness, due to the existence of cut-vertices/edges. Second, they use floating-point division to update the probability of each edge during computation, resulting in inaccurate results. In this paper, we study community search on uncertain graphs and propose efficient algorithms to address the above two problems. We first propose a novel community model, namely triangle-connected ((k,gamma ))-truss community, to return communities with enhanced cohesiveness. Then, we propose an online algorithm that uses a batch-recalculation strategy to guarantee the accuracy. To improve the performance of community search, we propose an index-based approach. This index organizes all the triangle-connected ((k,gamma ))-truss communities using a forest structure and maintains the mapping relationship from vertices in the uncertain graph to communities in the index. Based on this index, we can get the results of community search easily, without the costly operation as the online approach does. Finally, we conduct rich experiments on 10 real-world graphs. The experimental results verified the effectiveness and efficiency of our approaches.

在给定一个不确定图的情况下，社群搜索用于返回包含查询顶点并满足概率约束的密集子图。随着不确定图在实际应用中的大量出现，社群搜索在实际应用中变得越来越重要，它可以帮助用户在广告推荐、会议组织等方面做出决策。然而，现有的社群搜索方法仍然存在两个问题。首先，由于切顶/切边的存在，它们返回的子图可能无法满足用户对结构内聚性的期望。其次，它们在计算过程中使用浮点除法更新每条边的概率，导致结果不准确。在本文中，我们研究了不确定图上的群落搜索，并提出了解决上述两个问题的高效算法。我们首先提出了一种新颖的群落模型，即三角形连接（(k,gamma )）-桁架群落，以返回具有更强内聚性的群落。然后，我们提出了一种在线算法，使用批量计算策略来保证算法的准确性。为了提高群落搜索的性能，我们提出了一种基于索引的方法。该索引使用森林结构组织所有三角形连接的（(k,gamma )）-桁架群落，并保持不确定图中的顶点与索引中的群落之间的映射关系。基于这个索引，我们可以很容易地得到社群搜索的结果，而不需要像在线方法那样进行高成本的操作。最后，我们在 10 个真实图上进行了丰富的实验。实验结果验证了我们方法的有效性和效率。

{"title":"Truss community search in uncertain graphs","authors":"Bo Xing, Yuting Tan, Junfeng Zhou, Ming Du","doi":"10.1007/s10115-024-02215-2","DOIUrl":"https://doi.org/10.1007/s10115-024-02215-2","url":null,"abstract":"Given an uncertain graph, community search is used to return dense subgraphs that contain the query vertex and satisfy the probability constraint. With the proliferation of uncertain graphs in practical applications, community search has become increasingly important in practical applications to help users make decisions in advertising recommendations, conference organization, etc. However, existing approaches for community search still suffer from two problems. First, they may return subgraphs that cannot meet users’ expectations on structural cohesiveness, due to the existence of cut-vertices/edges. Second, they use floating-point division to update the probability of each edge during computation, resulting in inaccurate results. In this paper, we study community search on uncertain graphs and propose efficient algorithms to address the above two problems. We first propose a novel community model, namely triangle-connected ((k,gamma ))-truss community, to return communities with enhanced cohesiveness. Then, we propose an online algorithm that uses a batch-recalculation strategy to guarantee the accuracy. To improve the performance of community search, we propose an index-based approach. This index organizes all the triangle-connected ((k,gamma ))-truss communities using a forest structure and maintains the mapping relationship from vertices in the uncertain graph to communities in the index. Based on this index, we can get the results of community search easily, without the costly operation as the online approach does. Finally, we conduct rich experiments on 10 real-world graphs. The experimental results verified the effectiveness and efficiency of our approaches.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"13 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluation metrics on text summarization: comprehensive survey 文本摘要的评估指标：全面调查

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-31 DOI: 10.1007/s10115-024-02217-0

Ensieh Davoodijam, Mohsen Alambardar Meybodi

Automatic text summarization is the process of shortening a large document into a summary text that preserves the main concepts and key points of the original document. Due to the wide applications of text summarization, many studies have been conducted on it, but evaluating the quality of generated summaries poses significant challenges. Selecting the appropriate evaluation metrics to capture various aspects of summarization quality, including content, structure, coherence, readability, novelty, and semantic relevance, plays a crucial role in text summarization application. To address this challenge, the main focus of this study is on gathering and investigating a comprehensive set of evaluation metrics. Analysis of various metrics can enhance the understanding of the evaluation method and leads to select appropriate evaluation text summarization systems in the future. After a short review of various automatic text summarization methods, we thoroughly analyze 42 prominent metrics, categorizing them into six distinct categories to provide insights into their strengths, limitations, and applicability.

自动文本摘要是将大型文档缩短为摘要文本的过程，摘要文本保留了原始文档的主要概念和要点。由于文本摘要的广泛应用，人们对其进行了大量研究，但对生成摘要的质量进行评估却面临着巨大挑战。选择合适的评价指标来捕捉摘要质量的各个方面，包括内容、结构、连贯性、可读性、新颖性和语义相关性，在文本摘要应用中起着至关重要的作用。为应对这一挑战，本研究的重点是收集和研究一套全面的评价指标。对各种指标的分析可以加深对评价方法的理解，并为将来选择合适的评价文本摘要系统提供依据。在对各种自动文本摘要方法进行简短回顾之后，我们对 42 个著名的指标进行了深入分析，并将它们分为六个不同的类别，以便深入了解它们的优势、局限性和适用性。

引用次数: 0

Enhancing link prediction through node embedding and ensemble learning 通过节点嵌入和集合学习加强链接预测

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-30 DOI: 10.1007/s10115-024-02203-6

Zhongyuan Chen, Yongji Wang

Social networks, characterized by their dynamic and continually evolving nature, present challenges for effective link prediction (LP) due to the constant addition of nodes and connections. In response to this, we propose a novel approach to LP in social networks through Node Embedding and Ensemble Learning (LP-NEEL). Our method constructs a transition matrix from the network’s adjacency matrix and computes similarity measures between node pairs. Utilizing node2vec embedding, we extract features from nodes and generate edge embeddings by computing the inner product of node embeddings for each edge. This process yields a well-labeled dataset suitable for LP tasks. To mitigate overfitting, we balance the dataset by ensuring an equal number of negative and positive samples edge samples during both the testing and training phases. Leveraging this balanced dataset, we employ the XGBoost machine learning algorithm for final link prediction. Extensive experimentation across six social network datasets validates the efficacy of our approach, demonstrating improved predictive performance compared to existing methods.

社交网络具有动态和持续发展的特点，由于节点和连接的不断增加，为有效的链接预测（LP）带来了挑战。为此，我们提出了一种通过节点嵌入和集合学习（LP-NEEL）在社交网络中进行链接预测的新方法。我们的方法从网络的邻接矩阵中构建过渡矩阵，并计算节点对之间的相似度。利用 node2vec 嵌入，我们从节点中提取特征，并通过计算每条边的节点嵌入的内积生成边嵌入。这一过程产生了适合 LP 任务的标签良好的数据集。为了减少过拟合，我们在测试和训练阶段都确保有相同数量的负样本和正样本边缘样本，以平衡数据集。利用这一平衡数据集，我们采用 XGBoost 机器学习算法进行最终链接预测。在六个社交网络数据集上进行的广泛实验验证了我们方法的有效性，与现有方法相比，我们的预测性能得到了提高。

引用次数: 0

GMMDA: Gaussian mixture modeling of graph in latent space for graph data augmentation GMMDA：潜空间图形高斯混合建模，用于图形数据扩增

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-29 DOI: 10.1007/s10115-024-02207-2

Yanjin Li, Linchuan Xu, Kenji Yamanishi

Graph data augmentation (GDA), which manipulates graph structure and/or attributes, has been demonstrated as an effective method for improving the generalization of graph neural networks on semi-supervised node classification. As a data augmentation technique, label preservation is critical, that is, node labels should not change after data manipulation. However, most existing methods overlook the label preservation requirements. Determining the label-preserving nature of a GDA method is highly challenging, owing to the non-Euclidean nature of the graph structure. In this study, for the first time, we formulate a label-preserving problem (LPP) in the context of GDA. The LPP is formulated as an optimization problem in which, given a fixed augmentation budget, the objective is to find an augmented graph with minimal difference in data distribution compared to the original graph. To solve the LPP problem, we propose GMMDA, a generative data augmentation (DA) method based on Gaussian mixture modeling (GMM) of a graph in a latent space. We designed a novel learning objective that jointly learns a low-dimensional graph representation and estimates the GMM. The learning is followed by sampling from the GMM, and the samples are converted back to the graph as additional nodes. To uphold label preservation, we designed a minimum description length (MDL)-based method to select a set of samples that produces the minimum shift in the data distribution captured by the GMM. Through experiments, we demonstrate that GMMDA can improve the performance of graph convolutional network on Cora, Citeseer and Pubmed by as much as (7.75%), (8.75%) and (5.87%), respectively, significantly outperforming the state-of-the-art methods.

图数据增强（GDA）是对图结构和/或属性的操作，已被证明是提高图神经网络在半监督节点分类中的泛化能力的有效方法。作为一种数据增强技术，标签保存至关重要，即节点标签在数据处理后不应发生变化。然而，现有的大多数方法都忽略了标签保存的要求。由于图结构的非欧几里得性质，确定 GDA 方法的标签保留性质极具挑战性。在本研究中，我们首次在 GDA 的背景下提出了标签保留问题（LPP）。LPP 被表述为一个优化问题，在该问题中，给定一个固定的扩增预算，目标是找到一个与原始图相比数据分布差异最小的扩增图。为了解决 LPP 问题，我们提出了 GMMDA，这是一种基于潜空间图的高斯混合建模（GMM）的生成式数据增强（DA）方法。我们设计了一种新颖的学习目标，它可以联合学习低维图表示并估计 GMM。学习结束后从 GMM 中采样，然后将采样转换回图，作为附加节点。为了维护标签，我们设计了一种基于最小描述长度（MDL）的方法来选择一组样本，使 GMM 所捕捉的数据分布产生最小的偏移。通过实验，我们证明了GMMDA可以提高图卷积网络在Cora、Citeseer和Pubmed上的性能，分别高达（7.75%）、（8.75%）和（5.87%），明显优于最先进的方法。

{"title":"GMMDA: Gaussian mixture modeling of graph in latent space for graph data augmentation","authors":"Yanjin Li, Linchuan Xu, Kenji Yamanishi","doi":"10.1007/s10115-024-02207-2","DOIUrl":"https://doi.org/10.1007/s10115-024-02207-2","url":null,"abstract":"Graph data augmentation (GDA), which manipulates graph structure and/or attributes, has been demonstrated as an effective method for improving the generalization of graph neural networks on semi-supervised node classification. As a data augmentation technique, label preservation is critical, that is, node labels should not change after data manipulation. However, most existing methods overlook the label preservation requirements. Determining the label-preserving nature of a GDA method is highly challenging, owing to the non-Euclidean nature of the graph structure. In this study, for the first time, we formulate a label-preserving problem (LPP) in the context of GDA. The LPP is formulated as an optimization problem in which, given a fixed augmentation budget, the objective is to find an augmented graph with minimal difference in data distribution compared to the original graph. To solve the LPP problem, we propose GMMDA, a generative data augmentation (DA) method based on Gaussian mixture modeling (GMM) of a graph in a latent space. We designed a novel learning objective that jointly learns a low-dimensional graph representation and estimates the GMM. The learning is followed by sampling from the GMM, and the samples are converted back to the graph as additional nodes. To uphold label preservation, we designed a minimum description length (MDL)-based method to select a set of samples that produces the minimum shift in the data distribution captured by the GMM. Through experiments, we demonstrate that GMMDA can improve the performance of graph convolutional network on Cora, Citeseer and Pubmed by as much as (7.75%), (8.75%) and (5.87%), respectively, significantly outperforming the state-of-the-art methods.\u0000","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"1 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing the performance of deep learning models with fuzzy c-means clustering 利用模糊均值聚类提高深度学习模型的性能

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge and Information Systems

Pub Date : 2024-08-24 DOI: 10.1007/s10115-024-02211-6

Saumya Singh, Smriti Srivastava

Deep learning models (DLMs), such as recurrent neural networks (RNN), long short-term memory (LSTM), bidirectional long short-term memory (Bi-LSTM), and gated recurrent unit (GRU), are superior for sequential data analysis due to their ability to learn complex patterns. This paper proposes enhancing performance of these models by applying fuzzy c-means (FCM) clustering on sequential data from a nonlinear plant and the stock market. FCM clustering helps to organize the data into clusters based on similarity, which improves the performance of the models. Thus, the proposed fuzzy c-means recurrent neural network (FCM-RNN), fuzzy c-means long short-term memory (FCM-LSTM), fuzzy c-means bidirectional long short-term memory (FCM-Bi-LSTM), and fuzzy c-means gated recurrent unit (FCM-GRU) models showed enhanced prediction results than RNN, LSTM, Bi-LSTM, and GRU models, respectively. This enhancement is validated using performance metrics such as root-mean-square error and mean absolute error and is further illustrated by scatter plots comparing actual versus predicted values for training, validation, and testing data. The experiment results confirm that integrating FCM clustering with DLMs shows the superiority of the proposed models.

深度学习模型（DLMs），如递归神经网络（RNN）、长短期记忆（LSTM）、双向长短期记忆（Bi-LSTM）和门控递归单元（GRU），因其学习复杂模式的能力而在序列数据分析方面具有优势。本文建议通过对非线性工厂和股票市场的连续数据应用模糊均值（FCM）聚类来提高这些模型的性能。FCM 聚类有助于根据相似性将数据组织成群，从而提高模型的性能。因此，与 RNN、LSTM、Bi-LSTM 和 GRU 模型相比，所提出的模糊 c-means 循环神经网络（FCM-RNN）、模糊 c-means 长短期记忆（FCM-LSTM）、模糊 c-means 双向长短期记忆（FCM-Bi-LSTM）和模糊 c-means 门控循环单元（FCM-GRU）模型分别显示出更强的预测结果。使用均方根误差和平均绝对误差等性能指标验证了这种增强，并通过比较训练、验证和测试数据的实际值与预测值的散点图进一步加以说明。实验结果证实，将 FCM 聚类与 DLMs 集成在一起显示了所建议模型的优越性。

{"title":"Enhancing the performance of deep learning models with fuzzy c-means clustering","authors":"Saumya Singh, Smriti Srivastava","doi":"10.1007/s10115-024-02211-6","DOIUrl":"https://doi.org/10.1007/s10115-024-02211-6","url":null,"abstract":"Deep learning models (DLMs), such as recurrent neural networks (RNN), long short-term memory (LSTM), bidirectional long short-term memory (Bi-LSTM), and gated recurrent unit (GRU), are superior for sequential data analysis due to their ability to learn complex patterns. This paper proposes enhancing performance of these models by applying fuzzy c-means (FCM) clustering on sequential data from a nonlinear plant and the stock market. FCM clustering helps to organize the data into clusters based on similarity, which improves the performance of the models. Thus, the proposed fuzzy c-means recurrent neural network (FCM-RNN), fuzzy c-means long short-term memory (FCM-LSTM), fuzzy c-means bidirectional long short-term memory (FCM-Bi-LSTM), and fuzzy c-means gated recurrent unit (FCM-GRU) models showed enhanced prediction results than RNN, LSTM, Bi-LSTM, and GRU models, respectively. This enhancement is validated using performance metrics such as root-mean-square error and mean absolute error and is further illustrated by scatter plots comparing actual versus predicted values for training, validation, and testing data. The experiment results confirm that integrating FCM clustering with DLMs shows the superiority of the proposed models.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"36 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0