Frontiers of Computer Science最新文献

英文中文

A comprehensive survey of federated transfer learning: challenges, methods and applications 联合转移学习综合调查：挑战、方法和应用

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-07-23 DOI: 10.1007/s11704-024-40065-x

Wei Guo, Fuzhen Zhuang, Xiao Zhang, Yiqi Tong, Jin Dong

Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL methods do not work well due to the training and test data of each participant may not be sampled from the same feature space and the same underlying distribution. Meanwhile, the differences in their local devices (system heterogeneity), the continuous influx of online data (incremental data), and labeled data scarcity may further influence the performance of these methods. To solve this problem, federated transfer learning (FTL), which integrates transfer learning (TL) into FL, has attracted the attention of numerous researchers. However, since FL enables a continuous share of knowledge among participants with each communication round while not allowing local data to be accessed by other participants, FTL faces many unique challenges that are not present in TL. In this survey, we focus on categorizing and reviewing the current progress on federated transfer learning, and outlining corresponding solutions and applications. Furthermore, the common setting of FTL scenarios, available datasets, and significant related research are summarized in this survey.

联合学习（FL）是一种新颖的分布式机器学习范式，它通过消除数据共享要求，使参与者能够在保护隐私的前提下协作训练一个集中模型。在实践中，FL 通常涉及多个参与者，需要第三方汇总全局信息来指导目标参与者的更新。因此，由于每个参与者的训练数据和测试数据可能不是从相同的特征空间和相同的底层分布中采样，许多 FL 方法都不能很好地发挥作用。同时，他们本地设备的差异（系统异质性）、在线数据（增量数据）的不断涌入以及标记数据的稀缺性可能会进一步影响这些方法的性能。为解决这一问题，将迁移学习（TL）集成到 FL 中的联合迁移学习（FTL）吸引了众多研究人员的关注。然而，由于 FL 能够让参与者在每一轮交流中持续共享知识，同时又不允许其他参与者访问本地数据，因此 FTL 面临着许多 TL 中不存在的独特挑战。在本调查报告中，我们将重点对当前联合迁移学习的进展进行分类和回顾，并概述相应的解决方案和应用。此外，本调查还总结了 FTL 的常见场景设置、可用数据集和重要的相关研究。

{"title":"A comprehensive survey of federated transfer learning: challenges, methods and applications","authors":"Wei Guo, Fuzhen Zhuang, Xiao Zhang, Yiqi Tong, Jin Dong","doi":"10.1007/s11704-024-40065-x","DOIUrl":"https://doi.org/10.1007/s11704-024-40065-x","url":null,"abstract":"Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL methods do not work well due to the training and test data of each participant may not be sampled from the same feature space and the same underlying distribution. Meanwhile, the differences in their local devices (system heterogeneity), the continuous influx of online data (incremental data), and labeled data scarcity may further influence the performance of these methods. To solve this problem, federated transfer learning (FTL), which integrates transfer learning (TL) into FL, has attracted the attention of numerous researchers. However, since FL enables a continuous share of knowledge among participants with each communication round while not allowing local data to be accessed by other participants, FTL faces many unique challenges that are not present in TL. In this survey, we focus on categorizing and reviewing the current progress on federated transfer learning, and outlining corresponding solutions and applications. Furthermore, the common setting of FTL scenarios, available datasets, and significant related research are summarized in this survey.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"93 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder DMFVAE：基于变异自动编码器的深度矩阵因式分解方法的 miRNA-疾病关联预测

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-07-12 DOI: 10.1007/s11704-023-3610-y

Pijing Wei, Qianqian Wang, Zhen Gao, Ruifen Cao, Chunhou Zheng

MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.

微RNA（miRNA）与人类多种复杂疾病密切相关，因此，探索miRNA与疾病的关联（MDAs）有助于人们更好地了解复杂的疾病机制。目前已开发出越来越多的计算方法来预测 MDAs。然而，MDAs 的稀疏性可能会阻碍许多方法的性能。此外，许多方法未能捕捉到 miRNA-疾病网络的非线性关系，也未能充分利用网络和邻近节点的特征。在这项研究中，我们提出了一种带有变异自动编码器（DMFVAE）的深度矩阵因式分解模型来预测潜在的 MDAs。DMFVAE 首先分解原始关联系数矩阵和增强关联系数矩阵，其中增强关联系数矩阵通过自调整近邻法增强，分别得到稀疏向量和稠密向量。然后，利用变异编码器获取稀疏向量中 miRNA 和疾病的非线性潜向量，同时利用 node2vec 获取密集向量中 miRNA 和疾病的网络结构嵌入向量。最后，结合潜向量和网络结构嵌入向量获得样本特征，并通过具有通道注意的卷积神经网络实现最终预测。为了评估 DMFVAE 的性能，我们在 HMDD v2.0 和 HMDD v3.2 数据集上进行了五倍交叉验证，结果表明 DMFVAE 性能良好。此外，对肺部肿瘤、结肠肿瘤和食管肿瘤的案例研究也证实了 DMFVAE 识别人类疾病潜在 miRNA 的能力。

{"title":"DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder","authors":"Pijing Wei, Qianqian Wang, Zhen Gao, Ruifen Cao, Chunhou Zheng","doi":"10.1007/s11704-023-3610-y","DOIUrl":"https://doi.org/10.1007/s11704-023-3610-y","url":null,"abstract":"MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"218 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141610926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Graph foundation model 图形基础模型

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-07-05 DOI: 10.1007/s11704-024-40046-0

Chuan Shi, Junze Chen, Jiawei Liu, Cheng Yang

Graph Foundation Models represent an evolving direction in graph machine learning. Drawing inspiration from the success of Large Language Models in NLP, GFMs are designed to be trained on extensive graph data and adapted for a diverse array of downstream tasks. In this article, we have explained and introduced the concept of GFMs, comparing them with Language Foundation Models to highlight their similarities and differences. We identified the key technologies in building GFMs as the pre-train and adaptation techniques from the fields of GNNs and LLMs. Additionally, we discussed the potential for GFMs to have significant applications in various domains, ranging from social network analysis to bioinformatics and beyond.

图形基础模型代表了图形机器学习的发展方向。图基础模型从大型语言模型在 NLP 领域的成功中汲取灵感，旨在对大量图数据进行训练，并适用于各种下游任务。在本文中，我们解释并介绍了 GFMs 的概念，并将其与语言基础模型进行了比较，以突出它们之间的异同。我们确定了构建 GFMs 的关键技术，即来自 GNN 和 LLMs 领域的预训练和适应技术。此外，我们还讨论了 GFMs 在从社交网络分析到生物信息学等各个领域的重要应用潜力。

引用次数: 0

SEOE: an option graph based semantically embedding method for prenatal depression detection SEOE：基于选项图的产前抑郁症语义嵌入检测方法

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-06-27 DOI: 10.1007/s11704-024-3612-4

Xiaosong Han, Mengchen Cao, Dong Xu, Xiaoyue Feng, Yanchun Liang, Xiaoduo Lang, Renchu Guan

Prenatal depression, which can affect pregnant women’s physical and psychological health and cause postpartum depression, is increasing dramatically. Therefore, it is essential to detect prenatal depression early and conduct an attribution analysis. Many studies have used questionnaires to screen for prenatal depression, but the existing methods lack attributability. To diagnose the early signs of prenatal depression and identify the key factors that may lead to prenatal depression from questionnaires, we present the semantically enhanced option embedding (SEOE) model to represent questionnaire options. It can quantitatively determine the relationship and patterns between options and depression. SEOE first quantifies options and resorts them, gathering options with little difference, since Word2Vec is highly dependent on context. The resort task is transformed into an optimization problem involving the traveling salesman problem. Moreover, all questionnaire samples are used to train the options’ vector using Word2Vec. Finally, an LSTM and GRU fused model incorporating the cycle learning rate is constructed to detect whether a pregnant woman is suffering from depression. To verify the model, we compare it with other deep learning and traditional machine learning methods. The experiment results show that our proposed model can accurately identify pregnant women with depression and reach an F1 score of 0.8. The most relevant factors of depression found by SEOE are also verified in the literature. In addition, our model is of low computational complexity and strong generalization, which can be widely applied to other questionnaire analyses of psychiatric disorders.

产前抑郁症会影响孕妇的身心健康并导致产后抑郁，其发病率正急剧上升。因此，及早发现产前抑郁症并进行归因分析至关重要。许多研究采用问卷调查来筛查产前抑郁症，但现有方法缺乏可归因性。为了诊断产前抑郁症的早期征兆，并从问卷中找出可能导致产前抑郁症的关键因素，我们提出了语义增强选项嵌入（SEOE）模型来表示问卷选项。它可以定量确定选项与抑郁之间的关系和模式。由于 Word2Vec 高度依赖于上下文，因此 SEOE 首先对选项进行量化并对其进行排序，收集差异不大的选项。将排序任务转化为涉及旅行推销员问题的优化问题。此外，所有问卷样本都将用于使用 Word2Vec 训练选项向量。最后，我们构建了一个包含周期学习率的 LSTM 和 GRU 融合模型，用于检测孕妇是否患有抑郁症。为了验证该模型，我们将其与其他深度学习方法和传统机器学习方法进行了比较。实验结果表明，我们提出的模型可以准确地识别出患有抑郁症的孕妇，F1 得分为 0.8。SEOE 发现的抑郁症最相关因素也在文献中得到了验证。此外，我们的模型计算复杂度低，泛化能力强，可广泛应用于其他精神疾病的问卷分析。

{"title":"SEOE: an option graph based semantically embedding method for prenatal depression detection","authors":"Xiaosong Han, Mengchen Cao, Dong Xu, Xiaoyue Feng, Yanchun Liang, Xiaoduo Lang, Renchu Guan","doi":"10.1007/s11704-024-3612-4","DOIUrl":"https://doi.org/10.1007/s11704-024-3612-4","url":null,"abstract":"Prenatal depression, which can affect pregnant women’s physical and psychological health and cause postpartum depression, is increasing dramatically. Therefore, it is essential to detect prenatal depression early and conduct an attribution analysis. Many studies have used questionnaires to screen for prenatal depression, but the existing methods lack attributability. To diagnose the early signs of prenatal depression and identify the key factors that may lead to prenatal depression from questionnaires, we present the semantically enhanced option embedding (SEOE) model to represent questionnaire options. It can quantitatively determine the relationship and patterns between options and depression. SEOE first quantifies options and resorts them, gathering options with little difference, since Word2Vec is highly dependent on context. The resort task is transformed into an optimization problem involving the traveling salesman problem. Moreover, all questionnaire samples are used to train the options’ vector using Word2Vec. Finally, an LSTM and GRU fused model incorporating the cycle learning rate is constructed to detect whether a pregnant woman is suffering from depression. To verify the model, we compare it with other deep learning and traditional machine learning methods. The experiment results show that our proposed model can accurately identify pregnant women with depression and reach an F1 score of 0.8. The most relevant factors of depression found by SEOE are also verified in the literature. In addition, our model is of low computational complexity and strong generalization, which can be widely applied to other questionnaire analyses of psychiatric disorders.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"3 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141501822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

WPIA: accelerating DNN warm-up in Web browsers by precompiling WebGL programs WPIA：通过预编译 WebGL 程序加速 DNN 在网络浏览器中的预热

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-06-25 DOI: 10.1007/s11704-024-40066-w

Deyu Tian, Yun Ma, Yudong Han, Qi Yang, Haochen Yang, Gang Huang

In this paper, we study the long warm-up time of GPU acceleration of DNN inference in Web browsers. We analyzed the reason behind the long warm-up time through a measurement study and revealed that compiling WebGL programs takes most of the warm-up time. Inspired by this finding, we proposed WPIA, an approach that suggests precompiling WebGL programs on the server side to avoid compiling them in Web browsers. WPIA tackles the challenges of precompiling by merging WebGL programs and using a record-and-replay technique. Evaluation experiment results show that WPIA can accelerate the DNN warm-up time to an order of magnitude.

本文研究了网络浏览器中 DNN 推理的 GPU 加速预热时间过长的问题。我们通过测量研究分析了预热时间长的原因，发现编译 WebGL 程序占用了大部分预热时间。受这一发现的启发，我们提出了 WPIA，一种建议在服务器端预编译 WebGL 程序以避免在 Web 浏览器中编译它们的方法。WPIA 通过合并 WebGL 程序和使用记录与重放技术来应对预编译的挑战。评估实验结果表明，WPIA 可以将 DNN 预热时间加快一个数量级。

引用次数: 0

FedTop: a constraint-loosed federated learning aggregation method against poisoning attack FedTop：针对中毒攻击的限制松散联合学习聚合方法

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-06-25 DOI: 10.1007/s11704-024-3767-z

Che Wang, Zhenhao Wu, Jianbo Gao, Jiashuo Zhang, Junjie Xia, Feng Gao, Zhi Guan, Zhong Chen

In this paper, we developed FedTop which significantly facilitates collaboration effectiveness between normal participants without suffering significant negative impacts from malicious participants. FedTop can both be regarded as a normal aggregation method for federated learning with normal data and stand more severe poisoning attacks including targeted and untargeted attacks with more loosen preconditions. In addition, we experimentally demonstrate that this method can significantly improve the learning performance in a malicious environment. However, our work still faces much limitations on data set choosing, base model choosing and the number of malicious models. Thus, our future work will be focused on experimentation with more scenarios, such as increasing the number of participants or designing more complex poisoning attacks on more complex data sets.

在本文中，我们开发了 FedTop，它大大提高了正常参与者之间的协作效率，而不会受到恶意参与者的严重负面影响。FedTop 既可以被视为正常数据联合学习的正常聚合方法，也可以抵御更严重的中毒攻击，包括具有更宽松前提条件的定向和非定向攻击。此外，我们还通过实验证明，这种方法能显著提高恶意环境下的学习性能。然而，我们的工作在数据集选择、基础模型选择和恶意模型数量方面仍面临很多限制。因此，我们未来的工作将集中在更多场景的实验上，如增加参与者的数量或在更复杂的数据集上设计更复杂的中毒攻击。

引用次数: 0

Audio-guided self-supervised learning for disentangled visual speech representations 针对分离视觉语音表征的音频引导自监督学习

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-06-25 DOI: 10.1007/s11704-024-3787-8

Dalu Feng, Shuang Yang, Shiguang Shan, Xilin Chen

In this paper, we propose a novel two-branch framework to learn the disentangled visual speech representations based on two particular observations. Its main idea is to introduce the audio signal to guide the learning of speech-relevant cues and introduce a bottleneck to restrict the speech-irrelevant branch from learning high-frequency and fine-grained speech cues. Experiments on both the word-level and sentence-level audio-visual speech datasets LRW and LRS2-BBC show the effectiveness. Our future work is to explore more explicit auxiliary tasks and constraints beyond the reconstruction task of the speech-relevant and irrelevant branch to improve further its ability of capturing speech cues in the video. Meanwhile, it’s also a nice try to combine multiple types of knowledge representations [10] to further boost the obtained speech epresentations, which is also left for the future work.

在本文中，我们提出了一个新颖的双分支框架，基于两个特定的观察结果来学习分离的视觉语音表征。其主要思想是引入音频信号来引导语音相关线索的学习，并引入一个瓶颈来限制语音无关分支学习高频和细粒度语音线索。在单词级和句子级视听语音数据集 LRW 和 LRS2-BBC 上进行的实验显示了这种方法的有效性。我们未来的工作是在语音相关和不相关分支的重构任务之外，探索更明确的辅助任务和约束条件，以进一步提高其捕捉视频中语音线索的能力。同时，结合多种类型的知识表征[10]来进一步提高语音表征的效果也是一个不错的尝试，这也是未来工作的重点。

引用次数: 0

JAPO: learning join and pushdown order for cloud-native join optimization JAPO：学习连接和下推顺序，实现云原生连接优化

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-06-25 DOI: 10.1007/s11704-024-3937-z

Yuchen Yuan, Xiaoyue Feng, Bo Zhang, Pengyi Zhang, Jie Song

In this paper, we introduce JAPO which learn the join and pushdown order through DRL. The main idea is that the DRL agent learns better decisions based on the experiences by monitoring the rewards and latencies via trying different actions. The results show that our method can generate good plans both on join order and pushdown order. We also show that our method can select the well-performed distributed index placement via experiments. In the future, we plan to deploy JAPO to real systems execution and consider more factors in JAPO, such as different join types.

在本文中，我们介绍了通过 DRL 学习加入和下推顺序的 JAPO。其主要思想是，DRL 代理通过监控尝试不同行动的回报和延迟，根据经验学习更好的决策。结果表明，我们的方法能根据连接顺序和下推顺序生成良好的计划。我们还通过实验证明，我们的方法可以选择性能良好的分布式索引位置。未来，我们计划将 JAPO 部署到实际系统中执行，并在 JAPO 中考虑更多因素，如不同的连接类型。

引用次数: 0

TV100: a TV series dataset that pre-trained CLIP has not seen TV100：预训练 CLIP 未见过的电视剧数据集

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-06-06 DOI: 10.1007/s11704-024-40217-z

Da-Wei Zhou, Zhi-Hong Qi, Han-Jia Ye, De-Chuan Zhan

The era of pre-trained models has ushered in a wealth of new insights for the machine learning community. Among the myriad of questions that arise, one of paramount importance is: ‘Do pre-trained models possess comprehensive knowledge?’ This paper seeks to address this crucial inquiry. In line with our objective, we have made publicly available a novel dataset comprised of images from TV series released post-2021. This dataset holds significant potential for use in various research areas, including the evaluation of novel class iscovery and long-tailed learning, among others.

预训练模型时代为机器学习界带来了大量新见解。在出现的无数问题中，最重要的一个问题是："预训练模型是否拥有全面的知识？本文试图解决这一关键问题。根据我们的目标，我们公开了一个由 2021 年后发布的电视剧图像组成的新数据集。该数据集在多个研究领域都具有巨大的应用潜力，其中包括对新类别识别和长尾学习等的评估。

引用次数: 0

HeterMM: applying in-DRAM index to heterogeneous memory-based key-value stores HeterMM：将内存内索引应用于基于异构内存的键值存储

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2024-04-08 DOI: 10.1007/s11704-024-3713-0

Yunhong Ji, Wentao Huang, Xuan Zhou

We propose HeterMM, a versatile framework that leverages in-DRAM indexes in KV stores on heterogeneous memory. HeterMM incorporates a plug-in programming model, allowing for the integration of various types of indexes. By prioritizing the maintenance of both index and hot data in DRAM, HeterMM maximizes the utilization of the superior performance of DRAM. Our evaluation demonstrates that HeterMM outperforms existing state-of-the-art frameworks that convert in-DRAM indexes to persistent ones. Furthermore, HeterMM can surpass NVM-specific KV stores by carefully selecting the appropriate index for specific scenarios.

我们提出的 HeterMM 是一个多功能框架，可在异构内存的 KV 存储中利用内存中的索引。HeterMM 采用插件编程模型，允许集成各种类型的索引。通过优先在 DRAM 中维护索引和热数据，HeterMM 最大限度地利用了 DRAM 的卓越性能。我们的评估表明，HeterMM 优于将 DRAM 中的索引转换为持久性索引的现有先进框架。此外，HeterMM 还能通过为特定场景精心选择合适的索引，超越特定于 NVM 的 KV 存储。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Frontiers of Computer Science

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀