Pub Date : 2024-05-24DOI: 10.1007/s11280-024-01273-4
Akif Quddus Khan, M. Matskin, R.-C. Prodan, Christoph Bussler, Dumitru Roman, A. Soylu
{"title":"Cloud storage cost: a taxonomy and survey","authors":"Akif Quddus Khan, M. Matskin, R.-C. Prodan, Christoph Bussler, Dumitru Roman, A. Soylu","doi":"10.1007/s11280-024-01273-4","DOIUrl":"https://doi.org/10.1007/s11280-024-01273-4","url":null,"abstract":"","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141099103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Deep neural networks (DNN) exhibit powerful feature extraction capabilities, making them highly advantageous in numerous tasks. DNN-based techniques have become widely adopted in the field of speaker recognition. However, imperceptible adversarial perturbations can severely disrupt the decisions made by DNNs. In addition, researchers identified universal adversarial perturbations that can efficiently and significantly attack deep neural networks. In this paper, we propose an algorithm for conducting effective universal adversarial attacks by investigating the dominant features in the speaker recognition task. Through experiments in various scenarios, we find that our perturbations are not only more effective and undetectable but also exhibit a certain degree of transferablity across different datasets and models.
{"title":"Transferable universal adversarial perturbations against speaker recognition systems","authors":"Xiaochen Liu, Hao Tan, Junjian Zhang, Aiping Li, Zhaoquan Gu","doi":"10.1007/s11280-024-01274-3","DOIUrl":"https://doi.org/10.1007/s11280-024-01274-3","url":null,"abstract":"<p>Deep neural networks (DNN) exhibit powerful feature extraction capabilities, making them highly advantageous in numerous tasks. DNN-based techniques have become widely adopted in the field of speaker recognition. However, imperceptible adversarial perturbations can severely disrupt the decisions made by DNNs. In addition, researchers identified universal adversarial perturbations that can efficiently and significantly attack deep neural networks. In this paper, we propose an algorithm for conducting effective universal adversarial attacks by investigating the dominant features in the speaker recognition task. Through experiments in various scenarios, we find that our perturbations are not only more effective and undetectable but also exhibit a certain degree of transferablity across different datasets and models.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140939286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-08DOI: 10.1007/s11280-024-01269-0
Mark Klein, Nouhayla Majdoubi
Humanity needs to deliberate effectively at scale about highly complex and contentious problems. Current online deliberation tools—such as email, chatrooms, and forums—are however plagued by levels of discussion toxicity that deeply undercut the willingness and ability of the participants to engage in thoughtful, meaningful, deliberations. This has led many organizations to either shut down their forums or invest in expensive, frequently unreliable, and ethically fraught moderation of people's contributions in their forums. This paper includes a comprehensive review on online toxicity, and describes how a structured deliberation process can substantially reduce toxicity compared to current approaches. The key underlying insight is that unstructured conversations create, especially at scale, an “attention wars” dynamic wherein people are often incented to resort to extremified language in order to get visibility for their postings. A structured deliberation process wherein people collaboratively create a compact organized collection of answers and arguments removes this underlying incentive, and results, in our evaluation, in a 50% reduction of high-toxicity posts.
{"title":"The medium is the message: toxicity declines in structured vs unstructured online deliberations","authors":"Mark Klein, Nouhayla Majdoubi","doi":"10.1007/s11280-024-01269-0","DOIUrl":"https://doi.org/10.1007/s11280-024-01269-0","url":null,"abstract":"<p>Humanity needs to deliberate effectively <i>at scale</i> about highly complex and contentious problems. Current online deliberation tools—such as email, chatrooms, and forums—are however plagued by levels of discussion toxicity that deeply undercut the willingness and ability of the participants to engage in thoughtful, meaningful, deliberations. This has led many organizations to either shut down their forums or invest in expensive, frequently unreliable, and ethically fraught moderation of people's contributions in their forums. This paper includes a comprehensive review on online toxicity, and describes how a structured deliberation process can substantially reduce toxicity compared to current approaches. The key underlying insight is that unstructured conversations create, especially at scale, an “attention wars” dynamic wherein people are often incented to resort to extremified language in order to get visibility for their postings. A structured deliberation process wherein people collaboratively create a compact organized collection of answers and arguments <i>removes</i> this underlying incentive, and results, in our evaluation, in a 50% reduction of high-toxicity posts.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140939246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Graph Neural Networks (GNNs) have achieved remarkable success in diverse real-world applications. Traditional GNNs are designed based on homophily, which leads to poor performance under heterophily scenarios. Most current solutions deal with heterophily mainly by modeling the heterophily edges as data noises or high-frequency signals, treating all heterophilic edges as being of the same semantic. Consequently, they ignore the rich semantic information of these edges in heterophily graphs. To overcome this critic problem, we propose a novel GNN model based on relation vector translation named as Variational Relation Vector Graph Neural Network (VR-GNN). VR-GNN models relation generation and graph aggregation into an end-to-end model based on a variational inference framework. To be specific, the encoder utilizes the structure, feature and label to generate a fine-grained relation vector for each edge, which aims to infer its implicit semantic information. The decoder incorporates the generated relation vectors into the message-passing framework for deriving better node representations. We conduct extensive experiments on eight real-world datasets with different homophily-heterophily properties to verify model effectiveness. Extensive experimental results show that VR-GNN gains consistent and significant improvements against existing strong GNN methods under heterophily and competitive performance under homophily.
{"title":"VR-GNN: variational relation vector graph neural network for modeling homophily and heterophily","authors":"Fengzhao Shi, Yanan Cao, Ren Li, Xixun Lin, Yanmin Shang, Chuan Zhou, Jia Wu, Shirui Pan","doi":"10.1007/s11280-024-01261-8","DOIUrl":"https://doi.org/10.1007/s11280-024-01261-8","url":null,"abstract":"<p>Graph Neural Networks (GNNs) have achieved remarkable success in diverse real-world applications. Traditional GNNs are designed based on homophily, which leads to poor performance under heterophily scenarios. Most current solutions deal with heterophily mainly by modeling the heterophily edges as data noises or high-frequency signals, treating all heterophilic edges as being of the same semantic. Consequently, they ignore the rich semantic information of these edges in heterophily graphs. To overcome this critic problem, we propose a novel GNN model based on relation vector translation named as <b>V</b>ariational <b>R</b>elation Vector <b>G</b>raph <b>N</b>eural <b>N</b>etwork (<b>VR-GNN</b>). VR-GNN models relation generation and graph aggregation into an end-to-end model based on a variational inference framework. To be specific, the encoder utilizes the structure, feature and label to generate a fine-grained relation vector for each edge, which aims to infer its implicit semantic information. The decoder incorporates the generated relation vectors into the message-passing framework for deriving better node representations. We conduct extensive experiments on eight real-world datasets with different homophily-heterophily properties to verify model effectiveness. Extensive experimental results show that VR-GNN gains consistent and significant improvements against existing strong GNN methods under heterophily and competitive performance under homophily.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140939284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-02DOI: 10.1007/s11280-024-01266-3
Chang Liu, Yong Luo, Yongchao Xu, Bo Du
In tackling Tuberculosis (TB), a critical global health challenge, the integration of Foundation Models (FMs) into diagnostic processes represents a significant advance. FMs, with their extensive pre-training on diverse datasets, hold the promise of transforming TB diagnosis by leveraging their deep understanding and analytical capabilities. However, the application of these models in healthcare is complicated by the need to protect patient privacy, particularly when dealing with sensitive TB data from various medical centers. Our novel approach, FedARC, addresses this issue through personalized federated learning (PFL), enabling the use of private data without direct access. FedARC innovatively navigates data heterogeneity and privacy concerns by employing adaptive regularization and model-contrastive learning. This method not only aligns each center’s objective function with the global loss’s stationary point but also enhances model generalization across disparate data sources. Comprehensive evaluations on five publicly available chest X-ray image datasets demonstrate that foundation models profoundly influence outcomes, with our proposed method significantly surpassing contemporary methodologies in various scenarios.
结核病(TB)是全球健康面临的重大挑战,在应对这一挑战的过程中,将基础模型(FMs)融入诊断流程是一项重大进步。基础模型在各种数据集上进行了广泛的预训练,有望利用其深入理解和分析能力改变结核病诊断。然而,由于需要保护患者隐私,特别是在处理来自不同医疗中心的敏感结核病数据时,这些模型在医疗保健领域的应用就变得复杂起来。我们的新方法 FedARC 通过个性化联合学习 (PFL) 解决了这一问题,使私人数据的使用无需直接访问。FedARC 通过采用自适应正则化和模型对比学习,创新性地解决了数据异质性和隐私问题。这种方法不仅能使每个中心的目标函数与全局损失的静止点保持一致,还能增强不同数据源之间的模型泛化。在五个公开的胸部 X 光图像数据集上进行的综合评估表明,基础模型对结果有深远的影响,我们提出的方法在各种情况下都大大超过了当代的方法。
{"title":"Foundation models matter: federated learning for multi-center tuberculosis diagnosis via adaptive regularization and model-contrastive learning","authors":"Chang Liu, Yong Luo, Yongchao Xu, Bo Du","doi":"10.1007/s11280-024-01266-3","DOIUrl":"https://doi.org/10.1007/s11280-024-01266-3","url":null,"abstract":"<p>In tackling Tuberculosis (TB), a critical global health challenge, the integration of Foundation Models (FMs) into diagnostic processes represents a significant advance. FMs, with their extensive pre-training on diverse datasets, hold the promise of transforming TB diagnosis by leveraging their deep understanding and analytical capabilities. However, the application of these models in healthcare is complicated by the need to protect patient privacy, particularly when dealing with sensitive TB data from various medical centers. Our novel approach, FedARC, addresses this issue through personalized federated learning (PFL), enabling the use of private data without direct access. FedARC innovatively navigates data heterogeneity and privacy concerns by employing adaptive regularization and model-contrastive learning. This method not only aligns each center’s objective function with the global loss’s stationary point but also enhances model generalization across disparate data sources. Comprehensive evaluations on five publicly available chest X-ray image datasets demonstrate that foundation models profoundly influence outcomes, with our proposed method significantly surpassing contemporary methodologies in various scenarios.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140829383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Recommending medications with electronic health records (EHRs) is a challenging task for data-driven clinical decision support systems. Most existing models learnt representations for medical concepts based on EHRs and make recommendations with the learnt representations. However, most medications appear in EHR datasets for limited times (the frequency distribution of medications follows power law distribution), resulting in insufficient learning of their representations of the medications. Medical ontologies are the hierarchical classification systems for medical terms where similar terms will be in the same class on a certain level. In this paper, we propose OntoMedRec, the logically-pretrained and model-agnostic medical Ontology Encoders for Medication Recommendation that addresses data sparsity problem with medical ontologies. We conduct comprehensive experiments on real-world EHR datasets to evaluate the effectiveness of OntoMedRec by integrating it into various existing downstream medication recommendation models. The result shows the integration of OntoMedRec improves the performance of various models in both the entire EHR datasets and the admissions with few-shot medications. We provide the GitHub repository for the source code. (https://github.com/WaicongTam/OntoMedRec)
对于数据驱动的临床决策支持系统来说,利用电子健康记录(EHR)推荐药物是一项具有挑战性的任务。大多数现有模型都基于电子健康记录学习医疗概念的表征,并利用学习到的表征进行推荐。然而,大多数药物在电子病历数据集中出现的时间有限(药物的频率分布遵循幂律分布),导致对药物表征的学习不足。医学本体是医学术语的分层分类系统,相似的术语在一定层次上属于同一类别。在本文中,我们提出了用于用药推荐的经过逻辑预训练和模型无关的医学本体编码器 OntoMedRec,以解决医学本体的数据稀疏性问题。我们在真实世界的电子病历数据集上进行了综合实验,通过将 OntoMedRec 集成到现有的各种下游用药推荐模型中来评估其有效性。结果表明,OntoMedRec 的集成提高了各种模型在整个 EHR 数据集和少量药物入院中的性能。我们提供了源代码的 GitHub 代码库。(https://github.com/WaicongTam/OntoMedRec)
{"title":"OntoMedRec: Logically-pretrained model-agnostic ontology encoders for medication recommendation","authors":"Weicong Tan, Weiqing Wang, Xin Zhou, Wray Buntine, Gordon Bingham, Hongzhi Yin","doi":"10.1007/s11280-024-01268-1","DOIUrl":"https://doi.org/10.1007/s11280-024-01268-1","url":null,"abstract":"<p>Recommending medications with electronic health records (EHRs) is a challenging task for data-driven clinical decision support systems. Most existing models learnt representations for medical concepts based on EHRs and make recommendations with the learnt representations. However, most medications appear in EHR datasets for limited times (the frequency distribution of medications follows power law distribution), resulting in insufficient learning of their representations of the medications. Medical ontologies are the hierarchical classification systems for medical terms where similar terms will be in the same class on a certain level. In this paper, we propose <b>OntoMedRec</b>, the <i>logically-pretrained</i> and <i>model-agnostic</i> medical <b>Onto</b>logy Encoders for <b>Med</b>ication <b>Rec</b>ommendation that addresses data sparsity problem with medical ontologies. We conduct comprehensive experiments on real-world EHR datasets to evaluate the effectiveness of OntoMedRec by integrating it into various existing downstream medication recommendation models. The result shows the integration of OntoMedRec improves the performance of various models in both the entire EHR datasets and the admissions with few-shot medications. We provide the GitHub repository for the source code. (https://github.com/WaicongTam/OntoMedRec)</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140801390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-18DOI: 10.1007/s11280-024-01267-2
Jiawei Hong, Wen Yang, Pingfu Chao, Junhua Fang
Group recommendation aims to suggest desired items for a group of users. Existing methods can achieve inspiring results in predicting the group preferences in data-rich groups. However, they could be ineffective in supporting cold-start groups due to their sparsity interactions, which prevents the model from understanding their intent. Although cold-start groups can be alleviated by meta-learning, we cannot apply it by using the same initialization for all groups due to their varying preferences. To tackle this problem, this paper proposes a memory-augmented meta-optimized model for group recommendation, namely GroupMO. Specifically, we adopt a clustering method to assemble the groups with similar profiles into the same cluster and design a representative group profile memory to guide the preliminary initialization of group embedding network for each group by utilizing those clusters. Besides, we also design a group shared preference memory to guide the prediction network initialization at a more refined granularity level for different groups, so that the shared knowledge can be better transferred to groups with similar preferences. Moreover, we incorporate those two memories to optimize the meta-learning process. Finally, extensive experiments on two real-world datasets demonstrate the superiority of our model.
{"title":"GroupMO: a memory-augmented meta-optimized model for group recommendation","authors":"Jiawei Hong, Wen Yang, Pingfu Chao, Junhua Fang","doi":"10.1007/s11280-024-01267-2","DOIUrl":"https://doi.org/10.1007/s11280-024-01267-2","url":null,"abstract":"<p>Group recommendation aims to suggest desired items for a group of users. Existing methods can achieve inspiring results in predicting the group preferences in data-rich groups. However, they could be ineffective in supporting cold-start groups due to their sparsity interactions, which prevents the model from understanding their intent. Although cold-start groups can be alleviated by meta-learning, we cannot apply it by using the same initialization for all groups due to their varying preferences. To tackle this problem, this paper proposes a memory-augmented meta-optimized model for group recommendation, namely GroupMO. Specifically, we adopt a clustering method to assemble the groups with similar profiles into the same cluster and design a representative group profile memory to guide the preliminary initialization of group embedding network for each group by utilizing those clusters. Besides, we also design a group shared preference memory to guide the prediction network initialization at a more refined granularity level for different groups, so that the shared knowledge can be better transferred to groups with similar preferences. Moreover, we incorporate those two memories to optimize the meta-learning process. Finally, extensive experiments on two real-world datasets demonstrate the superiority of our model.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140624831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}