{"title":"Fostering Privacy in Collaborative Data Sharing via Auto-encoder Latent Space Embedding","authors":"Vinayak Raja, Bhuvi Chopra","doi":"10.60087/jaigs.v4i1.129","DOIUrl":null,"url":null,"abstract":"Securing privacy in machine learning via collaborative data sharing is essential for organizations seeking to harness collective data while upholding confidentiality. This becomes especially vital when protecting sensitive information across the entire machine learning pipeline, from model training to inference. This paper presents an innovative framework utilizing Representation Learning via autoencoders to generate privacy-preserving embedded data. As a result, organizations can distribute these representations, enhancing the performance of machine learning models in situations where multiple data sources converge for a unified predictive task downstream.","PeriodicalId":517201,"journal":{"name":"Journal of Artificial Intelligence General science (JAIGS) ISSN:3006-4023","volume":"106 22","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Artificial Intelligence General science (JAIGS) ISSN:3006-4023","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.60087/jaigs.v4i1.129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Securing privacy in machine learning via collaborative data sharing is essential for organizations seeking to harness collective data while upholding confidentiality. This becomes especially vital when protecting sensitive information across the entire machine learning pipeline, from model training to inference. This paper presents an innovative framework utilizing Representation Learning via autoencoders to generate privacy-preserving embedded data. As a result, organizations can distribute these representations, enhancing the performance of machine learning models in situations where multiple data sources converge for a unified predictive task downstream.