Meysam Ahangaran, Nauman Dawalatabad, Cody Karjadi, James Glass, Rhoda Au, Vijaya B. Kolachalama
{"title":"Obfuscation via pitch-shifting for balancing privacy and diagnostic utility in voice-based cognitive assessment","authors":"Meysam Ahangaran, Nauman Dawalatabad, Cody Karjadi, James Glass, Rhoda Au, Vijaya B. Kolachalama","doi":"10.1002/alz.70032","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> INTRODUCTION</h3>\n \n <p>Digital voice analysis is an emerging tool for differentiating cognitive states, but it poses privacy risks as automated systems may inadvertently identify speakers.</p>\n </section>\n \n <section>\n \n <h3> METHODS</h3>\n \n <p>We developed a computational framework to evaluate the trade-off between voice obfuscation and cognitive assessment accuracy, using pitch-shifting as a representative method. This framework was applied to voice recordings from the Framingham Heart Study (FHS, <i>n</i> = 128) and the DementiaBank Delaware (DBD, <i>n</i> = 85) corpus, both featuring responses to neuropsychological tests. Speaker obfuscation was measured via equal error rate (EER), and diagnostic utility was assessed through machine learning models distinguishing cognitive states: normal cognition (NC), mild cognitive impairment (MCI), and dementia (DE).</p>\n </section>\n \n <section>\n \n <h3> RESULTS</h3>\n \n <p>With the top 20 acoustic features, our framework achieved classification accuracies of 62.2% (EER: 0.3335) on the FHS dataset for NC, MCI, and DE differentiation, and 63.7% (EER: 0.1796) on the DBD dataset for NC and MCI differentiation, using obfuscated speech files.</p>\n </section>\n \n <section>\n \n <h3> DISCUSSION</h3>\n \n <p>Our results demonstrate the feasibility of privacy-preserving voice markers, offering a scalable solution for voice-based cognitive assessments.</p>\n </section>\n \n <section>\n \n <h3> Highlights</h3>\n \n <div>\n <ul>\n \n <li>We developed a computational framework using pitch-shifting and acoustic transformations to balance speaker privacy and diagnostic utility in voice-based cognitive assessments.</li>\n \n <li>We evaluated the framework on two independent datasets, Framingham Heart Study (FHS, <i>n</i> = 128) and DementiaBank Delaware (DBD, <i>n</i> = 85) corpus, assessing the trade-off between privacy (measured by equal error rate [EER]) and classification accuracy.</li>\n \n <li>Our framework achieved classification accuracies of 62.2% (EER: 0.3335) for distinguishing normal cognition (NC), mild cognitive impairment (MCI), and dementia in the FHS dataset and 63.7% (EER: 0.1796) for NC and MCI differentiation in the DBD dataset, using obfuscated speech files.</li>\n \n <li>Our framework demonstrates that pitch-shifting levels can preserve diagnostic utility while protecting speaker identity, offering a scalable and privacy-preserving solution.</li>\n </ul>\n </div>\n </section>\n </div>","PeriodicalId":7471,"journal":{"name":"Alzheimer's & Dementia","volume":"21 3","pages":""},"PeriodicalIF":13.0000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/alz.70032","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Alzheimer's & Dementia","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/alz.70032","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
INTRODUCTION
Digital voice analysis is an emerging tool for differentiating cognitive states, but it poses privacy risks as automated systems may inadvertently identify speakers.
METHODS
We developed a computational framework to evaluate the trade-off between voice obfuscation and cognitive assessment accuracy, using pitch-shifting as a representative method. This framework was applied to voice recordings from the Framingham Heart Study (FHS, n = 128) and the DementiaBank Delaware (DBD, n = 85) corpus, both featuring responses to neuropsychological tests. Speaker obfuscation was measured via equal error rate (EER), and diagnostic utility was assessed through machine learning models distinguishing cognitive states: normal cognition (NC), mild cognitive impairment (MCI), and dementia (DE).
RESULTS
With the top 20 acoustic features, our framework achieved classification accuracies of 62.2% (EER: 0.3335) on the FHS dataset for NC, MCI, and DE differentiation, and 63.7% (EER: 0.1796) on the DBD dataset for NC and MCI differentiation, using obfuscated speech files.
DISCUSSION
Our results demonstrate the feasibility of privacy-preserving voice markers, offering a scalable solution for voice-based cognitive assessments.
Highlights
We developed a computational framework using pitch-shifting and acoustic transformations to balance speaker privacy and diagnostic utility in voice-based cognitive assessments.
We evaluated the framework on two independent datasets, Framingham Heart Study (FHS, n = 128) and DementiaBank Delaware (DBD, n = 85) corpus, assessing the trade-off between privacy (measured by equal error rate [EER]) and classification accuracy.
Our framework achieved classification accuracies of 62.2% (EER: 0.3335) for distinguishing normal cognition (NC), mild cognitive impairment (MCI), and dementia in the FHS dataset and 63.7% (EER: 0.1796) for NC and MCI differentiation in the DBD dataset, using obfuscated speech files.
Our framework demonstrates that pitch-shifting levels can preserve diagnostic utility while protecting speaker identity, offering a scalable and privacy-preserving solution.
期刊介绍:
Alzheimer's & Dementia is a peer-reviewed journal that aims to bridge knowledge gaps in dementia research by covering the entire spectrum, from basic science to clinical trials to social and behavioral investigations. It provides a platform for rapid communication of new findings and ideas, optimal translation of research into practical applications, increasing knowledge across diverse disciplines for early detection, diagnosis, and intervention, and identifying promising new research directions. In July 2008, Alzheimer's & Dementia was accepted for indexing by MEDLINE, recognizing its scientific merit and contribution to Alzheimer's research.