Jake A. Bergquist PhD , Brian Zenger MD, PhD , James Brundage , Rob S. MacLeod PhD , T. Jared Bunch MD , Rashmee Shah MD, MS , Xiangyang Ye PhD , Ann Lyons PhD , Michael Torre PhD , Ravi Ranjan MD, PhD , Tolga Tasdizen PhD , Benjamin A. Steinberg MD, MHS
{"title":"Performance of off-the-shelf machine learning architectures and biases in low left ventricular ejection fraction detection","authors":"Jake A. Bergquist PhD , Brian Zenger MD, PhD , James Brundage , Rob S. MacLeod PhD , T. Jared Bunch MD , Rashmee Shah MD, MS , Xiangyang Ye PhD , Ann Lyons PhD , Michael Torre PhD , Ravi Ranjan MD, PhD , Tolga Tasdizen PhD , Benjamin A. Steinberg MD, MHS","doi":"10.1016/j.hroo.2024.07.009","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Artificial intelligence–machine learning (AI-ML) has demonstrated the ability to extract clinically useful information from electrocardiograms (ECGs) not available using traditional interpretation methods. There exists an extensive body of AI-ML research in fields outside of cardiology including several open-source AI-ML architectures that can be translated to new problems in an “off-the-shelf” manner.</p></div><div><h3>Objective</h3><p>We sought to address the limited investigation of which if any of these off-the-shelf architectures could be useful in ECG analysis as well as how and when these AI-ML approaches fail.</p></div><div><h3>Methods</h3><p>We applied 6 off-the-shelf AI-ML architectures to detect low left ventricular ejection fraction (LVEF) in a cohort of ECGs from 24,868 patients. We assessed LVEF classification and explored patient characteristics associated with inaccurate (false positive or false negative) LVEF prediction.</p></div><div><h3>Results</h3><p>We found that all of these network architectures produced LVEF detection area under the receiver-operating characteristic curve values above 0.9 (averaged over 5 instances per network), with the ResNet 18 network performing the highest (average area under the receiver-operating characteristic curve of 0.917). We also observed that some patient-specific characteristics such as race, sex, and presence of several comorbidities were associated with lower LVEF prediction performance.</p></div><div><h3>Conclusions</h3><p>This demonstrates the ability of off-the-shelf AI-ML architectures to detect clinically useful information from ECGs with performance matching contemporary custom-build AI-ML architectures. We also highlighted the presence of possible biases in these AI-ML approaches in the context of patient characteristics. These findings should be considered in the pursuit of efficient and equitable deployment of AI-ML technologies moving forward.</p></div>","PeriodicalId":29772,"journal":{"name":"Heart Rhythm O2","volume":"5 9","pages":"Pages 644-654"},"PeriodicalIF":2.5000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666501824002289/pdfft?md5=70091853d69b920d35ca322b9f124330&pid=1-s2.0-S2666501824002289-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heart Rhythm O2","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666501824002289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Artificial intelligence–machine learning (AI-ML) has demonstrated the ability to extract clinically useful information from electrocardiograms (ECGs) not available using traditional interpretation methods. There exists an extensive body of AI-ML research in fields outside of cardiology including several open-source AI-ML architectures that can be translated to new problems in an “off-the-shelf” manner.
Objective
We sought to address the limited investigation of which if any of these off-the-shelf architectures could be useful in ECG analysis as well as how and when these AI-ML approaches fail.
Methods
We applied 6 off-the-shelf AI-ML architectures to detect low left ventricular ejection fraction (LVEF) in a cohort of ECGs from 24,868 patients. We assessed LVEF classification and explored patient characteristics associated with inaccurate (false positive or false negative) LVEF prediction.
Results
We found that all of these network architectures produced LVEF detection area under the receiver-operating characteristic curve values above 0.9 (averaged over 5 instances per network), with the ResNet 18 network performing the highest (average area under the receiver-operating characteristic curve of 0.917). We also observed that some patient-specific characteristics such as race, sex, and presence of several comorbidities were associated with lower LVEF prediction performance.
Conclusions
This demonstrates the ability of off-the-shelf AI-ML architectures to detect clinically useful information from ECGs with performance matching contemporary custom-build AI-ML architectures. We also highlighted the presence of possible biases in these AI-ML approaches in the context of patient characteristics. These findings should be considered in the pursuit of efficient and equitable deployment of AI-ML technologies moving forward.