Pub Date : 2024-07-12DOI: 10.1101/2024.07.11.24310304
Shuyue Jia, Subhrangshu Bit, Edward Searls, Lindsey Claus, Pengrui Fan, Varuna H. Jasodanand, Meagan V. Lauber, Divya Veerapaneni, William M. Wang, Rhoda Au, Vijaya B Kolachalama
The proliferation of medical podcasts has generated an extensive repository of audio content, rich in specialized terminology, diverse medical topics, and expert dialogues. Here we introduce a computational framework designed to enhance large language models (LLMs) by leveraging the informational content of publicly accessible medical podcast data. This dataset, comprising over 4,300 hours of audio content, was transcribed to generate over 39 million text tokens. Our model, MedPodGPT, integrates the varied dialogue found in medical podcasts to improve understanding of natural language nuances, cultural contexts, and medical knowledge. Evaluated across multiple benchmarks, MedPodGPT demonstrated an average improvement of 2.31% over standard open-source benchmarks and showcased an improvement of 2.58% in its zero-shot multilingual transfer ability, effectively generalizing to different linguistic contexts. By harnessing the untapped potential of podcast content, MedPodGPT advances natural language processing, offering enhanced capabilities for various applications in medical research and education.
{"title":"MedPodGPT: A multilingual audio-augmented large language model for medical research and education","authors":"Shuyue Jia, Subhrangshu Bit, Edward Searls, Lindsey Claus, Pengrui Fan, Varuna H. Jasodanand, Meagan V. Lauber, Divya Veerapaneni, William M. Wang, Rhoda Au, Vijaya B Kolachalama","doi":"10.1101/2024.07.11.24310304","DOIUrl":"https://doi.org/10.1101/2024.07.11.24310304","url":null,"abstract":"The proliferation of medical podcasts has generated an extensive repository of audio content, rich in specialized terminology, diverse medical topics, and expert dialogues. Here we introduce a computational framework designed to enhance large language models (LLMs) by leveraging the informational content of publicly accessible medical podcast data. This dataset, comprising over 4,300 hours of audio content, was transcribed to generate over 39 million text tokens. Our model, MedPodGPT, integrates the varied dialogue found in medical podcasts to improve understanding of natural language nuances, cultural contexts, and medical knowledge. Evaluated across multiple benchmarks, MedPodGPT demonstrated an average improvement of 2.31% over standard open-source benchmarks and showcased an improvement of 2.58% in its zero-shot multilingual transfer ability, effectively generalizing to different linguistic contexts. By harnessing the untapped potential of podcast content, MedPodGPT advances natural language processing, offering enhanced capabilities for various applications in medical research and education.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141609846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-11DOI: 10.1101/2024.07.11.24310258
Wei Zhang
Advanced deep neural networks, when trained on extensive datasets, can outperform cardiologists in diagnosing cardiac arrhythmias. However, the availability of large-scale training data is often impractical. This study explores the use of transfer learning to identify and classify three ECG patterns. It applies knowledge gained from 2D image classification tasks to the domain of 1D time-series ECG signal classification. The research leverages various deep learning models to classify continuous wavelet transform (2D representations) of ECG signals. The effectiveness of these transferred deep learning models in classifying ECG time-series data is then evaluated.
{"title":"2D Transfer Learning for ECG Classification using Continuous Wavelet Transform","authors":"Wei Zhang","doi":"10.1101/2024.07.11.24310258","DOIUrl":"https://doi.org/10.1101/2024.07.11.24310258","url":null,"abstract":"Advanced deep neural networks, when trained on extensive datasets, can outperform cardiologists in diagnosing cardiac arrhythmias. However, the availability of large-scale training data is often impractical. This study explores the use of transfer learning to identify and classify three ECG patterns. It applies knowledge gained from 2D image classification tasks to the domain of 1D time-series ECG signal classification. The research leverages various deep learning models to classify continuous wavelet transform (2D representations) of ECG signals. The effectiveness of these transferred deep learning models in classifying ECG time-series data is then evaluated.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141609848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-10DOI: 10.1101/2024.07.06.24309995
Mukund Gupta, Edbert Victor Fandy, Krrish Ghindani
Lung cancer has become an increasingly prevalent disease, with an estimated 125,070 deaths in the United States alone in 2024 ( 5). To improve patient outcomes and assist doctors in differentiating between benign and malignant pulmonary nodules, this paper developed a Convolutional Neural Network (CNN) model for early binary detection of pulmonary nodules and assessed its effectiveness compared to other approaches. The CNN model showed an accuracy of 98.47%, while the radiomics-based SVM-LASSO model and the Lung-RADS system showed accuracies of 84.6% and 72.2% respectively. This demonstrates that the CNN model is significantly more effective for the early binary detection of pulmonary nodules than both the radiomics-based model and the Lung-RADS system. The paper also discusses the applications of Deep Learning in healthcare, concluding that although AI proves to be an effective method for early lung cancer detection, more research is needed to carefully assess the role and impact of AI in healthcare.
{"title":"EARLY LUNG CANCER SCREENING: A COMPARATIVE STUDY OF CNN AND RADIOMICS MODELS WITH PULMONARY NODULE BIOLOGIC CHARACTERIZATION","authors":"Mukund Gupta, Edbert Victor Fandy, Krrish Ghindani","doi":"10.1101/2024.07.06.24309995","DOIUrl":"https://doi.org/10.1101/2024.07.06.24309995","url":null,"abstract":"Lung cancer has become an increasingly prevalent disease, with an estimated 125,070 deaths in the\u0000United States alone in 2024 ( 5). To improve patient outcomes and assist doctors in differentiating between benign and malignant pulmonary nodules, this paper developed a Convolutional Neural Network (CNN) model for early binary detection of pulmonary nodules and assessed its effectiveness compared to other approaches. The CNN model showed an accuracy of 98.47%, while the radiomics-based SVM-LASSO model and the Lung-RADS system showed accuracies of 84.6% and 72.2%\u0000respectively. This demonstrates that the CNN model is significantly more effective for the early\u0000binary detection of pulmonary nodules than both the radiomics-based model and the Lung-RADS\u0000system. The paper also discusses the applications of Deep Learning in healthcare, concluding that\u0000although AI proves to be an effective method for early lung cancer detection, more research is needed to carefully assess the role and impact of AI in healthcare.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141566861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objective: We aimed to examine the characteristics, features, and content of suicide prevention mobile apps available in app stores in Canada and the United Kingdom. Design: Suicide prevention apps were identified from Apple and Android app stores between March-April 2023. Apps were screened against predefined inclusion criteria, and duplicate apps were removed. Data were then extracted based on descriptive (e.g., genre, app developer), security (e.g., password protection), and design features (e.g., personalization options). Content of apps were assessed using the Essential Features Framework. Extracted data were analyzed using a content analysis approach including narrative frequencies and descriptive statistics. Results: Fifty-two (n=52) suicide prevention apps were included within the review. Most were tailored for the general population and were in English language only. One app had the option to increase app accessibility by offering content presented using sign language. Many apps allowed some form of personalization by adding text content, however most did not facilitate further customization such as the ability to upload photo and audio content. All identified apps included content from at least one of the domains of the Essential Features Framework. The most commonly included domains were sources of suicide prevention support, and information about suicide. The domain least frequently included was screening tools followed by wellness content. No identified apps had the ability to be linked to patient medical records. Conclusions: The findings of this research present implications for the development of future suicide prevention apps. Development of a co-produced suicide prevention app which is accessible, allows for personalization, and can be integrated into clinical care may present an opportunity to enhance suicide prevention support for individuals experiencing suicidal thoughts and behaviours.
{"title":"Characteristics of Suicide Prevention Apps: A Content Analysis of Apps Available in Canada and the United Kingdom","authors":"Laura Bennett-Poynter, Samantha Groves, Jessica Kemp, Hwayeon Danielle Shin, Lydia Sequeira, Karen Lascelles, Gillian Strudwick","doi":"10.1101/2024.07.10.24310091","DOIUrl":"https://doi.org/10.1101/2024.07.10.24310091","url":null,"abstract":"Objective: We aimed to examine the characteristics, features, and content of suicide prevention mobile apps available in app stores in Canada and the United Kingdom.\u0000Design: Suicide prevention apps were identified from Apple and Android app stores between March-April 2023. Apps were screened against predefined inclusion criteria, and duplicate apps were removed. Data were then extracted based on descriptive (e.g., genre, app developer), security (e.g., password protection), and design features (e.g., personalization options). Content of apps were assessed using the Essential Features Framework. Extracted data were analyzed using a content analysis approach including narrative frequencies and descriptive statistics.\u0000Results: Fifty-two (n=52) suicide prevention apps were included within the review. Most were tailored for the general population and were in English language only. One app had the option to increase app accessibility by offering content presented using sign language. Many apps allowed some form of personalization by adding text content, however most did not facilitate further customization such as the ability to upload photo and audio content. All identified apps included content from at least one of the domains of the Essential Features Framework. The most commonly included domains were sources of suicide prevention support, and information about suicide. The domain least frequently included was screening tools followed by wellness content. No identified apps had the ability to be linked to patient medical records.\u0000Conclusions: The findings of this research present implications for the development of future suicide prevention apps. Development of a co-produced suicide prevention app which is accessible, allows for personalization, and can be integrated into clinical care may present an opportunity to enhance suicide prevention support for individuals experiencing suicidal thoughts and behaviours.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"434 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141587377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-10DOI: 10.1101/2024.07.09.24310088
Peter Bowman Mack, Casey Cole, Mintaek Lee, Lisa Peterson, Matthew Lundy, Karen Elizabeth Hegarty, William Espinoza
Objective: To determine whether the addition of a primary aldosteronism (PA) predictive model to a secondary hypertension decision support tool increases screening for PA in a primary care setting. Materials and Methods: 153 primary care clinics were randomized to receive a secondary hypertension decision support tool with or without an integrated predictive model between August 2023 and April 2024. Results: For patients with risk scores in the top 1 percentile, 63/2,896 (2.2%) patients where the alert was displayed in model clinics had the order set launched while 12/1,210 (1.0%) in no model clinics had the order set launched (P = 0.014). 19/2,896 (0.66%) of these highest risk patients in model clinics had an ARR ordered compared to 0/1,210 (0.0%) patients in no model clinics (P = 0.010). For patients with scores not in the top 1 percentile, 438/20,493 (2.1%) patients in model clinics had the order set launched compared to 273/17,820 (1.5%) in no model clinics (P < 0.001). 124/20,493 (0.61%) in model clinics had an ARR ordered compared to 34/17,820 (0.19%) in the no model clinics (P <