Pub Date : 2025-02-13DOI: 10.1101/2025.02.12.25322164
Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman
The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.
{"title":"Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors.","authors":"Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman","doi":"10.1101/2025.02.12.25322164","DOIUrl":"https://doi.org/10.1101/2025.02.12.25322164","url":null,"abstract":"<p><p>The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844586/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.12.25322031
Halide Bilge Türközer, Victor Zeng, Dung Hoang, Jothini Sritharan, Neha Iska, Elena I Ivleva, Brett A Clementz, Godfrey D Pearlson, Sarah Keedy, Elliot S Gershon, Carol A Tamminga, Matcheri S Keshavan, Paulo Lizano
Objective: The visual system is a significant site of pathology in psychosis spectrum disorders. However, there is limited research investigating human visual cortex (VC) subregions in this population. Using data from the Bipolar-Schizophrenia Network on Intermediate Phenotypes Consortium (BSNIP-1, BSNIP-2, PARDIP), this study examined structural measures in VC subregions in individuals with psychosis spectrum disorders.
Methods: Cortical surface area and thickness in five VC subregions (hOc1, hOc2, hOc3v, hOc4v, MT) were quantified using FreeSurfer v7.1.0 and compared between individuals with psychosis (n=1211) and healthy controls (n=734). Regional specificity was examined by controlling for total surface area or mean cortical thickness. ComBat was used to harmonize scanner effects. Associations between VC measures and symptom severity, cognition, and childhood trauma scores were assessed.
Results: Individuals with psychosis demonstrated smaller surface area in hOc1, hOc2, and hOc3v, and lower cortical thickness in all five VC subregions compared to healthy controls. Thickness reductions in hOc1, hOc4v, and MT were regionally specific. hOc4v and MT were among the top three regions exhibiting the most robust cortical thickness deficits (d = -0.38 to -0.40) across all VC and Desikan-Killiany brain regions. Lower thickness in mid-level visual subregions were associated with greater positive symptoms, poorer cognition, and higher childhood trauma scores.
Conclusions: This study demonstrates that the visual cortex is among the most profoundly affected brain regions in psychotic disorders. Different patterns of area and thickness changes across early and mid-level visual subregions, along with their varying associations with clinical measures, suggest distinct developmental and disease-related influences.
{"title":"Neuroanatomical Deficits in Visual Cortex Subregions of Individuals with Psychosis Spectrum Disorders linked to Symptoms, Cognition, and Childhood Trauma.","authors":"Halide Bilge Türközer, Victor Zeng, Dung Hoang, Jothini Sritharan, Neha Iska, Elena I Ivleva, Brett A Clementz, Godfrey D Pearlson, Sarah Keedy, Elliot S Gershon, Carol A Tamminga, Matcheri S Keshavan, Paulo Lizano","doi":"10.1101/2025.02.12.25322031","DOIUrl":"10.1101/2025.02.12.25322031","url":null,"abstract":"<p><strong>Objective: </strong>The visual system is a significant site of pathology in psychosis spectrum disorders. However, there is limited research investigating human visual cortex (VC) subregions in this population. Using data from the Bipolar-Schizophrenia Network on Intermediate Phenotypes Consortium (BSNIP-1, BSNIP-2, PARDIP), this study examined structural measures in VC subregions in individuals with psychosis spectrum disorders.</p><p><strong>Methods: </strong>Cortical surface area and thickness in five VC subregions (hOc1, hOc2, hOc3v, hOc4v, MT) were quantified using FreeSurfer v7.1.0 and compared between individuals with psychosis (<i>n</i>=1211) and healthy controls (<i>n</i>=734). Regional specificity was examined by controlling for total surface area or mean cortical thickness. ComBat was used to harmonize scanner effects. Associations between VC measures and symptom severity, cognition, and childhood trauma scores were assessed.</p><p><strong>Results: </strong>Individuals with psychosis demonstrated smaller surface area in hOc1, hOc2, and hOc3v, and lower cortical thickness in all five VC subregions compared to healthy controls. Thickness reductions in hOc1, hOc4v, and MT were regionally specific. hOc4v and MT were among the top three regions exhibiting the most robust cortical thickness deficits (<i>d</i> = -0.38 to -0.40) across all VC and Desikan-Killiany brain regions. Lower thickness in mid-level visual subregions were associated with greater positive symptoms, poorer cognition, and higher childhood trauma scores.</p><p><strong>Conclusions: </strong>This study demonstrates that the visual cortex is among the most profoundly affected brain regions in psychotic disorders. Different patterns of area and thickness changes across early and mid-level visual subregions, along with their varying associations with clinical measures, suggest distinct developmental and disease-related influences.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844584/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.12.25322179
Amulya Gupta, Christopher J Harvey, Uzair Mahmood, Jacob D Baer, Nikhil Parimi, Ashutosh Bapat, Seth H Sheldon, Madhu Reddy, Zijun Yao, Yongkuk Lee, Amit Noheria
Background: Vectorcardiographic 3D QRS voltage-time integral (VTIQRS-3D) is a novel marker of ventricular dyssynchrony pertinent for cardiac resynchronization therapy. It may have additional clinical utility but its normal reference ranges have not been established. We sought to define reference ranges for VTIQRS-3D in healthy individuals.
Methods: We retrospectively analyzed 12-lead ECGs of healthy adults (2010-2014) and compared them to patients with cardiomyopathy with reduced ejection fraction (EF) <50%. Using the Kors matrix, 12-lead ECGs with QRS duration ≤120 ms were converted to vectorcardiographic X, Y, and Z leads. VTIQRS-3D was calculated as the instantaneous root-mean-square (3D) voltage integrated over the QRS duration. Reference range limits were defined as the 2.5th to 97.5th percentiles respectively for healthy females and males in age groups 18-34, 35-54 and ≥55 years.
Results: The study included 468 healthy adults (age 44.6 ± 17.0 years; 63.9% female) and 314 patients with cardiomyopathy (age 62.1 ± 14.0 years; 34.4% female). VTIQRS-3D was significantly larger in the cardiomyopathy patients compared to the healthy population (48.2±21.4 vs. 38.1±9.3 μVs, p<0.0001). Increased age and female sex were significant predictors of lower VTIQRS-3D in the healthy population (both p<0.0001). VTIQRS-3D reference ranges for respective age groups for healthy females were 23.2-55.0, 23.9-56.4 and 19.6-50.9 μVs, and for healthy males were 29.9-57.2, 28.2-56.7 and 21.4-55.9 μVs.
Conclusion: VTIQRS-3D is higher at younger age in healthy population, male sex and in patients having cardiomyopathy with reduced EF. Age and sex need to be accounted for using VTIQRS-3D as a marker for structural heart disease.
{"title":"QRS 3D Voltage-Time Integral in Narrow QRS Complex - Establishing the Normal Reference Range.","authors":"Amulya Gupta, Christopher J Harvey, Uzair Mahmood, Jacob D Baer, Nikhil Parimi, Ashutosh Bapat, Seth H Sheldon, Madhu Reddy, Zijun Yao, Yongkuk Lee, Amit Noheria","doi":"10.1101/2025.02.12.25322179","DOIUrl":"10.1101/2025.02.12.25322179","url":null,"abstract":"<p><strong>Background: </strong>Vectorcardiographic 3D QRS voltage-time integral (VTI<sub>QRS-3D</sub>) is a novel marker of ventricular dyssynchrony pertinent for cardiac resynchronization therapy. It may have additional clinical utility but its normal reference ranges have not been established. We sought to define reference ranges for VTI<sub>QRS-3D</sub> in healthy individuals.</p><p><strong>Methods: </strong>We retrospectively analyzed 12-lead ECGs of healthy adults (2010-2014) and compared them to patients with cardiomyopathy with reduced ejection fraction (EF) <50%. Using the Kors matrix, 12-lead ECGs with QRS duration ≤120 ms were converted to vectorcardiographic X, Y, and Z leads. VTI<sub>QRS-3D</sub> was calculated as the instantaneous root-mean-square (3D) voltage integrated over the QRS duration. Reference range limits were defined as the 2.5th to 97.5th percentiles respectively for healthy females and males in age groups 18-34, 35-54 and ≥55 years.</p><p><strong>Results: </strong>The study included 468 healthy adults (age 44.6 ± 17.0 years; 63.9% female) and 314 patients with cardiomyopathy (age 62.1 ± 14.0 years; 34.4% female). VTI<sub>QRS-3D</sub> was significantly larger in the cardiomyopathy patients compared to the healthy population (48.2±21.4 vs. 38.1±9.3 μVs, p<0.0001). Increased age and female sex were significant predictors of lower VTI<sub>QRS-3D</sub> in the healthy population (both p<0.0001). VTI<sub>QRS-3D</sub> reference ranges for respective age groups for healthy females were 23.2-55.0, 23.9-56.4 and 19.6-50.9 μVs, and for healthy males were 29.9-57.2, 28.2-56.7 and 21.4-55.9 μVs.</p><p><strong>Conclusion: </strong>VTI<sub>QRS-3D</sub> is higher at younger age in healthy population, male sex and in patients having cardiomyopathy with reduced EF. Age and sex need to be accounted for using VTI<sub>QRS-3D</sub> as a marker for structural heart disease.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844597/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.12.25322169
Imen Alkuraya, Alexandra Santana Almansa, Azubuike Eleonu, Paul Avillach, Annapurna Poduri, Siddharth Srivastava
Introduction: Emerging evidence suggests that 20-30% of cases of cerebral palsy (CP) may have a genetic cause. Our group previously identified subsets of patients with CP or CP-masquerading conditions who warrant genetic testing, including those with regression or progressive neurological symptoms (CP masqueraders) and those without any known risk factors for CP (cryptogenic CP). Recognition of these subgroups in clinical settings remains challenging.
Methods: To address this challenge, we developed and evaluated a computational phenotyping approach using ICD- 9/ICD-10 billing codes to automatically identify patients with unexplained CP or CP-masquerading conditions who may benefit from genetic testing. We applied this computational phenotyping approach to a cohort of 250 participants from the Boston Children's Hospital CP Sequencing Study, aimed at identifying genetic causes in CP and CP-masquerading conditions.
Results: Manual review served as the gold standard, identifying 8% as CP masqueraders, 42% as cryptogenic CP, and 50% as non-cryptogenic CP. Computational phenotyping based on ICD-9/10 codes achieved a sensitivity of 95%, specificity of 72%, positive predictive value of 77%, and negative predictive value of 94% in identifying cases warranting genetic testing.
Conclusions: Our findings demonstrate the feasibility of using computational phenotyping to identify patients with CP or CP- masquerading conditions who warrant genetic testing. Further studies are needed to evaluate the effectiveness and real-world application of this tool in larger healthcare systems. Nonetheless, the computational phenotyping approach holds promise as a possible clinical decision support that could be integrated into electronic health record systems, enhancing clinical workflows and facilitating actionable genetic diagnoses.
{"title":"Use of Computational Phenotypes for Predicting Genetic Subgroups of Cerebral Palsy.","authors":"Imen Alkuraya, Alexandra Santana Almansa, Azubuike Eleonu, Paul Avillach, Annapurna Poduri, Siddharth Srivastava","doi":"10.1101/2025.02.12.25322169","DOIUrl":"https://doi.org/10.1101/2025.02.12.25322169","url":null,"abstract":"<p><strong>Introduction: </strong>Emerging evidence suggests that 20-30% of cases of cerebral palsy (CP) may have a genetic cause. Our group previously identified subsets of patients with CP or CP-masquerading conditions who warrant genetic testing, including those with regression or progressive neurological symptoms (CP masqueraders) and those without any known risk factors for CP (cryptogenic CP). Recognition of these subgroups in clinical settings remains challenging.</p><p><strong>Methods: </strong>To address this challenge, we developed and evaluated a computational phenotyping approach using ICD- 9/ICD-10 billing codes to automatically identify patients with unexplained CP or CP-masquerading conditions who may benefit from genetic testing. We applied this computational phenotyping approach to a cohort of 250 participants from the Boston Children's Hospital CP Sequencing Study, aimed at identifying genetic causes in CP and CP-masquerading conditions.</p><p><strong>Results: </strong>Manual review served as the gold standard, identifying 8% as CP masqueraders, 42% as cryptogenic CP, and 50% as non-cryptogenic CP. Computational phenotyping based on ICD-9/10 codes achieved a sensitivity of 95%, specificity of 72%, positive predictive value of 77%, and negative predictive value of 94% in identifying cases warranting genetic testing.</p><p><strong>Conclusions: </strong>Our findings demonstrate the feasibility of using computational phenotyping to identify patients with CP or CP- masquerading conditions who warrant genetic testing. Further studies are needed to evaluate the effectiveness and real-world application of this tool in larger healthcare systems. Nonetheless, the computational phenotyping approach holds promise as a possible clinical decision support that could be integrated into electronic health record systems, enhancing clinical workflows and facilitating actionable genetic diagnoses.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844589/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143484805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.10.25322025
A Banerjee, F Yang, J Dutta, A Cacciola, M Hornberger, M Saranathan
Introduction: Frontotemporal dementia involves progressive atrophy in deep gray matter nuclei, including the thalamus and basal ganglia (such as the caudate, putamen, nucleus accumbens, and globus pallidus), which are critical for cognition and behavior. This study examined cross-sectional and longitudinal atrophy using a state-of-the-art multi-atlas segmentation method sTHOMAS.
Methods: T1-weighted MRI scans from 274 participants at baseline and 237 at follow-up obtained from the Frontotemporal Lobar Degeneration Neuroimaging Initiative database were analyzed using sTHOMAS. Group differences were assessed using ANCOVA, adjusting for age, gender and intracranial volume as covariates.
Results: Atrophy was significant in the mediodorsal, pulvinar, anterior ventral nuclei, nucleus accumbens, and claustrum, with bvFTD most affected cross-sectionally. Longitudinally, the nucleus accumbens, mediodorsal, and pulvinar nuclei declined further. Atrophy correlated with naming (mediodorsal), working memory (ventrolateral posterior), and executive dysfunction (nucleus accumbens) neuropsychological tests.
Discussion: These findings highlight progressive, nucleus-specific atrophy in FTD and emphasize the importance of cross-sectional as well as longitudinal imaging and sex-specific analyses in understanding disease progression.
{"title":"Cross-Sectional and Longitudinal Patterns of Atrophy in Thalamic and Deep Gray Matter Nuclei in Frontotemporal Dementia.","authors":"A Banerjee, F Yang, J Dutta, A Cacciola, M Hornberger, M Saranathan","doi":"10.1101/2025.02.10.25322025","DOIUrl":"10.1101/2025.02.10.25322025","url":null,"abstract":"<p><strong>Introduction: </strong>Frontotemporal dementia involves progressive atrophy in deep gray matter nuclei, including the thalamus and basal ganglia (such as the caudate, putamen, nucleus accumbens, and globus pallidus), which are critical for cognition and behavior. This study examined cross-sectional and longitudinal atrophy using a state-of-the-art multi-atlas segmentation method sTHOMAS.</p><p><strong>Methods: </strong>T1-weighted MRI scans from 274 participants at baseline and 237 at follow-up obtained from the Frontotemporal Lobar Degeneration Neuroimaging Initiative database were analyzed using sTHOMAS. Group differences were assessed using ANCOVA, adjusting for age, gender and intracranial volume as covariates.</p><p><strong>Results: </strong>Atrophy was significant in the mediodorsal, pulvinar, anterior ventral nuclei, nucleus accumbens, and claustrum, with bvFTD most affected cross-sectionally. Longitudinally, the nucleus accumbens, mediodorsal, and pulvinar nuclei declined further. Atrophy correlated with naming (mediodorsal), working memory (ventrolateral posterior), and executive dysfunction (nucleus accumbens) neuropsychological tests.</p><p><strong>Discussion: </strong>These findings highlight progressive, nucleus-specific atrophy in FTD and emphasize the importance of cross-sectional as well as longitudinal imaging and sex-specific analyses in understanding disease progression.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844577/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143484739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.11.25322107
David Hein, Alana Christie, Michael Holcomb, Bingqing Xie, A J Jain, Joseph Vento, Neil Rakheja, Ameer Hamza Shakur, Scott Christley, Lindsay G Cowell, James Brugarolas, Andrew Jamieson, Payal Kapur
Background: Extracting structured data from free-text medical records is laborious and error-prone. Traditional rule-based and early neural network methods often struggle with domain complexity and require extensive tuning. Large language models (LLMs) offer a promising solution but must be tailored to nuanced clinical knowledge and complex, multipart entities.
Methods: We developed a flexible, end-to-end LLM pipeline to extract diagnoses, per-specimen anatomical-sites, procedures, histology, and detailed immunohistochemistry results from pathology reports. A human-in-the-loop process to create validated reference annotations for a development set of 152 kidney tumor reports guided iterative pipeline refinement. To drive nuanced assessment of performance we developed a comprehensive error ontology-categorizing by clinical significance (major vs. minor), source (LLM, manual annotation, or insufficient instructions), and contextual origin. The finalized pipeline was applied to 3,520 internal reports (of which 2,297 had pre-existing templated data available for cross referencing) and evaluated for adaptability using 53 publicly available breast cancer pathology reports.
Results: After six iterations, major LLM errors on the development set decreased to 0.99% (14/1413 entities). We identified 11 key contexts from which complications arose- including medical history integration, entity linking, and specification granularity- which provided valuable insight in understanding our research goals. Using the available templated data as a cross reference, we achieved a macro-averaged F1 score of 0.99 for identifying six kidney tumor subtypes and 0.97 for detecting metastasis. When adapted to the breast dataset, three iterations were required to align with domain-specific instructions, attaining 89% agreement with curated data.
Conclusion: This work illustrates that LLM-based extraction pipelines can achieve near expert-level accuracy with carefully constructed instructions and specific aims. Beyond raw performance metrics, the iterative process itself-balancing specificity and clinical relevance-proved essential. This approach offers a transferable blueprint for applying emerging LLM capabilities to other complex clinical information extraction tasks.
{"title":"Prompts to Table: Specification and Iterative Refinement for Clinical Information Extraction with Large Language Models.","authors":"David Hein, Alana Christie, Michael Holcomb, Bingqing Xie, A J Jain, Joseph Vento, Neil Rakheja, Ameer Hamza Shakur, Scott Christley, Lindsay G Cowell, James Brugarolas, Andrew Jamieson, Payal Kapur","doi":"10.1101/2025.02.11.25322107","DOIUrl":"10.1101/2025.02.11.25322107","url":null,"abstract":"<p><strong>Background: </strong>Extracting structured data from free-text medical records is laborious and error-prone. Traditional rule-based and early neural network methods often struggle with domain complexity and require extensive tuning. Large language models (LLMs) offer a promising solution but must be tailored to nuanced clinical knowledge and complex, multipart entities.</p><p><strong>Methods: </strong>We developed a flexible, end-to-end LLM pipeline to extract diagnoses, per-specimen anatomical-sites, procedures, histology, and detailed immunohistochemistry results from pathology reports. A human-in-the-loop process to create validated reference annotations for a development set of 152 kidney tumor reports guided iterative pipeline refinement. To drive nuanced assessment of performance we developed a comprehensive error ontology-categorizing by clinical significance (major vs. minor), source (LLM, manual annotation, or insufficient instructions), and contextual origin. The finalized pipeline was applied to 3,520 internal reports (of which 2,297 had pre-existing templated data available for cross referencing) and evaluated for adaptability using 53 publicly available breast cancer pathology reports.</p><p><strong>Results: </strong>After six iterations, major LLM errors on the development set decreased to 0.99% (14/1413 entities). We identified 11 key contexts from which complications arose- including medical history integration, entity linking, and specification granularity- which provided valuable insight in understanding our research goals. Using the available templated data as a cross reference, we achieved a macro-averaged F1 score of 0.99 for identifying six kidney tumor subtypes and 0.97 for detecting metastasis. When adapted to the breast dataset, three iterations were required to align with domain-specific instructions, attaining 89% agreement with curated data.</p><p><strong>Conclusion: </strong>This work illustrates that LLM-based extraction pipelines can achieve near expert-level accuracy with carefully constructed instructions and specific aims. Beyond raw performance metrics, the iterative process itself-balancing specificity and clinical relevance-proved essential. This approach offers a transferable blueprint for applying emerging LLM capabilities to other complex clinical information extraction tasks.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844613/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.11.25322033
Vanessa C Jacovas, Michelle Zelnick, Shannon McNulty, Justyne E Ross, Namrata Khurana, Xueyang Pan, Alejandro Nieto, Shiloh Martin, Benjamin McLean, Marwa A Elnagheeb, Morton J Cowan, Jennifer M Puck, Mike Hershfield, James Verbsky, Jolan Walter, Eric Allenspach, Alice Y Chan, Nicolai S C van Oers, Rajarshi Ghosh, Megan Piazza, Bo Yuan, Luigi D Notarangelo, Britt A Johnson, Ivan K Chinn
Purpose: This collaborative study, led by the Clinical Genome Resource Severe Combined Immunodeficiency Disease Variant Curation Expert Panel (ClinGen SCID-VCEP), implemented and adapted the American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines for interpreting germline variants in genes with established relationships to SCID. The effort focused on the 7 most common SCID-related genes identified by SCID newborn screening in North America: ADA , DCLRE1C , IL2RG , IL7R , JAK3 , RAG1 , and RAG2 .
Methods: The SCID-VCEP conducted a rigorous review of variants that involved database analyses, literature review, and expert feedback to derive gene-specific modifications to the ACMG/AMP guidelines. These specifications were validated using a pilot set of 90 variants. Results: Of these 90 variants, 25 were classified as pathogenic, 21 as likely pathogenic, 14 as variants of uncertain significance (VUS), 18 as likely benign, and 12 as benign. Seventeen variants with conflicting classifications in ClinVar were successfully resolved. The criteria included modifications to 20 of the 28 original ACMG/AMP criteria specific to SCID-related genes.
Conclusion: The SCID-specific variant curation guidelines developed by the SCID-VCEP will enhance the precision of SCID genetic diagnosis and provide a robust framework for interpreting variants in SCID-related genes, contributing to appropriate treatment of SCID.
{"title":"The ClinGen Severe Combined Immunodeficiency Disease Variant Curation Expert Panel: Specifications for classification of variants in <i>ADA</i> , <i>DCLRE1C</i> , <i>IL2RG</i> , <i>IL7R</i> , <i>JAK3</i> , <i>RAG1</i> , and <i>RAG2</i>.","authors":"Vanessa C Jacovas, Michelle Zelnick, Shannon McNulty, Justyne E Ross, Namrata Khurana, Xueyang Pan, Alejandro Nieto, Shiloh Martin, Benjamin McLean, Marwa A Elnagheeb, Morton J Cowan, Jennifer M Puck, Mike Hershfield, James Verbsky, Jolan Walter, Eric Allenspach, Alice Y Chan, Nicolai S C van Oers, Rajarshi Ghosh, Megan Piazza, Bo Yuan, Luigi D Notarangelo, Britt A Johnson, Ivan K Chinn","doi":"10.1101/2025.02.11.25322033","DOIUrl":"https://doi.org/10.1101/2025.02.11.25322033","url":null,"abstract":"<p><strong>Purpose: </strong>This collaborative study, led by the Clinical Genome Resource Severe Combined Immunodeficiency Disease Variant Curation Expert Panel (ClinGen SCID-VCEP), implemented and adapted the American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines for interpreting germline variants in genes with established relationships to SCID. The effort focused on the 7 most common SCID-related genes identified by SCID newborn screening in North America: <i>ADA</i> , <i>DCLRE1C</i> , <i>IL2RG</i> , <i>IL7R</i> , <i>JAK3</i> , <i>RAG1</i> , and <i>RAG2</i> .</p><p><strong>Methods: </strong>The SCID-VCEP conducted a rigorous review of variants that involved database analyses, literature review, and expert feedback to derive gene-specific modifications to the ACMG/AMP guidelines. These specifications were validated using a pilot set of 90 variants. <b>Results:</b> Of these 90 variants, 25 were classified as pathogenic, 21 as likely pathogenic, 14 as variants of uncertain significance (VUS), 18 as likely benign, and 12 as benign. Seventeen variants with conflicting classifications in ClinVar were successfully resolved. The criteria included modifications to 20 of the 28 original ACMG/AMP criteria specific to SCID-related genes.</p><p><strong>Conclusion: </strong>The SCID-specific variant curation guidelines developed by the SCID-VCEP will enhance the precision of SCID genetic diagnosis and provide a robust framework for interpreting variants in SCID-related genes, contributing to appropriate treatment of SCID.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844601/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143484786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.11.25322014
Moonil Kang, Ting Fang Alvin Ang, Sherral A Devine, Richard Sherva, Shubhabrata Mukherjee, Emily H Trittschuh, Laura E Gibbons, Phoebe Scollard, Michael Lee, Seo-Eun Choi, Brandon Klinedinst, Connie Nakano, Logan C Dumitrescu, Timothy J Hohman, Michael L Cuccaro, Andrew J Saykin, Walter A Kukull, David A Bennett, Li-San Wang, Richard P Mayeux, Jonathan L Haines, Margaret A Pericak-Vance, Gerard D Schellenberg, Paul K Crane, Rhoda Au, Kathryn L Lunetta, Jesse Mez, Lindsay A Farrer
Background: Genome-wide association studies (GWAS) have identified over 1,000 blood pressure (BP) loci and over 80 loci for Alzheimer's disease (AD). Considering BP is an AD risk factor, identifying pleiotropy in BP and cognitive performance measures may indicate mechanistic links between BP and AD.
Methods: Genome-wide scans for pleiotropy in BP variables-systolic (SBP), diastolic (DBP), mean arterial (MAP), and pulse pressure (PP)-and co-calibrated scores for cognitive domains (executive function, language, and memory) were performed using generalized linear mixed models and 116,075 longitudinal measures from 25,726 participants of clinic-based and prospective cohorts. GWAS was conducted using PLACO to estimate each SNP's main effect and interaction with age, and their joint effect on pleiotropy. Effects of genome-wide significant (GWS) pleiotropic SNPs on cognition as direct or mediated through BP were evaluated using Mendelian randomization. Potential contribution of genes in top-ranked pleiotropic loci to cognitive resilience was assessed by comparing their expression in brain tissue from pathologically confirmed AD cases with and without clinical symptoms.
Results: Pleiotropy GWAS identified GWS associations with APOE and 11 novel loci. In the total sample, pleiotropy was identified for SBP and language with JPH2 ( PJoint =6.09×10 -9 ) and GATA3 ( PG×Age =1.42×10 -8 ), MAP and executive function with PAX2 ( PG×Age =4.22×10 -8 ), MAP and language with LOC105371656 ( PG×Age =1.75×10 -8 ), and DBP and language with SUFU ( PG =2.10×10 -8 ). In prospective cohorts, pleiotropy was found for SBP and language with RTN4 ( PG×Age =1.49×10 -8 ), DBP and executive function with ULK2 ( PJoint =2.85×10 -8 ), PP and memory with SORBS2 ( PG =2.33×10 -8 ), and DBP and memory with LOC100128993 ( PG×Age =2.81×10 -8 ). In clinic-based cohorts, pleiotropy was observed for PP and language with ADAMTS3 ( PG =2.37×10 -8 ) and SBP and memory with LINC02946 ( PG×Age =3.47×10 -8 ). Five GWS pleiotropic loci influence cognition directly, and genes at six pleiotropic loci were differentially expressed between pathologically confirmed AD cases with and without clinical symptoms.
Conclusion: Our results provide insight into the underlying mechanisms of high BP and AD. Ongoing efforts to harmonize BP and cognitive measures across several cohorts will improve the power of discovering, replicating, and generalizing novel associations with pleiotropic loci.
{"title":"Genome-wide pleiotropy analysis of longitudinal blood pressure and harmonized cognitive performance measures.","authors":"Moonil Kang, Ting Fang Alvin Ang, Sherral A Devine, Richard Sherva, Shubhabrata Mukherjee, Emily H Trittschuh, Laura E Gibbons, Phoebe Scollard, Michael Lee, Seo-Eun Choi, Brandon Klinedinst, Connie Nakano, Logan C Dumitrescu, Timothy J Hohman, Michael L Cuccaro, Andrew J Saykin, Walter A Kukull, David A Bennett, Li-San Wang, Richard P Mayeux, Jonathan L Haines, Margaret A Pericak-Vance, Gerard D Schellenberg, Paul K Crane, Rhoda Au, Kathryn L Lunetta, Jesse Mez, Lindsay A Farrer","doi":"10.1101/2025.02.11.25322014","DOIUrl":"https://doi.org/10.1101/2025.02.11.25322014","url":null,"abstract":"<p><strong>Background: </strong>Genome-wide association studies (GWAS) have identified over 1,000 blood pressure (BP) loci and over 80 loci for Alzheimer's disease (AD). Considering BP is an AD risk factor, identifying pleiotropy in BP and cognitive performance measures may indicate mechanistic links between BP and AD.</p><p><strong>Methods: </strong>Genome-wide scans for pleiotropy in BP variables-systolic (SBP), diastolic (DBP), mean arterial (MAP), and pulse pressure (PP)-and co-calibrated scores for cognitive domains (executive function, language, and memory) were performed using generalized linear mixed models and 116,075 longitudinal measures from 25,726 participants of clinic-based and prospective cohorts. GWAS was conducted using PLACO to estimate each SNP's main effect and interaction with age, and their joint effect on pleiotropy. Effects of genome-wide significant (GWS) pleiotropic SNPs on cognition as direct or mediated through BP were evaluated using Mendelian randomization. Potential contribution of genes in top-ranked pleiotropic loci to cognitive resilience was assessed by comparing their expression in brain tissue from pathologically confirmed AD cases with and without clinical symptoms.</p><p><strong>Results: </strong>Pleiotropy GWAS identified GWS associations with <i>APOE</i> and 11 novel loci. In the total sample, pleiotropy was identified for SBP and language with <i>JPH2</i> ( <i>P</i> <sub>Joint</sub> =6.09×10 <sup>-9</sup> ) and <i>GATA3</i> ( <i>P</i> <sub>G×Age</sub> =1.42×10 <sup>-8</sup> ), MAP and executive function with <i>PAX2</i> ( <i>P</i> <sub>G×Age</sub> =4.22×10 <sup>-8</sup> ), MAP and language with <i>LOC105371656</i> ( <i>P</i> <sub>G×Age</sub> =1.75×10 <sup>-8</sup> ), and DBP and language with <i>SUFU</i> ( <i>P</i> <sub>G</sub> =2.10×10 <sup>-8</sup> ). In prospective cohorts, pleiotropy was found for SBP and language with <i>RTN4</i> ( <i>P</i> <sub>G×Age</sub> =1.49×10 <sup>-8</sup> ), DBP and executive function with <i>ULK2</i> ( <i>P</i> <sub>Joint</sub> =2.85×10 <sup>-8</sup> ), PP and memory with <i>SORBS2</i> ( <i>P</i> <sub>G</sub> =2.33×10 <sup>-8</sup> ), and DBP and memory with <i>LOC100128993</i> ( <i>P</i> <sub>G×Age</sub> =2.81×10 <sup>-8</sup> ). In clinic-based cohorts, pleiotropy was observed for PP and language with <i>ADAMTS3</i> ( <i>P</i> <sub>G</sub> =2.37×10 <sup>-8</sup> ) and SBP and memory with <i>LINC02946</i> ( <i>P</i> <sub>G×Age</sub> =3.47×10 <sup>-8</sup> ). Five GWS pleiotropic loci influence cognition directly, and genes at six pleiotropic loci were differentially expressed between pathologically confirmed AD cases with and without clinical symptoms.</p><p><strong>Conclusion: </strong>Our results provide insight into the underlying mechanisms of high BP and AD. Ongoing efforts to harmonize BP and cognitive measures across several cohorts will improve the power of discovering, replicating, and generalizing novel associations with pleiotropic loci.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844603/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.12.25322167
Yufeng Song, Frances Mehl, Lyndsey M Muehling, Glenda Canderan, Kyle Enfield, Jie Sun, Michael T Yin, Sarah J Ratcliffe, Jeffrey M Wilson, Alexandra Kadl, Judith Woodfolk, Steven L Zeichner
Many difficult to understand clinical features characterize COVID-19 and Post-Acute Sequelae of COVID-19 (PASC or Long COVID, LC). These can include blood pressure instability, hyperinflammation, coagulopathies, and neuropsychiatric complaints. The pathogenesis of these features remains unclear. The SARS-CoV-2 Spike protein Receptor Binding Domain (RBD) binds Angiotensin Converting Enzyme 2 (ACE2) on the surface of host cells to initiate infection. We hypothesized that some patients may produce anti-RBD antibodies that resemble ACE2 sufficiently to have ACE2-like catalytic activity, that is they are ACE2-like proteolytic abzymes that may help mediate the pathogenesis of COVID-19 and LC. In previous work, we showed that some acute COVID-19 patients had immunoglobulin-associated ACE2-like proteolytic activity, suggesting that some COVID-19 patients indeed produced ACE2-like abzymes. However, it remained unknown whether ACE2-like abzymes were seen only in acute COVID-19 patients or whether ACE2-like abzymes could also be identified in convalescent COVID-19 patients. Here we show that some convalescent COVID-19 patients attending a clinic for patients with persistent pulmonary symptoms also have ACE2-like abzymes and that the presence of ACE2-like catalytic activity correlates with alterations in blood pressure in an exercise test.
{"title":"ACE-2-like Enzymatic Activity in COVID-19 Convalescent Patients with Persistent Pulmonary Symptoms Associated with Immunoglobulin.","authors":"Yufeng Song, Frances Mehl, Lyndsey M Muehling, Glenda Canderan, Kyle Enfield, Jie Sun, Michael T Yin, Sarah J Ratcliffe, Jeffrey M Wilson, Alexandra Kadl, Judith Woodfolk, Steven L Zeichner","doi":"10.1101/2025.02.12.25322167","DOIUrl":"10.1101/2025.02.12.25322167","url":null,"abstract":"<p><p>Many difficult to understand clinical features characterize COVID-19 and Post-Acute Sequelae of COVID-19 (PASC or Long COVID, LC). These can include blood pressure instability, hyperinflammation, coagulopathies, and neuropsychiatric complaints. The pathogenesis of these features remains unclear. The SARS-CoV-2 Spike protein Receptor Binding Domain (RBD) binds Angiotensin Converting Enzyme 2 (ACE2) on the surface of host cells to initiate infection. We hypothesized that some patients may produce anti-RBD antibodies that resemble ACE2 sufficiently to have ACE2-like catalytic activity, that is they are ACE2-like proteolytic abzymes that may help mediate the pathogenesis of COVID-19 and LC. In previous work, we showed that some acute COVID-19 patients had immunoglobulin-associated ACE2-like proteolytic activity, suggesting that some COVID-19 patients indeed produced ACE2-like abzymes. However, it remained unknown whether ACE2-like abzymes were seen only in acute COVID-19 patients or whether ACE2-like abzymes could also be identified in convalescent COVID-19 patients. Here we show that some convalescent COVID-19 patients attending a clinic for patients with persistent pulmonary symptoms also have ACE2-like abzymes and that the presence of ACE2-like catalytic activity correlates with alterations in blood pressure in an exercise test.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844610/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13DOI: 10.1101/2025.02.12.25322170
Hui Li, Yihe Yang, Pingjian Ding, Rong Xu
Nearly 7.5% U.S. adults have long COVID. Recent epidemiological studies indicated that long COVID, is significantly associated with subsequent brain structure changes. However, it remains unknown if long COVID is causally associated with brain structure change. Here we applied two Mendelian Randomization (MR) methods - Inverse Variance Weighting MR method (IVW) for correlated instrument variables and Component analysis-based Generalized Method of Moments (PC-GMM) - to examine the potential causal relationships from long COVID to brain structure changes. The MR study was based on an instrumental variable analysis of data from a recent long COVID genome-wide association study (GWAS) (3,018 cases and 994,582 controls), the Enhancing NeuroImaging Genetics through Meta Analysis (ENIGMA) (Global and regional cortical measures, N = 33,709; combined hemispheric subcortical volumes, N = 38,851), and UK Biobank (left/right subcortical volumes, N = 19,629). We found no significant causal relationship between long COVID and brain structure changes. As we gain more insights into long COVID and its long-term health outcomes, future works are necessary to validate our findings and understand the mechanisms underlying the observed associations, though not causal, of long COVID with subsequent brain structure changes.
{"title":"Causal association of long COVID with brain structure changes: Findings from a 2-sample Mendelian randomization study.","authors":"Hui Li, Yihe Yang, Pingjian Ding, Rong Xu","doi":"10.1101/2025.02.12.25322170","DOIUrl":"10.1101/2025.02.12.25322170","url":null,"abstract":"<p><p>Nearly 7.5% U.S. adults have long COVID. Recent epidemiological studies indicated that long COVID, is significantly associated with subsequent brain structure changes. However, it remains unknown if long COVID is causally associated with brain structure change. Here we applied two Mendelian Randomization (MR) methods - Inverse Variance Weighting MR method (IVW) for correlated instrument variables and Component analysis-based Generalized Method of Moments (PC-GMM) - to examine the potential causal relationships from long COVID to brain structure changes. The MR study was based on an instrumental variable analysis of data from a recent long COVID genome-wide association study (GWAS) (3,018 cases and 994,582 controls), the Enhancing NeuroImaging Genetics through Meta Analysis (ENIGMA) (Global and regional cortical measures, N = 33,709; combined hemispheric subcortical volumes, N = 38,851), and UK Biobank (left/right subcortical volumes, N = 19,629). We found no significant causal relationship between long COVID and brain structure changes. As we gain more insights into long COVID and its long-term health outcomes, future works are necessary to validate our findings and understand the mechanisms underlying the observed associations, though not causal, of long COVID with subsequent brain structure changes.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844608/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}