Pub Date : 2024-12-06DOI: 10.1101/2024.12.04.24318517
Eric T Klopack, Gokul Seshadri, Thalida Em Arpawong, Steve Cole, Bharat Thyagarajan, Eileen M Crimmins
Increasingly, research suggests that aging is a coordinated multi-system decline in functioning that occurs at multiple biological levels. We developed and validated a transcriptomic (RNA-based) aging measure we call Transcriptomic Mortality-risk Age (TraMA) using RNA-seq data from the 2016 Health and Retirement Study using elastic net Cox regression analyses to predict 4-year mortality hazard. In a holdout test sample, TraMA was associated with earlier mortality, more chronic conditions, poorer cognitive functioning, and more limitations in activities of daily living. TraMA was also externally validated in the Long Life Family Study and several publicly available datasets. Results suggest that TraMA is a robust, portable RNAseq-based aging measure that is comparable, but independent from past biological aging measures (e.g., GrimAge). TraMA is likely to be of particular value to researchers interested in understanding the biological processes underlying health and aging, and for social, psychological, epidemiological, and demographic studies of health and aging.
{"title":"Development of a novel transcriptomic measure of aging: Transcriptomic Mortality-risk Age (TraMA).","authors":"Eric T Klopack, Gokul Seshadri, Thalida Em Arpawong, Steve Cole, Bharat Thyagarajan, Eileen M Crimmins","doi":"10.1101/2024.12.04.24318517","DOIUrl":"10.1101/2024.12.04.24318517","url":null,"abstract":"<p><p>Increasingly, research suggests that aging is a coordinated multi-system decline in functioning that occurs at multiple biological levels. We developed and validated a transcriptomic (RNA-based) aging measure we call Transcriptomic Mortality-risk Age (TraMA) using RNA-seq data from the 2016 Health and Retirement Study using elastic net Cox regression analyses to predict 4-year mortality hazard. In a holdout test sample, TraMA was associated with earlier mortality, more chronic conditions, poorer cognitive functioning, and more limitations in activities of daily living. TraMA was also externally validated in the Long Life Family Study and several publicly available datasets. Results suggest that TraMA is a robust, portable RNAseq-based aging measure that is comparable, but independent from past biological aging measures (e.g., GrimAge). TraMA is likely to be of particular value to researchers interested in understanding the biological processes underlying health and aging, and for social, psychological, epidemiological, and demographic studies of health and aging.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643192/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06DOI: 10.1101/2024.12.03.24317000
Yuk Yee Leung, Wan-Ping Lee, Amanda B Kuzma, Heather Nicaretta, Otto Valladares, Prabhakaran Gangadharan, Liming Qu, Yi Zhao, Youli Ren, Po-Liang Cheng, Pavel P Kuksa, Hui Wang, Heather White, Zivadin Katanic, Lauren Bass, Naveen Saravanan, Emily Greenfest-Allen, Maureen Kirsch, Laura Cantwell, Taha Iqbal, Nicholas R Wheeler, John J Farrell, Congcong Zhu, Shannon L Turner, Tamil I Gunasekaran, Pedro R Mena, Jimmy Jin, Luke Carter, Xiaoling Zhang, Badri N Vardarajan, Arthur Toga, Michael Cuccaro, Timothy J Hohman, William S Bush, Adam C Naj, Eden Martin, Clifton Dalgard, Brian W Kunkle, Lindsay A Farrer, Richard P Mayeux, Jonathan L Haines, Margaret A Pericak-Vance, Gerard D Schellenberg, Li-San Wang
The Alzheimer's Disease Sequencing Project (ADSP) is a national initiative to understand the genetic architecture of Alzheimer's Disease and Related Dementias (AD/ADRD) by sequencing whole genomes of affected participants and age-matched cognitive controls from diverse populations. The Genome Center for Alzheimer's Disease (GCAD) processed whole-genome sequencing data from 36,361 ADSP participants, including 35,014 genetically unique participants of which 45% are from non-European ancestry, across 17 cohorts in 14 countries in this fourth release (R4). This sequencing effort identified 387 million bi-allelic variants, 42 million short insertions/deletions, and 2.2 million structural variants. Annotations and quality control data are available for all variants and samples. Additionally, detailed phenotypes from 15,927 participants across 10 domains are also provided. A linkage disequilibrium panel was created using unrelated AD cases and controls. Researchers can access and analyze the genetic data via NIAGADS Data Sharing Service, the VariXam tool, or NIAGADS GenomicsDB.
{"title":"Alzheimer's Disease Sequencing Project Release 4 Whole Genome Sequencing Dataset.","authors":"Yuk Yee Leung, Wan-Ping Lee, Amanda B Kuzma, Heather Nicaretta, Otto Valladares, Prabhakaran Gangadharan, Liming Qu, Yi Zhao, Youli Ren, Po-Liang Cheng, Pavel P Kuksa, Hui Wang, Heather White, Zivadin Katanic, Lauren Bass, Naveen Saravanan, Emily Greenfest-Allen, Maureen Kirsch, Laura Cantwell, Taha Iqbal, Nicholas R Wheeler, John J Farrell, Congcong Zhu, Shannon L Turner, Tamil I Gunasekaran, Pedro R Mena, Jimmy Jin, Luke Carter, Xiaoling Zhang, Badri N Vardarajan, Arthur Toga, Michael Cuccaro, Timothy J Hohman, William S Bush, Adam C Naj, Eden Martin, Clifton Dalgard, Brian W Kunkle, Lindsay A Farrer, Richard P Mayeux, Jonathan L Haines, Margaret A Pericak-Vance, Gerard D Schellenberg, Li-San Wang","doi":"10.1101/2024.12.03.24317000","DOIUrl":"10.1101/2024.12.03.24317000","url":null,"abstract":"<p><p>The Alzheimer's Disease Sequencing Project (ADSP) is a national initiative to understand the genetic architecture of Alzheimer's Disease and Related Dementias (AD/ADRD) by sequencing whole genomes of affected participants and age-matched cognitive controls from diverse populations. The Genome Center for Alzheimer's Disease (GCAD) processed whole-genome sequencing data from 36,361 ADSP participants, including 35,014 genetically unique participants of which 45% are from non-European ancestry, across 17 cohorts in 14 countries in this fourth release (R4). This sequencing effort identified 387 million bi-allelic variants, 42 million short insertions/deletions, and 2.2 million structural variants. Annotations and quality control data are available for all variants and samples. Additionally, detailed phenotypes from 15,927 participants across 10 domains are also provided. A linkage disequilibrium panel was created using unrelated AD cases and controls. Researchers can access and analyze the genetic data via NIAGADS Data Sharing Service, the VariXam tool, or NIAGADS GenomicsDB.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643159/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06DOI: 10.1101/2024.12.04.24318429
Cecília Artico Banho, Maisa Carla Pereira Parra, Olivia Borghi Nascimento, Gabriel Pires Magnani, Maria Vitoria Moraes Ferreira, Ana Paula Lemos, Beatriz de Carvalho Marques, Marini Lino Brancini, Livia Sacchetto, Andreia Francesli Negri, Regiane Maria Tironi Menezes, Juliana Telles de Deus, Cassia Fernanda Estofolete, Nikos Vasilakis, Maurício Lacerda Nogueira
Background: Brazil is considered an epicenter for emerging and re-emerging arboviruses that significantly impact public health. The mid-sized city of São José do Rio Preto (SJdRP) in northwestern São Paulo state is considered hyperendemic for arboviral diseases, with case numbers climbing each year. Only 45 cases of chikungunya (CHIKV) were reported in the city from 2015 to 2022, indicating cryptic circulation of this virus, but cases in the state increased notably in 2023. This study investigates the use of active entomological surveillance to detect new arbovirus introductions in specific areas like SJdRP.
Methodology/principal findings: We used molecular testing to investigate the presence of CHIKV in adult culicids collected monthly from various neighborhoods in SJdRP. Positive samples underwent whole-genome sequencing and phylogenetic analysis. Entomological surveillance successfully detected the early spread of CHIKV across SJdRP, revealing an infection rate of 6.67%, with the well-established vectors Aedes aegypti and Ae. albopictus as well as Culex sp. carrying the virus. The vector positivity rate increased from December 2023 to April 2024, which correlates with rising numbers of chikungunya fever cases reported in SJdRP during the same period. The resurgence of CHIKV in this region is attributed to several introduction events, mainly from the Southeast and North of Brazil, which facilitated establishment of the virus within the highly dense vector population and led to extensive spread and, in turn, a major CHIKV epidemic in this geographical area.
Conclusions/significance: Extensive circulation of CHIKV was documented within the human and vector population, marking the onset of the first major CHIKV epidemic in SJdRP and neighboring cities. Because multiple arboviruses co-circulate in several locations in Brazil, entomological surveillance, along with ongoing monitoring of patient samples, is a key to help health authorities to implement more effective measures to interrupt transmission cycles and mitigate new epidemic waves.
{"title":"Entomological surveillance during a major CHIKV outbreak in northwestern São Paulo: insights from São José do Rio Preto.","authors":"Cecília Artico Banho, Maisa Carla Pereira Parra, Olivia Borghi Nascimento, Gabriel Pires Magnani, Maria Vitoria Moraes Ferreira, Ana Paula Lemos, Beatriz de Carvalho Marques, Marini Lino Brancini, Livia Sacchetto, Andreia Francesli Negri, Regiane Maria Tironi Menezes, Juliana Telles de Deus, Cassia Fernanda Estofolete, Nikos Vasilakis, Maurício Lacerda Nogueira","doi":"10.1101/2024.12.04.24318429","DOIUrl":"10.1101/2024.12.04.24318429","url":null,"abstract":"<p><strong>Background: </strong>Brazil is considered an epicenter for emerging and re-emerging arboviruses that significantly impact public health. The mid-sized city of São José do Rio Preto (SJdRP) in northwestern São Paulo state is considered hyperendemic for arboviral diseases, with case numbers climbing each year. Only 45 cases of chikungunya (CHIKV) were reported in the city from 2015 to 2022, indicating cryptic circulation of this virus, but cases in the state increased notably in 2023. This study investigates the use of active entomological surveillance to detect new arbovirus introductions in specific areas like SJdRP.</p><p><strong>Methodology/principal findings: </strong>We used molecular testing to investigate the presence of CHIKV in adult culicids collected monthly from various neighborhoods in SJdRP. Positive samples underwent whole-genome sequencing and phylogenetic analysis. Entomological surveillance successfully detected the early spread of CHIKV across SJdRP, revealing an infection rate of 6.67%, with the well-established vectors <i>Aedes aegypti</i> and <i>Ae. albopictus</i> as well as <i>Culex</i> sp. carrying the virus. The vector positivity rate increased from December 2023 to April 2024, which correlates with rising numbers of chikungunya fever cases reported in SJdRP during the same period. The resurgence of CHIKV in this region is attributed to several introduction events, mainly from the Southeast and North of Brazil, which facilitated establishment of the virus within the highly dense vector population and led to extensive spread and, in turn, a major CHIKV epidemic in this geographical area.</p><p><strong>Conclusions/significance: </strong>Extensive circulation of CHIKV was documented within the human and vector population, marking the onset of the first major CHIKV epidemic in SJdRP and neighboring cities. Because multiple arboviruses co-circulate in several locations in Brazil, entomological surveillance, along with ongoing monitoring of patient samples, is a key to help health authorities to implement more effective measures to interrupt transmission cycles and mitigate new epidemic waves.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643186/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06DOI: 10.1101/2024.12.04.24318520
Chelsea R Baker, Ivan Barilar, Leonardo S de Araujo, Daniel M Parker, Kimberly Fornace, Patrick K Moonan, John E Oeltmann, James L Tobias, Volodymyr M Minin, Chawangwa Modongo, Nicola M Zetola, Stefan Niemann, Sanghyuk S Shin
Background: The integration of genomic and geospatial data into infectious disease transmission analyses typically includes residential locations and excludes other activity spaces where transmission may occur (e.g. work, school, or social venues). The objective of this analysis was to explore residential as well as other activity spaces of tuberculosis (TB) outbreaks to identify potential geospatial 'hotspots' of transmission.
Methods: We analyzed data that included geospatial coordinates for residence and other activity spaces collected during 2012-2016 for the Kopanyo Study, a population-based study of TB transmission in Botswana. We included participants with results from whole genome sequencing conducted on archived samples from the original study. We used a spatial log-Gaussian Cox process model to detect core areas of increased activity spaces of individuals belonging to TB outbreaks (genotypic groups with ≤5 single-nucleotide polymorphisms), which we compared to ungrouped participants (those not in a genotypic group of any size).
Findings: We analyzed data collected from 636 participants, including 70 participants belonging to six outbreak groups with a combined total of 293 locations, and 566 ungrouped participants with a combined total of 2289 locations. Core areas of activity space for each outbreak group were geographically distinct, and we found evidence of localized transmission in four of six outbreaks. For most of the outbreaks, including activity space data led to the detection of larger areas of higher spatial intensity and more focal points compared to residential location alone.
Interpretation: Geospatial analysis using activity space data (social gathering places as well as residence) may lead to improved understanding of areas of infectious disease transmission compared to using residential data alone.
Funding: This work was supported by funding from the National Institute of Allergy and Infectious Diseases R01AI097045, R01AI147336, and R01AI170204.
背景:将基因组学和地理空间数据整合到传染病传播分析中通常包括居住地,而不包括可能发生传播的其他活动场所(如工作、学校或社交场所)。本分析的目的是探索结核病爆发的居住地及其他活动场所,以确定潜在的地理空间传播 "热点":我们分析了 2012-2016 年期间收集的 Kopanyo 研究数据,其中包括居住地和其他活动场所的地理空间坐标,该研究是一项基于人口的博茨瓦纳结核病传播研究。我们纳入了对原始研究的存档样本进行全基因组测序并得出结果的参与者。我们使用空间对数-高斯考克斯过程模型来检测属于结核病爆发(单核苷酸多态性≤5的基因型群体)的个人活动空间增加的核心区域,并将其与未分组参与者(不属于任何规模的基因型群体)进行比较:我们分析了从 636 名参与者收集到的数据,其中 70 名参与者属于 6 个疫情爆发组,总计 293 个地点;566 名未分组参与者,总计 2289 个地点。每个疫情爆发群体的核心活动区域在地理位置上截然不同,我们在六次疫情爆发中的四次发现了局部传播的证据。在大多数疫情中,与仅检测居住地点相比,包含活动空间数据可检测到更大范围、更高空间强度和更多焦点:解释:与仅使用居住地数据相比,使用活动空间数据(社交聚会场所以及居住地)进行地理空间分析可能会加深对传染病传播区域的了解:这项工作得到了美国国家过敏与传染病研究所(National Institute of Allergy and Infectious Diseases)R01AI097045、R01AI147336 和 R01AI170204 的资助。
{"title":"Using genomic epidemiology and geographic activity spaces to investigate tuberculosis outbreaks in Botswana.","authors":"Chelsea R Baker, Ivan Barilar, Leonardo S de Araujo, Daniel M Parker, Kimberly Fornace, Patrick K Moonan, John E Oeltmann, James L Tobias, Volodymyr M Minin, Chawangwa Modongo, Nicola M Zetola, Stefan Niemann, Sanghyuk S Shin","doi":"10.1101/2024.12.04.24318520","DOIUrl":"10.1101/2024.12.04.24318520","url":null,"abstract":"<p><strong>Background: </strong>The integration of genomic and geospatial data into infectious disease transmission analyses typically includes residential locations and excludes other activity spaces where transmission may occur (<i>e.g.</i> work, school, or social venues). The objective of this analysis was to explore residential as well as other activity spaces of tuberculosis (TB) outbreaks to identify potential geospatial 'hotspots' of transmission.</p><p><strong>Methods: </strong>We analyzed data that included geospatial coordinates for residence and other activity spaces collected during 2012-2016 for the Kopanyo Study, a population-based study of TB transmission in Botswana. We included participants with results from whole genome sequencing conducted on archived samples from the original study. We used a spatial log-Gaussian Cox process model to detect core areas of increased activity spaces of individuals belonging to TB outbreaks (genotypic groups with ≤5 single-nucleotide polymorphisms), which we compared to ungrouped participants (those not in a genotypic group of any size).</p><p><strong>Findings: </strong>We analyzed data collected from 636 participants, including 70 participants belonging to six outbreak groups with a combined total of 293 locations, and 566 ungrouped participants with a combined total of 2289 locations. Core areas of activity space for each outbreak group were geographically distinct, and we found evidence of localized transmission in four of six outbreaks. For most of the outbreaks, including activity space data led to the detection of larger areas of higher spatial intensity and more focal points compared to residential location alone.</p><p><strong>Interpretation: </strong>Geospatial analysis using activity space data (social gathering places as well as residence) may lead to improved understanding of areas of infectious disease transmission compared to using residential data alone.</p><p><strong>Funding: </strong>This work was supported by funding from the National Institute of Allergy and Infectious Diseases R01AI097045, R01AI147336, and R01AI170204.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643207/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-05DOI: 10.1101/2024.12.03.24318372
Genevieve E Romanowicz, Kristin Popp, Ethan Dinh, Isabella R Harker, Kelly Leguineche, Julie M Hughes, Kathryn E Ackerman, Mary L Bouxsein, Robert E Guldberg
Up to 40% of elite athletes experience bone stress injuries (BSIs), with 20-30% facing reinjury. Early identification of runners at high risk of subsequent BSI could improve prevention strategies. However, the complex etiology and multifactorial risk factors of BSIs makes identifying predictive risk factors challenging. In a study of 30 female recreational athletes with tibial BSIs, 10 experienced additional BSIs over a 1-year period, prompting investigation of systemic biomarkers of subsequent BSIs using aptamer-based proteomic technology. We hypothesized that early proteomic signatures could discriminate runners who experienced subsequent BSIs. 1,500 proteins related to metabolic, immune, and bone healing pathways were examined. Using supervised machine learning and genetic programming methods, we analyzed serum protein signatures over the 1-year monitoring period. Models were also created with clinical metrics, including standard-of-care blood analysis, bone density measures, and health histories. Protein signatures collected within three weeks of BSI diagnosis achieved the greatest separation by sparse partial least squares discriminant analysis (sPLS-DA), clustering single and recurrent BSI individuals with a mean accuracy of 96 ± 0.02%. Genetic programming models independently verified the presence of candidate biomarkers, including fumarylacetoacetase, osteopontin, and trypsin-2, which significantly outperformed clinical metrics. Time-course differential expression analysis highlighted 112 differentially expressed proteins in individuals with additional BSIs. Gene set enrichment analysis mapped these proteins to pathways indicating increased fibrin clot formation and decreased immune signaling in recurrent BSI individuals. These findings provide new insights into biomarkers and dysregulated protein pathways associated with recurrent BSI and may lead to new preventative or therapeutic intervention strategies.
One sentence summary: Our study identified candidate serum biomarkers to predict subsequent bone stress injuries in female runners, offering new insights for clinical monitoring and interventions.
{"title":"Deciphering Risk of Recurrent Bone Stress Injury in Female Runners Using Serum Proteomics Analysis and Predictive Models.","authors":"Genevieve E Romanowicz, Kristin Popp, Ethan Dinh, Isabella R Harker, Kelly Leguineche, Julie M Hughes, Kathryn E Ackerman, Mary L Bouxsein, Robert E Guldberg","doi":"10.1101/2024.12.03.24318372","DOIUrl":"10.1101/2024.12.03.24318372","url":null,"abstract":"<p><p>Up to 40% of elite athletes experience bone stress injuries (BSIs), with 20-30% facing reinjury. Early identification of runners at high risk of subsequent BSI could improve prevention strategies. However, the complex etiology and multifactorial risk factors of BSIs makes identifying predictive risk factors challenging. In a study of 30 female recreational athletes with tibial BSIs, 10 experienced additional BSIs over a 1-year period, prompting investigation of systemic biomarkers of subsequent BSIs using aptamer-based proteomic technology. We hypothesized that early proteomic signatures could discriminate runners who experienced subsequent BSIs. 1,500 proteins related to metabolic, immune, and bone healing pathways were examined. Using supervised machine learning and genetic programming methods, we analyzed serum protein signatures over the 1-year monitoring period. Models were also created with clinical metrics, including standard-of-care blood analysis, bone density measures, and health histories. Protein signatures collected within three weeks of BSI diagnosis achieved the greatest separation by sparse partial least squares discriminant analysis (sPLS-DA), clustering single and recurrent BSI individuals with a mean accuracy of 96 ± 0.02%. Genetic programming models independently verified the presence of candidate biomarkers, including fumarylacetoacetase, osteopontin, and trypsin-2, which significantly outperformed clinical metrics. Time-course differential expression analysis highlighted 112 differentially expressed proteins in individuals with additional BSIs. Gene set enrichment analysis mapped these proteins to pathways indicating increased fibrin clot formation and decreased immune signaling in recurrent BSI individuals. These findings provide new insights into biomarkers and dysregulated protein pathways associated with recurrent BSI and may lead to new preventative or therapeutic intervention strategies.</p><p><strong>One sentence summary: </strong>Our study identified candidate serum biomarkers to predict subsequent bone stress injuries in female runners, offering new insights for clinical monitoring and interventions.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643168/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-05DOI: 10.1101/2024.12.04.24318451
Tea Pribić, Jayanta K Das, Lovorka Đerek, Daniel W Belsky, Melissa Orenduff, Kim M Huffman, William E Kraus, Helena Deriš, Jelena Šimunović, Tamara Štambuk, Azra Frkatović Hodžić, Virginia B Kraus, Sai Krupa Das, Susan B Racette, Nirad Banskota, Luigi Ferruci, Carl Pieper, Nathan E Lewis, Gordan Lauc, Sridevi Krishnan
Background/objective: In a subset of participants from the CALERIE™ Phase 2 study we evaluated the effects of 2y of ~25% Calorie Restriction (CR) diet on IgG N-glycosylation (GlycAge), plasma and complement C3 N-glycome as markers of aging and inflammaging.
Methods: Plasma samples from 26 participants in the CR group who completed the CALERIE2 trial and were deemed adherent to the intervention (~>10 % CR at 12 mo) were obtained from the NIA AgingResearchBiobank. Glycomic investigations using UPLC or LC-MS analyses were conducted on samples from baseline (BL), mid-intervention (12 mo) and post-intervention (24 mo), and changes resulting from the 2y CR intervention were examined. In addition, anthropometric, clinical, metabolic, DNA methylation (epigenetic) and skeletal muscle transcriptomic data were analyzed to identify aging-related changes that occurred in tandem with the N-glycome changes.
Results: Following the 2y CR intervention, IgG galactosylation was higher at 24mo compared to BL (p = 0.051), digalactosylation and GlycAge (the IgG-based surrogate for biological age) were not different between BL and 12mo or BL and 24mo, but increased between 12mo and 24mo (p = 0.016, 0.027 respectively). GlycAge was also positively associated with TNF-α and ICAM-1 (p=0.030, p=0.017 respectively). Plasma highly branched glycans were decreased by the 2y intervention (BL vs 24 mo: p=0.013), but both plasma and IgG bisecting GlcNAcs were increased (BL vs 24mo: p<0.001, p = 0.01 respectively). Furthermore, total complement C3 protein concentrations were reduced (BL vs 24mo: p <0.001), as were Man9 glycoforms (BL vs 24mo: p<0.001), and Man10 (which is glucosylated) C3 glycoforms (BL vs 24mo: p = 0.046).
Conclusions: 24-mos of CR was associated with several favorable, anti-aging, anti-inflammatory changes in the glycome: increased galactosylation, reduced branching glycans, and reduced GlycAge. These promising CR effects were accompanied by an increase in bisecting GlcNAc, a known pro-inflammatory biomarker. These intriguing findings linking CR, clinical, and glycomic changes may be anti-aging and inflammatory, and merit additional investigation.
{"title":"A 2-year calorie restriction intervention reduces glycomic biological age biomarkers.","authors":"Tea Pribić, Jayanta K Das, Lovorka Đerek, Daniel W Belsky, Melissa Orenduff, Kim M Huffman, William E Kraus, Helena Deriš, Jelena Šimunović, Tamara Štambuk, Azra Frkatović Hodžić, Virginia B Kraus, Sai Krupa Das, Susan B Racette, Nirad Banskota, Luigi Ferruci, Carl Pieper, Nathan E Lewis, Gordan Lauc, Sridevi Krishnan","doi":"10.1101/2024.12.04.24318451","DOIUrl":"10.1101/2024.12.04.24318451","url":null,"abstract":"<p><strong>Background/objective: </strong>In a subset of participants from the CALERIE<sup>™</sup> Phase 2 study we evaluated the effects of 2y of ~25% Calorie Restriction (CR) diet on IgG N-glycosylation (GlycAge), plasma and complement C3 N-glycome as markers of aging and inflammaging.</p><p><strong>Methods: </strong>Plasma samples from 26 participants in the CR group who completed the CALERIE2 trial and were deemed adherent to the intervention (~>10 % CR at 12 mo) were obtained from the NIA AgingResearchBiobank. Glycomic investigations using UPLC or LC-MS analyses were conducted on samples from baseline (BL), mid-intervention (12 mo) and post-intervention (24 mo), and changes resulting from the 2y CR intervention were examined. In addition, anthropometric, clinical, metabolic, DNA methylation (epigenetic) and skeletal muscle transcriptomic data were analyzed to identify aging-related changes that occurred in tandem with the N-glycome changes.</p><p><strong>Results: </strong>Following the 2y CR intervention, IgG galactosylation was higher at 24mo compared to BL (p = 0.051), digalactosylation and GlycAge (the IgG-based surrogate for biological age) were not different between BL and 12mo or BL and 24mo, but increased between 12mo and 24mo (p = 0.016, 0.027 respectively). GlycAge was also positively associated with TNF-α and ICAM-1 (p=0.030, p=0.017 respectively). Plasma highly branched glycans were decreased by the 2y intervention (BL vs 24 mo: p=0.013), but both plasma and IgG bisecting GlcNAcs were increased (BL vs 24mo: p<0.001, p = 0.01 respectively). Furthermore, total complement C3 protein concentrations were reduced (BL vs 24mo: p <0.001), as were Man9 glycoforms (BL vs 24mo: p<0.001), and Man10 (which is glucosylated) C3 glycoforms (BL vs 24mo: p = 0.046).</p><p><strong>Conclusions: </strong>24-mos of CR was associated with several favorable, anti-aging, anti-inflammatory changes in the glycome: increased galactosylation, reduced branching glycans, and reduced GlycAge. These promising CR effects were accompanied by an increase in bisecting GlcNAc, a known pro-inflammatory biomarker. These intriguing findings linking CR, clinical, and glycomic changes may be anti-aging and inflammatory, and merit additional investigation.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643172/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-05DOI: 10.1101/2024.12.03.24318375
Kexin Huang, Tony Zeng, Soner Koc, Alexandra Pettet, Jingtian Zhou, Mika Jain, Dongbo Sun, Camilo Ruiz, Hongyu Ren, Laurence Howe, Tom G Richardson, Adrian Cortes, Katie Aiello, Kim Branson, Andreas Pfenning, Jesse M Engreitz, Martin Jinye Zhang, Jure Leskovec
Genome-wide association studies (GWASs) have identified tens of thousands of disease associated variants and provided critical insights into developing effective treatments. However, limited sample sizes have hindered the discovery of variants for uncommon and rare diseases. Here, we introduce KGWAS, a novel geometric deep learning method that leverages a massive functional knowledge graph across variants and genes to improve detection power in small-cohort GWASs significantly. KGWAS assesses the strength of a variant's association to disease based on the aggregate GWAS evidence across molecular elements interacting with the variant within the knowledge graph. Comprehensive simulations and replication experiments showed that, for small sample sizes ( N =1-10K), KGWAS identified up to 100% more statistically significant associations than state-of-the-art GWAS methods and achieved the same statistical power with up to 2.67× fewer samples. We applied KGWAS to 554 uncommon UK Biobank diseases ( Ncase <5K) and identified 183 more associations (46.9% improvement) than the original GWAS, where the gain further increases to 79.8% for 141 rare diseases (N case <300). The KGWAS-only discoveries are supported by abundant functional evidence, such as rs2155219 (on 11q13) associated with ulcerative colitis potentially via regulating LRRC32 expression in CD4+ regulatory T cells, and rs7312765 (on 12q12) associated with the rare disease myasthenia gravis potentially via regulating PPHLN1 expression in neuron-related cell types. Furthermore, KGWAS consistently improves downstream analyses such as identifying disease-specific network links for interpreting GWAS variants, identifying disease-associated genes, and identifying disease-relevant cell populations. Overall, KGWAS is a flexible and powerful AI model that integrates growing functional genomics data to discover novel variants, genes, cells, and networks, especially valuable for small cohort diseases.
{"title":"Small-cohort GWAS discovery with AI over massive functional genomics knowledge graph.","authors":"Kexin Huang, Tony Zeng, Soner Koc, Alexandra Pettet, Jingtian Zhou, Mika Jain, Dongbo Sun, Camilo Ruiz, Hongyu Ren, Laurence Howe, Tom G Richardson, Adrian Cortes, Katie Aiello, Kim Branson, Andreas Pfenning, Jesse M Engreitz, Martin Jinye Zhang, Jure Leskovec","doi":"10.1101/2024.12.03.24318375","DOIUrl":"10.1101/2024.12.03.24318375","url":null,"abstract":"<p><p>Genome-wide association studies (GWASs) have identified tens of thousands of disease associated variants and provided critical insights into developing effective treatments. However, limited sample sizes have hindered the discovery of variants for uncommon and rare diseases. Here, we introduce KGWAS, a novel geometric deep learning method that leverages a massive functional knowledge graph across variants and genes to improve detection power in small-cohort GWASs significantly. KGWAS assesses the strength of a variant's association to disease based on the aggregate GWAS evidence across molecular elements interacting with the variant within the knowledge graph. Comprehensive simulations and replication experiments showed that, for small sample sizes ( <i>N</i> =1-10K), KGWAS identified up to 100% more statistically significant associations than state-of-the-art GWAS methods and achieved the same statistical power with up to 2.67× fewer samples. We applied KGWAS to 554 uncommon UK Biobank diseases ( <i>N</i> <sub>case</sub> <5K) and identified 183 more associations (46.9% improvement) than the original GWAS, where the gain further increases to 79.8% for 141 rare diseases (N <sub>case</sub> <300). The KGWAS-only discoveries are supported by abundant functional evidence, such as rs2155219 (on 11q13) associated with ulcerative colitis potentially via regulating <i>LRRC32</i> expression in CD4+ regulatory T cells, and rs7312765 (on 12q12) associated with the rare disease myasthenia gravis potentially via regulating <i>PPHLN1</i> expression in neuron-related cell types. Furthermore, KGWAS consistently improves downstream analyses such as identifying disease-specific network links for interpreting GWAS variants, identifying disease-associated genes, and identifying disease-relevant cell populations. Overall, KGWAS is a flexible and powerful AI model that integrates growing functional genomics data to discover novel variants, genes, cells, and networks, especially valuable for small cohort diseases.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643201/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-05DOI: 10.1101/2024.12.03.24318408
Joni-Leigh Webster, Sahithi Lakamana, Yao Ge, Abeed Sarker
Black women and men outpace other races for stimulant-involved overdose mortality despite lower lifetime use. Growth in mortality from prescription stimulant medications is increasing in tandem with prescribing patterns for these medications. We used Twitter to explore nonmedical prescription stimulant use (NMPSU) among Black women and men using emotion and sentiment analysis, and topic modeling. We applied the NRC Lexicon and VADER dictionary, and LDA topic modeling to examine feelings and themes in conversations about NMPSU by gender. We paid attention to the ability of natural language processing techniques to detect differences in emotion and sentiment among Black Twitter subscribers given increased mortality from stimulants. We found that, although emotion and sentiment outcomes match the directionality of emotions and sentiment observed (i.e., Black Twitter subscribers use more positive language in tweets), this belies limitations of NRC and VADER dictionaries to distinguish feelings for Black people. Even still, LDA topic models showcased the relevance of hip-hop, dependence on NMPSU, and recreational use as consequential to Black Twitter subscribers' discussions. However, gender shaped the relevance of these topics for each group. Greater attention needs to be paid to how Black women and men use social media to discuss important topics like drug use. Natural language processing methods and social media research should include larger proportions of Black, Hispanic/Latinx, and American Indian populations in development of emotion and sentiment lexicons, otherwise outcomes regarding NMPSU will not be generalizable to populations writ large due to cultural differences in communication about drug use online.
{"title":"\"I Been Taking Adderall Mixing it With Lean, Hope I Don't Wake Up Out My Sleep\": Harnessing Twitter to Understand Nonmedical Prescription Stimulant Use among Black Women and Men Subscribers.","authors":"Joni-Leigh Webster, Sahithi Lakamana, Yao Ge, Abeed Sarker","doi":"10.1101/2024.12.03.24318408","DOIUrl":"10.1101/2024.12.03.24318408","url":null,"abstract":"<p><p>Black women and men outpace other races for stimulant-involved overdose mortality despite lower lifetime use. Growth in mortality from prescription stimulant medications is increasing in tandem with prescribing patterns for these medications. We used Twitter to explore nonmedical prescription stimulant use (NMPSU) among Black women and men using emotion and sentiment analysis, and topic modeling. We applied the NRC Lexicon and VADER dictionary, and LDA topic modeling to examine feelings and themes in conversations about NMPSU by gender. We paid attention to the ability of natural language processing techniques to detect differences in emotion and sentiment among Black Twitter subscribers given increased mortality from stimulants. We found that, although emotion and sentiment outcomes match the directionality of emotions and sentiment observed (i.e., Black Twitter subscribers use more positive language in tweets), this belies limitations of NRC and VADER dictionaries to distinguish feelings for Black people. Even still, LDA topic models showcased the relevance of hip-hop, dependence on NMPSU, and recreational use as consequential to Black Twitter subscribers' discussions. However, gender shaped the relevance of these topics for each group. Greater attention needs to be paid to how Black women and men use social media to discuss important topics like drug use. Natural language processing methods and social media research should include larger proportions of Black, Hispanic/Latinx, and American Indian populations in development of emotion and sentiment lexicons, otherwise outcomes regarding NMPSU will not be generalizable to populations writ large due to cultural differences in communication about drug use online.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643189/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-05DOI: 10.1101/2024.12.04.24318493
Lucas Pietan, Elizabeth Phillippi, Marcelo Melo, Hatem El-Shanti, Brian J Smith, Benjamin Darbro, Terry Braun, Thomas Casavant
The COVID-19 pandemic has caused substantial worldwide disruptions in health, economy, and society, manifesting symptoms such as loss of smell (anosmia) and loss of taste (ageusia), that can result in prolonged sensory impairment. Establishing the host genetic etiology of anosmia and ageusia in COVID-19 will aid in the overall understanding of the sensorineural aspect of the disease and contribute to possible treatments or cures. By using human genome sequencing data from the University of Iowa (UI) COVID-19 cohort (N=187) and the National Institute of Health All of Us (AoU) Research Program COVID-19 cohort (N=947), we investigated the genetics of anosmia and/or ageusia by employing feature selection techniques to construct a novel variant and gene prioritization pipeline, utilizing machine learning methods for the classification of patients. Models were assessed using a permutation-based variable importance (PVI) strategy for final prioritization of candidate variants and genes. The highest held-out test set area under the receiver operating characteristic (AUROC) curve for models and datasets from the UI cohort was 0.735 and 0.798 for the variant and gene analysis respectively and for the AoU cohort was 0.687 for the variant analysis. Our analysis prioritized several novel and known candidate host genetic factors involved in immune response, neuronal signaling, and calcium signaling supporting previously proposed hypotheses for anosmia/ageusia in COVID-19.
COVID-19 大流行在全球范围内对健康、经济和社会造成了严重破坏,表现出嗅觉丧失(anosmia)和味觉丧失(ageusia)等症状,可导致长时间的感觉障碍。确定 COVID-19 中嗅觉缺失和味觉缺失的宿主遗传病因将有助于全面了解该疾病的感音神经方面,并为可能的治疗或治愈方法做出贡献。通过利用爱荷华大学(UI)COVID-19队列(N=187)和美国国立卫生研究院(National Institute of Health All of Us,AoU)研究计划COVID-19队列(N=947)的人类基因组测序数据,我们采用特征选择技术构建了一个新型变体和基因优先级管道,并利用机器学习方法对患者进行分类,从而研究了无嗅症和/或老年性无嗅症的遗传学。使用基于置换的变量重要性(PVI)策略对模型进行评估,以最终确定候选变体和基因的优先级。对于 UI 队列中的模型和数据集,变异分析和基因分析的最高保持测试集接收器操作特征曲线下面积(AUROC)分别为 0.735 和 0.798,而对于 AoU 队列,变异分析的最高保持测试集接收器操作特征曲线下面积为 0.687。我们的分析优先考虑了涉及免疫反应、神经元信号传导和钙信号传导的几个新的和已知的候选宿主遗传因子,支持之前提出的 COVID-19 中无精/老年痴呆症的假说。
{"title":"Genome-wide Machine Learning Analysis of Anosmia and Ageusia with COVID-19.","authors":"Lucas Pietan, Elizabeth Phillippi, Marcelo Melo, Hatem El-Shanti, Brian J Smith, Benjamin Darbro, Terry Braun, Thomas Casavant","doi":"10.1101/2024.12.04.24318493","DOIUrl":"10.1101/2024.12.04.24318493","url":null,"abstract":"<p><p>The COVID-19 pandemic has caused substantial worldwide disruptions in health, economy, and society, manifesting symptoms such as loss of smell (anosmia) and loss of taste (ageusia), that can result in prolonged sensory impairment. Establishing the host genetic etiology of anosmia and ageusia in COVID-19 will aid in the overall understanding of the sensorineural aspect of the disease and contribute to possible treatments or cures. By using human genome sequencing data from the University of Iowa (UI) COVID-19 cohort (N=187) and the National Institute of Health All of Us (AoU) Research Program COVID-19 cohort (N=947), we investigated the genetics of anosmia and/or ageusia by employing feature selection techniques to construct a novel variant and gene prioritization pipeline, utilizing machine learning methods for the classification of patients. Models were assessed using a permutation-based variable importance (PVI) strategy for final prioritization of candidate variants and genes. The highest held-out test set area under the receiver operating characteristic (AUROC) curve for models and datasets from the UI cohort was 0.735 and 0.798 for the variant and gene analysis respectively and for the AoU cohort was 0.687 for the variant analysis. Our analysis prioritized several novel and known candidate host genetic factors involved in immune response, neuronal signaling, and calcium signaling supporting previously proposed hypotheses for anosmia/ageusia in COVID-19.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643161/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-05DOI: 10.1101/2024.12.04.24318500
Lu Zeng, Khan Atlas, Tsering Lama, Tanuja Chitnis, Howard Weiner, Gao Wang, Masashi Fujita, Frauke Zipp, Mariko Taga, Krzysztof Kiryluk, Philip L De Jager
Multiple Sclerosis (MS) is a chronic inflammatory and neurodegenerative disease affecting the brain and spinal cord. Genetic studies have identified many risk loci, that were thought to primarily impact immune cells and microglia. Here, we performed a multi-ancestry genome-wide association study with 20,831 MS and 729,220 control participants, identifying 236 susceptibility variants outside the Major Histocompatibility Complex, including four novel loci. We derived a polygenic score for MS and, optimized for European ancestry, it is informative for African-American and Latino participants. Integrating single-cell data from blood and brain tissue, we identified 76 genes affected by MS risk variants. Notably, while T cells showed the strongest enrichment, inhibitory neurons emerged as a key cell type, highlighting the importance of neuronal and glial dysfunction in MS susceptibility.
{"title":"GWAS highlights the neuronal contribution to multiple sclerosis susceptibility.","authors":"Lu Zeng, Khan Atlas, Tsering Lama, Tanuja Chitnis, Howard Weiner, Gao Wang, Masashi Fujita, Frauke Zipp, Mariko Taga, Krzysztof Kiryluk, Philip L De Jager","doi":"10.1101/2024.12.04.24318500","DOIUrl":"10.1101/2024.12.04.24318500","url":null,"abstract":"<p><p>Multiple Sclerosis (MS) is a chronic inflammatory and neurodegenerative disease affecting the brain and spinal cord. Genetic studies have identified many risk loci, that were thought to primarily impact immune cells and microglia. Here, we performed a multi-ancestry genome-wide association study with 20,831 MS and 729,220 control participants, identifying 236 susceptibility variants outside the Major Histocompatibility Complex, including four novel loci. We derived a polygenic score for MS and, optimized for European ancestry, it is informative for African-American and Latino participants. Integrating single-cell data from blood and brain tissue, we identified 76 genes affected by MS risk variants. Notably, while T cells showed the strongest enrichment, inhibitory neurons emerged as a key cell type, highlighting the importance of neuronal and glial dysfunction in MS susceptibility.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643295/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}