首页 > 最新文献

medRxiv : the preprint server for health sciences最新文献

英文 中文
Peer Education Intervention Reduced Sexually Transmitted Infections Among Male Tajik Labor Migrants Who Inject Drugs: Results of a Cluster-randomized Controlled Trial. MASLIHAT 艾滋病预防干预措施减少了莫斯科注射毒品的塔吉克男性移民的性传播感染。
Pub Date : 2024-12-13 DOI: 10.1101/2024.08.15.24312070
Mary Ellen Mackesy-Amiti, Judith A Levy, Casey M Luc, Jonbek Jonbekov

Background: In a cluster-randomized controlled trial, the "Migrants' Approached Self-Learning Intervention in HIV/AIDS for Tajiks" (MASLIHAT) reduced intervention participants' sexual risk behaviour including any condomless sex, condomless sex with female sex workers, and multiple sexual partners. This analysis investigates if observed changes in sexual risk behaviors translated into fewer reported STIs among participants over 12-month follow-up.

Methods: The MASLIHAT intervention was tested in a cluster-randomized controlled trial with sites assigned to either the MASLIHAT intervention or comparison health education training (TANSIHAT). Participants and network members (n=420) were interviewed at baseline and 3-month intervals for one year to assess HIV/STI sex and drug risk behaviour. We conducted mixed effects robust Poisson regression analyses to test for differences between conditions in self-reported STIs during 12 months of follow-up, and to test the contribution of sexual risk behaviours to STI acquisition. We then tested the mediating effects of sexual behaviours during the first six months following the intervention on STIs reported at the 9 and 12-month follow-up interviews.

Results: Participants in the MASLIHAT condition were significantly less likely to report an STI during follow-up (IRR=0.27, 95% CI 0.13-0.58). Condomless sex with a non-main (casual or commercial) partner was significantly associated with STI acquisition (IRR=2.30, 95% CI 1.26-4.21). Adjusting for condomless sex with a non-main partner, the effect of MASLIHAT intervention participation was reduced (IRR=0.36, 95% CI 0.16-0.80), signalling possible mediation. Causal mediation analysis indicated that the intervention's effect on reported STI was partially mediated by reductions among MASLIHAT participants in condomless sex with a non-main partner.

Conclusions: The MASLIHAT peer-education intervention reduced reported STIs among Tajik labour migrants partly through reduced condomless sex with casual and commercial partners.

Clinical trial registration: ClinicalTrials.gov , 2021-04-16, NCT04853394 .

目的:在莫斯科工作期间注射毒品的塔吉克男性劳工移民感染艾滋病毒和性传播疾病(STI)的风险很高,这会损害他们及其性伴侣的健康。在一项分组随机对照试验中,"塔吉克斯坦移民艾滋病自学干预方法"(MASLIHAT)减少了干预参与者的性风险行为,包括无套性行为、与女性性工作者(CS/FSW)的无套性行为以及多个性伴侣。本分析调查了干预措施对性风险行为的影响是否会在 12 个月的随访中转化为参与者性传播感染发病率的降低:MASLIHAT干预措施在一项分组随机对照试验中进行了测试,试验地点被分配给MASLIHAT干预措施或对比健康教育培训(TANSIHAT)。对参与者和网络成员(420 人)进行了为期一年的基线访谈和每隔三个月的访谈,以评估 HIV/STI性行为和毒品风险行为。在目前的分析中,我们仅关注性传播感染,并进行了混合效应稳健泊松回归分析,以检验 12 个月随访期间不同情况下自我报告的性传播感染的差异,并检验性风险行为对性传播感染的影响。结构方程模型研究了性行为可能对两种情况下观察到的性传播感染感染率差异的中介作用:结果:MASLIHAT条件下的参与者在随访期间报告性传播感染的可能性明显较低(IRR=0.27,95% CI 0.13-0.58)。在 3 种相关的性风险行为中,只有 CS/FSW 与性传播感染的发生有显著相关性(IRR=3.30,95% CI 1.57-3.93)。对 CS/FSW 进行调整后,MASLIHAT 干预参与的影响降低了(IRR=0.37,95% CI 0.17-0.84),表明可能存在中介作用。结构方程模型表明,干预对性传播感染发病率的影响是通过 MASLIHAT 参与者 CS/FSW 的减少而产生的:MASLIHAT 同伴教育干预措施通过减少 CS/FSW 降低了塔吉克移民劳工的性传播感染率。
{"title":"Peer Education Intervention Reduced Sexually Transmitted Infections Among Male Tajik Labor Migrants Who Inject Drugs: Results of a Cluster-randomized Controlled Trial.","authors":"Mary Ellen Mackesy-Amiti, Judith A Levy, Casey M Luc, Jonbek Jonbekov","doi":"10.1101/2024.08.15.24312070","DOIUrl":"10.1101/2024.08.15.24312070","url":null,"abstract":"<p><strong>Background: </strong>In a cluster-randomized controlled trial, the \"Migrants' Approached Self-Learning Intervention in HIV/AIDS for Tajiks\" (MASLIHAT) reduced intervention participants' sexual risk behaviour including any condomless sex, condomless sex with female sex workers, and multiple sexual partners. This analysis investigates if observed changes in sexual risk behaviors translated into fewer reported STIs among participants over 12-month follow-up.</p><p><strong>Methods: </strong>The MASLIHAT intervention was tested in a cluster-randomized controlled trial with sites assigned to either the MASLIHAT intervention or comparison health education training (TANSIHAT). Participants and network members (n=420) were interviewed at baseline and 3-month intervals for one year to assess HIV/STI sex and drug risk behaviour. We conducted mixed effects robust Poisson regression analyses to test for differences between conditions in self-reported STIs during 12 months of follow-up, and to test the contribution of sexual risk behaviours to STI acquisition. We then tested the mediating effects of sexual behaviours during the first six months following the intervention on STIs reported at the 9 and 12-month follow-up interviews.</p><p><strong>Results: </strong>Participants in the MASLIHAT condition were significantly less likely to report an STI during follow-up (IRR=0.27, 95% CI 0.13-0.58). Condomless sex with a non-main (casual or commercial) partner was significantly associated with STI acquisition (IRR=2.30, 95% CI 1.26-4.21). Adjusting for condomless sex with a non-main partner, the effect of MASLIHAT intervention participation was reduced (IRR=0.36, 95% CI 0.16-0.80), signalling possible mediation. Causal mediation analysis indicated that the intervention's effect on reported STI was partially mediated by reductions among MASLIHAT participants in condomless sex with a non-main partner.</p><p><strong>Conclusions: </strong>The MASLIHAT peer-education intervention reduced reported STIs among Tajik labour migrants partly through reduced condomless sex with casual and commercial partners.</p><p><strong>Clinical trial registration: </strong>ClinicalTrials.gov , 2021-04-16, NCT04853394 .</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343258/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LD-informed deep learning for Alzheimer's gene loci detection using WGS data. 利用全基因组测序数据识别阿尔茨海默病遗传位点的关联失衡深度学习框架。
Pub Date : 2024-12-12 DOI: 10.1101/2024.09.19.24313993
Taeho Jo, Paula J Bice, Kwangsik Nho, Andrew J Saykin

The exponential growth of genomic datasets necessitates advanced analytical tools to effectively identify genetic loci from large-scale high throughput sequencing data. This study presents Deep-Block, a multi-stage deep learning framework that incorporates biological knowledge into its AI architecture to identify genetic regions as significantly associated with Alzheimer's disease (AD). The framework employs a three-stage approach: (1) genome segmentation based on linkage disequilibrium (LD) patterns, (2) selection of relevant LD blocks using sparse attention mechanisms, and (3) application of TabNet and Random Forest algorithms to quantify single nucleotide polymorphism (SNP) feature importance, thereby identifying genetic factors contributing to AD risk. The Deep-Block was applied to a large-scale whole genome sequencing (WGS) dataset from the Alzheimer's Disease Sequencing Project (ADSP), comprising 7,416 non-Hispanic white participants (3,150 cognitively normal older adults (CN), 4,266 AD). 30,218 LD blocks were identified and then ranked based on their relevance with Alzheimer's disease. Subsequently, the Deep-Block identified novel SNPs within the top 1,500 LD blocks and confirmed previously known variants, including APOE rs429358 and rs769449. Expression Quantitative Trait Loci (eQTL) analysis across 13 brain regions provided functional evidence for the identified variants. The results were cross-validated against established AD-associated loci from the European Alzheimer's and Dementia Biobank (EADB) and the GWAS catalog. The Deep-Block framework effectively processes large-scale high throughput sequencing data while preserving SNP interactions during dimensionality reduction, minimizing bias and information loss. The framework's findings are supported by tissue-specific eQTL evidence across brain regions, indicating the functional relevance of the identified variants. Additionally, the Deep-Block approach has identified both known and novel genetic variants, enhancing our understanding of the genetic architecture and demonstrating its potential for application in large-scale sequencing studies. Keywords: Alzheimer's disease, Whole-Genome Sequencing, Linkage Disequilibrium, Deep Learning, Genetic Loci, Imputation Methods.

基因组数据集的指数级增长需要先进的分析工具,以便从大规模高通量测序数据中有效识别基因位点。本研究介绍了一种多阶段深度学习框架 Deep-Block,该框架将生物知识融入其人工智能架构,以识别与阿尔茨海默病(AD)显著相关的基因区域。该框架采用了三阶段方法:(1)基于连锁不平衡(LD)模式的基因组分割;(2)使用稀疏注意机制选择相关的 LD 块;(3)应用 TabNet 和随机森林算法量化单核苷酸多态性(SNP)特征的重要性,从而确定导致 AD 风险的遗传因素。Deep-Block应用于阿尔茨海默病测序项目(ADSP)的大规模全基因组测序(WGS)数据集,其中包括7416名非西班牙裔白人参与者(3150名认知正常的老年人(CN),4266名注意力缺失症患者)。首先,确定了 30,218 个 LD 块,然后根据它们与阿尔茨海默病的相关性进行排序。随后,Deep-Block 在前 1,500 个 LD 块中鉴定出了新的 SNPs,并确认了之前已知的变异,包括 APOE rs429358 和 rs769449。研究结果与欧洲阿尔茨海默氏症和痴呆症生物库(EADB)和GWAS目录中已确定的AD相关位点进行了交叉验证。Deep-Block框架能有效处理大规模高通量测序数据,同时在进行降维时保留SNP之间的相互作用,因为降维有可能带来偏差或导致信息丢失。Deep-Block 方法识别了已知和新的遗传变异,增强了我们对遗传结构的理解,并证明了该框架在大规模测序研究中的应用潜力。
{"title":"LD-informed deep learning for Alzheimer's gene loci detection using WGS data.","authors":"Taeho Jo, Paula J Bice, Kwangsik Nho, Andrew J Saykin","doi":"10.1101/2024.09.19.24313993","DOIUrl":"10.1101/2024.09.19.24313993","url":null,"abstract":"<p><p>The exponential growth of genomic datasets necessitates advanced analytical tools to effectively identify genetic loci from large-scale high throughput sequencing data. This study presents Deep-Block, a multi-stage deep learning framework that incorporates biological knowledge into its AI architecture to identify genetic regions as significantly associated with Alzheimer's disease (AD). The framework employs a three-stage approach: (1) genome segmentation based on linkage disequilibrium (LD) patterns, (2) selection of relevant LD blocks using sparse attention mechanisms, and (3) application of TabNet and Random Forest algorithms to quantify single nucleotide polymorphism (SNP) feature importance, thereby identifying genetic factors contributing to AD risk. The Deep-Block was applied to a large-scale whole genome sequencing (WGS) dataset from the Alzheimer's Disease Sequencing Project (ADSP), comprising 7,416 non-Hispanic white participants (3,150 cognitively normal older adults (CN), 4,266 AD). 30,218 LD blocks were identified and then ranked based on their relevance with Alzheimer's disease. Subsequently, the Deep-Block identified novel SNPs within the top 1,500 LD blocks and confirmed previously known variants, including APOE rs429358 and rs769449. Expression Quantitative Trait Loci (eQTL) analysis across 13 brain regions provided functional evidence for the identified variants. The results were cross-validated against established AD-associated loci from the European Alzheimer's and Dementia Biobank (EADB) and the GWAS catalog. The Deep-Block framework effectively processes large-scale high throughput sequencing data while preserving SNP interactions during dimensionality reduction, minimizing bias and information loss. The framework's findings are supported by tissue-specific eQTL evidence across brain regions, indicating the functional relevance of the identified variants. Additionally, the Deep-Block approach has identified both known and novel genetic variants, enhancing our understanding of the genetic architecture and demonstrating its potential for application in large-scale sequencing studies. Keywords: Alzheimer's disease, Whole-Genome Sequencing, Linkage Disequilibrium, Deep Learning, Genetic Loci, Imputation Methods.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11451815/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142383161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified Clinical Vocabulary Embeddings for Advancing Precision Medicine. 统一临床词汇嵌入,提高精准度。
Pub Date : 2024-12-10 DOI: 10.1101/2024.12.03.24318322
Ruth Johnson, Uri Gottlieb, Galit Shaham, Lihi Eisen, Jacob Waxman, Stav Devons-Sberro, Curtis R Ginder, Peter Hong, Raheel Sayeed, Ben Y Reis, Ran D Balicer, Noa Dagan, Marinka Zitnik

Integrating clinical knowledge into AI remains challenging despite numerous medical guidelines and vocabularies. Medical codes, central to healthcare systems, often reflect operational patterns shaped by geographic factors, national policies, insurance frameworks, and physician practices rather than the precise representation of clinical knowledge. This disconnect hampers AI in representing clinical relationships, raising concerns about bias, transparency, and generalizability. Here, we developed a resource of 67,124 clinical vocabulary embeddings derived from a clinical knowledge graph tailored to electronic health record vocabularies, spanning over 1.3 million edges. Using graph transformer neural networks, we generated clinical vocabulary embeddings that provide a new representation of clinical knowledge by unifying seven medical vocabularies. These embeddings were validated through a phenotype risk score analysis involving 4.57 million patients from Clalit Healthcare Services, effectively stratifying individuals based on survival outcomes. Inter-institutional panels of clinicians evaluated the embeddings for alignment with clinical knowledge across 90 diseases and 3,000 clinical codes, confirming their robustness and transferability. This resource addresses gaps in integrating clinical vocabularies into AI models and training datasets, paving the way for knowledge-grounded population and patient-level models.

尽管有众多医疗指南和词汇表,但将临床知识融入人工智能仍是一项挑战。医疗代码是医疗保健系统的核心,通常反映的是由地理因素、国家政策、保险框架和医生实践所形成的操作模式,而不是临床知识的精确表述。这种脱节阻碍了人工智能对临床关系的表述,引发了对偏差、透明度和可推广性的担忧。在这里,我们开发了一个由 67124 个临床词汇嵌入组成的资源,这些词汇嵌入来自一个为电子健康记录词汇定制的临床知识图谱,跨越 130 多万条边。利用图转换器神经网络,我们生成了临床词汇嵌入,通过统一七个医学词汇,为临床知识提供了一种新的表示方法。通过对来自 Clalit 医疗保健服务公司的 457 万名患者进行表型风险评分分析,对这些嵌入进行了验证,从而有效地根据生存结果对个人进行分层。由临床医生组成的机构间小组评估了嵌入与 90 种疾病和 3,000 个临床代码的临床知识的一致性,确认了其稳健性和可移植性。这一资源填补了将临床词汇表整合到人工智能模型和训练数据集中的空白,为建立以知识为基础的人群和患者级模型铺平了道路。
{"title":"Unified Clinical Vocabulary Embeddings for Advancing Precision Medicine.","authors":"Ruth Johnson, Uri Gottlieb, Galit Shaham, Lihi Eisen, Jacob Waxman, Stav Devons-Sberro, Curtis R Ginder, Peter Hong, Raheel Sayeed, Ben Y Reis, Ran D Balicer, Noa Dagan, Marinka Zitnik","doi":"10.1101/2024.12.03.24318322","DOIUrl":"10.1101/2024.12.03.24318322","url":null,"abstract":"<p><p>Integrating clinical knowledge into AI remains challenging despite numerous medical guidelines and vocabularies. Medical codes, central to healthcare systems, often reflect operational patterns shaped by geographic factors, national policies, insurance frameworks, and physician practices rather than the precise representation of clinical knowledge. This disconnect hampers AI in representing clinical relationships, raising concerns about bias, transparency, and generalizability. Here, we developed a resource of 67,124 clinical vocabulary embeddings derived from a clinical knowledge graph tailored to electronic health record vocabularies, spanning over 1.3 million edges. Using graph transformer neural networks, we generated clinical vocabulary embeddings that provide a new representation of clinical knowledge by unifying seven medical vocabularies. These embeddings were validated through a phenotype risk score analysis involving 4.57 million patients from Clalit Healthcare Services, effectively stratifying individuals based on survival outcomes. Inter-institutional panels of clinicians evaluated the embeddings for alignment with clinical knowledge across 90 diseases and 3,000 clinical codes, confirming their robustness and transferability. This resource addresses gaps in integrating clinical vocabularies into AI models and training datasets, paving the way for knowledge-grounded population and patient-level models.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643188/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering causal gene-tissue pairs and variants: A multivariable TWAS method controlling for infinitesimal effects. 揭示因果基因-组织对和变异:控制无穷小效应的多变量 TWAS 方法。
Pub Date : 2024-12-10 DOI: 10.1101/2024.11.13.24317250
Yihe Yang, Noah Lorincz-Comi, Xiaofeng Zhu

Transcriptome-wide association studies (TWAS) are commonly used to prioritize causal genes underlying associations found in genome-wide association studies (GWAS) and have been extended to identify causal genes through multivariable TWAS methods. However, recent studies have shown that widespread infinitesimal effects due to polygenicity can impair the performance of these methods. In this report, we introduce a multivariable TWAS method named Tissue-Gene pairs, direct causal Variants, and Infinitesimal effects selector (TGVIS) to identify tissue-specific causal genes and direct causal variants while accounting for infinitesimal effects. In simulations, TGVIS maintains an accurate prioritization of causal gene-tissue pairs and variants and demonstrates comparable or superior power to existing approaches, regardless of the presence of infinitesimal effects. In the real data analysis of GWAS summary data of 45 cardiometabolic traits and expression/splicing quantitative trait loci (eQTL/sQTL) from 31 tissues, TGVIS is able to improve causal gene prioritization and identifies novel genes that were missed by conventional TWAS.

全转录组关联研究(TWAS)通常用于优先确定全基因组关联研究(GWAS)中发现的关联的因果基因,并通过多变量 TWAS 方法扩展到确定因果基因。然而,最近的研究表明,多基因性导致的广泛的无限小效应会损害这些方法的性能。在本报告中,我们介绍了一种名为 "组织-基因对、直接因果变异和无穷小效应选择器(TGVIS)"的多变量 TWAS 方法,用于识别组织特异性因果基因和直接因果变异,同时考虑无穷小效应。在模拟实验中,TGVIS 保持了因果基因-组织对和变异的准确优先级,并显示出与现有方法相当或更高的能力,而不管是否存在无限小效应。在对来自 31 个组织的 45 个心脏代谢性状和表达/拼接定量性状位点(eQTL/sQTL)的 GWAS 摘要数据进行真实数据分析时,TGVIS 能够改进因果基因的优先排序,并识别出传统 TWAS 遗漏的新基因。
{"title":"Uncovering causal gene-tissue pairs and variants: A multivariable TWAS method controlling for infinitesimal effects.","authors":"Yihe Yang, Noah Lorincz-Comi, Xiaofeng Zhu","doi":"10.1101/2024.11.13.24317250","DOIUrl":"10.1101/2024.11.13.24317250","url":null,"abstract":"<p><p>Transcriptome-wide association studies (TWAS) are commonly used to prioritize causal genes underlying associations found in genome-wide association studies (GWAS) and have been extended to identify causal genes through multivariable TWAS methods. However, recent studies have shown that widespread infinitesimal effects due to polygenicity can impair the performance of these methods. In this report, we introduce a multivariable TWAS method named Tissue-Gene pairs, direct causal Variants, and Infinitesimal effects selector (TGVIS) to identify tissue-specific causal genes and direct causal variants while accounting for infinitesimal effects. In simulations, TGVIS maintains an accurate prioritization of causal gene-tissue pairs and variants and demonstrates comparable or superior power to existing approaches, regardless of the presence of infinitesimal effects. In the real data analysis of GWAS summary data of 45 cardiometabolic traits and expression/splicing quantitative trait loci (eQTL/sQTL) from 31 tissues, TGVIS is able to improve causal gene prioritization and identifies novel genes that were missed by conventional TWAS.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11601775/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142741941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep phenotyping obesity using EHR data: Promise, Challenges, and Future Directions. 利用电子病历数据进行肥胖症深度表型分析:前景、挑战和未来方向。
Pub Date : 2024-12-08 DOI: 10.1101/2024.12.06.24318608
Xiaoyang Ruan, Shuyu Lu, Liwei Wang, Andrew Wen, Murali Sameer, Hongfang Liu
<p><p>Obesity affects approximately 34% of adults and 15-20% of children and adolescents in the U.S, and poses significant economic and psychosocial burdens. Due to the multifaceted nature of obesity, currently patient responses to any single anti-obesity medication (AOM) vary significantly, highlighting the need for developing approaches to obesity deep phenotyping and associated precision medicine. While recent advancement in classical phenotyping-guided pharmacotherapies have shown clinical value, they are less embraced by healthcare providers within the precision medicine framework, primarily due to their operational complexity and lack of granularity. From this perspective, several recent review articles highlighted the importance of obesity deep phenotyping for personalized precision medicine. In view of the established role of electronic health record (EHR) as an important data source for clinical phenotypings, we offer an in-depth analysis of the commonly available data elements from obesity patients prior to pharmacotherapy. We also experimented with a multi-modal longitudinal deep autoencoder to explore the feasibility, data requirements, clustering patterns, and challenges associated with EHR-based obesity deep phenotyping. Our analysis indicates at least nine clusters, among which five have distinct explainable clinical relevance. Further research within larger independent cohorts to validate the reproducibility, uncover more detailed substructures and corresponding treatment response is warranted.</p><p><strong>Background: </strong>Obesity affects approximately 40% of adults and 15-20% of children and adolescents in the U.S, and poses significant economic and psychosocial burdens. Currently, patient responses to any single anti-obesity medication (AOM) vary significantly, making obesity deep phenotyping and associated precision medicine important targets of investigation.</p><p><strong>Objective: </strong>To evaluate the potential of EHR as a primary data source for obesity deep phenotyping, we conduct an in-depth analysis of the data elements and quality available from obesity patients prior to pharmacotherapy, and apply a multi-modal longitudinal deep autoencoder to investigate the feasibility, data requirements, clustering patterns, and challenges associated with EHR-based obesity deep phenotyping.</p><p><strong>Methods: </strong>We analyzed 53,688 pre-AOM periods from 32,969 patients with obesity or overweight who underwent medium- to long-term AOM treatment. A total of 92 lab and vital measurements, along with 79 ICD-derived clinical classifications software (CCS) codes recorded within one year prior to AOM treatment, were used to train a gated recurrent unit with decay based longitudinal autoencoder (GRU-D-AE) to generate dense embeddings for each pre-AOM record. principal component analysis (PCA) and gaussian mixture modeling (GMM) were applied to identify clusters.</p><p><strong>Results: </strong>Our analysis identified at le
{"title":"Deep phenotyping obesity using EHR data: Promise, Challenges, and Future Directions.","authors":"Xiaoyang Ruan, Shuyu Lu, Liwei Wang, Andrew Wen, Murali Sameer, Hongfang Liu","doi":"10.1101/2024.12.06.24318608","DOIUrl":"10.1101/2024.12.06.24318608","url":null,"abstract":"&lt;p&gt;&lt;p&gt;Obesity affects approximately 34% of adults and 15-20% of children and adolescents in the U.S, and poses significant economic and psychosocial burdens. Due to the multifaceted nature of obesity, currently patient responses to any single anti-obesity medication (AOM) vary significantly, highlighting the need for developing approaches to obesity deep phenotyping and associated precision medicine. While recent advancement in classical phenotyping-guided pharmacotherapies have shown clinical value, they are less embraced by healthcare providers within the precision medicine framework, primarily due to their operational complexity and lack of granularity. From this perspective, several recent review articles highlighted the importance of obesity deep phenotyping for personalized precision medicine. In view of the established role of electronic health record (EHR) as an important data source for clinical phenotypings, we offer an in-depth analysis of the commonly available data elements from obesity patients prior to pharmacotherapy. We also experimented with a multi-modal longitudinal deep autoencoder to explore the feasibility, data requirements, clustering patterns, and challenges associated with EHR-based obesity deep phenotyping. Our analysis indicates at least nine clusters, among which five have distinct explainable clinical relevance. Further research within larger independent cohorts to validate the reproducibility, uncover more detailed substructures and corresponding treatment response is warranted.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Obesity affects approximately 40% of adults and 15-20% of children and adolescents in the U.S, and poses significant economic and psychosocial burdens. Currently, patient responses to any single anti-obesity medication (AOM) vary significantly, making obesity deep phenotyping and associated precision medicine important targets of investigation.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;To evaluate the potential of EHR as a primary data source for obesity deep phenotyping, we conduct an in-depth analysis of the data elements and quality available from obesity patients prior to pharmacotherapy, and apply a multi-modal longitudinal deep autoencoder to investigate the feasibility, data requirements, clustering patterns, and challenges associated with EHR-based obesity deep phenotyping.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We analyzed 53,688 pre-AOM periods from 32,969 patients with obesity or overweight who underwent medium- to long-term AOM treatment. A total of 92 lab and vital measurements, along with 79 ICD-derived clinical classifications software (CCS) codes recorded within one year prior to AOM treatment, were used to train a gated recurrent unit with decay based longitudinal autoencoder (GRU-D-AE) to generate dense embeddings for each pre-AOM record. principal component analysis (PCA) and gaussian mixture modeling (GMM) were applied to identify clusters.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Our analysis identified at le","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643233/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling early-onset cancer kinetics to study changes in underlying risk, detection, and impact of population screening. 建立早发性癌症动力学模型,研究潜在风险、检测和人群筛查影响的变化。
Pub Date : 2024-12-08 DOI: 10.1101/2024.12.05.24318584
Navid Mohammad Mirzaei, Chin Hur, Mary Beth Terry, Piero Dalerba, Wan Yang

Recent studies have reported increases in early-onset cancer cases (diagnosed under age 50) and call into question whether the increase is related to earlier diagnosis from other medical tests and reflected by decreasing tumor-size-at-diagnosis (apparent effects) or actual increases in underlying cancer risk (true effects), or both. The classic Multi-Stage Clonal Expansion (MSCE) model assumes cancer detection at the emergence of the first malignant cell, although later modifications have included lag-times or stochasticity in detection to more realistically represent tumor detection requiring a certain size threshold. Here, we introduce an approach to explicitly incorporate tumor-size-at-diagnosis in the MSCE framework and account for improvements in cancer detection over time to distinguish between apparent and true increases in early-onset cancer incidence. We demonstrate that our model is structurally identifiable and provides better parameter estimation than the classic model. Applying this model to colorectal, female breast, and thyroid cancers, we examine changes in cancer risk while accounting for detection improvements over time in three representative birth cohorts (1950-1954, 1965-1969, and 1980-1984). Our analyses suggest accelerated carcinogenic events and shorter mean sojourn times in more recent cohorts. We further use this model to examine the screening impact on the incidence of breast and colorectal cancers, both having established screening protocols. Our results align with well-documented differences in screening effects between these two cancers. These findings underscore the importance of accounting for tumor-size-at-diagnosis in cancer modeling and support true increases in early-onset cancer risk in recent years for breast, colorectal, and thyroid cancer.

Significance: This study models recent increases in early-onset cancers, accounting for both true factors contributing to cancer risk and those caused by improved detection. We show that while advancement in detection has led to earlier detection, our model estimates shorter sojourn times and more aggressive carcinogenic events for recent cohorts, suggesting faster tumor progression. Further, a counterfactual analysis using this model reveals the known statistically significant reduction in colorectal cancer incidence (supporting a robust modeling approach), likely due to screening and timely removal of precancerous polyps. Overall, we introduce an enhanced model to detect subtle trends in cancer risk and demonstrate its ability to provide valuable insights into cancer progression and highlight areas for future refinement and application.

最近有研究报告称,早发癌症病例(诊断年龄在 50 岁以下)有所增加,并质疑这种增加是否与其他医学检查的早期诊断有关,以及是否反映了诊断时肿瘤大小的减少(表面效应)或潜在癌症风险的实际增加(真实效应),或两者兼而有之。经典的多阶段克隆扩增(MSCE)模型假定在第一个恶性细胞出现时就能检测到癌症,尽管后来的修改包含了检测中的滞后时间或随机性,以更真实地反映需要一定大小阈值的肿瘤检测。在此,我们介绍一种方法,在 MSCE 框架中明确纳入诊断时的肿瘤大小,并考虑到癌症检测随时间推移的改进,以区分早发癌症发病率的表面增长和实际增长。我们证明了我们的模型在结构上是可识别的,并提供了比经典模型更好的参数估计。我们将该模型应用于结直肠癌、女性乳腺癌和甲状腺癌,研究了癌症风险的变化,同时考虑了三个具有代表性的出生队列(1950-1954 年、1965-1969 年和 1980-1984 年)中随着时间推移检测能力的提高。我们的分析表明,在较新的队列中,致癌事件发生的速度加快,平均停留时间缩短。我们进一步利用这一模型来研究筛查对乳腺癌和结直肠癌发病率的影响,这两种癌症都有既定的筛查方案。我们的研究结果与这两种癌症筛查效果的差异一致。这些发现强调了在癌症建模中考虑诊断时肿瘤大小的重要性,并支持近年来乳腺癌、结直肠癌和甲状腺癌的早发癌症风险确实有所增加:本研究对近年来早发癌症的增加情况进行了建模,既考虑了导致癌症风险的真实因素,也考虑了因检测水平提高而导致的因素。我们的研究表明,虽然检测技术的进步导致了更早的检测,但我们的模型估计最近的队列中存在更短的存活时间和更具侵袭性的致癌事件,这表明肿瘤进展更快。此外,使用该模型进行的反事实分析表明,已知的结直肠癌发病率在统计学上显著下降(支持稳健的建模方法),这可能是由于筛查和及时切除癌前息肉所致。总之,我们引入了一个增强型模型来检测癌症风险的微妙趋势,并证明了该模型有能力为癌症进展提供有价值的见解,同时也强调了未来需要改进和应用的领域。
{"title":"Modeling early-onset cancer kinetics to study changes in underlying risk, detection, and impact of population screening.","authors":"Navid Mohammad Mirzaei, Chin Hur, Mary Beth Terry, Piero Dalerba, Wan Yang","doi":"10.1101/2024.12.05.24318584","DOIUrl":"10.1101/2024.12.05.24318584","url":null,"abstract":"<p><p>Recent studies have reported increases in early-onset cancer cases (diagnosed under age 50) and call into question whether the increase is related to earlier diagnosis from other medical tests and reflected by decreasing tumor-size-at-diagnosis (apparent effects) or actual increases in underlying cancer risk (true effects), or both. The classic Multi-Stage Clonal Expansion (MSCE) model assumes cancer detection at the emergence of the first malignant cell, although later modifications have included lag-times or stochasticity in detection to more realistically represent tumor detection requiring a certain size threshold. Here, we introduce an approach to explicitly incorporate tumor-size-at-diagnosis in the MSCE framework and account for improvements in cancer detection over time to distinguish between apparent and true increases in early-onset cancer incidence. We demonstrate that our model is structurally identifiable and provides better parameter estimation than the classic model. Applying this model to colorectal, female breast, and thyroid cancers, we examine changes in cancer risk while accounting for detection improvements over time in three representative birth cohorts (1950-1954, 1965-1969, and 1980-1984). Our analyses suggest accelerated carcinogenic events and shorter mean sojourn times in more recent cohorts. We further use this model to examine the screening impact on the incidence of breast and colorectal cancers, both having established screening protocols. Our results align with well-documented differences in screening effects between these two cancers. These findings underscore the importance of accounting for tumor-size-at-diagnosis in cancer modeling and support true increases in early-onset cancer risk in recent years for breast, colorectal, and thyroid cancer.</p><p><strong>Significance: </strong>This study models recent increases in early-onset cancers, accounting for both true factors contributing to cancer risk and those caused by improved detection. We show that while advancement in detection has led to earlier detection, our model estimates shorter sojourn times and more aggressive carcinogenic events for recent cohorts, suggesting faster tumor progression. Further, a counterfactual analysis using this model reveals the known statistically significant reduction in colorectal cancer incidence (supporting a robust modeling approach), likely due to screening and timely removal of precancerous polyps. Overall, we introduce an enhanced model to detect subtle trends in cancer risk and demonstrate its ability to provide valuable insights into cancer progression and highlight areas for future refinement and application.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643252/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intraoperative superb microvascular ultrasound imaging in glioma: novel quantitative analysis correlates with tumour grade. 胶质瘤术中超级微血管超声成像:与肿瘤分级相关的新型定量分析。
Pub Date : 2024-12-08 DOI: 10.1101/2024.12.07.24318636
Luke Dixon, Alistair Weld, Dolin Bhagawati, Neekhil Patel, Stamatia Giannarou, Matthew Grech-Sollars, Adrian Lim, Sophie Camp

Accurate grading of gliomas is critical to guide therapy and predict prognosis. The presence of microvascular proliferation is a hallmark feature of high grade gliomas which traditionally requires targeted surgical biopsy of representative tissue. Superb microvascular imaging (SMI) is a novel high resolution Doppler ultrasound technique which can uniquely define the microvascular architecture of whole tumours. We examined both qualitative and quantitative vascular features of gliomas captured with SMI, analysing flow signal density, vessel number, branching points, curvature, vessel angle deviation, fractal dimension, and entropy. Results indicate that high-grade gliomas exhibit significantly greater vascular complexity and disorganisation, with increased fractal dimension and entropy, correlating with known histopathological markers of aggressive angiogenesis. The integrated ROC model achieved high accuracy (AUC = 0.95), highlighting SMI's potential as a non-invasive diagnostic and prognostic tool. While further validation with larger datasets is required, this study opens avenues for SMI in glioma management, supporting intraoperative decision-making and informing future prognosis.

胶质瘤的准确分级对于指导治疗和预测预后至关重要。微血管增生是高级别胶质瘤的标志性特征,传统上需要对代表性组织进行有针对性的手术活检。超微血管成像(SMI)是一种新型的高分辨率多普勒超声技术,能独特地确定整个肿瘤的微血管结构。我们研究了用 SMI 捕获的胶质瘤的定性和定量血管特征,分析了血流信号密度、血管数量、分支点、曲率、血管角度偏差、分形维度和熵。结果表明,高级别胶质瘤的血管复杂性和无序性明显增加,分形维度和熵增加,与侵袭性血管生成的已知组织病理学标志物相关。综合 ROC 模型达到了很高的准确度(AUC = 0.95),凸显了 SMI 作为无创诊断和预后工具的潜力。虽然还需要用更大的数据集进行进一步验证,但这项研究为神经胶质瘤管理中的SMI开辟了道路,为术中决策提供了支持,并为未来预后提供了信息。
{"title":"Intraoperative superb microvascular ultrasound imaging in glioma: novel quantitative analysis correlates with tumour grade.","authors":"Luke Dixon, Alistair Weld, Dolin Bhagawati, Neekhil Patel, Stamatia Giannarou, Matthew Grech-Sollars, Adrian Lim, Sophie Camp","doi":"10.1101/2024.12.07.24318636","DOIUrl":"10.1101/2024.12.07.24318636","url":null,"abstract":"<p><p>Accurate grading of gliomas is critical to guide therapy and predict prognosis. The presence of microvascular proliferation is a hallmark feature of high grade gliomas which traditionally requires targeted surgical biopsy of representative tissue. Superb microvascular imaging (SMI) is a novel high resolution Doppler ultrasound technique which can uniquely define the microvascular architecture of whole tumours. We examined both qualitative and quantitative vascular features of gliomas captured with SMI, analysing flow signal density, vessel number, branching points, curvature, vessel angle deviation, fractal dimension, and entropy. Results indicate that high-grade gliomas exhibit significantly greater vascular complexity and disorganisation, with increased fractal dimension and entropy, correlating with known histopathological markers of aggressive angiogenesis. The integrated ROC model achieved high accuracy (AUC = 0.95), highlighting SMI's potential as a non-invasive diagnostic and prognostic tool. While further validation with larger datasets is required, this study opens avenues for SMI in glioma management, supporting intraoperative decision-making and informing future prognosis.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643247/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Epigenome-wide Association Analysis of Mitochondrial Heteroplasmy Provides Insight into Molecular Mechanisms of Disease. 线粒体异形的全表观基因组关联分析为疾病的分子机制提供了洞察力。
Pub Date : 2024-12-08 DOI: 10.1101/2024.12.05.24318557
Meng Lai, Kyeezu Kim, Yinan Zheng, Christina A Castellani, Scott M Ratliff, Mengyao Wang, Xue Liu, Jeffrey Haessler, Tianxiao Huan, Lawrence F Bielak, Wei Zhao, Roby Joehanes, Jiantao Ma, Xiuqing Guo, JoAnn E Manson, Megan L Grove, Jan Bressler, Kent D Taylor, Tuuli Lappalainen, Silva Kasela, Thomas W Blackwell, Nicole J Lake, Jessica D Faul, Kendra R Ferrier, Lifang Hou, Charles Kooperberg, Alexander P Reiner, Kai Zhang, Patricia A Peyser, Myriam Fornage, Eric Boerwinkle, Laura M Raffield, April P Carson, Stephen S Rich, Yongmei Liu, Daniel Levy, Jerome I Rotter, Jennifer A Smith, Dan E Arking, Chunyu Liu

The relationship between mitochondrial DNA (mtDNA) heteroplasmy and nuclear DNA (nDNA) methylation (CpGs) remains to be studied. We conducted an epigenome-wide association analysis of heteroplasmy burden scores across 10,986 participants (mean age 77, 63% women, and 54% non-White races/ethnicities) from seven population-based observational cohorts. We identified 412 CpGs (FDR p < 0.05) associated with mtDNA heteroplasmy. Higher levels of heteroplasmy burden were associated with lower nDNA methylation levels at most significant CpGs. Functional inference analyses of genes annotated to heteroplasmy-associated CpGs emphasized mitochondrial functions and showed enrichment in cardiometabolic conditions and traits. We developed CpG-scores based on heteroplasmy-count associated CpGs (MHC-CpG scores) using elastic net Cox regression in a training cohort. A one-unit higher level of the standardized MHC-CpG scores were associated with 1.26-fold higher hazard of all-cause mortality (95% CI: 1.14, 1.39) and 1.09-fold higher hazard of CVD (95% CI: 1.01-1.17) in the meta-analysis of testing cohorts, adjusting for age, sex, and smoking. These findings shed light on the relationship between mtDNA heteroplasmy and DNA methylation, and the role of heteroplasmy-associated CpGs as biomarkers in predicting all-cause mortality and cardiovascular disease.

线粒体 DNA(mtDNA)异质性与核 DNA(nDNA)甲基化(CpGs)之间的关系仍有待研究。我们对来自七个人群观察队列的 10,986 名参与者(平均年龄 77 岁,63% 为女性,54% 为非白人种族/族裔)进行了表观基因组范围的异质性负担评分关联分析。我们发现了 412 个与 mtDNA 异质性相关的 CpGs(FDR p < 0.05)。在大多数重要的 CpGs 上,较高水平的异源蛋白负担与较低的 nDNA 甲基化水平相关。对注释到与异源基因相关的 CpGs 的基因进行的功能推断分析强调了线粒体功能,并显示了在心脏代谢状况和性状中的富集。我们利用训练队列中的弹性网 Cox 回归,根据与异质体相关的 CpG(MHC-CpG 分数)制定了 CpG 分数。在对测试队列进行的荟萃分析中,标准化 MHC-CpG 分数每增加一个单位,全因死亡率的危险性就会增加 1.26 倍(95% CI:1.14, 1.39),心血管疾病的危险性增加 1.09 倍(95% CI:1.01-1.17),并对年龄、性别和吸烟进行了调整。这些发现揭示了mtDNA异质性与DNA甲基化之间的关系,以及异质性相关CpGs作为生物标志物在预测全因死亡率和心血管疾病中的作用。
{"title":"Epigenome-wide Association Analysis of Mitochondrial Heteroplasmy Provides Insight into Molecular Mechanisms of Disease.","authors":"Meng Lai, Kyeezu Kim, Yinan Zheng, Christina A Castellani, Scott M Ratliff, Mengyao Wang, Xue Liu, Jeffrey Haessler, Tianxiao Huan, Lawrence F Bielak, Wei Zhao, Roby Joehanes, Jiantao Ma, Xiuqing Guo, JoAnn E Manson, Megan L Grove, Jan Bressler, Kent D Taylor, Tuuli Lappalainen, Silva Kasela, Thomas W Blackwell, Nicole J Lake, Jessica D Faul, Kendra R Ferrier, Lifang Hou, Charles Kooperberg, Alexander P Reiner, Kai Zhang, Patricia A Peyser, Myriam Fornage, Eric Boerwinkle, Laura M Raffield, April P Carson, Stephen S Rich, Yongmei Liu, Daniel Levy, Jerome I Rotter, Jennifer A Smith, Dan E Arking, Chunyu Liu","doi":"10.1101/2024.12.05.24318557","DOIUrl":"10.1101/2024.12.05.24318557","url":null,"abstract":"<p><p>The relationship between mitochondrial DNA (mtDNA) heteroplasmy and nuclear DNA (nDNA) methylation (CpGs) remains to be studied. We conducted an epigenome-wide association analysis of heteroplasmy burden scores across 10,986 participants (mean age 77, 63% women, and 54% non-White races/ethnicities) from seven population-based observational cohorts. We identified 412 CpGs (FDR p < 0.05) associated with mtDNA heteroplasmy. Higher levels of heteroplasmy burden were associated with lower nDNA methylation levels at most significant CpGs. Functional inference analyses of genes annotated to heteroplasmy-associated CpGs emphasized mitochondrial functions and showed enrichment in cardiometabolic conditions and traits. We developed CpG-scores based on heteroplasmy-count associated CpGs (MHC-CpG scores) using elastic net Cox regression in a training cohort. A one-unit higher level of the standardized MHC-CpG scores were associated with 1.26-fold higher hazard of all-cause mortality (95% CI: 1.14, 1.39) and 1.09-fold higher hazard of CVD (95% CI: 1.01-1.17) in the meta-analysis of testing cohorts, adjusting for age, sex, and smoking. These findings shed light on the relationship between mtDNA heteroplasmy and DNA methylation, and the role of heteroplasmy-associated CpGs as biomarkers in predicting all-cause mortality and cardiovascular disease.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643249/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Stacked Classifiers for Multi-task Executive Function in Schizophrenia Yields Diagnostic and Prognostic Insights. 利用堆叠分类器对精神分裂症患者的多任务执行功能进行诊断和预后分析。
Pub Date : 2024-12-08 DOI: 10.1101/2024.12.05.24318587
Tongyi Zhang, Xin Zhao, B T Thomas Yeo, Xiaoning Huo, Simon B Eickhoff, Ji Chen

Cognitive impairment is a central characteristic of schizophrenia. Executive functioning (EF) impairments are often seen in mental disorders, particularly schizophrenia, where they relate to adverse outcomes. As a heterogeneous construct, how specifically each dimension of EF to characterize the diagnostic and prognostic aspects of schizophrenia remains opaque. We used classification models with a stacking approach on systematically measured EFs to discriminate 195 patients with schizophrenia from healthy individuals. Baseline EF measurements were moreover employed to predict symptomatically remitted or non-remitted prognostic subgroups. EF feature importance was determined at the group-level and the ensuing individual importance scores were associated with four symptom dimensions. EF assessments of inhibitory control (interference and response inhibitions), followed by working memory, evidently predicted schizophrenia diagnosis (area under the curve [AUC]=0.87) and remission status (AUC=0.81). The models highlighted the importance of interference inhibition or working memory updating in accurately identifying individuals with schizophrenia or those in remission. These identified patients had high-level negative symptoms at baseline and those who remitted showed milder cognitive symptoms at follow-up, without differences in baseline EF or symptom severity compared to non-remitted patients. Our work indicates that impairments in specific EF dimensions in schizophrenia are differentially linked to individual symptom-load and prognostic outcomes. Thus, assessments and models based on EF may be a promising tool that can aid in the clinical evaluation of this disorder.

认知障碍是精神分裂症的一个主要特征。执行功能(EF)障碍经常见于精神障碍,尤其是精神分裂症,它们与不良后果有关。作为一种异质性结构,执行功能的每个维度如何具体描述精神分裂症的诊断和预后方面仍不清楚。我们在系统测量的 EFs 基础上,采用堆叠法建立分类模型,将 195 名精神分裂症患者与健康人区分开来。此外,我们还利用基线心率测量结果来预测症状缓解或未缓解的预后亚组。EF特征的重要性是在群体水平上确定的,随后的个体重要性得分与四个症状维度相关联。对抑制控制(干扰抑制和反应抑制)和工作记忆的 EF 评估明显可以预测精神分裂症的诊断(曲线下面积 [AUC]= 0.87)和缓解状态(AUC=0.81)。这些模型强调了干扰抑制或工作记忆更新在准确识别精神分裂症患者或缓解期患者方面的重要性。这些被识别出的患者在基线时有较高程度的阴性症状,而缓解期患者在随访时表现出较轻的认知症状,但与未缓解期患者相比,基线EF或症状严重程度并无差异。我们的研究表明,精神分裂症患者特定EF维度的损伤与个体症状负荷和预后结果有着不同的联系。因此,基于EF的评估和模型可能是一种很有前途的工具,有助于对这种疾病进行临床评估。
{"title":"Leveraging Stacked Classifiers for Multi-task Executive Function in Schizophrenia Yields Diagnostic and Prognostic Insights.","authors":"Tongyi Zhang, Xin Zhao, B T Thomas Yeo, Xiaoning Huo, Simon B Eickhoff, Ji Chen","doi":"10.1101/2024.12.05.24318587","DOIUrl":"10.1101/2024.12.05.24318587","url":null,"abstract":"<p><p>Cognitive impairment is a central characteristic of schizophrenia. Executive functioning (EF) impairments are often seen in mental disorders, particularly schizophrenia, where they relate to adverse outcomes. As a heterogeneous construct, how specifically each dimension of EF to characterize the diagnostic and prognostic aspects of schizophrenia remains opaque. We used classification models with a stacking approach on systematically measured EFs to discriminate 195 patients with schizophrenia from healthy individuals. Baseline EF measurements were moreover employed to predict symptomatically remitted or non-remitted prognostic subgroups. EF feature importance was determined at the group-level and the ensuing individual importance scores were associated with four symptom dimensions. EF assessments of inhibitory control (interference and response inhibitions), followed by working memory, evidently predicted schizophrenia diagnosis (area under the curve [AUC]=0.87) and remission status (AUC=0.81). The models highlighted the importance of interference inhibition or working memory updating in accurately identifying individuals with schizophrenia or those in remission. These identified patients had high-level negative symptoms at baseline and those who remitted showed milder cognitive symptoms at follow-up, without differences in baseline EF or symptom severity compared to non-remitted patients. Our work indicates that impairments in specific EF dimensions in schizophrenia are differentially linked to individual symptom-load and prognostic outcomes. Thus, assessments and models based on EF may be a promising tool that can aid in the clinical evaluation of this disorder.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643294/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering Phenotypic Expansion in AXIN2-Related Disorders through Precision Animal Modeling. 通过精准动物建模揭示 AXIN2 相关疾病的表型扩展
Pub Date : 2024-12-08 DOI: 10.1101/2024.12.05.24318524
Nathalie M Aceves-Ewing, Denise G Lanza, Paul C Marcogliese, Di Lu, Chih-Wei Hsu, Matthew Gonzalez, Audrey E Christiansen, Tara L Rasmussen, Alex J Ho, Angelina Gaspero, John Seavitt, Mary E Dickinson, Bo Yuan, Brian J Shayota, Stephanie Pachter, Xiaolin Hu, Debra Lynn Day-Salvatore, Laura Mackay, Oguz Kanca, Michael F Wangler, Lorraine Potocki, Jill A Rosenfeld, Richard Alan Lewis, Hsiao-Tuan Chao, Brendan Lee, Sukyeong Lee, Shinya Yamamoto, Hugo J Bellen, Lindsay C Burrage, Jason D Heaney

Heterozygous pathogenic variants in AXIN2 are associated with oligodontia-colorectal cancer syndrome (ODCRCS), a disorder characterized by oligodontia, colorectal cancer, and in some cases, sparse hair and eyebrows. We have identified four individuals with one of two de novo , heterozygous variants (NM_004655.4:c.196G>A, p.(Glu66Lys) and c.199G>T, p.(Gly67Arg)) in AXIN2 whose presentations expand the phenotype of AXIN2-related disorders. In addition to ODCRCS features, these individuals have global developmental delay, microcephaly, and limb, ophthalmologic, and renal abnormalities. Structural modeling of these variants suggests that they disrupt AXIN2 binding to tankyrase, which regulates AXIN2 levels through PARsylation and subsequent proteasomal degradation. To test whether these variants produce a phenotype in vivo , we utilized an innovative prime editing N1 screen to phenotype heterozygous (p.E66K) mouse embryos, which were perinatal lethal with short palate and skeletal abnormalities, contrary to published viable Axin2 null mouse models. Modeling of the p.E66K variant in the Drosophila wing revealed gain-of-function activity compared to reference AXIN2. However, the variant showed loss-of-function activity in the fly eye compared to reference AXIN2, suggesting that the mechanism by which p.E66K affects AXIN2 function is cell context-dependent. Together, our studies in humans, mice, and flies demonstrate that specific variants in the tankyrase-binding domain of AXIN2 are pathogenic, leading to phenotypic expansion with context-dependent effects on AXIN2 function and WNT signaling. Moreover, the modeling strategies used to demonstrate variant pathogenicity may be beneficial for the resolution of other de novo heterozygous variants of uncertain significance associated with congenital anomalies in humans.

AXIN2 中的杂合致病变体与少突-结直肠癌综合征(ODCRCS)有关,该综合征是一种以少突、结直肠癌为特征的疾病,在某些病例中还伴有头发和眉毛稀疏。我们发现了四名患有 AXIN2 两个新发杂合变体(NM_004655.4:c.196G>A, p.(Glu66Lys) 和 c.199G>T, p.(Gly67Arg))之一的患者,他们的表现扩展了 AXIN2 相关疾病的表型。除了 ODCRCS 特征外,这些患者还伴有全身发育迟缓、小头畸形以及肢体、眼科和肾脏异常。对这些变体的结构建模表明,它们会破坏 AXIN2 与 tankyrase 的结合,而 tankyrase 可通过 PARsylation 和随后的蛋白酶体降解调节 AXIN2 的水平。为了检测这些变体是否会在体内产生表型,我们利用创新性的质粒编辑 N1 筛选技术对杂合型(p.E66K)小鼠胚胎进行了表型鉴定,结果与已发表的可存活的 Axin2 基因无效小鼠模型相反,这些小鼠胚胎在围产期致死,并伴有短腭和骨骼异常。与参考 AXIN2 相比,p.E66K 变体在果蝇翅膀中的建模显示了功能增益活性。然而,与参考 AXIN2 相比,该变异体在果蝇眼睛中显示出功能丧失活性,这表明 p.E66K 影响 AXIN2 功能的机制取决于细胞环境。我们在人类、小鼠和蝇类中的研究共同证明,AXIN2的tankyrase结合域中的特定变异具有致病性,会导致表型扩展,并对AXIN2功能和WNT信号转导产生依赖性影响。此外,用于证明变异致病性的建模策略可能有利于解决与人类先天性畸形相关的其他意义不确定的新发杂合变异。
{"title":"Uncovering Phenotypic Expansion in AXIN2-Related Disorders through Precision Animal Modeling.","authors":"Nathalie M Aceves-Ewing, Denise G Lanza, Paul C Marcogliese, Di Lu, Chih-Wei Hsu, Matthew Gonzalez, Audrey E Christiansen, Tara L Rasmussen, Alex J Ho, Angelina Gaspero, John Seavitt, Mary E Dickinson, Bo Yuan, Brian J Shayota, Stephanie Pachter, Xiaolin Hu, Debra Lynn Day-Salvatore, Laura Mackay, Oguz Kanca, Michael F Wangler, Lorraine Potocki, Jill A Rosenfeld, Richard Alan Lewis, Hsiao-Tuan Chao, Brendan Lee, Sukyeong Lee, Shinya Yamamoto, Hugo J Bellen, Lindsay C Burrage, Jason D Heaney","doi":"10.1101/2024.12.05.24318524","DOIUrl":"10.1101/2024.12.05.24318524","url":null,"abstract":"<p><p>Heterozygous pathogenic variants in <i>AXIN2</i> are associated with oligodontia-colorectal cancer syndrome (ODCRCS), a disorder characterized by oligodontia, colorectal cancer, and in some cases, sparse hair and eyebrows. We have identified four individuals with one of two <i>de novo</i> , heterozygous variants (NM_004655.4:c.196G>A, p.(Glu66Lys) and c.199G>T, p.(Gly67Arg)) in <i>AXIN2</i> whose presentations expand the phenotype of AXIN2-related disorders. In addition to ODCRCS features, these individuals have global developmental delay, microcephaly, and limb, ophthalmologic, and renal abnormalities. Structural modeling of these variants suggests that they disrupt AXIN2 binding to tankyrase, which regulates AXIN2 levels through PARsylation and subsequent proteasomal degradation. To test whether these variants produce a phenotype <i>in vivo</i> , we utilized an innovative prime editing N1 screen to phenotype heterozygous (p.E66K) mouse embryos, which were perinatal lethal with short palate and skeletal abnormalities, contrary to published viable <i>Axin2</i> null mouse models. Modeling of the p.E66K variant in the <i>Drosophila</i> wing revealed gain-of-function activity compared to reference AXIN2. However, the variant showed loss-of-function activity in the fly eye compared to reference AXIN2, suggesting that the mechanism by which p.E66K affects AXIN2 function is cell context-dependent. Together, our studies in humans, mice, and flies demonstrate that specific variants in the tankyrase-binding domain of AXIN2 are pathogenic, leading to phenotypic expansion with context-dependent effects on AXIN2 function and WNT signaling. Moreover, the modeling strategies used to demonstrate variant pathogenicity may be beneficial for the resolution of other <i>de novo</i> heterozygous variants of uncertain significance associated with congenital anomalies in humans.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643287/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
medRxiv : the preprint server for health sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1