Pub Date : 2025-10-28eCollection Date: 2025-01-01DOI: 10.34133/hds.0339
Jianrong Zhang
{"title":"Anticancer Drug Approval in China: From Acceleration of Access to Certainty of Benefits.","authors":"Jianrong Zhang","doi":"10.34133/hds.0339","DOIUrl":"10.34133/hds.0339","url":null,"abstract":"","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0339"},"PeriodicalIF":0.0,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12560837/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145402929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-02eCollection Date: 2025-01-01DOI: 10.34133/hds.0340
Yao Sui, Onur Afacan, Camilo Jaimes, Ali Gholipour, Simon K Warfield
Background: Magnetic resonance imaging (MRI) is of considerable importance due to its wide range of applications in both scientific research and clinical diagnostics. Acquiring high-quality MRI data is of paramount importance. Super-resolution reconstruction serves as a post-acquisition method capable of improving MRI data quality. Current methods predominantly utilize convolutional neural networks in super-resolution reconstruction. However, convolutional layers have inherent limitations in capturing extensive spatial dependencies due to their localized nature. Methods: We developed a new methodology that enables rapid and high-quality MRI data acquisition through a novel super-resolution approach. We proposed an innovative architecture using transformers to exploit long-range spatial dependencies present in images, allowing for an unsupervised learning framework specifically designed for super-resolution tasks tailored to individual subject. We validated our approach using both simulated data and clinical data comprising 40 scans acquired with a 3-T MRI system. Results: We obtained images with T2 contrast at an isotropic spatial resolution of 500 μm in just 4 min of imaging time, and simultaneously, the signal-to-noise ratio and contrast-to-noise ratio were improved by 13.23% and 18.45%, respectively, in comparison to current leading super-resolution techniques. Conclusions: The results demonstrated that incorporating long-range spatial dependencies substantially improved super-resolution reconstruction, thereby allowing for the acquisition of high-quality MRI data with reduced imaging time.
{"title":"Unsupervised Transformer Learning for Rapid and High-Quality MRI Data Acquisition.","authors":"Yao Sui, Onur Afacan, Camilo Jaimes, Ali Gholipour, Simon K Warfield","doi":"10.34133/hds.0340","DOIUrl":"10.34133/hds.0340","url":null,"abstract":"<p><p><b>Background:</b> Magnetic resonance imaging (MRI) is of considerable importance due to its wide range of applications in both scientific research and clinical diagnostics. Acquiring high-quality MRI data is of paramount importance. Super-resolution reconstruction serves as a post-acquisition method capable of improving MRI data quality. Current methods predominantly utilize convolutional neural networks in super-resolution reconstruction. However, convolutional layers have inherent limitations in capturing extensive spatial dependencies due to their localized nature. <b>Methods:</b> We developed a new methodology that enables rapid and high-quality MRI data acquisition through a novel super-resolution approach. We proposed an innovative architecture using transformers to exploit long-range spatial dependencies present in images, allowing for an unsupervised learning framework specifically designed for super-resolution tasks tailored to individual subject. We validated our approach using both simulated data and clinical data comprising 40 scans acquired with a 3-T MRI system. <b>Results:</b> We obtained images with T2 contrast at an isotropic spatial resolution of 500 μm in just 4 min of imaging time, and simultaneously, the signal-to-noise ratio and contrast-to-noise ratio were improved by 13.23% and 18.45%, respectively, in comparison to current leading super-resolution techniques. <b>Conclusions:</b> The results demonstrated that incorporating long-range spatial dependencies substantially improved super-resolution reconstruction, thereby allowing for the acquisition of high-quality MRI data with reduced imaging time.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0340"},"PeriodicalIF":0.0,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12489180/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145234210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-03eCollection Date: 2025-01-01DOI: 10.34133/hds.0321
Siqi Li, Xin Li, Kunyu Yu, Qiming Wu, Di Miao, Mingcheng Zhu, Mengying Yan, Yuhe Ke, Danny D'Agostino, Yilin Ning, Ziwen Wang, Yuqing Shang, Molei Liu, Chuan Hong, Nan Liu
Background: Clinical and biomedical research in low-resource settings often faces substantial challenges due to the need for high-quality data with sufficient sample sizes to construct effective models. These constraints hinder robust model training and prompt researchers to seek methods for leveraging existing knowledge from related studies to support new research efforts. Transfer learning (TL), a machine learning technique, emerges as a powerful solution by utilizing knowledge from pretrained models to enhance the performance of new models, offering promise across various healthcare domains. Despite its conceptual origins in the 1990s, the application of TL in medical research has remained limited, especially beyond image analysis. This review aims to analyze TL applications, highlight overlooked techniques, and suggest improvements for future healthcare research. Methods: Following the PRISMA-ScR guidelines, we conducted a search for published articles that employed TL with structured clinical or biomedical data by searching the SCOPUS, MEDLINE, Web of Science, Embase, and CINAHL databases. Results: We screened 5,080 papers, with 86 meeting the inclusion criteria. Among these, only 2% (2 of 86) utilized external studies, and 5% (4 of 86) addressed scenarios involving multi-site collaborations with privacy constraints. Conclusions: To achieve actionable TL with structured medical data while addressing regional disparities, inequality, and privacy constraints in healthcare research, we advocate for the careful identification of appropriate source data and models, the selection of suitable TL frameworks, and the validation of TL models with proper baselines.
背景:低资源环境下的临床和生物医学研究往往面临着巨大的挑战,因为需要有足够样本量的高质量数据来构建有效的模型。这些限制阻碍了稳健的模型训练,并促使研究人员寻求利用相关研究中的现有知识来支持新的研究工作的方法。迁移学习(TL)是一种机器学习技术,作为一种强大的解决方案,它利用来自预训练模型的知识来增强新模型的性能,为各种医疗保健领域提供了希望。尽管其概念起源于20世纪90年代,但在医学研究中的应用仍然有限,特别是在图像分析之外。本文旨在分析TL的应用,强调被忽视的技术,并为未来的医疗保健研究提出改进建议。方法:根据PRISMA-ScR指南,我们通过检索SCOPUS、MEDLINE、Web of Science、Embase和CINAHL数据库,对已发表的使用TL的结构化临床或生物医学数据的文章进行检索。结果:共筛选论文5080篇,其中86篇符合纳入标准。其中,只有2%(86人中2人)利用了外部研究,5%(86人中4人)处理了涉及隐私限制的多站点协作的场景。结论:为了在解决医疗保健研究中的区域差异、不平等和隐私限制的同时,利用结构化医疗数据实现可操作的TL,我们提倡仔细识别合适的源数据和模型,选择合适的TL框架,并使用适当的基线验证TL模型。
{"title":"Bridging Data Gaps in Healthcare: A Scoping Review of Transfer Learning in Structured Data Analysis.","authors":"Siqi Li, Xin Li, Kunyu Yu, Qiming Wu, Di Miao, Mingcheng Zhu, Mengying Yan, Yuhe Ke, Danny D'Agostino, Yilin Ning, Ziwen Wang, Yuqing Shang, Molei Liu, Chuan Hong, Nan Liu","doi":"10.34133/hds.0321","DOIUrl":"10.34133/hds.0321","url":null,"abstract":"<p><p><b>Background:</b> Clinical and biomedical research in low-resource settings often faces substantial challenges due to the need for high-quality data with sufficient sample sizes to construct effective models. These constraints hinder robust model training and prompt researchers to seek methods for leveraging existing knowledge from related studies to support new research efforts. Transfer learning (TL), a machine learning technique, emerges as a powerful solution by utilizing knowledge from pretrained models to enhance the performance of new models, offering promise across various healthcare domains. Despite its conceptual origins in the 1990s, the application of TL in medical research has remained limited, especially beyond image analysis. This review aims to analyze TL applications, highlight overlooked techniques, and suggest improvements for future healthcare research. <b>Methods:</b> Following the PRISMA-ScR guidelines, we conducted a search for published articles that employed TL with structured clinical or biomedical data by searching the SCOPUS, MEDLINE, Web of Science, Embase, and CINAHL databases. <b>Results:</b> We screened 5,080 papers, with 86 meeting the inclusion criteria. Among these, only 2% (2 of 86) utilized external studies, and 5% (4 of 86) addressed scenarios involving multi-site collaborations with privacy constraints. <b>Conclusions:</b> To achieve actionable TL with structured medical data while addressing regional disparities, inequality, and privacy constraints in healthcare research, we advocate for the careful identification of appropriate source data and models, the selection of suitable TL frameworks, and the validation of TL models with proper baselines.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0321"},"PeriodicalIF":0.0,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12408193/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145016720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: The traditional manual literature screening approach is limited by its time-consuming nature and high labor costs. A pressing issue is how to leverage large language models to enhance the efficiency and quality of evidence-based evaluations of drug efficacy and safety. Methods: This study utilized a manually curated reference literature database-comprising vaccine, hypoglycemic agent, and antidepressant evaluation studies-previously developed by our team through conventional systematic review methods. This validated database served as the gold standard for the development and optimization of LitAutoScreener. Following the PICOS (Population, Intervention, Comparison, Outcomes, Study Design) principles, a chain-of-thought reasoning approach with few-shot learning prompts was implemented to develop the screening algorithm. We subsequently evaluated the performance of LitAutoScreener using 2 independent validation cohorts, assessing both classification accuracy and processing efficiency. Results: For respiratory syncytial virus vaccine safety validation title-abstract screening, our tools based on GPT (GPT-4o), Kimi (moonshot-v1-128k), and DeepSeek (deepseek-chat 2.5) demonstrated high accuracy in inclusion/exclusion decisions (99.38%, 98.94%, and 98.85%, respectively). Recall rates were 100.00%, 99.13%, and 98.26%, with statistically significant performance differences (χ2 = 5.99, P = 0.048), where GPT outperformed the other models. Exclusion reason concordance rates were 98.85%, 94.79%, and 96.47% (χ2 = 30.22, P < 0.001). In full-text screening, all models maintained perfect recall (100.00%), with accuracies of 100.00% (GPT), 100.00% (Kimi), and 99.45% (DeepSeek). Processing times averaged 1 to 5 s per article for title-abstract screening and 60 s for full-text processing (including PDF preprocessing). Conclusions: LitAutoScreener offers a new approach for efficient literature screening in drug intervention studies, achieving high accuracy and significantly improving screening efficiency.
{"title":"LitAutoScreener: Development and Validation of an Automated Literature Screening Tool in Evidence-Based Medicine Driven by Large Language Models.","authors":"Yiming Tao, Xuehu Li, Zuhar Yisha, Sihan Yang, Siyan Zhan, Feng Sun","doi":"10.34133/hds.0322","DOIUrl":"10.34133/hds.0322","url":null,"abstract":"<p><p><b>Background:</b> The traditional manual literature screening approach is limited by its time-consuming nature and high labor costs. A pressing issue is how to leverage large language models to enhance the efficiency and quality of evidence-based evaluations of drug efficacy and safety. <b>Methods:</b> This study utilized a manually curated reference literature database-comprising vaccine, hypoglycemic agent, and antidepressant evaluation studies-previously developed by our team through conventional systematic review methods. This validated database served as the gold standard for the development and optimization of LitAutoScreener. Following the PICOS (Population, Intervention, Comparison, Outcomes, Study Design) principles, a chain-of-thought reasoning approach with few-shot learning prompts was implemented to develop the screening algorithm. We subsequently evaluated the performance of LitAutoScreener using 2 independent validation cohorts, assessing both classification accuracy and processing efficiency. <b>Results:</b> For respiratory syncytial virus vaccine safety validation title-abstract screening, our tools based on GPT (GPT-4o), Kimi (moonshot-v1-128k), and DeepSeek (deepseek-chat 2.5) demonstrated high accuracy in inclusion/exclusion decisions (99.38%, 98.94%, and 98.85%, respectively). Recall rates were 100.00%, 99.13%, and 98.26%, with statistically significant performance differences (<i>χ</i> <sup>2</sup> = 5.99, <i>P</i> = 0.048), where GPT outperformed the other models. Exclusion reason concordance rates were 98.85%, 94.79%, and 96.47% (<i>χ</i> <sup>2</sup> = 30.22, <i>P</i> < 0.001). In full-text screening, all models maintained perfect recall (100.00%), with accuracies of 100.00% (GPT), 100.00% (Kimi), and 99.45% (DeepSeek). Processing times averaged 1 to 5 s per article for title-abstract screening and 60 s for full-text processing (including PDF preprocessing). <b>Conclusions:</b> LitAutoScreener offers a new approach for efficient literature screening in drug intervention studies, achieving high accuracy and significantly improving screening efficiency.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0322"},"PeriodicalIF":0.0,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12404845/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144994177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-02eCollection Date: 2025-01-01DOI: 10.34133/hds.0325
Jingyu Wang, Tong Xie, Jiawen Peng, Yuemiao Zhang, Hong Zhang
Background: Recently, several cutting-edge experimental studies have directed chimeric antigen receptor (CAR)-T therapies toward specific renal diseases, revealing substantial renal benefits. Prior to widespread implementation of these animal experiments and potentially clinical trials, it is crucial to assess the renal safety of CAR-T therapies using real-world safety evidence. Methods: Our focus was on utilizing 4 algorithms, including disproportionality analysis, based on the US Food and Drug Administration Adverse Event Reporting System database, to filter positive signals of acute and chronic renal injury associated with 6 CAR-T therapies. Further determination of causality was achieved through Mendelian randomization (MR) for drugs associated with renal injury events showing a correlation. Results: Six therapies were evaluated involving a total of 9,770 patients, with only acute kidney injury (AKI) identified as associated with idecabtagene vicleucel treatment using 4 algorithmic thresholds, including disproportionality analysis. Subsequently, MR revealed no causal relationship between the idecabtagene vicleucel target B cell maturation antigen and the risk of AKI (P = 0.576), a finding validated in another independent dataset (P = 0.734). Conclusion: CAR-T therapies do not directly cause renal damage and necessitate controlling adverse renal risks during or after treatment, such as cytokine release syndrome. Future research efforts should rigorously optimize these aspects to better cater to nephrologists' requirements.
{"title":"Characterizing the Real-World Risks of Kidney Injuries Associated with Chimeric Antigen Receptor T Cell Therapies-Evidence and Safety.","authors":"Jingyu Wang, Tong Xie, Jiawen Peng, Yuemiao Zhang, Hong Zhang","doi":"10.34133/hds.0325","DOIUrl":"10.34133/hds.0325","url":null,"abstract":"<p><p><b>Background:</b> Recently, several cutting-edge experimental studies have directed chimeric antigen receptor (CAR)-T therapies toward specific renal diseases, revealing substantial renal benefits. Prior to widespread implementation of these animal experiments and potentially clinical trials, it is crucial to assess the renal safety of CAR-T therapies using real-world safety evidence. <b>Methods:</b> Our focus was on utilizing 4 algorithms, including disproportionality analysis, based on the US Food and Drug Administration Adverse Event Reporting System database, to filter positive signals of acute and chronic renal injury associated with 6 CAR-T therapies. Further determination of causality was achieved through Mendelian randomization (MR) for drugs associated with renal injury events showing a correlation. <b>Results:</b> Six therapies were evaluated involving a total of 9,770 patients, with only acute kidney injury (AKI) identified as associated with idecabtagene vicleucel treatment using 4 algorithmic thresholds, including disproportionality analysis. Subsequently, MR revealed no causal relationship between the idecabtagene vicleucel target B cell maturation antigen and the risk of AKI (<i>P</i> = 0.576), a finding validated in another independent dataset (<i>P</i> = 0.734). <b>Conclusion:</b> CAR-T therapies do not directly cause renal damage and necessitate controlling adverse renal risks during or after treatment, such as cytokine release syndrome. Future research efforts should rigorously optimize these aspects to better cater to nephrologists' requirements.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0325"},"PeriodicalIF":0.0,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12404846/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144994080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-05eCollection Date: 2025-01-01DOI: 10.34133/hds.0224
Shusheng Li, Yang Bo, Yuchu Chen, Jianfeng Cao, Bo Bi, Ting Ma, Chenfei Ye
Background: Major depressive disorder (MDD) and autism spectrum disorder (ASD) are complex and heterogeneous neuropsychiatric disorders with overlapping symptoms, presenting remarkable challenges for accurate diagnosis. Leveraging functional neuroimaging data offers an opportunity to develop more robust, data-driven approach for psychiatric disorder detection. However, existing methods often struggle to capture the long-term dependencies and dynamic patterns inherent in such data, particularly across diverse imaging sites. Methods: We propose Multiscale Contextual Mamba (MSC-Mamba), a Mamba-based model designed for capturing long-term dependencies in multivariate time-series data while maintaining linear scalability, allowing us to account for long-range interactions and subtle dynamic patterns within the brain's functional networks. One of the main advantages of MSC-Mamba is its ability to leverage the distinct characteristics of time-series data, allowing it to generate meaningful contextual information across various scales. This method effectively addresses both channel-mixing and channel-independence scenarios, facilitating the selection of relevant features for prediction by considering both global and local contexts at multiple scales. Results: Two large-scale multisite functional magnetic resonance imaging datasets, including REST-meta-MDD (n = 1,642) and Autism Brain Imaging Data Exchange (ABIDE) (n = 1,022), were used to validate the performance of our proposed approach. MSC-Mamba has achieved state-of-the-art performance, with an accuracy of 69.91% for MDD detection and 73.08% for ASD detection. The results demonstrate the model's robust generalization across imaging sites and its sensitivity to intricate brain network dynamics. Conclusions: This paper demonstrates the potential of state-space models in advancing psychiatric neuroimaging research. The findings suggest that such models can significantly enhance detection accuracy for MDD and ASD, pointing toward more reliable, data-driven diagnostic tools in psychiatric disorder detection.
{"title":"Multiscale Contextual Mamba: Advancing Psychiatric Disorder Detection across Multisite Functional Magnetic Resonance Imaging Datasets via State Space Modeling.","authors":"Shusheng Li, Yang Bo, Yuchu Chen, Jianfeng Cao, Bo Bi, Ting Ma, Chenfei Ye","doi":"10.34133/hds.0224","DOIUrl":"10.34133/hds.0224","url":null,"abstract":"<p><p><b>Background:</b> Major depressive disorder (MDD) and autism spectrum disorder (ASD) are complex and heterogeneous neuropsychiatric disorders with overlapping symptoms, presenting remarkable challenges for accurate diagnosis. Leveraging functional neuroimaging data offers an opportunity to develop more robust, data-driven approach for psychiatric disorder detection. However, existing methods often struggle to capture the long-term dependencies and dynamic patterns inherent in such data, particularly across diverse imaging sites. <b>Methods:</b> We propose Multiscale Contextual Mamba (MSC-Mamba), a Mamba-based model designed for capturing long-term dependencies in multivariate time-series data while maintaining linear scalability, allowing us to account for long-range interactions and subtle dynamic patterns within the brain's functional networks. One of the main advantages of MSC-Mamba is its ability to leverage the distinct characteristics of time-series data, allowing it to generate meaningful contextual information across various scales. This method effectively addresses both channel-mixing and channel-independence scenarios, facilitating the selection of relevant features for prediction by considering both global and local contexts at multiple scales. <b>Results:</b> Two large-scale multisite functional magnetic resonance imaging datasets, including REST-meta-MDD (<i>n</i> = 1,642) and Autism Brain Imaging Data Exchange (ABIDE) (<i>n</i> = 1,022), were used to validate the performance of our proposed approach. MSC-Mamba has achieved state-of-the-art performance, with an accuracy of 69.91% for MDD detection and 73.08% for ASD detection. The results demonstrate the model's robust generalization across imaging sites and its sensitivity to intricate brain network dynamics. <b>Conclusions:</b> This paper demonstrates the potential of state-space models in advancing psychiatric neuroimaging research. The findings suggest that such models can significantly enhance detection accuracy for MDD and ASD, pointing toward more reliable, data-driven diagnostic tools in psychiatric disorder detection.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0224"},"PeriodicalIF":0.0,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12324564/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144790891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-05eCollection Date: 2025-01-01DOI: 10.34133/hds.0283
Zhipeng He, Shishi Tang, Yurong Hu, Yuxuan Li, Junhao Liang, Li Fang, Miaoxin Li, Ziyi Chen, Yi Zhou
Background: Alterations of brain connectivity within resting-state networks (RSNs) have been widely reported in observational studies on epilepsy. However, the causal relationship between epilepsy and structural connectivity (SC)/functional connectivity (FC) within RSNs remain unclear. We conducted a bidirectional two-sample Mendelian randomization (MR) to explore the causal relationship between epilepsy subtypes and brain connectivity properties within RSNs. Methods: Genetic instruments were obtained from the latest genome-wide association studies (GWAS) of 69,995 individuals (Ncases = 27,559, Ncontrols = 42,436) issued by the International League Against Epilepsy. The GWAS summary SC/FC data within RSNs (NSC = 23,985, NFC = 24,336) were sourced from the Center for Neurogenomics and Cognitive Research. We investigate the causal relationship between epilepsy subtypes and brain connectivity within RSNs through a bidirectional two-sample MR analysis. Results: We found that the increased risk of generalized genetic epilepsy is consistent with a causal effect on dorsal attention and somatomotor FC. In the reverse MR analysis, there was no suggestive causal effect of FC/SC connectivity on epilepsy subtypes. Conclusions: This study shed light on the associations of FC/SC levels within the RSNs and epilepsy along with its subtypes. This insight could yield crucial intervention strategies to different subtypes of epilepsy at the level of brain structure and functional networks.
背景:在癫痫的观察性研究中,静息状态网络(RSNs)内脑连通性的改变已被广泛报道。然而,癫痫与rsn内结构连接(SC)/功能连接(FC)之间的因果关系尚不清楚。我们进行了双向双样本孟德尔随机化(MR)来探索癫痫亚型与rsn内大脑连接特性之间的因果关系。方法:从国际抗癫痫联盟发布的69,995例(N例= 27,559例,N对照= 42,436例)的最新全基因组关联研究(GWAS)中获得遗传仪器。rsn内的GWAS SC/FC汇总数据(N SC = 23,985, N FC = 24,336)来自神经基因组学和认知研究中心。我们通过双向双样本MR分析来研究癫痫亚型与rsn内大脑连接之间的因果关系。结果:我们发现全身性遗传性癫痫的风险增加与背侧注意力和躯体运动FC的因果效应一致。在反向MR分析中,没有提示FC/SC连接对癫痫亚型的因果影响。结论:本研究揭示了rsn内FC/SC水平与癫痫及其亚型的关系。这一见解可以在大脑结构和功能网络水平上为不同亚型癫痫提供关键的干预策略。
{"title":"Brain Connectivity Yields Insights into the Pathogenesis of Epilepsy and Subtypes: Evidence from Mendelian Randomization Analysis.","authors":"Zhipeng He, Shishi Tang, Yurong Hu, Yuxuan Li, Junhao Liang, Li Fang, Miaoxin Li, Ziyi Chen, Yi Zhou","doi":"10.34133/hds.0283","DOIUrl":"10.34133/hds.0283","url":null,"abstract":"<p><p><b>Background:</b> Alterations of brain connectivity within resting-state networks (RSNs) have been widely reported in observational studies on epilepsy. However, the causal relationship between epilepsy and structural connectivity (SC)/functional connectivity (FC) within RSNs remain unclear. We conducted a bidirectional two-sample Mendelian randomization (MR) to explore the causal relationship between epilepsy subtypes and brain connectivity properties within RSNs. <b>Methods:</b> Genetic instruments were obtained from the latest genome-wide association studies (GWAS) of 69,995 individuals (<i>N</i> <sub>cases</sub> = 27,559, <i>N</i> <sub>controls</sub> = 42,436) issued by the International League Against Epilepsy. The GWAS summary SC/FC data within RSNs (<i>N</i> <sub>SC</sub> = 23,985, <i>N</i> <sub>FC</sub> = 24,336) were sourced from the Center for Neurogenomics and Cognitive Research. We investigate the causal relationship between epilepsy subtypes and brain connectivity within RSNs through a bidirectional two-sample MR analysis. <b>Results:</b> We found that the increased risk of generalized genetic epilepsy is consistent with a causal effect on dorsal attention and somatomotor FC. In the reverse MR analysis, there was no suggestive causal effect of FC/SC connectivity on epilepsy subtypes. <b>Conclusions:</b> This study shed light on the associations of FC/SC levels within the RSNs and epilepsy along with its subtypes. This insight could yield crucial intervention strategies to different subtypes of epilepsy at the level of brain structure and functional networks.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0283"},"PeriodicalIF":0.0,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12324163/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144790890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-05eCollection Date: 2025-01-01DOI: 10.34133/hds.0282
Ying Li, Yanwu Yang, Yuchu Chen, Chenfei Ye, Ting Ma
Importance: Precisely decoding brain dysfunction from high-dimensional functional recordings is crucial for advancing our understanding of brain dysfunction in brain disorders. Self-supervised learning (SSL) models offer a transformative approach for mapping dependencies in functional neuroimaging data. Leveraging the intrinsic organization of brain signals for comprehensive feature extraction, these models enable the analysis of critical neurofunctional features within a clinically relevant framework, overcoming challenges related to data heterogeneity and the scarcity of labeled data. Highlight: This paper provides a comprehensive overview of SSL techniques applied to functional neuroimaging data, such as functional magnetic resonance imaging and electroencephalography, with a specific focus on their applications in various neuropsychiatric disorders. We discuss 3 main categories of SSL methods: contrastive learning, generative learning, and generative-contrastive methods, outlining their basic principles and representative methods. Critically, we highlight the potential of SSL in addressing data scarcity, multimodal integration, and dynamic network modeling for disease detection and prediction. We showcase successful applications of these techniques in understanding and classifying conditions such as Alzheimer's disease, Parkinson's disease, and epilepsy, demonstrating their potential in downstream neuropsychological applications. Conclusion: SSL models provide a scalable and effective methodology for individual detection and prediction in brain disorders. Despite current limitations in interpretability and data heterogeneity, the potential of SSL for future clinical applications, particularly in the areas of transdiagnostic psychosis subtyping and decoding task-based brain functional recordings, is substantial.
{"title":"Self-Supervised Learning to Unveil Brain Dysfunctional Signatures in Brain Disorders: Methods and Applications.","authors":"Ying Li, Yanwu Yang, Yuchu Chen, Chenfei Ye, Ting Ma","doi":"10.34133/hds.0282","DOIUrl":"10.34133/hds.0282","url":null,"abstract":"<p><p><b>Importance:</b> Precisely decoding brain dysfunction from high-dimensional functional recordings is crucial for advancing our understanding of brain dysfunction in brain disorders. Self-supervised learning (SSL) models offer a transformative approach for mapping dependencies in functional neuroimaging data. Leveraging the intrinsic organization of brain signals for comprehensive feature extraction, these models enable the analysis of critical neurofunctional features within a clinically relevant framework, overcoming challenges related to data heterogeneity and the scarcity of labeled data. <b>Highlight:</b> This paper provides a comprehensive overview of SSL techniques applied to functional neuroimaging data, such as functional magnetic resonance imaging and electroencephalography, with a specific focus on their applications in various neuropsychiatric disorders. We discuss 3 main categories of SSL methods: contrastive learning, generative learning, and generative-contrastive methods, outlining their basic principles and representative methods. Critically, we highlight the potential of SSL in addressing data scarcity, multimodal integration, and dynamic network modeling for disease detection and prediction. We showcase successful applications of these techniques in understanding and classifying conditions such as Alzheimer's disease, Parkinson's disease, and epilepsy, demonstrating their potential in downstream neuropsychological applications. <b>Conclusion:</b> SSL models provide a scalable and effective methodology for individual detection and prediction in brain disorders. Despite current limitations in interpretability and data heterogeneity, the potential of SSL for future clinical applications, particularly in the areas of transdiagnostic psychosis subtyping and decoding task-based brain functional recordings, is substantial.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0282"},"PeriodicalIF":0.0,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12324563/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144790892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-02eCollection Date: 2025-01-01DOI: 10.34133/hds.0284
Gregorio Ferreira, Jacopo Amidei, Rubén Nieto, Andreas Kaltenbrunner
Background: Advances in artificial intelligence have enabled the simulation of human-like behaviors, raising the possibility of using large language models (LLMs) to generate synthetic population samples for research purposes, which may be particularly useful in health and social sciences. Methods: This paper explores the potential of LLMs to simulate population samples mirroring real ones, as well as the feasibility of using personality questionnaires to assess the personality of LLMs. To advance in that direction, 2 experiments were conducted with GPT-4o using the Eysenck Personality Questionnaire Revised-Abbreviated (EPQR-A) in 6 languages: Spanish, English, Slovak, Hebrew, Portuguese, and Turkish. Results: We find that GPT-4o exhibits distinct personality traits, which vary based on parameter settings and the language of the questionnaire. While the model shows promising trends in reflecting certain personality traits and differences across gender and academic fields, discrepancies between the synthetic populations' responses and those from real populations remain. Conclusions: These inconsistencies suggest that creating fully reliable synthetic population samples for questionnaire testing is still an open challenge. Further research is required to better align synthetic and real population behaviors.
{"title":"How Well Do Simulated Population Samples with GPT-4 Align with Real Ones? The Case of the Eysenck Personality Questionnaire Revised-Abbreviated Personality Test.","authors":"Gregorio Ferreira, Jacopo Amidei, Rubén Nieto, Andreas Kaltenbrunner","doi":"10.34133/hds.0284","DOIUrl":"10.34133/hds.0284","url":null,"abstract":"<p><p><b>Background:</b> Advances in artificial intelligence have enabled the simulation of human-like behaviors, raising the possibility of using large language models (LLMs) to generate synthetic population samples for research purposes, which may be particularly useful in health and social sciences. <b>Methods:</b> This paper explores the potential of LLMs to simulate population samples mirroring real ones, as well as the feasibility of using personality questionnaires to assess the personality of LLMs. To advance in that direction, 2 experiments were conducted with GPT-4o using the Eysenck Personality Questionnaire Revised-Abbreviated (EPQR-A) in 6 languages: Spanish, English, Slovak, Hebrew, Portuguese, and Turkish. <b>Results:</b> We find that GPT-4o exhibits distinct personality traits, which vary based on parameter settings and the language of the questionnaire. While the model shows promising trends in reflecting certain personality traits and differences across gender and academic fields, discrepancies between the synthetic populations' responses and those from real populations remain. <b>Conclusions:</b> These inconsistencies suggest that creating fully reliable synthetic population samples for questionnaire testing is still an open challenge. Further research is required to better align synthetic and real population behaviors.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0284"},"PeriodicalIF":0.0,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12217932/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144556087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-03eCollection Date: 2025-01-01DOI: 10.34133/hds.0209
Lei Feng, Weiwei Wang, Can Yin, Jing Li, Xinwei Zhang, Xiaotian Chang, Zizhao Feng, Mui Van Zandt, Seng Chan You, Sarah Seager, Christian Reich, Siyan Zhan, Feng Sun, Gang Wang
Background: The use of antidepressants in the treatment of bipolar depression remains controversial due to concerns about their potential to induce mood polarity switches. This multinational observational study aims to examine the association between the use of antidepressants and the risk of hypomanic/manic switch among bipolar depressive patients. Methods: Four electronic health record databases (IQVIA Disease Analyzer Germany, IQVIA Disease Analyzer France, IQVIA US Hospital Charge Data Master, and Beijing Anding Hospital) and one administrative claims database (IQVIA US Open Claims) were analyzed, and the study period covered from January 2013 until December 2017. Treatment patterns of patients with bipolar depression were collected. The hazard ratio (HR) was calculated by comparing the incidence of hypomanic/manic switch in patients who received antidepressants (AD group) with that in those who did not receive any antidepressant (non-AD group) in 730 days after the date of the first diagnosis of bipolar depression. Results: The analysis included a total of 122,843 patients from the 5 databases; 60.6% of them received antidepressants for bipolar depression. Across the 5 data sources, the mean age at index date ranged from 37.50 (15.72) to 52.10 (16.22) years. After controlling potential confounders by propensity score matching, the AD group's manic switch risk was not significantly higher than the non-AD group's (HR 1.04 [95% CI, 0.96 to 1.13]; P = 0.989). Additionally, no statistically significant difference was observed between patients prescribed antimanic drugs and those who were not (HR 0.69 [95% CI, 0.38 to 1.25]; P = 0.535). Conclusions: This study indicated that antidepressants were widely used in clinical settings for managing bipolar depression. The use of antidepressants was not associated with the risk of mania/hypomania switch when compared to non-antidepressants treatment. Therefore, antidepressants could be considered a treatment option for bipolar depression.
{"title":"Risk of Switch to Mania/Hypomania in Bipolar Depressive Patients Treated with Antidepressants: A Real-World Study.","authors":"Lei Feng, Weiwei Wang, Can Yin, Jing Li, Xinwei Zhang, Xiaotian Chang, Zizhao Feng, Mui Van Zandt, Seng Chan You, Sarah Seager, Christian Reich, Siyan Zhan, Feng Sun, Gang Wang","doi":"10.34133/hds.0209","DOIUrl":"10.34133/hds.0209","url":null,"abstract":"<p><p><b>Background:</b> The use of antidepressants in the treatment of bipolar depression remains controversial due to concerns about their potential to induce mood polarity switches. This multinational observational study aims to examine the association between the use of antidepressants and the risk of hypomanic/manic switch among bipolar depressive patients. <b>Methods:</b> Four electronic health record databases (IQVIA Disease Analyzer Germany, IQVIA Disease Analyzer France, IQVIA US Hospital Charge Data Master, and Beijing Anding Hospital) and one administrative claims database (IQVIA US Open Claims) were analyzed, and the study period covered from January 2013 until December 2017. Treatment patterns of patients with bipolar depression were collected. The hazard ratio (HR) was calculated by comparing the incidence of hypomanic/manic switch in patients who received antidepressants (AD group) with that in those who did not receive any antidepressant (non-AD group) in 730 days after the date of the first diagnosis of bipolar depression. <b>Results:</b> The analysis included a total of 122,843 patients from the 5 databases; 60.6% of them received antidepressants for bipolar depression. Across the 5 data sources, the mean age at index date ranged from 37.50 (15.72) to 52.10 (16.22) years. After controlling potential confounders by propensity score matching, the AD group's manic switch risk was not significantly higher than the non-AD group's (HR 1.04 [95% CI, 0.96 to 1.13]; <i>P</i> = 0.989). Additionally, no statistically significant difference was observed between patients prescribed antimanic drugs and those who were not (HR 0.69 [95% CI, 0.38 to 1.25]; <i>P</i> = 0.535). <b>Conclusions:</b> This study indicated that antidepressants were widely used in clinical settings for managing bipolar depression. The use of antidepressants was not associated with the risk of mania/hypomania switch when compared to non-antidepressants treatment. Therefore, antidepressants could be considered a treatment option for bipolar depression.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0209"},"PeriodicalIF":0.0,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12130621/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144217807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}