首页 > 最新文献

BMC Medical Research Methodology最新文献

英文 中文
AI-powered classification and network analysis for knowledge mapping in medicine: a century of neurosyphilis research. 医学知识图谱的人工智能分类和网络分析:一个世纪的神经梅毒研究。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-26 DOI: 10.1186/s12874-025-02750-8
Justine Falciola, Myriam Lamrayah, François R Herrmann, Alexandre Wenger, Laurence Toutous Trellu
{"title":"AI-powered classification and network analysis for knowledge mapping in medicine: a century of neurosyphilis research.","authors":"Justine Falciola, Myriam Lamrayah, François R Herrmann, Alexandre Wenger, Laurence Toutous Trellu","doi":"10.1186/s12874-025-02750-8","DOIUrl":"10.1186/s12874-025-02750-8","url":null,"abstract":"","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"12"},"PeriodicalIF":3.4,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12849313/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145843538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recruitment of a probability-based general population health panel for public health research in Germany: the panel 'Health in Germany'. 为德国公共卫生研究招募基于概率的一般人口健康小组:“德国的健康”小组。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-23 DOI: 10.1186/s12874-025-02746-4
Johannes Lemcke, Ilter Öztürk, Stefan Damerow, Tobias Heller, Sabine Born, Matthias Wetzstein, Jennifer Allen, Patrick Schmich

Background: This report presents the study design and recruitment outcomes for the 'Health in Germany' panel, a long-term population-based health survey infrastructure developed by the Robert Koch Institute. The initial recruitment was conducted using a stratified random sample of the German population and a mixed-mode approach combining web-based and paper questionnaires. We examine participation rates across demographic subgroups, assess sample composition, and analyze potential selection effects.

Method: The panel recruitment survey for the 'Health in Germany' panel utilized the residents' registration offices as the sampling frame. A two-stage stratified (cluster) sample was drawn from 359 primary sampling units across Germany. A mixed-mode approach was employed, offering both online and paper survey-mode on the basis of age groups. The sequence of survey modes was differentiated by age groups based on information from the residents' registration offices. For respondents aged 16-69 years, a sequential mixed-mode design, offering the online mode first and only with the second reminder the paper survey-mode (also called push-to-web strategy), was applied. For respondents aged 70 years and older a simultaneous mixed-mode design was used, offering the online and paper survey-mode with the invitation. Participants completed a first panel recruitment survey in which socio-demographic and health data were collected. Selection effects and sample composition were analyzed via logistic regression models and compared with official population data.

Results: A total of 62,556 interviews were conducted, with 49,766 participants consenting to join the panel. After double opt-in registration process (for online participants), 47,863 remained. Response rates were higher among women and younger participants. Online participation predominated in the sequential design, whereas offline participation was more common in the simultaneous design. Logistic regression indicated higher participation among women and residents of smaller municipalities. Overall, the sample composition aligned broadly with population benchmarks, except for education and citizenship.

Conclusions: The first recruitment study for the 'Health in Germany' panel established one of Europe's largest population-based health panels through a mixed-mode design. Future expansions include regular health surveys, biometric measurements, complementary self-recruitment alongside the probability-based sample, and the development of a mobile survey app.

背景:本报告介绍了“德国健康”小组的研究设计和招募结果,这是由罗伯特·科赫研究所开发的一项长期基于人群的健康调查基础设施。最初的招募使用德国人口分层随机样本和混合模式方法,结合网络和纸质问卷调查。我们检查了人口统计亚组的参与率,评估了样本组成,并分析了潜在的选择效应。方法:“健康在德国”小组招募调查以居民登记办公室为抽样框架。从德国359个主要抽样单位中抽取两阶段分层(整群)样本。采用混合模式方法,根据年龄组提供在线和纸质调查模式。根据居民登记办公室提供的信息,按年龄组划分调查方式顺序。对于16-69岁的受访者,采用顺序混合模式设计,首先提供在线模式,第二次提醒纸张调查模式(也称为推送到网络策略)。对于70岁及以上的被调查者,采用同时进行的混合模式设计,在邀请的同时提供在线和纸质调查模式。参与者完成了第一次小组招聘调查,其中收集了社会人口和健康数据。通过logistic回归模型分析选择效应和样本构成,并与官方人口数据进行比较。结果:共进行了62,556次访谈,其中49,766名参与者同意加入小组。经过两次选择注册程序(针对在线参与者),仍有47,863人。女性和年轻参与者的回应率更高。在线参与在顺序设计中占主导地位,而离线参与在同时设计中更为常见。逻辑回归表明,妇女和小城市居民的参与率较高。总体而言,除了教育和公民身份之外,样本构成与人口基准基本一致。结论:“德国健康”小组的首次招募研究通过混合模式设计建立了欧洲最大的基于人口的健康小组之一。未来的扩展包括定期健康调查、生物特征测量、基于概率的样本补充自我招募,以及移动调查应用程序的开发。
{"title":"Recruitment of a probability-based general population health panel for public health research in Germany: the panel 'Health in Germany'.","authors":"Johannes Lemcke, Ilter Öztürk, Stefan Damerow, Tobias Heller, Sabine Born, Matthias Wetzstein, Jennifer Allen, Patrick Schmich","doi":"10.1186/s12874-025-02746-4","DOIUrl":"10.1186/s12874-025-02746-4","url":null,"abstract":"<p><strong>Background: </strong>This report presents the study design and recruitment outcomes for the 'Health in Germany' panel, a long-term population-based health survey infrastructure developed by the Robert Koch Institute. The initial recruitment was conducted using a stratified random sample of the German population and a mixed-mode approach combining web-based and paper questionnaires. We examine participation rates across demographic subgroups, assess sample composition, and analyze potential selection effects.</p><p><strong>Method: </strong>The panel recruitment survey for the 'Health in Germany' panel utilized the residents' registration offices as the sampling frame. A two-stage stratified (cluster) sample was drawn from 359 primary sampling units across Germany. A mixed-mode approach was employed, offering both online and paper survey-mode on the basis of age groups. The sequence of survey modes was differentiated by age groups based on information from the residents' registration offices. For respondents aged 16-69 years, a sequential mixed-mode design, offering the online mode first and only with the second reminder the paper survey-mode (also called push-to-web strategy), was applied. For respondents aged 70 years and older a simultaneous mixed-mode design was used, offering the online and paper survey-mode with the invitation. Participants completed a first panel recruitment survey in which socio-demographic and health data were collected. Selection effects and sample composition were analyzed via logistic regression models and compared with official population data.</p><p><strong>Results: </strong>A total of 62,556 interviews were conducted, with 49,766 participants consenting to join the panel. After double opt-in registration process (for online participants), 47,863 remained. Response rates were higher among women and younger participants. Online participation predominated in the sequential design, whereas offline participation was more common in the simultaneous design. Logistic regression indicated higher participation among women and residents of smaller municipalities. Overall, the sample composition aligned broadly with population benchmarks, except for education and citizenship.</p><p><strong>Conclusions: </strong>The first recruitment study for the 'Health in Germany' panel established one of Europe's largest population-based health panels through a mixed-mode design. Future expansions include regular health surveys, biometric measurements, complementary self-recruitment alongside the probability-based sample, and the development of a mobile survey app.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"15"},"PeriodicalIF":3.4,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12853828/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145817635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-constant absolute effect measures for time-to-event outcomes. 时间到事件结果的时间常数绝对效应测量。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-17 DOI: 10.1186/s12874-025-02743-7
Oliver Kuss, Annika Hoyer

Background: Reporting treatment effects from clinical trials on both relative and absolute scales is crucial. While absolute measures like the Number Needed to Treat (NNT) are well-established for binary outcomes, their calculation for time-to-event outcomes remains challenging due to time-dependence, which hinders interpretation and communication. Traditional additive hazard models, while addressing time-dependence, have been limited by restrictive assumptions regarding outcome distributions.

Methods: This paper proposes to use a recently introduced class of parametric additive hazard models to compute time-constant absolute effect measures for time-to-event outcomes. These models allow for a wide range of parametric distributions, overcoming the limitations of previous approaches. The approach provides a single, absolute effect size (e.g., hazard difference or NNT) summarizing the effect over the entire study duration. We illustrate this method using digitized Kaplan-Meier data from the EMPA-REG OUTCOME trial, focusing on all-cause mortality, and fit six different parametric distributions (exponential, linear hazard rate, Weibull, log-logistic, Gompertz, and Gamma-Gompertz).

Results: Despite notable differences in model fit across the six distributions, the estimated rate differences, corresponding NNTs, and their confidence intervals were remarkably similar. The linear hazard rate and Gompertz models, which provided the best fit according to the BIC, yielded a rate difference of -8.8 per 1,000 person-years, with an NNT of 114. These models also demonstrated increasing hazards, aligning with expectations for all-cause mortality. The estimated modes of the distributions from the best-fitting models (10.4 and 13.0 years) were more plausible than those from simpler models.

Conclusions: The class of parametric additive hazard models offers a valuable tool for calculating time-constant absolute effect measures for time-to-event outcomes. This approach effectively addresses the issues of time-dependence and limited distribution flexibility, providing a single, interpretable absolute effect size. Future work could explore more general distributions and further derivation of absolute effect measures on the time scale.

背景:从临床试验中报告治疗效果的相对和绝对尺度是至关重要的。虽然像需要治疗的数量(NNT)这样的绝对测量方法对于二元结果已经建立,但由于时间依赖性,它们对时间到事件结果的计算仍然具有挑战性,这阻碍了解释和沟通。传统的加性风险模型虽然解决了时间依赖性问题,但由于对结果分布的限制性假设而受到限制。方法:本文建议使用最近引入的一类参数加性风险模型来计算时间-事件结果的时间常数绝对效应度量。这些模型允许大范围的参数分布,克服了以前方法的局限性。该方法提供了一个单一的绝对效应值(例如,危险差异或NNT),总结了整个研究期间的效应。我们使用来自EMPA-REG OUTCOME试验的数字化Kaplan-Meier数据来说明这种方法,重点关注全因死亡率,并拟合六种不同的参数分布(指数、线性风险率、Weibull、log-logistic、Gompertz和Gamma-Gompertz)。结果:尽管六个分布之间的模型拟合存在显著差异,但估计的比率差异、相应的nnt及其置信区间非常相似。根据BIC提供最佳拟合的线性危险率和Gompertz模型得出的危险率差异为-8.8 / 1000人年,NNT为114。这些模型也显示出风险增加,与全因死亡率的预期一致。最佳拟合模型(10.4年和13.0年)的分布估计模式比简单模型的分布估计模式更可信。结论:参数加性风险模型为计算时间到事件结果的时间常数绝对效应度量提供了一个有价值的工具。这种方法有效地解决了时间依赖性和有限分布灵活性的问题,提供了单一的、可解释的绝对效应大小。未来的工作可以探索更一般的分布,并进一步推导绝对效应在时间尺度上的度量。
{"title":"Time-constant absolute effect measures for time-to-event outcomes.","authors":"Oliver Kuss, Annika Hoyer","doi":"10.1186/s12874-025-02743-7","DOIUrl":"10.1186/s12874-025-02743-7","url":null,"abstract":"<p><strong>Background: </strong>Reporting treatment effects from clinical trials on both relative and absolute scales is crucial. While absolute measures like the Number Needed to Treat (NNT) are well-established for binary outcomes, their calculation for time-to-event outcomes remains challenging due to time-dependence, which hinders interpretation and communication. Traditional additive hazard models, while addressing time-dependence, have been limited by restrictive assumptions regarding outcome distributions.</p><p><strong>Methods: </strong>This paper proposes to use a recently introduced class of parametric additive hazard models to compute time-constant absolute effect measures for time-to-event outcomes. These models allow for a wide range of parametric distributions, overcoming the limitations of previous approaches. The approach provides a single, absolute effect size (e.g., hazard difference or NNT) summarizing the effect over the entire study duration. We illustrate this method using digitized Kaplan-Meier data from the EMPA-REG OUTCOME trial, focusing on all-cause mortality, and fit six different parametric distributions (exponential, linear hazard rate, Weibull, log-logistic, Gompertz, and Gamma-Gompertz).</p><p><strong>Results: </strong>Despite notable differences in model fit across the six distributions, the estimated rate differences, corresponding NNTs, and their confidence intervals were remarkably similar. The linear hazard rate and Gompertz models, which provided the best fit according to the BIC, yielded a rate difference of -8.8 per 1,000 person-years, with an NNT of 114. These models also demonstrated increasing hazards, aligning with expectations for all-cause mortality. The estimated modes of the distributions from the best-fitting models (10.4 and 13.0 years) were more plausible than those from simpler models.</p><p><strong>Conclusions: </strong>The class of parametric additive hazard models offers a valuable tool for calculating time-constant absolute effect measures for time-to-event outcomes. This approach effectively addresses the issues of time-dependence and limited distribution flexibility, providing a single, interpretable absolute effect size. Future work could explore more general distributions and further derivation of absolute effect measures on the time scale.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"10"},"PeriodicalIF":3.4,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12821884/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145773489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unmeasured confounding and misclassification in studies estimating vaccine effectiveness against hospitalisation and death using electronic health records (EHRs): an evaluation of a multi-country European retrospective cohort study. 使用电子健康记录(EHRs)估计疫苗对住院和死亡有效性的研究中未测量的混淆和错误分类:对欧洲多国回顾性队列研究的评价。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-17 DOI: 10.1186/s12874-025-02742-8
James Humphreys, Nathalie Nicolay, Toon Braeye, Izaak Van Evercooren, Christian Holm Hansen, Ida Rask Moustsen-Helms, Chiara Sacco, Alberto Mateo-Urdiales, Jesús Castilla, Iván Martínez-Baz, Ausenda Machado, Patricia Soares, Brechje de Gier, Hinta Meijerink, Susana Monge, Sabrina Bacci, Baltazar Nunes

Background: Electronic health record (EHR)-based observational studies can rapidly provide real-world data on vaccine effectiveness (VE), though EHR data may be prone to misclassification and unmeasured confounding.

Methods: In VEBIS-EHR, a retrospective multi-country COVID-19 VE cohort study, we examined unmeasured confounding using a negative control outcome (death not related to COVID-19) and misclassification due to timing of data extraction. The evaluation spanned two periods (November-December 2023, January-February 2024), encompassing up to 18.7 million individuals across six EU/EEA countries. Vaccine confounding-adjusted hazard ratios (aHRs) were pooled using random-effects meta-analysis.

Results: aHRs against non-COVID-19 mortality ranged from 0.35 (95% CI: 0.28-0.44) to 0.70 (0.66-0.73) when comparing vaccinated versus unvaccinated. Delaying EHR data extraction modestly increased the capture of outcome and exposure events, with some variation by vaccination status. Site-level fluctuations in aHRs did not meaningfully alter the overall pooled VE, suggesting stable estimates despite misclassification related to extraction timing.

Conclusions: We observed some evidence of unmeasured confounding when using non-COVID-19 deaths as a negative outcome, though the specificity of our negative control must be considered. This result may suggest overestimation of VE, but also the need for further analysis with more specific negative control outcomes and confounding-adjustment techniques. Addressing such confounding using richer data sources and more refined approaches remains critical to ensure accurate, timely VE estimates based on retrospective cohorts constructed using registry data. Extending the delay between the end of observation and data extraction modestly improves the completeness of exposure and outcome data, with limited effect on pooled VE estimates.

背景:基于电子健康记录(EHR)的观察性研究可以快速提供关于疫苗有效性(VE)的真实数据,尽管电子健康记录数据可能容易出现错误分类和无法测量的混淆。方法:在VEBIS-EHR(一项回顾性多国COVID-19 VE队列研究)中,我们使用阴性对照结果(与COVID-19无关的死亡)和由于数据提取时间导致的错误分类来检查未测量的混杂因素。评估跨越两个时期(2023年11月至12月,2024年1月至2月),涵盖了六个欧盟/欧洲经济区国家的1870万人。采用随机效应荟萃分析对疫苗混杂校正风险比(aHRs)进行汇总。结果:当比较接种疫苗和未接种疫苗时,非covid -19死亡率的ahr范围为0.35 (95% CI: 0.28-0.44)至0.70(0.66-0.73)。延迟EHR数据提取适度地增加了结果和暴露事件的捕获,并因疫苗接种状况而有所变化。ahr的位点水平波动并没有改变总体汇总的VE,这表明尽管与提取时间相关的错误分类,但估计是稳定的。结论:当使用非covid -19死亡作为阴性结果时,我们观察到一些无法测量的混杂证据,尽管必须考虑阴性对照的特异性。这一结果可能表明对VE的高估,但也需要进一步分析更具体的阴性对照结果和混杂调整技术。使用更丰富的数据源和更精细的方法来解决这些混淆问题,对于确保基于使用注册表数据构建的回顾性队列的准确、及时的VE估计仍然至关重要。延长观察结束和数据提取之间的延迟适度提高了暴露和结果数据的完整性,但对汇总VE估计的影响有限。
{"title":"Unmeasured confounding and misclassification in studies estimating vaccine effectiveness against hospitalisation and death using electronic health records (EHRs): an evaluation of a multi-country European retrospective cohort study.","authors":"James Humphreys, Nathalie Nicolay, Toon Braeye, Izaak Van Evercooren, Christian Holm Hansen, Ida Rask Moustsen-Helms, Chiara Sacco, Alberto Mateo-Urdiales, Jesús Castilla, Iván Martínez-Baz, Ausenda Machado, Patricia Soares, Brechje de Gier, Hinta Meijerink, Susana Monge, Sabrina Bacci, Baltazar Nunes","doi":"10.1186/s12874-025-02742-8","DOIUrl":"10.1186/s12874-025-02742-8","url":null,"abstract":"<p><strong>Background: </strong>Electronic health record (EHR)-based observational studies can rapidly provide real-world data on vaccine effectiveness (VE), though EHR data may be prone to misclassification and unmeasured confounding.</p><p><strong>Methods: </strong>In VEBIS-EHR, a retrospective multi-country COVID-19 VE cohort study, we examined unmeasured confounding using a negative control outcome (death not related to COVID-19) and misclassification due to timing of data extraction. The evaluation spanned two periods (November-December 2023, January-February 2024), encompassing up to 18.7 million individuals across six EU/EEA countries. Vaccine confounding-adjusted hazard ratios (aHRs) were pooled using random-effects meta-analysis.</p><p><strong>Results: </strong>aHRs against non-COVID-19 mortality ranged from 0.35 (95% CI: 0.28-0.44) to 0.70 (0.66-0.73) when comparing vaccinated versus unvaccinated. Delaying EHR data extraction modestly increased the capture of outcome and exposure events, with some variation by vaccination status. Site-level fluctuations in aHRs did not meaningfully alter the overall pooled VE, suggesting stable estimates despite misclassification related to extraction timing.</p><p><strong>Conclusions: </strong>We observed some evidence of unmeasured confounding when using non-COVID-19 deaths as a negative outcome, though the specificity of our negative control must be considered. This result may suggest overestimation of VE, but also the need for further analysis with more specific negative control outcomes and confounding-adjustment techniques. Addressing such confounding using richer data sources and more refined approaches remains critical to ensure accurate, timely VE estimates based on retrospective cohorts constructed using registry data. Extending the delay between the end of observation and data extraction modestly improves the completeness of exposure and outcome data, with limited effect on pooled VE estimates.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"9"},"PeriodicalIF":3.4,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12822061/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145773494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel statistical feature selection framework for biomarker discovery and cancer classification via multiomics integration. 基于多组学整合的生物标志物发现和癌症分类的新统计特征选择框架。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-17 DOI: 10.1186/s12874-025-02713-z
Moshira S Ghaleb, Maryam N Al-Berry, Hala M Ebied, Mohamed F Tolba

Background: Early cancer diagnosis is essential for improving prognosis and guiding treatment. However, the high dimensionality and complexity of omics data present major challenges. Computational approaches that extract stable biomarkers and enable reliable classification across cancer types and stages are needed.

Methods: A novel feature selection method, sDCFE (synergistic Discriminative Cluster-based Feature Extraction), was developed by extending Fisher-like variance analysis with a median absolute deviation (MAD) regularization term and a cluster separation component to enhance robustness and interpretability. Features selected by sDCFE were compared with those obtained from XGBoost, and the intersected set of 82 genes was evaluated through functional enrichment (KEGG, Reactome, GO BP), survival analysis (Kaplan-Meier, Cox regression), and biomarker novelty assessment against six external resources. Hybrid classification models integrating XGBoost, sDCFE, and deep learning were applied to pancancer classification, and the framework was further extended to lung squamous cell carcinoma (LUSC) staging using RNA-seq and methylation data.

Results: The overlap between sDCFE and XGBoost yielded 82 candidate biomarkers enriched in cancer-related pathways, including cell cycle regulation, immune signalling, and DNA repair. Novelty assessment stratified these genes into established, emerging, and novel categories. Six genes-HFE2, LOC339674, SERINC2, SFTA3, SOX2OT, and ACPP-emerged as the most promising candidates, supported by enrichment and survival associations across multiple cancers. The hybrid model achieved near-perfect pancancer classification on TCGA (accuracy = 99.3%, MCC = 0.992, AUC = 1.0) and demonstrated strong generalizability on PCAWG (accuracy = 94%, MCC = 0.929, AUC = 0.997). In the LUSC staging task, multiomics integration improved classification performance: the CNN-based model reached 84% accuracy, while logistic regression applied to sDCFE-ranked features achieved 88.5% accuracy with superior calibration, highlighting the robustness of the selected features.

Conclusion: sDCFE provides a principled extension of Fisher-like methods, enabling stable and interpretable biomarker selection. When combined with XGBoost and deep learning, the framework achieves highly accurate and biologically grounded cancer classification across both cancer types and stages. The identification of novel and prognostic biomarkers, including HFE2, LOC339674, SERINC2, SFTA3, SOX2OT, and ACPP, underscores its translational potential. These results position the framework as a promising precision oncology tool to support early diagnosis, risk stratification, and treatment decision-making.

背景:肿瘤早期诊断对改善预后和指导治疗至关重要。然而,组学数据的高维度和复杂性提出了主要挑战。需要能够提取稳定的生物标志物并实现跨癌症类型和阶段的可靠分类的计算方法。方法:提出了一种新的特征选择方法sDCFE (synergistic Discriminative clustering based feature Extraction,基于协同判别聚类的特征提取),该方法将Fisher-like方差分析扩展为中位数绝对偏差(MAD)正则化项和聚类分离分量,以增强鲁棒性和可解释性。将sDCFE选择的特征与XGBoost获得的特征进行比较,并通过功能富集(KEGG, Reactome, GO BP),生存分析(Kaplan-Meier, Cox回归)和针对六个外部资源的生物标志物新颖性评估来评估82个交叉基因集。结合XGBoost、sDCFE和深度学习的混合分类模型应用于癌症分类,并利用RNA-seq和甲基化数据将该框架进一步扩展到肺鳞状细胞癌(LUSC)分期。结果:sDCFE和XGBoost之间的重叠产生了82个富集于癌症相关途径的候选生物标志物,包括细胞周期调节、免疫信号传导和DNA修复。新颖性评估将这些基因分为已建立的、新兴的和新颖的三类。六个基因——hfe2、LOC339674、SERINC2、SFTA3、SOX2OT和acpp——成为最有希望的候选基因,它们在多种癌症中的富集和生存相关性得到了支持。混合模型在TCGA上实现了近乎完美的胰腺癌分类(准确率为99.3%,MCC = 0.992, AUC = 1.0),在PCAWG上具有较强的泛化性(准确率为94%,MCC = 0.929, AUC = 0.997)。在LUSC分期任务中,多组学集成提高了分类性能:基于cnn的模型准确率达到84%,而逻辑回归应用于sdcfe排序特征的准确率达到88.5%,校正效果较好,突出了所选特征的鲁棒性。结论:sDCFE提供了Fisher-like方法的原则性扩展,实现了稳定和可解释的生物标志物选择。当与XGBoost和深度学习相结合时,该框架可以实现跨癌症类型和阶段的高度准确和基于生物学的癌症分类。新的预后生物标志物的鉴定,包括HFE2、LOC339674、SERINC2、SFTA3、SOX2OT和ACPP,强调了其转化潜力。这些结果将该框架定位为一种有前景的精确肿瘤学工具,以支持早期诊断、风险分层和治疗决策。
{"title":"A novel statistical feature selection framework for biomarker discovery and cancer classification via multiomics integration.","authors":"Moshira S Ghaleb, Maryam N Al-Berry, Hala M Ebied, Mohamed F Tolba","doi":"10.1186/s12874-025-02713-z","DOIUrl":"10.1186/s12874-025-02713-z","url":null,"abstract":"<p><strong>Background: </strong>Early cancer diagnosis is essential for improving prognosis and guiding treatment. However, the high dimensionality and complexity of omics data present major challenges. Computational approaches that extract stable biomarkers and enable reliable classification across cancer types and stages are needed.</p><p><strong>Methods: </strong>A novel feature selection method, sDCFE (synergistic Discriminative Cluster-based Feature Extraction), was developed by extending Fisher-like variance analysis with a median absolute deviation (MAD) regularization term and a cluster separation component to enhance robustness and interpretability. Features selected by sDCFE were compared with those obtained from XGBoost, and the intersected set of 82 genes was evaluated through functional enrichment (KEGG, Reactome, GO BP), survival analysis (Kaplan-Meier, Cox regression), and biomarker novelty assessment against six external resources. Hybrid classification models integrating XGBoost, sDCFE, and deep learning were applied to pancancer classification, and the framework was further extended to lung squamous cell carcinoma (LUSC) staging using RNA-seq and methylation data.</p><p><strong>Results: </strong>The overlap between sDCFE and XGBoost yielded 82 candidate biomarkers enriched in cancer-related pathways, including cell cycle regulation, immune signalling, and DNA repair. Novelty assessment stratified these genes into established, emerging, and novel categories. Six genes-HFE2, LOC339674, SERINC2, SFTA3, SOX2OT, and ACPP-emerged as the most promising candidates, supported by enrichment and survival associations across multiple cancers. The hybrid model achieved near-perfect pancancer classification on TCGA (accuracy = 99.3%, MCC = 0.992, AUC = 1.0) and demonstrated strong generalizability on PCAWG (accuracy = 94%, MCC = 0.929, AUC = 0.997). In the LUSC staging task, multiomics integration improved classification performance: the CNN-based model reached 84% accuracy, while logistic regression applied to sDCFE-ranked features achieved 88.5% accuracy with superior calibration, highlighting the robustness of the selected features.</p><p><strong>Conclusion: </strong>sDCFE provides a principled extension of Fisher-like methods, enabling stable and interpretable biomarker selection. When combined with XGBoost and deep learning, the framework achieves highly accurate and biologically grounded cancer classification across both cancer types and stages. The identification of novel and prognostic biomarkers, including HFE2, LOC339674, SERINC2, SFTA3, SOX2OT, and ACPP, underscores its translational potential. These results position the framework as a promising precision oncology tool to support early diagnosis, risk stratification, and treatment decision-making.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"11"},"PeriodicalIF":3.4,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12822226/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145773529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing interpretability for Bayesian basket trial designs by effective sample size. 通过有效样本量提高贝叶斯篮子试验设计的可解释性。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-16 DOI: 10.1186/s12874-025-02715-x
Xin Chen, Jingyi Zhang, Wenyun Yang, Liyun Jiang, Bosheng Li, Fangrong Yan

Background: There is growing interest in utilizing Bayesian approaches to borrow information across tumor types in basket trials. Several innovative designs, primarily extensions of the Bayesian hierarchical model (BHM), have been proposed to dynamically borrow information based on observed data. However, there is no recognized solution to quantify the degree of information borrowing in such a context, posing a great challenge for non-statisticians to understand these complex designs.

Methods: The tool of effective sample size (ESS) is leveraged to Bayesian basket trials and several ESS-based borrowing strategies are proposed. The mean squared error (MSE), which explicitly accounts for the trade-off between estimation bias and variance reduction, is selected as the target measure for deriving ESS. Through a reanalysis of the RAGNAR study as well as simulation studies, the interpretability of ESS is demonstrated at both the analysis and design stages of basket trials.

Results: ESS reflects the impact of information borrowing on MSE and intuitively characterizes the degree of borrowing. It aligns with the type I error rate and power, showing potential as a valuable complement in statistical analyses and simulation studies.

Conclusions: Quantifying the degree of information borrowing by ESS can greatly help trialists design Bayesian basket trials, reasonably evaluate and interpret the results of Bayesian analyses, conduct sensitivity analyses, and ultimately borrow proper amount of information in basket trials.

背景:在篮子试验中利用贝叶斯方法来获取肿瘤类型信息的兴趣越来越大。一些创新的设计,主要是贝叶斯层次模型(BHM)的扩展,已经提出了基于观测数据的动态借用信息。然而,在这种情况下,没有公认的解决方案来量化信息借用的程度,这对非统计学家理解这些复杂的设计构成了巨大的挑战。方法:将有效样本量(ESS)工具运用到贝叶斯篮子试验中,并提出几种基于有效样本量的借鉴策略。选择均方误差(MSE)作为推导ESS的目标度量,它明确地说明了估计偏差和方差减少之间的权衡。通过对RAGNAR研究和模拟研究的再分析,在篮子试验的分析和设计阶段都证明了ESS的可解释性。结果:ESS反映了信息借用对MSE的影响,直观地表征了信息借用的程度。它与I型错误率和功率一致,显示出在统计分析和模拟研究中作为有价值的补充的潜力。结论:通过ESS量化信息借用程度,可以极大地帮助试验人员设计贝叶斯篮子试验,合理评价和解释贝叶斯分析结果,进行敏感性分析,最终在篮子试验中借用适量的信息。
{"title":"Enhancing interpretability for Bayesian basket trial designs by effective sample size.","authors":"Xin Chen, Jingyi Zhang, Wenyun Yang, Liyun Jiang, Bosheng Li, Fangrong Yan","doi":"10.1186/s12874-025-02715-x","DOIUrl":"10.1186/s12874-025-02715-x","url":null,"abstract":"<p><strong>Background: </strong>There is growing interest in utilizing Bayesian approaches to borrow information across tumor types in basket trials. Several innovative designs, primarily extensions of the Bayesian hierarchical model (BHM), have been proposed to dynamically borrow information based on observed data. However, there is no recognized solution to quantify the degree of information borrowing in such a context, posing a great challenge for non-statisticians to understand these complex designs.</p><p><strong>Methods: </strong>The tool of effective sample size (ESS) is leveraged to Bayesian basket trials and several ESS-based borrowing strategies are proposed. The mean squared error (MSE), which explicitly accounts for the trade-off between estimation bias and variance reduction, is selected as the target measure for deriving ESS. Through a reanalysis of the RAGNAR study as well as simulation studies, the interpretability of ESS is demonstrated at both the analysis and design stages of basket trials.</p><p><strong>Results: </strong>ESS reflects the impact of information borrowing on MSE and intuitively characterizes the degree of borrowing. It aligns with the type I error rate and power, showing potential as a valuable complement in statistical analyses and simulation studies.</p><p><strong>Conclusions: </strong>Quantifying the degree of information borrowing by ESS can greatly help trialists design Bayesian basket trials, reasonably evaluate and interpret the results of Bayesian analyses, conduct sensitivity analyses, and ultimately borrow proper amount of information in basket trials.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"278"},"PeriodicalIF":3.4,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709804/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145767078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A clustering-stratified cross-validation framework for validating omics survival models: application to head and neck cancer. 用于验证组学生存模型的聚类分层交叉验证框架:头颈癌的应用。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-16 DOI: 10.1186/s12874-025-02709-9
Antoine Dubray-Vautrin, Olivier Choussy, Constance Lamy, Grégoire Marret, Joey Martin, Jerzy Klijanienko, Sophie Vacher, Ladidi Ahmanache, Ivan Bieche, Célia Dupain, Christophe Le Tourneau, Jimmy Mullaert
{"title":"A clustering-stratified cross-validation framework for validating omics survival models: application to head and neck cancer.","authors":"Antoine Dubray-Vautrin, Olivier Choussy, Constance Lamy, Grégoire Marret, Joey Martin, Jerzy Klijanienko, Sophie Vacher, Ladidi Ahmanache, Ivan Bieche, Célia Dupain, Christophe Le Tourneau, Jimmy Mullaert","doi":"10.1186/s12874-025-02709-9","DOIUrl":"10.1186/s12874-025-02709-9","url":null,"abstract":"","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"277"},"PeriodicalIF":3.4,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709853/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145767041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighting strategy and selection analysis in the panel 'Health in Germany': methods and results for the 2024 annual survey. “德国健康”小组中的加权策略和选择分析:2024年年度调查的方法和结果。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-15 DOI: 10.1186/s12874-025-02740-w
Stefan Damerow, Ronny Kuhnert, Angelika Schaffrath Rosario, Johannes Lemcke

Background: The panel `Health in Germany` has been established to gather nationwide health-related information, replacing cross-sectional surveys as primary data sources. However, panel designs involve multiple selection stages, potentially introducing additional nonresponse bias. This study aims to describe this drop-out bias and the weighting strategy used to improve representativeness.

Methods: Panelists were recruited through a recruitment study. At completion of the recruitment questionnaire, participants were invited to register for the panel. Registered panelists were subsequently invited to the first annual survey in 2024, which was divided into three sub-waves. Each included one of four questionnaires with different topics. Logistic regression models are used to estimate the probability for panel registration and participation in panel 2024 questionnaires, using sociodemographic and health-related variables from the recruitment study to illustrate drop-out bias. The weighting scheme and techniques are described. To assess the potential for drop-out bias and how it is reduced through weighting, unweighted and weighted estimates are compared to internal and external reference distributions using standardized differences.

Results: Drop-out analysis showed that sociodemographic characteristics (age, education, German citizenship) had a stronger association with panel registration than health-related parameters (e.g., self-rated health, smoking status, sport activity, chronic diseases) among the participants of the recruitment study. Similar patterns as for registration were found for participation in the 2024 questionnaires, except for age, which showed a reverse effect. No differences were observed across questionnaire types. Standardized differences confirmed the findings: sociodemographic characteristics-particularly education and German citizenship-showed larger deviations than health-related parameters. The largest deviations occurred in the recruitment study. The weighting procedure reduced most standardized differences to below 0.5% points. An exception is German citizenship, which showed only slight improvement.

Conclusions: Drop-out within the first year of a newly established panel is mainly affected by sociodemographic variables, with minor effects due to health-related parameters. The additional recruitment steps did not lead to concerning deviations in sample composition compared to the recruitment study. Remaining differences were addressed through drop-out and calibration weighting, so the weighted panel 2024 sample does not substantially differ from what would be expected in a cross-sectional design such as the recruitment study. However, continued analyses are needed, as sample composition may change due to future panel attrition.

背景:设立了“德国健康状况”小组,以收集全国卫生相关信息,取代横断面调查作为主要数据来源。然而,面板设计涉及多个选择阶段,可能会引入额外的非反应偏差。本研究旨在描述这种退出偏差和加权策略,用于提高代表性。方法:通过招募研究招募小组成员。参加者填写招聘问卷后,便获邀报名参加小组讨论。注册小组成员随后被邀请参加2024年的第一次年度调查,该调查分为三次。每个都包括四份不同主题的问卷中的一份。使用逻辑回归模型来估计小组注册和参与小组2024问卷的概率,使用招募研究中的社会人口统计学和健康相关变量来说明退出偏倚。介绍了加权方案和加权技术。为了评估退出偏倚的可能性以及如何通过加权来减少它,使用标准化差异将未加权和加权估计与内部和外部参考分布进行比较。结果:退出分析表明,在招募研究的参与者中,社会人口学特征(年龄、教育程度、德国国籍)与小组登记的关联强于与健康相关参数(例如,自评健康、吸烟状况、体育活动、慢性病)的关联。2024份问卷的参与情况与登记情况相似,但年龄表现出相反的效果。问卷类型之间没有差异。标准化差异证实了研究结果:社会人口学特征——尤其是教育和德国国籍——比健康相关参数显示出更大的偏差。最大的偏差发生在招募研究中。加权程序将大多数标准化差异降低到0.5%以下。德国国籍是个例外,仅略有改善。结论:新建立的小组第一年的退学主要受社会人口变量的影响,健康相关参数的影响较小。与招募研究相比,额外的招募步骤没有导致样本组成的相关偏差。剩余的差异通过退出和校准加权来解决,因此加权面板2024样本与招聘研究等横截面设计中的预期没有实质性差异。然而,需要继续分析,因为样品成分可能会因未来面板磨损而改变。
{"title":"Weighting strategy and selection analysis in the panel 'Health in Germany': methods and results for the 2024 annual survey.","authors":"Stefan Damerow, Ronny Kuhnert, Angelika Schaffrath Rosario, Johannes Lemcke","doi":"10.1186/s12874-025-02740-w","DOIUrl":"10.1186/s12874-025-02740-w","url":null,"abstract":"<p><strong>Background: </strong>The panel `Health in Germany` has been established to gather nationwide health-related information, replacing cross-sectional surveys as primary data sources. However, panel designs involve multiple selection stages, potentially introducing additional nonresponse bias. This study aims to describe this drop-out bias and the weighting strategy used to improve representativeness.</p><p><strong>Methods: </strong>Panelists were recruited through a recruitment study. At completion of the recruitment questionnaire, participants were invited to register for the panel. Registered panelists were subsequently invited to the first annual survey in 2024, which was divided into three sub-waves. Each included one of four questionnaires with different topics. Logistic regression models are used to estimate the probability for panel registration and participation in panel 2024 questionnaires, using sociodemographic and health-related variables from the recruitment study to illustrate drop-out bias. The weighting scheme and techniques are described. To assess the potential for drop-out bias and how it is reduced through weighting, unweighted and weighted estimates are compared to internal and external reference distributions using standardized differences.</p><p><strong>Results: </strong>Drop-out analysis showed that sociodemographic characteristics (age, education, German citizenship) had a stronger association with panel registration than health-related parameters (e.g., self-rated health, smoking status, sport activity, chronic diseases) among the participants of the recruitment study. Similar patterns as for registration were found for participation in the 2024 questionnaires, except for age, which showed a reverse effect. No differences were observed across questionnaire types. Standardized differences confirmed the findings: sociodemographic characteristics-particularly education and German citizenship-showed larger deviations than health-related parameters. The largest deviations occurred in the recruitment study. The weighting procedure reduced most standardized differences to below 0.5% points. An exception is German citizenship, which showed only slight improvement.</p><p><strong>Conclusions: </strong>Drop-out within the first year of a newly established panel is mainly affected by sociodemographic variables, with minor effects due to health-related parameters. The additional recruitment steps did not lead to concerning deviations in sample composition compared to the recruitment study. Remaining differences were addressed through drop-out and calibration weighting, so the weighted panel 2024 sample does not substantially differ from what would be expected in a cross-sectional design such as the recruitment study. However, continued analyses are needed, as sample composition may change due to future panel attrition.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"8"},"PeriodicalIF":3.4,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12822093/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145762275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demystifying inconsistent two-sample mendelian randomization estimations using selection diagram. 用选择图揭开不一致的两样本孟德尔随机化估计的神秘面纱。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.1186/s12874-025-02707-x
Lei Hou, Yuanyuan Yu, Zhi Geng, Fuzhong Xue, Hongkai Li

Two-Sample Mendelian Randomization (TSMR) analysis is a widely used method for inferring causal effect in the presence of unmeasured confounding. However, causal inferences may be biased if the distributions of key variables (e.g., exposures, outcomes, and confounders) differ across populations. Such discrepancies in the distributions of key variables between the two populations are referred to as different local mechanisms. This paper aims to clarify the impact of different local mechanisms on the estimation of the Local Average Treatment Effect (LATE) in TSMR analyses using selection diagrams. We first uncover and formally define the Complete and Partial Inconsistent TSMR Estimations (InTSMRE). Subsequently, we propose a criterion of No InTSMRE in the context of continuous and binary outcomes. Following this, we introduce the LATE Ratio to evaluate the deviation of the LATE estimate from the true causal effect. Finally, we demonstrate that the violation of the Monotonicity condition exacerbates the occurrences of the Complete InTSMRE; otherwise only the Partial InTSMRE occurs. Additionally, through simulation studies, we illustrate the specific conditions under which these InTSMRE arise. We explore the LATEs of Waist-to-hip ratio on Type 2 diabetes in European and mixed populations, demonstrating the phenomenon of the InTSMRE.

双样本孟德尔随机化(TSMR)分析是一种广泛使用的方法,用于推断存在未测量混杂的因果效应。然而,如果关键变量(例如,暴露、结果和混杂因素)的分布在人群中不同,则因果推断可能存在偏差。两个种群之间关键变量分布的这种差异被称为不同的局部机制。本文旨在利用选择图阐明不同的局部机制对TSMR分析中局部平均处理效应(LATE)估计的影响。我们首先揭示并正式定义了完全和部分不一致TSMR估计(InTSMRE)。随后,我们提出了在连续和二元结果背景下的无InTSMRE准则。接下来,我们引入LATE比率来评估LATE估计与真实因果效应的偏差。最后,我们证明了单调性条件的违反加剧了完全InTSMRE的发生;否则只有局部InTSMRE发生。此外,通过模拟研究,我们说明了这些InTSMRE产生的具体条件。我们研究了欧洲和混合人群中腰臀比对2型糖尿病的影响,证明了InTSMRE现象。
{"title":"Demystifying inconsistent two-sample mendelian randomization estimations using selection diagram.","authors":"Lei Hou, Yuanyuan Yu, Zhi Geng, Fuzhong Xue, Hongkai Li","doi":"10.1186/s12874-025-02707-x","DOIUrl":"10.1186/s12874-025-02707-x","url":null,"abstract":"<p><p>Two-Sample Mendelian Randomization (TSMR) analysis is a widely used method for inferring causal effect in the presence of unmeasured confounding. However, causal inferences may be biased if the distributions of key variables (e.g., exposures, outcomes, and confounders) differ across populations. Such discrepancies in the distributions of key variables between the two populations are referred to as different local mechanisms. This paper aims to clarify the impact of different local mechanisms on the estimation of the Local Average Treatment Effect (LATE) in TSMR analyses using selection diagrams. We first uncover and formally define the Complete and Partial Inconsistent TSMR Estimations (InTSMRE). Subsequently, we propose a criterion of No InTSMRE in the context of continuous and binary outcomes. Following this, we introduce the LATE Ratio to evaluate the deviation of the LATE estimate from the true causal effect. Finally, we demonstrate that the violation of the Monotonicity condition exacerbates the occurrences of the Complete InTSMRE; otherwise only the Partial InTSMRE occurs. Additionally, through simulation studies, we illustrate the specific conditions under which these InTSMRE arise. We explore the LATEs of Waist-to-hip ratio on Type 2 diabetes in European and mixed populations, demonstrating the phenomenon of the InTSMRE.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"22"},"PeriodicalIF":3.4,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12859973/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145740937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Qualitative dyadic analysis in care partnership research: a scoping review. 护理伙伴关系研究中的定性二元分析:范围综述。
IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-11 DOI: 10.1186/s12874-025-02722-y
Andrea S E Parks, Lesley Gotlib Conn, Bahar Aria, Manisha Reza Paul, Allan Li, Agessandro Abrahao, Lorne Zinman, Joanna E M Sale

Background: Chronic illness impacts not only individuals affected by it, but also those who care for them. Care partnerships recognize that health conditions are often shared, dyadic experiences. Qualitative dyadic analysis, which foregrounds the dyad as the unit of analysis, is a method that can enhance understanding of illness as a joint experience. However, when perspectives of dyad members are collected separately, their subsequent analysis as a unit can be challenging.

Objective: To review and summarize qualitative literature where data have been collected through separate individual interviews with patient and care partner dyads and analyzed at the dyadic level.

Methods: A scoping review guided by Joanna Briggs Institute methodology was undertaken. Databases (Ovid's Medline, Embase, and PsycINFO; EBSCO CINAHL; and ProQuest Sociological Abstracts) were searched in February 2024. Eligible articles included peer-reviewed literature published in English from 2010 onwards documenting qualitative dyadic analysis of individual interviews collected from patient and care partner dyads. Title and abstracts were screened and the full text of all potentially eligible articles was reviewed by two independent reviewers. Data were extracted using a table and results were summarized using frequency counts and qualitative content analysis.

Results: 7,494 records were identified and screened. 113 reports of 112 unique studies fulfilled eligibility criteria and were included. Numerous methodologies and analytic methods were reported, many of which incorporated methods from different qualitative traditions, often with variable sequencing of analytic steps that were infrequently well described. Studies were not routinely conceptualized at the dyadic level and underlying epistemological assumptions were rarely discussed despite their essential role in grounding dyadic analysis.

Conclusions: When conducting qualitative dyadic analysis, researchers should consider dyadic study conceptualization from study outset. The purpose of the analysis, the analytic steps taken, and their alignment with underlying epistemology and other incorporated methodologies should be clearly documented and reported.

背景:慢性疾病不仅影响受其影响的个人,也影响那些照顾他们的人。保健伙伴关系认识到,健康状况往往是共同的、双重的经历。定性的二元分析,将二元作为分析的单位,是一种可以增强对疾病作为一种共同经验的理解的方法。然而,当两组成员的观点被单独收集时,他们作为一个整体的后续分析可能具有挑战性。目的:回顾和总结定性文献中收集的数据,这些数据是通过对患者和护理伙伴的单独访谈收集的,并在二元水平上进行分析。方法:采用乔安娜布里格斯研究所的方法进行范围审查。数据库(Ovid's Medline, Embase和PsycINFO; EBSCO CINAHL;和ProQuest社会学摘要)于2024年2月检索。符合条件的文章包括2010年以来发表的同行评议的英文文献,记录了从患者和护理伙伴中收集的个体访谈的定性二元分析。对标题和摘要进行筛选,并由两名独立审稿人对所有可能符合条件的文章的全文进行审查。使用表格提取数据,并使用频率计数和定性内容分析对结果进行总结。结果:共筛选出7494条记录。112项独特研究的113份报告符合入选标准。报告了许多方法论和分析方法,其中许多结合了来自不同定性传统的方法,通常具有不同的分析步骤顺序,这些步骤很少被很好地描述。研究通常不会在二元水平上概念化,并且很少讨论潜在的认识论假设,尽管它们在二元分析的基础中起着重要作用。结论:在进行定性二元分析时,研究者应从研究一开始就考虑二元研究的概念化。分析的目的,所采取的分析步骤,以及它们与基本认识论和其他综合方法的一致性,应该清楚地记录和报告。
{"title":"Qualitative dyadic analysis in care partnership research: a scoping review.","authors":"Andrea S E Parks, Lesley Gotlib Conn, Bahar Aria, Manisha Reza Paul, Allan Li, Agessandro Abrahao, Lorne Zinman, Joanna E M Sale","doi":"10.1186/s12874-025-02722-y","DOIUrl":"10.1186/s12874-025-02722-y","url":null,"abstract":"<p><strong>Background: </strong>Chronic illness impacts not only individuals affected by it, but also those who care for them. Care partnerships recognize that health conditions are often shared, dyadic experiences. Qualitative dyadic analysis, which foregrounds the dyad as the unit of analysis, is a method that can enhance understanding of illness as a joint experience. However, when perspectives of dyad members are collected separately, their subsequent analysis as a unit can be challenging.</p><p><strong>Objective: </strong>To review and summarize qualitative literature where data have been collected through separate individual interviews with patient and care partner dyads and analyzed at the dyadic level.</p><p><strong>Methods: </strong>A scoping review guided by Joanna Briggs Institute methodology was undertaken. Databases (Ovid's Medline, Embase, and PsycINFO; EBSCO CINAHL; and ProQuest Sociological Abstracts) were searched in February 2024. Eligible articles included peer-reviewed literature published in English from 2010 onwards documenting qualitative dyadic analysis of individual interviews collected from patient and care partner dyads. Title and abstracts were screened and the full text of all potentially eligible articles was reviewed by two independent reviewers. Data were extracted using a table and results were summarized using frequency counts and qualitative content analysis.</p><p><strong>Results: </strong>7,494 records were identified and screened. 113 reports of 112 unique studies fulfilled eligibility criteria and were included. Numerous methodologies and analytic methods were reported, many of which incorporated methods from different qualitative traditions, often with variable sequencing of analytic steps that were infrequently well described. Studies were not routinely conceptualized at the dyadic level and underlying epistemological assumptions were rarely discussed despite their essential role in grounding dyadic analysis.</p><p><strong>Conclusions: </strong>When conducting qualitative dyadic analysis, researchers should consider dyadic study conceptualization from study outset. The purpose of the analysis, the analytic steps taken, and their alignment with underlying epistemology and other incorporated methodologies should be clearly documented and reported.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":" ","pages":"7"},"PeriodicalIF":3.4,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12801508/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145721044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMC Medical Research Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1