首页 > 最新文献

BMC Medical Research Methodology最新文献

英文 中文
CRIME-Q-a unifying tool for critical appraisal of methodological (technical) quality, quality of reporting and risk of bias in animal research. crime - q是一个统一的工具,用于对动物研究中的方法(技术)质量、报告质量和偏倚风险进行批判性评估。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-19 DOI: 10.1186/s12874-024-02413-0
Mikkel Schou Andersen, Mikkel Seremet Kofoed, Asger Sand Paludan-Müller, Christian Bonde Pedersen, Tiit Mathiesen, Christian Mawrin, Birgitte Brinkmann Olsen, Bo Halle, Frantz Rom Poulsen

Background: Systematic reviews within the field of animal research are becoming more common. However, in animal translational research, issues related to methodological quality and quality of reporting continue to arise, potentially leading to underestimation or overestimation of the effects of interventions or prevent studies from being replicated. The various tools and checklists available to ensure good-quality studies and proper reporting include both unique and/or overlapping items and/or simply lack necessary elements or are too situational to certain conditions or diseases. Currently, there is no tool available, which covers all aspects of animal models, from bench-top activities to animal facilities, hence a new tool is needed. This tool should be designed to be able to assess all kinds of animal studies such as old, new, low quality, high quality, interventional and noninterventional on. It should do this on multiple levels through items on quality of reporting, methodological (technical) quality, and risk of bias, for use in assessing the overall quality of studies involving animal research.

Methods: During a systematic review of meningioma models in animals, we developed a novel unifying tool that can assess all types of animal studies from multiple perspectives. The tool was inspired by the Collaborative Approach to Meta Analysis and Review of Animal Data from Experimental Studies (CAMARADES) checklist, the ARRIVE 2.0 guidelines, and SYRCLE's risk of bias tool, while also incorporating unique items. We used the interrater agreement percentage and Cohen's kappa index to test the interrater agreement between two independent reviewers for the items in the tool.

Results: There was high interrater agreement across all items (92.9%, 95% CI 91.0-94.8). Cohen's kappa index showed quality of reporting had the best mean index of 0.86 (95%-CI 0.78-0.94), methodological quality had a mean index of 0.83 (95%-CI 0.78-0.94) and finally the items from SYRCLE's risk of bias had a mean kappa index of 0.68 (95%-CI 0.57-0.79).

Conclusions: The Critical Appraisal of Methodological (technical) Quality, Quality of Reporting and Risk of Bias in Animal Research (CRIME-Q) tool unifies a broad spectrum of information (both unique items and items inspired by other methods) about the quality of reporting and methodological (technical) quality, and contains items from SYRCLE's risk of bias. The tool is intended for use in assessing overall study quality across multiple domains and items and is not, unlike other tools, restricted to any particular model or study design (whether interventional or noninterventional). It is also easy to apply when designing and conducting animal experiments to ensure proper reporting and design in terms of replicability, transparency, and validity.

背景:动物研究领域的系统综述正变得越来越普遍。然而,在动物转化研究中,与方法质量和报告质量有关的问题继续出现,可能导致对干预措施效果的低估或高估,或阻止研究被复制。可用于确保高质量研究和适当报告的各种工具和检查表包括独特和/或重叠的项目和/或根本缺乏必要的要素,或对某些条件或疾病过于因时制宜。目前,还没有一种工具可以覆盖动物模型的各个方面,从工作台活动到动物设施,因此需要一种新的工具。该工具应设计成能够评估各种动物研究,如旧的、新的、低质量的、高质量的、介入性的和非介入性的。它应该通过报告质量、方法学(技术)质量和偏倚风险等项目在多个层面上进行评估,以用于评估涉及动物研究的总体质量。方法:在对动物脑膜瘤模型的系统回顾中,我们开发了一种新的统一工具,可以从多个角度评估所有类型的动物研究。该工具的灵感来自于《实验研究动物数据荟萃分析和回顾协作方法》(CAMARADES)清单、ARRIVE 2.0指南和sycle的偏倚风险工具,同时也纳入了独特的项目。我们使用仲裁者协议百分比和Cohen的kappa指数来测试两个独立的评论者对工具中项目的仲裁者协议。结果:所有条目的解释一致性很高(92.9%,95% CI 91.0-94.8)。Cohen的kappa指数显示,报告质量的平均指数为0.86 (95%-CI 0.78-0.94),方法学质量的平均指数为0.83 (95%-CI 0.78-0.94),最后来自sycle的偏倚风险项目的平均kappa指数为0.68 (95%-CI 0.57-0.79)。结论:动物研究中方法(技术)质量、报告质量和偏倚风险的关键评估(CRIME-Q)工具统一了关于报告质量和方法(技术)质量的广泛信息(包括独特的项目和受其他方法启发的项目),并包含来自sycle偏倚风险的项目。该工具旨在用于评估跨多个领域和项目的总体研究质量,与其他工具不同,它不限于任何特定的模型或研究设计(无论是介入性的还是非介入性的)。在设计和进行动物实验时也很容易应用,以确保在可复制性、透明度和有效性方面进行适当的报告和设计。
{"title":"CRIME-Q-a unifying tool for critical appraisal of methodological (technical) quality, quality of reporting and risk of bias in animal research.","authors":"Mikkel Schou Andersen, Mikkel Seremet Kofoed, Asger Sand Paludan-Müller, Christian Bonde Pedersen, Tiit Mathiesen, Christian Mawrin, Birgitte Brinkmann Olsen, Bo Halle, Frantz Rom Poulsen","doi":"10.1186/s12874-024-02413-0","DOIUrl":"10.1186/s12874-024-02413-0","url":null,"abstract":"<p><strong>Background: </strong>Systematic reviews within the field of animal research are becoming more common. However, in animal translational research, issues related to methodological quality and quality of reporting continue to arise, potentially leading to underestimation or overestimation of the effects of interventions or prevent studies from being replicated. The various tools and checklists available to ensure good-quality studies and proper reporting include both unique and/or overlapping items and/or simply lack necessary elements or are too situational to certain conditions or diseases. Currently, there is no tool available, which covers all aspects of animal models, from bench-top activities to animal facilities, hence a new tool is needed. This tool should be designed to be able to assess all kinds of animal studies such as old, new, low quality, high quality, interventional and noninterventional on. It should do this on multiple levels through items on quality of reporting, methodological (technical) quality, and risk of bias, for use in assessing the overall quality of studies involving animal research.</p><p><strong>Methods: </strong>During a systematic review of meningioma models in animals, we developed a novel unifying tool that can assess all types of animal studies from multiple perspectives. The tool was inspired by the Collaborative Approach to Meta Analysis and Review of Animal Data from Experimental Studies (CAMARADES) checklist, the ARRIVE 2.0 guidelines, and SYRCLE's risk of bias tool, while also incorporating unique items. We used the interrater agreement percentage and Cohen's kappa index to test the interrater agreement between two independent reviewers for the items in the tool.</p><p><strong>Results: </strong>There was high interrater agreement across all items (92.9%, 95% CI 91.0-94.8). Cohen's kappa index showed quality of reporting had the best mean index of 0.86 (95%-CI 0.78-0.94), methodological quality had a mean index of 0.83 (95%-CI 0.78-0.94) and finally the items from SYRCLE's risk of bias had a mean kappa index of 0.68 (95%-CI 0.57-0.79).</p><p><strong>Conclusions: </strong>The Critical Appraisal of Methodological (technical) Quality, Quality of Reporting and Risk of Bias in Animal Research (CRIME-Q) tool unifies a broad spectrum of information (both unique items and items inspired by other methods) about the quality of reporting and methodological (technical) quality, and contains items from SYRCLE's risk of bias. The tool is intended for use in assessing overall study quality across multiple domains and items and is not, unlike other tools, restricted to any particular model or study design (whether interventional or noninterventional). It is also easy to apply when designing and conducting animal experiments to ensure proper reporting and design in terms of replicability, transparency, and validity.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"306"},"PeriodicalIF":3.9,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11656974/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The use of sequential multiple assignment randomized trials (SMARTs) in physical activity interventions: a systematic review. 在体育锻炼干预中使用连续多次分配随机试验(SMART):系统综述。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-19 DOI: 10.1186/s12874-024-02439-4
Aoife Whiston, K M Kidwell, S O'Reilly, C Walsh, J C Walsh, L Glynn, K Robinson, S Hayes

Background: Physical activity (PA) is often the cornerstone in risk-reduction interventions for the prevention and treatment of many chronic health conditions. PA interventions are inherently multi-dimensional and complex in nature. Thus, study designs used in the evaluation of PA interventions must be adaptive to intervention components and individual capacities. A Sequential Multiple Assignment Randomised Trial (SMART) is a factorial design in a sequential setting used to build effective adaptive interventions. SMARTs represent a relatively new design for PA intervention research. This systematic review aims to examine the state-of-the-art of SMARTs used to develop PA interventions, with a focus on study characteristics, design, and analyses.

Methods: PubMed, Embase, PsychINFO, CENTRAL, and CinAHL were systematically searched through May 2023 for studies wherein PA SMARTs were conducted. Methodological quality was assessed using the Cochrane Risk of Bias 2 Tool.

Results: Twenty studies across a variety of populations - e.g., obesity, chronic pain, and cardiovascular conditions, were included. All PA SMARTs involved two decision stages, with the majority including two initial treatment options. PA interventions most commonly consisted of individual aerobic exercise with strategies such as goal setting, wearable technology, and motivational interviewing also used to promote PA. Variation was observed across tailoring variables and timing of tailoring variables. Non-response strategies primarily involved augmenting and switching treatment options, and for responders to continue with initial treatment options. For analyses, most sample size estimations and outcome analyses accounted for the SMART aims specified. Techniques such as linear mixed models, weighted regressions, and Q-learning regression were frequently used. Risk of bias was high across the majority of included studies.

Conclusions: Individual-based aerobic exercise interventions supported by behaviour change techniques and wearable sensing technology may play a key role in the future development of SMARTs addressing PA intervention development. Clearer rationale for the selection of tailoring variables, timing of tailoring variables, and included measures is essential to advance PA SMART designs. Collaborative efforts from researchers, clinicians, and patients are needed in order to bridge the gap between adaptive research designs and personalised treatment pathways observed in clinical practice.

背景:身体活动(PA)往往是预防和治疗许多慢性健康状况的降低风险干预措施的基石。PA干预措施本质上是多维的和复杂的。因此,用于评估PA干预措施的研究设计必须适应干预成分和个人能力。顺序多任务随机试验(SMART)是在顺序设置中用于构建有效适应性干预措施的析因设计。smart代表了PA干预研究的一种相对较新的设计。本系统综述旨在研究用于开发PA干预措施的smart的最新技术,重点关注研究特征、设计和分析。方法:系统检索PubMed、Embase、PsychINFO、CENTRAL和CinAHL到2023年5月进行PA smart的研究。采用Cochrane风险偏倚2工具评估方法学质量。结果:20项研究涵盖了不同的人群,如肥胖、慢性疼痛和心血管疾病。所有的PA smart都涉及两个决策阶段,其中大多数包括两个初始治疗方案。PA干预最常见的是个体有氧运动,目标设定、可穿戴技术和动机性访谈等策略也用于促进PA。在裁剪变量和裁剪变量的时间上观察到差异。无反应策略主要包括增加和转换治疗方案,以及反应者继续使用最初的治疗方案。对于分析,大多数样本量估计和结果分析都说明了指定的SMART目标。经常使用线性混合模型、加权回归和q -学习回归等技术。在大多数纳入的研究中,偏倚风险很高。结论:在行为改变技术和可穿戴传感技术的支持下,基于个人的有氧运动干预可能在smart解决PA干预发展的未来发展中发挥关键作用。更清晰的选择裁剪变量的基本原理,裁剪变量的时间,包括措施是必不可少的,以推进PA SMART设计。为了弥合适应性研究设计与临床实践中观察到的个性化治疗途径之间的差距,需要研究人员、临床医生和患者的合作努力。
{"title":"The use of sequential multiple assignment randomized trials (SMARTs) in physical activity interventions: a systematic review.","authors":"Aoife Whiston, K M Kidwell, S O'Reilly, C Walsh, J C Walsh, L Glynn, K Robinson, S Hayes","doi":"10.1186/s12874-024-02439-4","DOIUrl":"10.1186/s12874-024-02439-4","url":null,"abstract":"<p><strong>Background: </strong>Physical activity (PA) is often the cornerstone in risk-reduction interventions for the prevention and treatment of many chronic health conditions. PA interventions are inherently multi-dimensional and complex in nature. Thus, study designs used in the evaluation of PA interventions must be adaptive to intervention components and individual capacities. A Sequential Multiple Assignment Randomised Trial (SMART) is a factorial design in a sequential setting used to build effective adaptive interventions. SMARTs represent a relatively new design for PA intervention research. This systematic review aims to examine the state-of-the-art of SMARTs used to develop PA interventions, with a focus on study characteristics, design, and analyses.</p><p><strong>Methods: </strong>PubMed, Embase, PsychINFO, CENTRAL, and CinAHL were systematically searched through May 2023 for studies wherein PA SMARTs were conducted. Methodological quality was assessed using the Cochrane Risk of Bias 2 Tool.</p><p><strong>Results: </strong>Twenty studies across a variety of populations - e.g., obesity, chronic pain, and cardiovascular conditions, were included. All PA SMARTs involved two decision stages, with the majority including two initial treatment options. PA interventions most commonly consisted of individual aerobic exercise with strategies such as goal setting, wearable technology, and motivational interviewing also used to promote PA. Variation was observed across tailoring variables and timing of tailoring variables. Non-response strategies primarily involved augmenting and switching treatment options, and for responders to continue with initial treatment options. For analyses, most sample size estimations and outcome analyses accounted for the SMART aims specified. Techniques such as linear mixed models, weighted regressions, and Q-learning regression were frequently used. Risk of bias was high across the majority of included studies.</p><p><strong>Conclusions: </strong>Individual-based aerobic exercise interventions supported by behaviour change techniques and wearable sensing technology may play a key role in the future development of SMARTs addressing PA intervention development. Clearer rationale for the selection of tailoring variables, timing of tailoring variables, and included measures is essential to advance PA SMART designs. Collaborative efforts from researchers, clinicians, and patients are needed in order to bridge the gap between adaptive research designs and personalised treatment pathways observed in clinical practice.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"308"},"PeriodicalIF":3.9,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11658464/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142863446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate filter methods for feature selection with the γ -metric. 用 γ 度量选择特征的多元滤波方法。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-19 DOI: 10.1186/s12874-024-02426-9
Nicolas Ngo, Pierre Michel, Roch Giorgi

Background: The γ -metric value is generally used as the importance score of a feature (or a set of features) in a classification context. This study aimed to go further by creating a new methodology for multivariate feature selection for classification, whereby the γ -metric is associated with a specific search direction (and therefore a specific stopping criterion). As three search directions are used, we effectively created three distinct methods.

Methods: We assessed the performance of our new methodology through a simulation study, comparing them against more conventional methods. Classification performance indicators, number of selected features, stability and execution time were used to evaluate the performance of the methods. We also evaluated how well the proposed methodology selected relevant features for the detection of atrial fibrillation, which is a cardiac arrhythmia.

Results: We found that in the simulation study as well as the detection of AF task, our methods were able to select informative features and maintain a good level of predictive performance; however in a case of strong correlation and large datasets, the γ -metric based methods were less efficient to exclude non-informative features.

Conclusions: Results highlighted a good combination of both the forward search direction and the γ -metric as an evaluation function. However, using the backward search direction, the feature selection algorithm could fall into a local optima and can be improved.

背景:γ -度量值通常用作分类上下文中一个特征(或一组特征)的重要性评分。本研究旨在通过创建一种用于分类的多变量特征选择的新方法来更进一步,其中γ -度量与特定的搜索方向(因此是特定的停止标准)相关联。由于使用了三个搜索方向,我们有效地创建了三种不同的方法。方法:我们通过模拟研究评估了我们的新方法的性能,并将其与更传统的方法进行了比较。采用分类性能指标、选择特征的数量、稳定性和执行时间来评价方法的性能。我们还评估了所提出的方法选择心房颤动(一种心律失常)检测的相关特征。结果:我们发现,在模拟研究和AF任务检测中,我们的方法能够选择信息特征并保持良好的预测性能;然而,在强相关性和大型数据集的情况下,基于γ -metric的方法在排除非信息特征方面效率较低。结论:结果突出了正向搜索方向和γ -度量作为评价函数的良好结合。然而,使用反向搜索方向,特征选择算法可能陷入局部最优,可以改进。
{"title":"<ArticleTitle xmlns:ns0=\"http://www.w3.org/1998/Math/MathML\">Multivariate filter methods for feature selection with the <ns0:math><ns0:mrow><ns0:mi>γ</ns0:mi></ns0:mrow> </ns0:math> -metric.","authors":"Nicolas Ngo, Pierre Michel, Roch Giorgi","doi":"10.1186/s12874-024-02426-9","DOIUrl":"10.1186/s12874-024-02426-9","url":null,"abstract":"<p><strong>Background: </strong>The <math><mi>γ</mi></math> -metric value is generally used as the importance score of a feature (or a set of features) in a classification context. This study aimed to go further by creating a new methodology for multivariate feature selection for classification, whereby the <math><mi>γ</mi></math> -metric is associated with a specific search direction (and therefore a specific stopping criterion). As three search directions are used, we effectively created three distinct methods.</p><p><strong>Methods: </strong>We assessed the performance of our new methodology through a simulation study, comparing them against more conventional methods. Classification performance indicators, number of selected features, stability and execution time were used to evaluate the performance of the methods. We also evaluated how well the proposed methodology selected relevant features for the detection of atrial fibrillation, which is a cardiac arrhythmia.</p><p><strong>Results: </strong>We found that in the simulation study as well as the detection of AF task, our methods were able to select informative features and maintain a good level of predictive performance; however in a case of strong correlation and large datasets, the <math><mi>γ</mi></math> -metric based methods were less efficient to exclude non-informative features.</p><p><strong>Conclusions: </strong>Results highlighted a good combination of both the forward search direction and the <math><mi>γ</mi></math> -metric as an evaluation function. However, using the backward search direction, the feature selection algorithm could fall into a local optima and can be improved.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"307"},"PeriodicalIF":3.9,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11657396/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142863427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting cardiovascular diseases using unsupervised machine learning clustering based on electronic medical records. 利用基于电子病历的无监督机器学习聚类检测心血管疾病。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-19 DOI: 10.1186/s12874-024-02422-z
Ying Hu, Hai Yan, Ming Liu, Jing Gao, Lianhong Xie, Chunyu Zhang, Lili Wei, Yinging Ding, Hong Jiang

Background: Electronic medical records (EMR)-trained machine learning models have the potential in CVD risk prediction by integrating a range of medical data from patients, facilitate timely diagnosis and classification of CVDs. We tested the hypothesis that unsupervised ML approach utilizing EMR could be used to develop a new model for detecting prevalent CVD in clinical settings.

Methods: We included 155,894 patients (aged ≥ 18 years) discharged between January 2014 and July 2022, from Xuhui Hospital, Shanghai, China, including 64,916 CVD cases and 90,979 non-CVD cases. K-means clustering was used to generate the clustering models with k = 2, 4, and 8 as predetermined number of clusters k = 2, 4, and 8. Bayesian theorem was used to estimate the models' predictive accuracy.

Results: The overall predictive accuracy of the 2-, 4-, and 8-classification clustering models in the training set was 0.856, 0.8634, and 0.8506, respectively. Similarly, the predictive accuracy of the 2-, 4-, and 8-classification clustering models in the testing set was 0.8598, 0.8659, and 0.8525, respectively. After reducing from 19 dimensions to 2 dimensions by principal component analysis, significant separation was observed for CVD cases and non-CVD cases in both training and testing sets.

Conclusion: Our findings indicate that the utilization of EMR data can support the development of a robust model for CVD detection through an unsupervised ML approach. Further investigation using longitudinal design is needed to refine the model for its applications in clinical settings.

背景:电子病历(EMR)训练的机器学习模型通过整合来自患者的一系列医疗数据,促进心血管疾病的及时诊断和分类,在心血管疾病风险预测方面具有潜力。我们验证了假设,即利用EMR的无监督ML方法可用于开发一种新的模型,用于检测临床环境中流行的心血管疾病。方法:纳入2014年1月至2022年7月在中国上海徐汇医院出院的155894例患者(年龄≥18岁),其中CVD患者64916例,非CVD患者90979例。使用k -means聚类生成k = 2、4和8作为预定簇数k = 2、4和8的聚类模型。利用贝叶斯定理对模型的预测精度进行了估计。结果:2类、4类和8类聚类模型在训练集中的总体预测准确率分别为0.856、0.8634和0.8506。同样,2类、4类和8类聚类模型在测试集中的预测准确率分别为0.8598、0.8659和0.8525。通过主成分分析将19维降至2维后,在训练集和测试集上,CVD病例和非CVD病例都有显著的分离。结论:我们的研究结果表明,利用EMR数据可以通过无监督ML方法支持开发健壮的CVD检测模型。需要使用纵向设计进行进一步的研究,以完善其在临床应用的模型。
{"title":"Detecting cardiovascular diseases using unsupervised machine learning clustering based on electronic medical records.","authors":"Ying Hu, Hai Yan, Ming Liu, Jing Gao, Lianhong Xie, Chunyu Zhang, Lili Wei, Yinging Ding, Hong Jiang","doi":"10.1186/s12874-024-02422-z","DOIUrl":"10.1186/s12874-024-02422-z","url":null,"abstract":"<p><strong>Background: </strong>Electronic medical records (EMR)-trained machine learning models have the potential in CVD risk prediction by integrating a range of medical data from patients, facilitate timely diagnosis and classification of CVDs. We tested the hypothesis that unsupervised ML approach utilizing EMR could be used to develop a new model for detecting prevalent CVD in clinical settings.</p><p><strong>Methods: </strong>We included 155,894 patients (aged ≥ 18 years) discharged between January 2014 and July 2022, from Xuhui Hospital, Shanghai, China, including 64,916 CVD cases and 90,979 non-CVD cases. K-means clustering was used to generate the clustering models with k = 2, 4, and 8 as predetermined number of clusters k = 2, 4, and 8. Bayesian theorem was used to estimate the models' predictive accuracy.</p><p><strong>Results: </strong>The overall predictive accuracy of the 2-, 4-, and 8-classification clustering models in the training set was 0.856, 0.8634, and 0.8506, respectively. Similarly, the predictive accuracy of the 2-, 4-, and 8-classification clustering models in the testing set was 0.8598, 0.8659, and 0.8525, respectively. After reducing from 19 dimensions to 2 dimensions by principal component analysis, significant separation was observed for CVD cases and non-CVD cases in both training and testing sets.</p><p><strong>Conclusion: </strong>Our findings indicate that the utilization of EMR data can support the development of a robust model for CVD detection through an unsupervised ML approach. Further investigation using longitudinal design is needed to refine the model for its applications in clinical settings.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"309"},"PeriodicalIF":3.9,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11658374/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142863432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robustness assessment of regressions using cluster analysis typologies: a bootstrap procedure with application in state sequence analysis. 使用聚类分析类型学的回归稳健性评估:一个在状态序列分析中应用的自举过程。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-18 DOI: 10.1186/s12874-024-02435-8
Leonard Roth, Matthias Studer, Emilie Zuercher, Isabelle Peytremann-Bridevaux

Background: In standard Sequence Analysis, similar trajectories are clustered together to create a typology of trajectories, which is then often used to evaluate the association between sequence patterns and covariates inside regression models. The sampling uncertainty, which affects both the derivation of the typology and the associated regressions, is typically ignored in this analysis, an oversight that may lead to wrong statistical conclusions. We propose utilising sampling variation to derive new estimates that further inform on the association of interest.

Methods: We introduce a novel procedure to assess the robustness of regression results obtained from the standard analysis. Bootstrap samples are drawn from the data, and for each bootstrap, a new typology replicating the original one is constructed, followed by the estimation of the corresponding regression models. The bootstrap estimates are then combined using a multilevel modelling framework that mimics a meta-analysis. The fitted values from this multilevel model allow to account for the sampling uncertainty in the inferential analysis. We illustrate the methodology by applying it to the study of healthcare utilisation trajectories in a Swiss cohort of diabetic patients.

Results: The procedure provides robust estimates for an association of interest, along with 95% prediction intervals, representing the range of expected values if the clustering and associated regressions were performed on a new sample from the same underlying distribution. It also identifies central and borderline trajectories within each cluster. Regarding the illustrative application, while there was evidence of an association between regular lipid testing and subsequent healthcare utilisation patterns in the original analysis, this is not supported in the robustness assessment.

Conclusions: Investigating the relationship between trajectory patterns and covariates is of interest in many situations. However, it is a challenging task with potential pitfalls. Our Robustness Assessment of Regression using Cluster Analysis Typologies (RARCAT) may assist in ensuring the robustness of such association studies. The method is applicable wherever clustering is combined with regression analysis, so its relevance goes beyond State Sequence Analysis.

背景:在标准序列分析中,相似的轨迹被聚集在一起以创建轨迹的类型,然后通常用于评估回归模型中序列模式和协变量之间的关联。抽样的不确定性会影响类型学的推导和相关的回归,在这种分析中通常被忽略,这种疏忽可能导致错误的统计结论。我们建议利用抽样变化来得出新的估计,以进一步了解兴趣关联。方法:我们引入了一种新的方法来评估从标准分析中获得的回归结果的稳健性。从数据中提取Bootstrap样本,对于每个Bootstrap,构建一个复制原始类型的新类型,然后对相应的回归模型进行估计。然后使用模拟元分析的多级建模框架将自举估计结合起来。该多层模型的拟合值考虑了推理分析中的抽样不确定性。我们通过将其应用于瑞士糖尿病患者队列中医疗保健利用轨迹的研究来说明该方法。结果:该程序为感兴趣的关联提供了稳健的估计,以及95%的预测区间,如果对来自相同底层分布的新样本进行聚类和相关回归,则表示期望值的范围。它还确定每个集群内的中心和边界轨迹。关于说明性应用,虽然在原始分析中有证据表明定期脂质测试与随后的医疗保健利用模式之间存在关联,但这在稳健性评估中没有得到支持。结论:在许多情况下,研究轨迹模式和协变量之间的关系是有意义的。然而,这是一项具有挑战性的任务,有潜在的陷阱。我们使用聚类分析类型学(RARCAT)对回归的稳健性评估可能有助于确保此类关联研究的稳健性。该方法适用于聚类与回归分析相结合的任何地方,因此它的相关性超越了状态序列分析。
{"title":"Robustness assessment of regressions using cluster analysis typologies: a bootstrap procedure with application in state sequence analysis.","authors":"Leonard Roth, Matthias Studer, Emilie Zuercher, Isabelle Peytremann-Bridevaux","doi":"10.1186/s12874-024-02435-8","DOIUrl":"10.1186/s12874-024-02435-8","url":null,"abstract":"<p><strong>Background: </strong>In standard Sequence Analysis, similar trajectories are clustered together to create a typology of trajectories, which is then often used to evaluate the association between sequence patterns and covariates inside regression models. The sampling uncertainty, which affects both the derivation of the typology and the associated regressions, is typically ignored in this analysis, an oversight that may lead to wrong statistical conclusions. We propose utilising sampling variation to derive new estimates that further inform on the association of interest.</p><p><strong>Methods: </strong>We introduce a novel procedure to assess the robustness of regression results obtained from the standard analysis. Bootstrap samples are drawn from the data, and for each bootstrap, a new typology replicating the original one is constructed, followed by the estimation of the corresponding regression models. The bootstrap estimates are then combined using a multilevel modelling framework that mimics a meta-analysis. The fitted values from this multilevel model allow to account for the sampling uncertainty in the inferential analysis. We illustrate the methodology by applying it to the study of healthcare utilisation trajectories in a Swiss cohort of diabetic patients.</p><p><strong>Results: </strong>The procedure provides robust estimates for an association of interest, along with 95% prediction intervals, representing the range of expected values if the clustering and associated regressions were performed on a new sample from the same underlying distribution. It also identifies central and borderline trajectories within each cluster. Regarding the illustrative application, while there was evidence of an association between regular lipid testing and subsequent healthcare utilisation patterns in the original analysis, this is not supported in the robustness assessment.</p><p><strong>Conclusions: </strong>Investigating the relationship between trajectory patterns and covariates is of interest in many situations. However, it is a challenging task with potential pitfalls. Our Robustness Assessment of Regression using Cluster Analysis Typologies (RARCAT) may assist in ensuring the robustness of such association studies. The method is applicable wherever clustering is combined with regression analysis, so its relevance goes beyond State Sequence Analysis.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"303"},"PeriodicalIF":3.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654397/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A PubMed search filter for efficiently retrieving exercise training studies. PubMed搜索过滤器,用于有效检索运动训练研究。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-18 DOI: 10.1186/s12874-024-02414-z
Dawei Yin, Mikaela V Engracia, Matthew K Edema, David C Clarke

Background: A barrier to evidence-informed exercise programming is locating studies of exercise training programs. The purpose of this study was to create a search filter for studies of exercise training programs for the PubMed electronic bibliographic database.

Methods: Candidate search terms were identified from three sources: exercise-relevant MeSH terms and their corresponding Entry terms, word frequency analysis of articles in a gold-standard reference set curated from systematic reviews focused on exercise training, and retrospective searching of articles retrieved in the search filter development and testing steps. These terms were assembled into an exercise training search filter, and its performance was assessed against a basic search string applied to six case studies. Search string performance was measured as sensitivity (relative recall), precision, and number needed to read (NNR). We aimed to achieve relative recall ≥ 85%, and a NNR ≥ 2.

Results: The reference set consisted of 71 articles drawn from six systematic reviews. Sixty-one candidate search terms were evaluated for inclusion, 21 of which were included in the finalized exercise-training search filter. The relative recall of the search filter was 96% for the reference set and the precision mean ± SD was 54 ± 16% across the case studies, with the corresponding NNR = ~ 2. The exercise training search filter consistently outperformed the basic search string.

Conclusion: The exercise training search filter fosters more efficient searches for studies of exercise training programs in the PubMed electronic bibliographic database. This search string may therefore support evidence-informed practice in exercise programming.

背景:循证运动规划的一个障碍是运动训练计划的定位研究。本研究的目的是为PubMed电子书目数据库的运动训练项目研究创建一个搜索过滤器。方法:候选搜索词从三个来源确定:与运动相关的MeSH术语及其相应的词条术语,对黄金标准参考集中的文章进行词频分析,这些参考集中于运动训练的系统综述,以及对搜索过滤器开发和测试步骤中检索到的文章进行回顾性搜索。这些术语被组装成一个运动训练搜索过滤器,并根据应用于六个案例研究的基本搜索字符串评估其性能。搜索字符串性能通过灵敏度(相对召回率)、精度和需要读取的数量(NNR)来衡量。我们的目标是达到相对召回率≥85%,NNR≥2。结果:参考文献集包括来自6篇系统综述的71篇文章。61个候选搜索词被评估纳入,其中21个被纳入最终的运动-训练搜索过滤器。参考集的相对查全率为96%,各案例的查全精度均值±SD为54±16%,相应的NNR = ~ 2。运动训练搜索过滤器的性能始终优于基本搜索字符串。结论:运动训练搜索过滤器促进了对PubMed电子书目数据库中运动训练项目研究的更有效的搜索。因此,这个搜索字符串可能支持在练习编程中循证实践。
{"title":"A PubMed search filter for efficiently retrieving exercise training studies.","authors":"Dawei Yin, Mikaela V Engracia, Matthew K Edema, David C Clarke","doi":"10.1186/s12874-024-02414-z","DOIUrl":"10.1186/s12874-024-02414-z","url":null,"abstract":"<p><strong>Background: </strong>A barrier to evidence-informed exercise programming is locating studies of exercise training programs. The purpose of this study was to create a search filter for studies of exercise training programs for the PubMed electronic bibliographic database.</p><p><strong>Methods: </strong>Candidate search terms were identified from three sources: exercise-relevant MeSH terms and their corresponding Entry terms, word frequency analysis of articles in a gold-standard reference set curated from systematic reviews focused on exercise training, and retrospective searching of articles retrieved in the search filter development and testing steps. These terms were assembled into an exercise training search filter, and its performance was assessed against a basic search string applied to six case studies. Search string performance was measured as sensitivity (relative recall), precision, and number needed to read (NNR). We aimed to achieve relative recall ≥ 85%, and a NNR ≥ 2.</p><p><strong>Results: </strong>The reference set consisted of 71 articles drawn from six systematic reviews. Sixty-one candidate search terms were evaluated for inclusion, 21 of which were included in the finalized exercise-training search filter. The relative recall of the search filter was 96% for the reference set and the precision mean ± SD was 54 ± 16% across the case studies, with the corresponding NNR = ~ 2. The exercise training search filter consistently outperformed the basic search string.</p><p><strong>Conclusion: </strong>The exercise training search filter fosters more efficient searches for studies of exercise training programs in the PubMed electronic bibliographic database. This search string may therefore support evidence-informed practice in exercise programming.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"302"},"PeriodicalIF":3.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11653657/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142851945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A modified and weighted Gower distance-based clustering analysis for mixed type data: a simulation and empirical analyses. 混合类型数据的改进加权高尔距离聚类分析:模拟与实证分析。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-18 DOI: 10.1186/s12874-024-02427-8
Pinyan Liu, Han Yuan, Yilin Ning, Bibhas Chakraborty, Nan Liu, Marco Aurélio Peres

Background: Traditional clustering techniques are typically restricted to either continuous or categorical variables. However, most real-world clinical data are mixed type. This study aims to introduce a clustering technique specifically designed for datasets containing both continuous and categorical variables to offer better clustering compatibility, adaptability, and interpretability than other mixed type techniques.

Methods: This paper proposed a modified Gower distance incorporating feature importance as weights to maintain equal contributions between continuous and categorical features. The algorithm (DAFI) was evaluated using five simulated datasets with varying proportions of important features and real-world datasets from the 2011-2014 National Health and Nutrition Examination Survey (NHANES). Effectiveness was demonstrated through comparisons with 13 clustering techniques. Clustering performance was assessed using the adjusted Rand index (ARI) for accuracy in simulation studies and the silhouette score for cohesion and separation in NHANES. Additionally, multivariable logistic regression estimated the association between periodontitis (PD) and cardiovascular diseases (CVDs), adjusting for clusters in NHANES.

Results: In simulation studies, the DAFI-Gower algorithm consistently performs better than baseline methods according to the adjusted Rand index in settings investigated, especially on datasets with more redundant features. In NHANES, 3,760 people were analyzed. DAFI-Gower achieves the highest silhouette score (0.79). Four distinct clusters with diverse health profiles were identified. By incorporating feature importance, we found that cluster formations were more strongly influenced by CVD-related factors. The association between periodontitis and cardiovascular diseases, after adjusting for clusters, reveals significant insights (adjusted OR 1.95, 95% CI 1.50 to 2.55, p = 0.012), highlighting severe periodontitis as a potential risk factor for cardiovascular diseases.

Conclusions: DAFI performed better than classic clustering baselines on both simulated and real-world datasets. It effectively captures cluster characteristics by considering feature importance, which is crucial in clinical settings where many variables may be similar or irrelevant. We envisage that DAFI offers an effective solution for mixed type clustering.

背景:传统的聚类技术通常局限于连续变量或分类变量。然而,大多数现实世界的临床数据是混合型的。本研究旨在引入一种专门为包含连续变量和分类变量的数据集设计的聚类技术,以提供比其他混合类型技术更好的聚类兼容性、适应性和可解释性。方法:本文提出了一种将特征重要度作为权重的改进高尔距离,以保持连续特征和分类特征之间的贡献相等。该算法(DAFI)使用五个具有不同比例重要特征的模拟数据集和来自2011-2014年国家健康与营养检查调查(NHANES)的真实数据集进行评估。通过与13种聚类技术的比较,证明了有效性。在模拟研究中使用调整后的兰德指数(ARI)来评估聚类性能的准确性,在NHANES中使用剪影评分来评估聚类和分离。此外,多变量logistic回归估计了牙周炎(PD)和心血管疾病(cvd)之间的关联,调整了NHANES中的聚类。结果:在模拟研究中,在调查的设置中,根据调整后的Rand指数,DAFI-Gower算法始终优于基线方法,特别是在冗余特征较多的数据集上。NHANES对3760人进行了分析。DAFI-Gower的剪影评分最高(0.79)。确定了四个具有不同健康概况的不同群集。结合特征重要性,我们发现簇的形成更强烈地受到cvd相关因素的影响。在调整聚类后,牙周炎和心血管疾病之间的关联揭示了重要的洞察力(调整OR 1.95, 95% CI 1.50至2.55,p = 0.012),突出了严重的牙周炎是心血管疾病的潜在危险因素。结论:DAFI在模拟和真实数据集上的表现都优于经典聚类基线。它通过考虑特征的重要性来有效地捕获集群特征,这在许多变量可能相似或不相关的临床环境中至关重要。我们设想DAFI为混合型聚类提供了一个有效的解决方案。
{"title":"A modified and weighted Gower distance-based clustering analysis for mixed type data: a simulation and empirical analyses.","authors":"Pinyan Liu, Han Yuan, Yilin Ning, Bibhas Chakraborty, Nan Liu, Marco Aurélio Peres","doi":"10.1186/s12874-024-02427-8","DOIUrl":"10.1186/s12874-024-02427-8","url":null,"abstract":"<p><strong>Background: </strong>Traditional clustering techniques are typically restricted to either continuous or categorical variables. However, most real-world clinical data are mixed type. This study aims to introduce a clustering technique specifically designed for datasets containing both continuous and categorical variables to offer better clustering compatibility, adaptability, and interpretability than other mixed type techniques.</p><p><strong>Methods: </strong>This paper proposed a modified Gower distance incorporating feature importance as weights to maintain equal contributions between continuous and categorical features. The algorithm (DAFI) was evaluated using five simulated datasets with varying proportions of important features and real-world datasets from the 2011-2014 National Health and Nutrition Examination Survey (NHANES). Effectiveness was demonstrated through comparisons with 13 clustering techniques. Clustering performance was assessed using the adjusted Rand index (ARI) for accuracy in simulation studies and the silhouette score for cohesion and separation in NHANES. Additionally, multivariable logistic regression estimated the association between periodontitis (PD) and cardiovascular diseases (CVDs), adjusting for clusters in NHANES.</p><p><strong>Results: </strong>In simulation studies, the DAFI-Gower algorithm consistently performs better than baseline methods according to the adjusted Rand index in settings investigated, especially on datasets with more redundant features. In NHANES, 3,760 people were analyzed. DAFI-Gower achieves the highest silhouette score (0.79). Four distinct clusters with diverse health profiles were identified. By incorporating feature importance, we found that cluster formations were more strongly influenced by CVD-related factors. The association between periodontitis and cardiovascular diseases, after adjusting for clusters, reveals significant insights (adjusted OR 1.95, 95% CI 1.50 to 2.55, p = 0.012), highlighting severe periodontitis as a potential risk factor for cardiovascular diseases.</p><p><strong>Conclusions: </strong>DAFI performed better than classic clustering baselines on both simulated and real-world datasets. It effectively captures cluster characteristics by considering feature importance, which is crucial in clinical settings where many variables may be similar or irrelevant. We envisage that DAFI offers an effective solution for mixed type clustering.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"305"},"PeriodicalIF":3.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654179/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing Medical Record Abstraction (MRA) error rates in an observational study to pooled rates identified in the data quality literature. 比较一项观察性研究中的病历摘要(MRA)错误率与数据质量文献中确定的汇总错误率。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-18 DOI: 10.1186/s12874-024-02424-x
Maryam Y Garza, Tremaine B Williams, Songthip Ounpraseuth, Zhuopei Hu, Jeannette Lee, Jessica Snowden, Anita C Walden, Alan E Simon, Lori A Devlin, Leslie W Young, Meredith N Zozus

Background: Medical record abstraction (MRA) is a commonly used method for data collection in clinical research, but is prone to error, and the influence of quality control (QC) measures is seldom and inconsistently assessed during the course of a study. We employed a novel, standardized MRA-QC framework as part of an ongoing observational study in an effort to control MRA error rates. In order to assess the effectiveness of our framework, we compared our error rates against traditional MRA studies that had not reported using formalized MRA-QC methods. Thus, the objective of this study was to compare the MRA error rates derived from the literature with the error rates found in a study using MRA as the sole method of data collection that employed an MRA-QC framework.

Methods: A comparison of the error rates derived from MRA-centric studies identified as part of a systematic literature review was conducted against those derived from an MRA-centric study that employed an MRA-QC framework to evaluate the effectiveness of the MRA-QC framework. An inverse variance-weighted meta-analytical method with Freeman-Tukey transformation was used to compute pooled effect size for both the MRA studies identified in the literature and the study that implemented the MRA-QC framework. The level of heterogeneity was assessed using the Q-statistic and Higgins and Thompson's I2 statistic.

Results: The overall error rate from the MRA literature was 6.57%. Error rates for the study using our MRA-QC framework were between 1.04% (optimistic, all-field rate) and 2.57% (conservative, populated-field rate), 4.00-5.53% points less than the observed rate from the literature (p < 0.0001).

Conclusions: Review of the literature indicated that the accuracy associated with MRA varied widely across studies. However, our results demonstrate that, with appropriate training and continuous QC, MRA error rates can be significantly controlled during the course of a clinical research study.

背景:病历抽象化(MRA)是临床研究中常用的数据收集方法,但该方法容易出错,且在研究过程中对质量控制(QC)措施的影响评估很少且不一致。为了控制MRA错误率,我们采用了一种新的标准化MRA- qc框架作为正在进行的观察性研究的一部分。为了评估我们的框架的有效性,我们将我们的错误率与传统的MRA研究进行了比较,这些研究没有使用形式化的MRA- qc方法。因此,本研究的目的是比较文献中得出的MRA错误率与采用MRA- qc框架的MRA作为唯一数据收集方法的研究中发现的错误率。方法:将系统性文献综述中以核磁共振为中心的研究得出的错误率与采用核磁共振-质量控制框架评估核磁共振-质量控制框架有效性的核磁共振为中心的研究得出的错误率进行比较。采用Freeman-Tukey变换的方差加权逆元分析方法计算文献中确定的MRA研究和实施MRA- qc框架的研究的综合效应大小。使用q统计量和Higgins和Thompson的I2统计量评估异质性水平。结果:MRA文献的总体错误率为6.57%。使用我们的MRA- qc框架的研究错误率在1.04%(乐观,全场率)和2.57%(保守,全场率)之间,比从文献中观察到的错误率低4.00-5.53% (p结论:文献回顾表明,MRA的准确性在不同的研究中差异很大。然而,我们的研究结果表明,通过适当的培训和持续的质量控制,在临床研究过程中,MRA错误率可以得到显著控制。
{"title":"Comparing Medical Record Abstraction (MRA) error rates in an observational study to pooled rates identified in the data quality literature.","authors":"Maryam Y Garza, Tremaine B Williams, Songthip Ounpraseuth, Zhuopei Hu, Jeannette Lee, Jessica Snowden, Anita C Walden, Alan E Simon, Lori A Devlin, Leslie W Young, Meredith N Zozus","doi":"10.1186/s12874-024-02424-x","DOIUrl":"10.1186/s12874-024-02424-x","url":null,"abstract":"<p><strong>Background: </strong>Medical record abstraction (MRA) is a commonly used method for data collection in clinical research, but is prone to error, and the influence of quality control (QC) measures is seldom and inconsistently assessed during the course of a study. We employed a novel, standardized MRA-QC framework as part of an ongoing observational study in an effort to control MRA error rates. In order to assess the effectiveness of our framework, we compared our error rates against traditional MRA studies that had not reported using formalized MRA-QC methods. Thus, the objective of this study was to compare the MRA error rates derived from the literature with the error rates found in a study using MRA as the sole method of data collection that employed an MRA-QC framework.</p><p><strong>Methods: </strong>A comparison of the error rates derived from MRA-centric studies identified as part of a systematic literature review was conducted against those derived from an MRA-centric study that employed an MRA-QC framework to evaluate the effectiveness of the MRA-QC framework. An inverse variance-weighted meta-analytical method with Freeman-Tukey transformation was used to compute pooled effect size for both the MRA studies identified in the literature and the study that implemented the MRA-QC framework. The level of heterogeneity was assessed using the Q-statistic and Higgins and Thompson's I<sup>2</sup> statistic.</p><p><strong>Results: </strong>The overall error rate from the MRA literature was 6.57%. Error rates for the study using our MRA-QC framework were between 1.04% (optimistic, all-field rate) and 2.57% (conservative, populated-field rate), 4.00-5.53% points less than the observed rate from the literature (p < 0.0001).</p><p><strong>Conclusions: </strong>Review of the literature indicated that the accuracy associated with MRA varied widely across studies. However, our results demonstrate that, with appropriate training and continuous QC, MRA error rates can be significantly controlled during the course of a clinical research study.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"304"},"PeriodicalIF":3.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11653794/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An empirical study on 209 networks of treatments revealed intransitivity to be common and multiple statistical tests suboptimal to assess transitivity. 对209个治疗网络的实证研究表明,不可传递性是常见的,多个统计测试不适合评估传递性。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-16 DOI: 10.1186/s12874-024-02436-7
Loukia M Spineli

Background: Transitivity assumption is the cornerstone of network meta-analysis (NMA). Investigating the plausibility of transitivity can unveil the credibility of NMA results. The commonness of transitivity was examined based on study dissimilarities regarding several study-level aggregate clinical and methodological characteristics reported in the systematic reviews. The present study also demonstrated the disadvantages of using multiple statistical tests to assess transitivity and compared the conclusions drawn from multiple statistical tests with those from the approach of study dissimilarities for transitivity assessment.

Methods: An empirical study was conducted using 209 published systematic reviews with NMA to create a database of study-level aggregate clinical and methodological characteristics found in the tracenma R package. For each systematic review, the network of the primary outcome was considered to create a dataset with extracted study-level aggregate clinical and methodological characteristics reported in the systematic review that may act as effect modifiers. Transitivity was evaluated by calculating study dissimilarities based on the extracted characteristics to provide a measure of overall dissimilarity within and between the observed treatment comparisons. Empirically driven thresholds of low dissimilarity were employed to determine the proportion of datasets with evidence of likely intransitivity. One-way ANOVA and chi-squared test were employed for each characteristic to investigate comparison dissimilarity at a significance level of 5%.

Results: Study dissimilarities covered a wide range of possible values across the datasets. A 'likely concerning' extent of study dissimilarities, both intra-comparison and inter-comparison, dominated the analysed datasets. Using a higher dissimilarity threshold, a 'likely concerning' extent of study dissimilarities persisted for objective outcomes but decreased substantially for subjective outcomes. A likely intransitivity prevailed in all datasets; however, using a higher dissimilarity threshold resulted in few networks with transitivity for semi-objective and subjective outcomes. Statistical tests were feasible in 127 (61%) datasets, yielding conflicting conclusions with the approach of study dissimilarities in many datasets.

Conclusions: Study dissimilarity, manifested from variations in the effect modifiers' distribution across the studies, should be expected and properly quantified. Measuring the overall study dissimilarity between observed comparisons and comparing it with a proper threshold can aid in determining whether concerns of likely intransitivity are warranted.

背景:及物性假设是网络元分析的基础。研究及物性的合理性可以揭示NMA结果的可信度。及物性的共性是根据系统综述中报告的几个研究水平的临床和方法学特征的研究差异来检验的。本研究还指出了使用多个统计检验评估及物性的不足,并将多个统计检验得出的结论与研究差异评估方法得出的结论进行了比较。方法:使用NMA发表的209篇系统综述进行实证研究,以创建一个研究水平的tracenma R包中临床和方法学特征的综合数据库。对于每个系统评价,主要结果的网络被认为是创建一个数据集,其中提取了系统评价中报告的研究级总体临床和方法学特征,这些特征可能作为效果调节剂。传递性通过计算基于提取特征的研究差异来评估,以提供观察到的治疗比较内部和之间总体差异的度量。采用经验驱动的低不相似性阈值来确定具有可能不可传递性证据的数据集的比例。各特征采用单因素方差分析和卡方检验,在5%的显著性水平上研究比较差异。结果:研究差异涵盖了数据集的广泛可能值。在分析的数据集中,“可能关注”的研究差异程度,包括内部比较和内部比较。使用较高的差异阈值,研究差异的“可能关注”程度在客观结果中持续存在,但在主观结果中大幅下降。在所有数据集中都可能存在不可及性;然而,使用较高的差异阈值导致半客观和主观结果具有及物性的网络很少。统计检验在127个(61%)数据集中是可行的,但与许多数据集中研究差异的方法产生了相互矛盾的结论。结论:研究差异表现在研究中效果调节剂分布的差异,应该预期并适当量化。测量观察到的比较之间的整体研究差异,并将其与适当的阈值进行比较,可以帮助确定是否有必要关注可能的不及物性。
{"title":"An empirical study on 209 networks of treatments revealed intransitivity to be common and multiple statistical tests suboptimal to assess transitivity.","authors":"Loukia M Spineli","doi":"10.1186/s12874-024-02436-7","DOIUrl":"10.1186/s12874-024-02436-7","url":null,"abstract":"<p><strong>Background: </strong>Transitivity assumption is the cornerstone of network meta-analysis (NMA). Investigating the plausibility of transitivity can unveil the credibility of NMA results. The commonness of transitivity was examined based on study dissimilarities regarding several study-level aggregate clinical and methodological characteristics reported in the systematic reviews. The present study also demonstrated the disadvantages of using multiple statistical tests to assess transitivity and compared the conclusions drawn from multiple statistical tests with those from the approach of study dissimilarities for transitivity assessment.</p><p><strong>Methods: </strong>An empirical study was conducted using 209 published systematic reviews with NMA to create a database of study-level aggregate clinical and methodological characteristics found in the tracenma R package. For each systematic review, the network of the primary outcome was considered to create a dataset with extracted study-level aggregate clinical and methodological characteristics reported in the systematic review that may act as effect modifiers. Transitivity was evaluated by calculating study dissimilarities based on the extracted characteristics to provide a measure of overall dissimilarity within and between the observed treatment comparisons. Empirically driven thresholds of low dissimilarity were employed to determine the proportion of datasets with evidence of likely intransitivity. One-way ANOVA and chi-squared test were employed for each characteristic to investigate comparison dissimilarity at a significance level of 5%.</p><p><strong>Results: </strong>Study dissimilarities covered a wide range of possible values across the datasets. A 'likely concerning' extent of study dissimilarities, both intra-comparison and inter-comparison, dominated the analysed datasets. Using a higher dissimilarity threshold, a 'likely concerning' extent of study dissimilarities persisted for objective outcomes but decreased substantially for subjective outcomes. A likely intransitivity prevailed in all datasets; however, using a higher dissimilarity threshold resulted in few networks with transitivity for semi-objective and subjective outcomes. Statistical tests were feasible in 127 (61%) datasets, yielding conflicting conclusions with the approach of study dissimilarities in many datasets.</p><p><strong>Conclusions: </strong>Study dissimilarity, manifested from variations in the effect modifiers' distribution across the studies, should be expected and properly quantified. Measuring the overall study dissimilarity between observed comparisons and comparing it with a proper threshold can aid in determining whether concerns of likely intransitivity are warranted.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"301"},"PeriodicalIF":3.9,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648297/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142833894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Tips from an expert panel on the development of a clinical research protocol. 更正:来自专家小组关于临床研究方案发展的提示。
IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-12-13 DOI: 10.1186/s12874-024-02438-5
Abdelrahman M Makram, Randa Elsheikh, Omar M Makram, Nguyen Thanh Van, Nguyen Hai Nam, Nguyen Khoi Quan, Nguyen Tran Minh Duc, Ngoc Quynh Tram Nguyen, Gibson Omwansa Javes, Sara S Elsheikh, Atsuko Imoto, Peter Lee, Norio Ohmagari, Hirotsugu Aiga, Yasuhiko Kamiya, Patricia Takako Endo, Nguyen Tien Huy
{"title":"Correction: Tips from an expert panel on the development of a clinical research protocol.","authors":"Abdelrahman M Makram, Randa Elsheikh, Omar M Makram, Nguyen Thanh Van, Nguyen Hai Nam, Nguyen Khoi Quan, Nguyen Tran Minh Duc, Ngoc Quynh Tram Nguyen, Gibson Omwansa Javes, Sara S Elsheikh, Atsuko Imoto, Peter Lee, Norio Ohmagari, Hirotsugu Aiga, Yasuhiko Kamiya, Patricia Takako Endo, Nguyen Tien Huy","doi":"10.1186/s12874-024-02438-5","DOIUrl":"10.1186/s12874-024-02438-5","url":null,"abstract":"","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"300"},"PeriodicalIF":3.9,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639107/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142817037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMC Medical Research Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1