倾向评分匹配：我们应该在设计观察性研究中使用它吗？

IF 3.4 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES BMC Medical Research Methodology Pub Date : 2025-01-29 DOI:10.1186/s12874-025-02481-w

Fei Wan

{"title":"倾向评分匹配：我们应该在设计观察性研究中使用它吗？","authors":"Fei Wan","doi":"10.1186/s12874-025-02481-w","DOIUrl":null,"url":null,"abstract":"Background: Propensity Score Matching (PSM) stands as a widely embraced method in comparative effectiveness research. PSM crafts matched datasets, mimicking some attributes of randomized designs, from observational data. In a valid PSM design where all baseline confounders are measured and matched, the confounders would be balanced, allowing the treatment status to be considered as if it were randomly assigned. Nevertheless, recent research has unveiled a different facet of PSM, termed \"the PSM paradox\". As PSM approaches exact matching by progressively pruning matched sets in order of decreasing propensity score distance, it can paradoxically lead to greater covariate imbalance, heightened model dependence, and increased bias, contrary to its intended purpose.Methods: We used analytic formula, simulation, and literature to demonstrate that this paradox stems from the misuse of metrics for assessing chance imbalance and bias.Results: Firstly, matched pairs typically exhibit different covariate values despite having identical propensity scores. However, this disparity represents a \"chance\" difference and will average to zero over a large number of matched pairs. Common distance metrics cannot capture this \"chance\" nature in covariate imbalance, instead reflecting increasing variability in chance imbalance as units are pruned and the sample size diminishes. Secondly, the largest estimate among numerous fitted models, because of uncertainty among researchers over the correct model, was used to determine statistical bias. This cherry-picking procedure ignores the most significant benefit of matching design-reducing model dependence based on its robustness against model misspecification bias.Conclusions: We conclude that the PSM paradox is not a legitimate concern and should not stop researchers from using PSM designs.","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"25"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11776168/pdf/","citationCount":"0","resultStr":"{\"title\":\"Propensity Score Matching: should we use it in designing observational studies?\",\"authors\":\"Fei Wan\",\"doi\":\"10.1186/s12874-025-02481-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Propensity Score Matching (PSM) stands as a widely embraced method in comparative effectiveness research. PSM crafts matched datasets, mimicking some attributes of randomized designs, from observational data. In a valid PSM design where all baseline confounders are measured and matched, the confounders would be balanced, allowing the treatment status to be considered as if it were randomly assigned. Nevertheless, recent research has unveiled a different facet of PSM, termed \\\"the PSM paradox\\\". As PSM approaches exact matching by progressively pruning matched sets in order of decreasing propensity score distance, it can paradoxically lead to greater covariate imbalance, heightened model dependence, and increased bias, contrary to its intended purpose.Methods: We used analytic formula, simulation, and literature to demonstrate that this paradox stems from the misuse of metrics for assessing chance imbalance and bias.Results: Firstly, matched pairs typically exhibit different covariate values despite having identical propensity scores. However, this disparity represents a \\\"chance\\\" difference and will average to zero over a large number of matched pairs. Common distance metrics cannot capture this \\\"chance\\\" nature in covariate imbalance, instead reflecting increasing variability in chance imbalance as units are pruned and the sample size diminishes. Secondly, the largest estimate among numerous fitted models, because of uncertainty among researchers over the correct model, was used to determine statistical bias. This cherry-picking procedure ignores the most significant benefit of matching design-reducing model dependence based on its robustness against model misspecification bias.Conclusions: We conclude that the PSM paradox is not a legitimate concern and should not stop researchers from using PSM designs.\",\"PeriodicalId\":9114,\"journal\":{\"name\":\"BMC Medical Research Methodology\",\"volume\":\"25 1\",\"pages\":\"25\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11776168/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Research Methodology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12874-025-02481-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-025-02481-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

背景：倾向得分匹配（PSM）是比较有效性研究中被广泛采用的一种方法。PSM工艺匹配的数据集，模仿随机设计的一些属性，从观测数据。在有效的PSM设计中，所有基线混杂因素都被测量和匹配，混杂因素将是平衡的，允许将治疗状态视为随机分配。然而，最近的研究揭示了PSM的另一个方面，称为“PSM悖论”。由于PSM通过按倾向得分距离递减的顺序逐步修剪匹配集来接近精确匹配，它可能矛盾地导致更大的协变量不平衡、模型依赖性增强和偏差增加，这与其预期目的相反。方法：我们使用分析公式、模拟和文献来证明这种悖论源于误用评估机会不平衡和偏差的指标。结果：首先，尽管具有相同的倾向得分，匹配对通常表现出不同的协变量值。然而，这种差异代表了一种“机会”差异，并且在大量匹配对中平均为零。普通距离度量不能捕捉协变量不平衡中的这种“机会”性质，而是反映了随着单位被修剪和样本量减少而增加的机会不平衡的可变性。其次，在众多拟合模型中，由于研究人员对正确模型的不确定性，使用最大估计值来确定统计偏差。这种挑选樱桃的过程忽略了匹配设计减少模型依赖的最重要的好处，这是基于它对模型错误规范偏差的鲁棒性。结论：我们得出结论，PSM悖论不是一个合理的关注，不应该阻止研究人员使用PSM设计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Propensity Score Matching: should we use it in designing observational studies?

Background: Propensity Score Matching (PSM) stands as a widely embraced method in comparative effectiveness research. PSM crafts matched datasets, mimicking some attributes of randomized designs, from observational data. In a valid PSM design where all baseline confounders are measured and matched, the confounders would be balanced, allowing the treatment status to be considered as if it were randomly assigned. Nevertheless, recent research has unveiled a different facet of PSM, termed "the PSM paradox". As PSM approaches exact matching by progressively pruning matched sets in order of decreasing propensity score distance, it can paradoxically lead to greater covariate imbalance, heightened model dependence, and increased bias, contrary to its intended purpose.

Methods: We used analytic formula, simulation, and literature to demonstrate that this paradox stems from the misuse of metrics for assessing chance imbalance and bias.

Results: Firstly, matched pairs typically exhibit different covariate values despite having identical propensity scores. However, this disparity represents a "chance" difference and will average to zero over a large number of matched pairs. Common distance metrics cannot capture this "chance" nature in covariate imbalance, instead reflecting increasing variability in chance imbalance as units are pruned and the sample size diminishes. Secondly, the largest estimate among numerous fitted models, because of uncertainty among researchers over the correct model, was used to determine statistical bias. This cherry-picking procedure ignores the most significant benefit of matching design-reducing model dependence based on its robustness against model misspecification bias.

Conclusions: We conclude that the PSM paradox is not a legitimate concern and should not stop researchers from using PSM designs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

BMC Medical Research Methodology 医学-卫生保健

CiteScore

6.50

自引率

2.50%

发文量

298

审稿时长

3-8 weeks

期刊介绍： BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.