Confirmation of previously identified plasma microRNA ratios for breast cancer detection in a nested case-control study within a screening setting

IF 6.8 1区医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL Clinical and Translational Medicine Pub Date : 2024-11-15 DOI:10.1002/ctm2.70068

Emir Sehovic, Ilaria Gregnanin, Maurizia Mello-Grand, Paola Ostano, Viviana Vergini, Andrea Ortale, Angela Amoruso, Elisabetta Favettini, Nereo Segnan, Giovanna Chiorino, Livia Giordano, Elisabetta Petracci

{"title":"Confirmation of previously identified plasma microRNA ratios for breast cancer detection in a nested case-control study within a screening setting","authors":"Emir Sehovic, Ilaria Gregnanin, Maurizia Mello-Grand, Paola Ostano, Viviana Vergini, Andrea Ortale, Angela Amoruso, Elisabetta Favettini, Nereo Segnan, Giovanna Chiorino, Livia Giordano, Elisabetta Petracci","doi":"10.1002/ctm2.70068","DOIUrl":null,"url":null,"abstract":"Dear Editor,Circulating cell-free microRNAs (miRNAs) were rarely explored as biomarkers for early detection of breast cancer (BC) within a screening setting or in prospectively sampled cohorts.1 In this study, we confirmed the discriminatory ability of a combination of novel and reliable circulating miRNA-ratio biomarkers, with and without nonmolecular predictors, that could be used for BC early detection in the context of mammographic screening using a standard, affordable, noninvasive and reproducible technique such as quantitative Reverse Transcription Polymerase Chain Reaction (RT-qPCR). The models were built on a discovery case-control set (n = 131) nested within a large mammographic screening cohort,2 where more than 14 000 out of 26 640 enrolled women filled an extensive questionnaire on lifestyle habits, hormonal and reproductive history and familiarity for BC, underwent anthropometric measurements and blood sampling. A model with candidate predictors was obtained through penalised logistic regression, and the selected variables were seven plasma miRNA-ratios, breast density, lifestyle score, menopausal status (MS), body mass index (BMI) and their interaction (BMI × MS). Area under the receiver operating characteristic curve (ROC AUC) of .79 for the complete model and of .73 for the miRNA-only model were obtained.3 Here, we applied the two models to a new set of women (validation set, n = 159) nested within the same cohort, including cases diagnosed up to four years after blood collection. Table 1 shows the main characteristics of the sample, with a similar distribution of factors between cases and controls except for the number of previous breast biopsies, breastfeeding and waist circumference. The flowchart of the validation study is visualised in Figure 1 and the methods are detailed in the Supplementary Information. Investigating associations between sample characteristics and studied miRNAs, we found weak correlations between BMI and three miRNA-ratios and between WCRF lifestyle score and one miRNA-ratio (Figure S1).The cancer characteristics in the discovery and validation sets were similar. However, in the discovery set the diagnosis occurred earlier relative to blood sampling (average 3 months vs. average 25 months), and the proportion of ki-67 positive tumours was lower (23 .5% vs. 82%) (Table S1). Moreover, unlike the discovery set, the validation set included 31 controls that underwent second-level investigation after a suspicious mammogram but then had a negative biopsy. The variables selected in the discovery model were comparable between the two control subgroups (Table S2). Two miRNA-ratios (miR-199a-3p/let-7a-5p and miR-26b-5p/miR-142-5p) were associated with ER status, with p-values of .049 and .027, respectively. Additionally, miR-93-5p/miR-19b-3p was associated with PgR status (p = .036) and let-7b-5p/miR-19b-3p with Tabar's classification of breast density (p = .025) (Figure S2).We applied the coefficients of nonmolecular variables and miRNA-ratios (obtained in the discovery set) to the validation set, yielding subpar discriminatory ability (Figure S3A), with ROC AUC = .63 (95% CI: .53–.74) and Brier score of .43. To assess the model calibration, a calibration curve was computed, and its intercept and slope were analysed. The predicted probabilities were miscalibrated, with a substantial overestimation of BC risk (Figure S3B), probably due to the differences between the two sets. The closed testing procedure indicated that the most appropriate model updating method was the re-estimation of the intercept and coefficients. After model recalibration using penalised ridge logistic regression, we obtained an ROC AUC of .87 (.81–.93) (Figure 2A), a Brier score of .11 and robust estimates after bootstrapping (Figure 2B). The sensitivity and specificity at Youden's cut-off (.17) were .97 and .70, respectively, and the calibration of the predicted probabilities was improved (Figure 2C). Using univariate logistic regression, we also investigated the seven miRNA-ratios in other publicly available circulating miRNA datasets, and despite the technological and population differences, subsets of the seven miRNA-ratios were associated with BC (Table S3).Furthermore, we merged the individual patient data of the discovery and validation sets and performed the internal external cross-validation (IECV). Using this approach, we created a new discriminatory model and accounted for the two datasets. The IECV model on all merged predictors had a relatively large heterogeneity on the meta-analysed Brier score (tau2 = .054). Therefore, the model on the combined dataset, based on all selected predictors, was not more informative than the models obtained from individual sets. A notable limitation of the IECV method in this study is the inclusion of only two cohorts, resulting in relatively unstable meta-analysis estimates. Nevertheless, we utilised the IECV method to create a model with the most generalisable predictors, obtaining a relatively low heterogeneity on the meta-analysed Brier score (tau2 = .001). This model included five predictors (miR-21-5p/miR-23a-3p, miR-199a-3p/let-7a-5p, MS, breast density and BMI), and had well-calibrated predicted probabilities (Figure S4) with a Brier score of .17 and an ROC AUC of .79 (.73–.85). The two miRNA-ratios were the predictors with coefficients of highest magnitude in the updated model (Table 2), suggesting a stronger diagnostic potential. A model combining three circulating small RNAs including miR-23a-3p and a miR-21-5p isoform was shown to discriminate stage 0 BCs from controls (ROC AUC of .92), although the study was not conducted within a screening set nor with prospective enrolment.4 Additionally, among the analysed datasets in Table S3, the most concordant miRNA-ratios (based on the direction of the relationship as summarised by the odds ratio) were the two with highest generalisability according to the IECV.Analogous results were obtained for the miRNA-ratio-only model, with relatively poor performance after initial application and miscalibrated predictive probabilities (Figure S5). After model updating, we obtained an ROC AUC of .77 (.69–.85), with a substantial improvement of the predicted probabilities (Brier score = .14) (Figure S6).We also assessed the performance of the identified plasma miRNA-ratios in 103 paired samples from breast cancer TCGA dataset, and all but let-7a-5p/miR-22-3p were significantly deregulated in breast tumours relative to normal adjacent tissues (Table S4). Hence, it is plausible that these miRNA-ratios have a role in BC onset or progression.Most of the thus-far published results of diagnostic cell free circulating miRNAs in the context of BC have focused on miRNA levels in patients already diagnosed with BC and lack methodological standardisation.1, 5 Thus, it remains unclear whether these biomarkers can be used for risk stratification or if they are merely a consequence of cancer progression, and whether they are generalisable. The key strengths of this study are the prospective sampling before any kind of treatment or diagnosis and the usage of miRNA-ratios for which RT-qPCR normalisers are not necessary.6In conclusion, we validated the discriminatory ability, in a screening setting, of candidate miRNA-ratio biomarkers that could easily be applied in the clinics and identified which of them are most generalisable. Although building a model on a larger dataset with more BC cases is needed, we highlighted the potential of plasma miRNAs, alone or combined with lifestyle and individual characteristics, for BC precision screening (Graphical abstract).ES: statistical data analysis, microRNA assessment, manuscript conceptualisation, writing and revision; IG: sample collection, processing, storing, microRNA assessment, manuscript revision; MMG: sample collection, processing, storing; PO: data analysis and data upload to Zenodo; VV: database management and case-control selection; AO: database generation and data elaboration; AA: mammogram analysis and breast density evaluation; EF: mammogram analysis; NS: funding and manuscript revision; GC: funding, supervision of experimental and analysis work, manuscript conceptualisation, writing and revision; LG: supervision of clinical data analysis, manuscript revision; EP: statistical data analysis supervision, manuscript conceptualisation, writing and revision.The project was funded by an investigator grant from the Italian Association for Cancer Research (AIRC IG 2014 Ref No 15374) to NS and LG, by the European Union's Horizon 2020 Research and Innovation Programme, Marie Skłodowska-Curie (Grant Number 859860) to ES and GC and by the 106562/RF 2023.1638 grant from Fondazione CRT to IG, MM-G and GC.Ethical approval was obtained from the Ethics Committee of each participating centre (Ethical and deontological institutional review board of the A.O.U Città della Salute e della Scienza of Turin, with the protocol number 78326 on 11.07.2013 and Ethical Committee of Novara with the protocol number 248/CE and study number CE 27/15).","PeriodicalId":10189,"journal":{"name":"Clinical and Translational Medicine","volume":"14 11","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11567874/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Medicine","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ctm2.70068","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Dear Editor,

Circulating cell-free microRNAs (miRNAs) were rarely explored as biomarkers for early detection of breast cancer (BC) within a screening setting or in prospectively sampled cohorts.¹ In this study, we confirmed the discriminatory ability of a combination of novel and reliable circulating miRNA-ratio biomarkers, with and without nonmolecular predictors, that could be used for BC early detection in the context of mammographic screening using a standard, affordable, noninvasive and reproducible technique such as quantitative Reverse Transcription Polymerase Chain Reaction (RT-qPCR). The models were built on a discovery case-control set (n = 131) nested within a large mammographic screening cohort,² where more than 14 000 out of 26 640 enrolled women filled an extensive questionnaire on lifestyle habits, hormonal and reproductive history and familiarity for BC, underwent anthropometric measurements and blood sampling. A model with candidate predictors was obtained through penalised logistic regression, and the selected variables were seven plasma miRNA-ratios, breast density, lifestyle score, menopausal status (MS), body mass index (BMI) and their interaction (BMI × MS). Area under the receiver operating characteristic curve (ROC AUC) of .79 for the complete model and of .73 for the miRNA-only model were obtained.³ Here, we applied the two models to a new set of women (validation set, n = 159) nested within the same cohort, including cases diagnosed up to four years after blood collection. Table 1 shows the main characteristics of the sample, with a similar distribution of factors between cases and controls except for the number of previous breast biopsies, breastfeeding and waist circumference. The flowchart of the validation study is visualised in Figure 1 and the methods are detailed in the Supplementary Information. Investigating associations between sample characteristics and studied miRNAs, we found weak correlations between BMI and three miRNA-ratios and between WCRF lifestyle score and one miRNA-ratio (Figure S1).

The cancer characteristics in the discovery and validation sets were similar. However, in the discovery set the diagnosis occurred earlier relative to blood sampling (average 3 months vs. average 25 months), and the proportion of ki-67 positive tumours was lower (23 .5% vs. 82%) (Table S1). Moreover, unlike the discovery set, the validation set included 31 controls that underwent second-level investigation after a suspicious mammogram but then had a negative biopsy. The variables selected in the discovery model were comparable between the two control subgroups (Table S2). Two miRNA-ratios (miR-199a-3p/let-7a-5p and miR-26b-5p/miR-142-5p) were associated with ER status, with p-values of .049 and .027, respectively. Additionally, miR-93-5p/miR-19b-3p was associated with PgR status (p = .036) and let-7b-5p/miR-19b-3p with Tabar's classification of breast density (p = .025) (Figure S2).

We applied the coefficients of nonmolecular variables and miRNA-ratios (obtained in the discovery set) to the validation set, yielding subpar discriminatory ability (Figure S3A), with ROC AUC = .63 (95% CI: .53–.74) and Brier score of .43. To assess the model calibration, a calibration curve was computed, and its intercept and slope were analysed. The predicted probabilities were miscalibrated, with a substantial overestimation of BC risk (Figure S3B), probably due to the differences between the two sets. The closed testing procedure indicated that the most appropriate model updating method was the re-estimation of the intercept and coefficients. After model recalibration using penalised ridge logistic regression, we obtained an ROC AUC of .87 (.81–.93) (Figure 2A), a Brier score of .11 and robust estimates after bootstrapping (Figure 2B). The sensitivity and specificity at Youden's cut-off (.17) were .97 and .70, respectively, and the calibration of the predicted probabilities was improved (Figure 2C). Using univariate logistic regression, we also investigated the seven miRNA-ratios in other publicly available circulating miRNA datasets, and despite the technological and population differences, subsets of the seven miRNA-ratios were associated with BC (Table S3).

Furthermore, we merged the individual patient data of the discovery and validation sets and performed the internal external cross-validation (IECV). Using this approach, we created a new discriminatory model and accounted for the two datasets. The IECV model on all merged predictors had a relatively large heterogeneity on the meta-analysed Brier score (tau² = .054). Therefore, the model on the combined dataset, based on all selected predictors, was not more informative than the models obtained from individual sets. A notable limitation of the IECV method in this study is the inclusion of only two cohorts, resulting in relatively unstable meta-analysis estimates. Nevertheless, we utilised the IECV method to create a model with the most generalisable predictors, obtaining a relatively low heterogeneity on the meta-analysed Brier score (tau² = .001). This model included five predictors (miR-21-5p/miR-23a-3p, miR-199a-3p/let-7a-5p, MS, breast density and BMI), and had well-calibrated predicted probabilities (Figure S4) with a Brier score of .17 and an ROC AUC of .79 (.73–.85). The two miRNA-ratios were the predictors with coefficients of highest magnitude in the updated model (Table 2), suggesting a stronger diagnostic potential. A model combining three circulating small RNAs including miR-23a-3p and a miR-21-5p isoform was shown to discriminate stage 0 BCs from controls (ROC AUC of .92), although the study was not conducted within a screening set nor with prospective enrolment.⁴ Additionally, among the analysed datasets in Table S3, the most concordant miRNA-ratios (based on the direction of the relationship as summarised by the odds ratio) were the two with highest generalisability according to the IECV.

Analogous results were obtained for the miRNA-ratio-only model, with relatively poor performance after initial application and miscalibrated predictive probabilities (Figure S5). After model updating, we obtained an ROC AUC of .77 (.69–.85), with a substantial improvement of the predicted probabilities (Brier score = .14) (Figure S6).

We also assessed the performance of the identified plasma miRNA-ratios in 103 paired samples from breast cancer TCGA dataset, and all but let-7a-5p/miR-22-3p were significantly deregulated in breast tumours relative to normal adjacent tissues (Table S4). Hence, it is plausible that these miRNA-ratios have a role in BC onset or progression.

Most of the thus-far published results of diagnostic cell free circulating miRNAs in the context of BC have focused on miRNA levels in patients already diagnosed with BC and lack methodological standardisation.^{1, 5} Thus, it remains unclear whether these biomarkers can be used for risk stratification or if they are merely a consequence of cancer progression, and whether they are generalisable. The key strengths of this study are the prospective sampling before any kind of treatment or diagnosis and the usage of miRNA-ratios for which RT-qPCR normalisers are not necessary.⁶

In conclusion, we validated the discriminatory ability, in a screening setting, of candidate miRNA-ratio biomarkers that could easily be applied in the clinics and identified which of them are most generalisable. Although building a model on a larger dataset with more BC cases is needed, we highlighted the potential of plasma miRNAs, alone or combined with lifestyle and individual characteristics, for BC precision screening (Graphical abstract).

ES: statistical data analysis, microRNA assessment, manuscript conceptualisation, writing and revision; IG: sample collection, processing, storing, microRNA assessment, manuscript revision; MMG: sample collection, processing, storing; PO: data analysis and data upload to Zenodo; VV: database management and case-control selection; AO: database generation and data elaboration; AA: mammogram analysis and breast density evaluation; EF: mammogram analysis; NS: funding and manuscript revision; GC: funding, supervision of experimental and analysis work, manuscript conceptualisation, writing and revision; LG: supervision of clinical data analysis, manuscript revision; EP: statistical data analysis supervision, manuscript conceptualisation, writing and revision.

The project was funded by an investigator grant from the Italian Association for Cancer Research (AIRC IG 2014 Ref No 15374) to NS and LG, by the European Union's Horizon 2020 Research and Innovation Programme, Marie Skłodowska-Curie (Grant Number 859860) to ES and GC and by the 106562/RF 2023.1638 grant from Fondazione CRT to IG, MM-G and GC.

Ethical approval was obtained from the Ethics Committee of each participating centre (Ethical and deontological institutional review board of the A.O.U Città della Salute e della Scienza of Turin, with the protocol number 78326 on 11.07.2013 and Ethical Committee of Novara with the protocol number 248/CE and study number CE 27/15).

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在筛查环境下的一项巢式病例对照研究中，确认先前确定的用于检测乳腺癌的血浆 microRNA 比率。

亲爱的编辑，循环中的无细胞microRNA（miRNA）很少作为生物标志物在筛查或前瞻性取样队列中用于乳腺癌（BC）的早期检测1。在这项研究中，我们证实了新颖可靠的循环 miRNA 比率生物标志物组合的鉴别能力，无论是否有非分子预测因子，这些生物标志物都可用于乳腺 X 线照相筛查中的 BC 早期检测，使用的是标准、经济、无创和可重复的技术，如定量反转录聚合酶链反应（RT-qPCR）。在 26 640 名登记妇女中，有 14 000 多人填写了有关生活习惯、荷尔蒙和生殖史以及对 BC 的熟悉程度的详细问卷，并接受了人体测量和血液采样。通过惩罚性逻辑回归得出了一个包含候选预测因子的模型，所选变量包括七种血浆 miRNA 比率、乳腺密度、生活方式评分、绝经状态（MS）、体重指数（BMI）及其交互作用（BMI × MS）。完整模型的接收者操作特征曲线下面积（ROC AUC）为 0.79，纯 miRNA 模型的接收者操作特征曲线下面积（ROC AUC）为 0.73。3 在此，我们将这两个模型应用于嵌套在同一队列中的一组新女性（验证集，n = 159），包括采血后四年内确诊的病例。表 1 显示了样本的主要特征，除既往乳腺活检次数、母乳喂养和腰围外，病例和对照组的其他因素分布相似。验证研究的流程图见图 1，方法详见补充信息。在调查样本特征与所研究的 miRNA 之间的关系时，我们发现体重指数与三个 miRNA 比率之间以及 WCRF 生活方式评分与一个 miRNA 比率之间存在微弱的相关性（图 S1）。然而，在发现集中，相对于血液采样，诊断发生得更早（平均 3 个月对平均 25 个月），而且 ki-67 阳性肿瘤的比例较低（23.5% 对 82%）（表 S1）。此外，与发现集不同的是，验证集包括31个对照组，这些对照组在可疑乳房X光检查后接受了二级检查，但活检结果为阴性。发现模型中选择的变量在两个对照亚组中具有可比性（表 S2）。两个 miRNA 比率（miR-199a-3p/let-7a-5p 和 miR-26b-5p/miR-142-5p）与 ER 状态相关，p 值分别为 0.049 和 0.027。此外，miR-93-5p/miR-19b-3p 与 PgR 状态相关（p = .036），let-7b-5p/miR-19b-3p 与 Tabar 的乳腺密度分类相关（p = .025）（图 S2）。我们将非分子变量系数和 miRNA 比率（在发现集中获得）应用于验证集中，结果显示判别能力不佳（图 S3A），ROC AUC = .63 (95% CI: .53-.74) 和 Brier 分数为 .43。为评估模型校准，计算了校准曲线，并分析了其截距和斜率。预测的概率被误判，BC 风险被大幅高估（图 S3B），这可能是由于两组数据之间的差异造成的。封闭测试程序表明，最合适的模型更新方法是重新估计截距和系数。使用惩罚性脊逻辑回归对模型进行重新校准后，我们得到的 ROC AUC 为 0.87（0.81-0.93）（图 2A），Brier 得分为 0.11，自举后的估计值稳健（图 2B）。尤登截断值（.17）的灵敏度和特异度分别为 0.97 和 0.70，预测概率的校准也得到了改善（图 2C）。尽管存在技术和人群差异，但这七种 miRNA 比率的子集与 BC 相关（表 S3）。通过这种方法，我们创建了一个新的判别模型，并对两个数据集进行了核算。所有合并预测因子的 IECV 模型在荟萃分析的 Brier 评分上具有相对较大的异质性（tau2 = .054）。因此，基于所有选定预测因子的合并数据集模型并不比单个数据集的模型更有参考价值。在本研究中，IECV 方法的一个显著局限是只纳入了两个队列，导致荟萃分析估计值相对不稳定。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Clinical and Translational Medicine Multiple-

CiteScore

15.90

自引率

1.90%

发文量

450

审稿时长

4 weeks

期刊介绍： Clinical and Translational Medicine (CTM) is an international, peer-reviewed, open-access journal dedicated to accelerating the translation of preclinical research into clinical applications and fostering communication between basic and clinical scientists. It highlights the clinical potential and application of various fields including biotechnologies, biomaterials, bioengineering, biomarkers, molecular medicine, omics science, bioinformatics, immunology, molecular imaging, drug discovery, regulation, and health policy. With a focus on the bench-to-bedside approach, CTM prioritizes studies and clinical observations that generate hypotheses relevant to patients and diseases, guiding investigations in cellular and molecular medicine. The journal encourages submissions from clinicians, researchers, policymakers, and industry professionals.