基于人工智能的HCC预后分层，走向个体化治疗途径

IF 6.2 2区医学 Q1 GASTROENTEROLOGY & HEPATOLOGY Liver International Pub Date : 2025-03-14 DOI:10.1111/liv.16153

Olivier Sutter, Lorenzo-Carlo Pescatori

{"title":"基于人工智能的HCC预后分层，走向个体化治疗途径","authors":"Olivier Sutter, Lorenzo-Carlo Pescatori","doi":"10.1111/liv.16153","DOIUrl":null,"url":null,"abstract":"Early-stage hepatocellular carcinoma (HCC), defined as a single tumour or up to three lesions < 3 cm, according to the Barcelona Clinic Liver Cancer (BCLC) classification, is eligible for curative treatment [1]. Curative options include liver transplantation, surgical resection or percutaneous ablation. Ablation can be selected as a first-line therapy over surgery for tumour(s) < 3 cm due to its advantages, such as lower complication rates, reduced mortality and minimal invasiveness, making it feasible even when liver function is slightly impaired or in the presence of portal hypertension. Moreover, for transplant-eligible patients, the strategy of first-line percutaneous ablation followed by salvage transplantation in case of recurrence is gaining traction worldwide in the context of graft shortages [2]. However, unlike surgery, ablation does not allow for a full histopathological analysis of the resected specimen, limiting the tumour tissue analysis to samples obtained via micro-biopsies prior to the ablation. This is a limitation for prognostic stratification following a first curative treatment, especially for transplantable patients, as recognised histological markers of tumour aggressiveness, such as microvascular invasion and satellites, cannot be captured by biopsies. Intratumoral heterogeneity (ITH) is another promising marker with the potential to provide valuable information on prognosis and the risk of early recurrence. As ITH can only be fully assessed through histological and genomic evaluation of the entire lesion, imaging features of ITH, using modalities such as radiomics and/or deep-learning, could serve as surrogate markers in advanced HCC or in early-stage HCCs treated with ablation where, by definition, no surgical specimen is available.In this issue of Liver International, Zhang et al. describe a transformer-based quantitative ITH model that integrates signatures extracted from ultrasound (US), contrast-enhanced US (CEUS) and magnetic resonance imaging (MRI), acquired before ablation, along with demographic, clinicopathological and laboratory variables to predict individual early recurrence risk [3]. The model was tested on cohorts of patients treated with radiofrequency ablation (RFA) and microwave ablation (MWA), and then validated on external cohorts undergoing RFA, laser ablation (LA) and irreversible electroporation (IRE) treatments. The primary structure of the network used to extract ITH-related features from imaging is referred to as the vision-transformer-based quantitative intratumoral heterogeneity (ViT-Q-ITH) model. This deep learning model segments images into patches and employs a self-attention mechanism to analyse correlations between these patches, capturing both global and local relationships to enhance the understanding of complex visual structures. A combined model was then developed by integrating the ViT-Q-ITH score with clinical factors. The study results show that the combined model achieved high performance in both the internal validation cohort (AUC 0.86) and the external test cohort (AUC 0.83), with sensitivities of 76% and 74% and specificities of 88% and 84%, respectively, outperforming both the traditional clinical model and the standalone ViT-Q-ITH model. Recurrence-free survival analyses further confirmed the superior stratification capability of the combined model, clearly distinguishing high-risk from low-risk patients more accurately than clinical models alone. In all test cohorts, high-risk patients identified by the combined model had a significantly higher probability of early recurrence (local or distant) compared to low-risk patients.One of the study's strengths lies in the generalisability of the combined model when applied to ‘real-world’ data from external cohorts. Indeed, these external cohorts differed not only in the therapeutic modalities employed (some patients treated with IRE or LA instead of traditional RFA/MWA) but also in baseline clinical and biological characteristics (e.g., much smaller tumours and less advanced liver disease) compared to the training cohort. Despite these unfavourable conditions on paper, the developed combined model showed excellent performance, achieving AUCs over 0.8. This suggests that the model may be applicable across different clinical settings and ablation modalities, broadening its potential for clinical use. An innovative aspect of this study is the development of a model that integrates traditional clinical data, known to be linked to HCC recurrence, with readily available images such as single pictures from US and CEUS. It is also worth noting that the two MRI sequences used were unenhanced acquisitions (T2- and diffusion-weighted imaging), as opposed to numerous studies on tumour heterogeneity on imaging, which generally analyse vascularity/texture on contrast-enhanced volumes. This choice was likely made to limit variations related to image contrast, injection type and timing, thereby improving the model's generalisability across centres.However, one recurrent challenge for the widespread adoption of radiomics or AI-based models lies in their interpretability. Although the vision-transformer offers superior performance, its neural network nature makes it more complex to interpret compared to traditional approaches based on simple clinical variables or static images. This is critical because predictive models must be understandable not only to AI specialists but also to the clinicians who want to use them in daily practice. Despite these challenges, this model appears robust, and there are several areas for potential development to improve its clinical applicability, particularly by extending validation to multi-institutional and international datasets to test the model's robustness in diverse populations. A potential target population for AI-based imaging biomarkers could be transplant-eligible patients treated with first-line ablation, in order to stratify, based on their predicted early recurrence risk, whether they should be fast-tracked for transplantation rather than following an ‘ablate and wait’ strategy. Additionally, integrating other data sources, such as genetic sequencing or circulating biomarkers, could further enhance the model's predictive power. Combining genetic data with imaging and clinical data could provide a more complete picture of tumour biology, improving prediction accuracy and enabling even more personalised risk stratification. Lastly, the challenge of interpretability must be addressed more thoroughly. Clinician involvement in model training and validation could help develop more transparent tools that can be used in daily practice, reducing the common feeling among physicians of dealing with a ‘black box’ with AI-based tools. In this context, explainable AI (XAI) techniques will play a crucial role, enabling deep learning models to maintain high accuracy without sacrificing an understanding of their internal mechanisms [4].In conclusion, the study by Zhang et al. contributes to the growing body of evidence supporting the use of AI-based imaging biomarkers in the prognostic stratification of HCC [5]. While challenges remain, particularly in terms of interpretability and large-scale validation, these approaches have the potential to improve clinicians' ability to predict recurrence and tailor patient management accordingly.The authors declare no conflicts of interest.","PeriodicalId":18101,"journal":{"name":"Liver International","volume":"45 4","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/liv.16153","citationCount":"0","resultStr":"{\"title\":\"AI-Based Prognostic Stratification in HCC, Towards a Personalised Treatment Approach\",\"authors\":\"Olivier Sutter, Lorenzo-Carlo Pescatori\",\"doi\":\"10.1111/liv.16153\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Early-stage hepatocellular carcinoma (HCC), defined as a single tumour or up to three lesions < 3 cm, according to the Barcelona Clinic Liver Cancer (BCLC) classification, is eligible for curative treatment [1]. Curative options include liver transplantation, surgical resection or percutaneous ablation. Ablation can be selected as a first-line therapy over surgery for tumour(s) < 3 cm due to its advantages, such as lower complication rates, reduced mortality and minimal invasiveness, making it feasible even when liver function is slightly impaired or in the presence of portal hypertension. Moreover, for transplant-eligible patients, the strategy of first-line percutaneous ablation followed by salvage transplantation in case of recurrence is gaining traction worldwide in the context of graft shortages [2]. However, unlike surgery, ablation does not allow for a full histopathological analysis of the resected specimen, limiting the tumour tissue analysis to samples obtained via micro-biopsies prior to the ablation. This is a limitation for prognostic stratification following a first curative treatment, especially for transplantable patients, as recognised histological markers of tumour aggressiveness, such as microvascular invasion and satellites, cannot be captured by biopsies. Intratumoral heterogeneity (ITH) is another promising marker with the potential to provide valuable information on prognosis and the risk of early recurrence. As ITH can only be fully assessed through histological and genomic evaluation of the entire lesion, imaging features of ITH, using modalities such as radiomics and/or deep-learning, could serve as surrogate markers in advanced HCC or in early-stage HCCs treated with ablation where, by definition, no surgical specimen is available.In this issue of Liver International, Zhang et al. describe a transformer-based quantitative ITH model that integrates signatures extracted from ultrasound (US), contrast-enhanced US (CEUS) and magnetic resonance imaging (MRI), acquired before ablation, along with demographic, clinicopathological and laboratory variables to predict individual early recurrence risk [3]. The model was tested on cohorts of patients treated with radiofrequency ablation (RFA) and microwave ablation (MWA), and then validated on external cohorts undergoing RFA, laser ablation (LA) and irreversible electroporation (IRE) treatments. The primary structure of the network used to extract ITH-related features from imaging is referred to as the vision-transformer-based quantitative intratumoral heterogeneity (ViT-Q-ITH) model. This deep learning model segments images into patches and employs a self-attention mechanism to analyse correlations between these patches, capturing both global and local relationships to enhance the understanding of complex visual structures. A combined model was then developed by integrating the ViT-Q-ITH score with clinical factors. The study results show that the combined model achieved high performance in both the internal validation cohort (AUC 0.86) and the external test cohort (AUC 0.83), with sensitivities of 76% and 74% and specificities of 88% and 84%, respectively, outperforming both the traditional clinical model and the standalone ViT-Q-ITH model. Recurrence-free survival analyses further confirmed the superior stratification capability of the combined model, clearly distinguishing high-risk from low-risk patients more accurately than clinical models alone. In all test cohorts, high-risk patients identified by the combined model had a significantly higher probability of early recurrence (local or distant) compared to low-risk patients.One of the study's strengths lies in the generalisability of the combined model when applied to ‘real-world’ data from external cohorts. Indeed, these external cohorts differed not only in the therapeutic modalities employed (some patients treated with IRE or LA instead of traditional RFA/MWA) but also in baseline clinical and biological characteristics (e.g., much smaller tumours and less advanced liver disease) compared to the training cohort. Despite these unfavourable conditions on paper, the developed combined model showed excellent performance, achieving AUCs over 0.8. This suggests that the model may be applicable across different clinical settings and ablation modalities, broadening its potential for clinical use. An innovative aspect of this study is the development of a model that integrates traditional clinical data, known to be linked to HCC recurrence, with readily available images such as single pictures from US and CEUS. It is also worth noting that the two MRI sequences used were unenhanced acquisitions (T2- and diffusion-weighted imaging), as opposed to numerous studies on tumour heterogeneity on imaging, which generally analyse vascularity/texture on contrast-enhanced volumes. This choice was likely made to limit variations related to image contrast, injection type and timing, thereby improving the model's generalisability across centres.However, one recurrent challenge for the widespread adoption of radiomics or AI-based models lies in their interpretability. Although the vision-transformer offers superior performance, its neural network nature makes it more complex to interpret compared to traditional approaches based on simple clinical variables or static images. This is critical because predictive models must be understandable not only to AI specialists but also to the clinicians who want to use them in daily practice. Despite these challenges, this model appears robust, and there are several areas for potential development to improve its clinical applicability, particularly by extending validation to multi-institutional and international datasets to test the model's robustness in diverse populations. A potential target population for AI-based imaging biomarkers could be transplant-eligible patients treated with first-line ablation, in order to stratify, based on their predicted early recurrence risk, whether they should be fast-tracked for transplantation rather than following an ‘ablate and wait’ strategy. Additionally, integrating other data sources, such as genetic sequencing or circulating biomarkers, could further enhance the model's predictive power. Combining genetic data with imaging and clinical data could provide a more complete picture of tumour biology, improving prediction accuracy and enabling even more personalised risk stratification. Lastly, the challenge of interpretability must be addressed more thoroughly. Clinician involvement in model training and validation could help develop more transparent tools that can be used in daily practice, reducing the common feeling among physicians of dealing with a ‘black box’ with AI-based tools. In this context, explainable AI (XAI) techniques will play a crucial role, enabling deep learning models to maintain high accuracy without sacrificing an understanding of their internal mechanisms [4].In conclusion, the study by Zhang et al. contributes to the growing body of evidence supporting the use of AI-based imaging biomarkers in the prognostic stratification of HCC [5]. While challenges remain, particularly in terms of interpretability and large-scale validation, these approaches have the potential to improve clinicians' ability to predict recurrence and tailor patient management accordingly.The authors declare no conflicts of interest.\",\"PeriodicalId\":18101,\"journal\":{\"name\":\"Liver International\",\"volume\":\"45 4\",\"pages\":\"\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/liv.16153\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Liver International\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/liv.16153\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Liver International","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/liv.16153","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

这种选择可能是为了限制与图像对比度、注射类型和时间相关的变化，从而提高模型在各个中心的通用性。然而，广泛采用放射组学或基于人工智能的模型的一个反复出现的挑战在于它们的可解释性。尽管视觉转换器提供了优越的性能，但与基于简单临床变量或静态图像的传统方法相比，其神经网络特性使其更复杂。这一点至关重要，因为预测模型不仅要让人工智能专家理解，而且要让想要在日常实践中使用它们的临床医生理解。尽管存在这些挑战，但该模型似乎是稳健的，并且有几个潜在的发展领域可以提高其临床适用性，特别是通过将验证扩展到多机构和国际数据集，以测试该模型在不同人群中的稳健性。基于人工智能的成像生物标志物的潜在目标人群可能是接受一线消融治疗的符合移植条件的患者，以便根据他们预测的早期复发风险，对他们进行分层，是否应该快速追踪移植，而不是遵循“消融和等待”策略。此外，整合其他数据源，如基因测序或循环生物标志物，可以进一步提高模型的预测能力。将遗传数据与成像和临床数据相结合，可以提供更完整的肿瘤生物学图景，提高预测的准确性，并实现更个性化的风险分层。最后，必须更彻底地解决可解释性的挑战。临床医生参与模型训练和验证可以帮助开发更透明的工具，可以在日常实践中使用，减少医生使用基于人工智能的工具处理“黑盒子”的普遍感觉。在这种情况下，可解释的人工智能（XAI）技术将发挥至关重要的作用，使深度学习模型能够在不牺牲对其内部机制的理解的情况下保持高准确性。总之，Zhang等人的研究为越来越多的证据支持在HCC预后分层中使用基于人工智能的成像生物标志物做出了贡献。尽管挑战仍然存在，特别是在可解释性和大规模验证方面，这些方法有可能提高临床医生预测复发的能力，并相应地调整患者管理。作者声明无利益冲突。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

AI-Based Prognostic Stratification in HCC, Towards a Personalised Treatment Approach

Early-stage hepatocellular carcinoma (HCC), defined as a single tumour or up to three lesions < 3 cm, according to the Barcelona Clinic Liver Cancer (BCLC) classification, is eligible for curative treatment [1]. Curative options include liver transplantation, surgical resection or percutaneous ablation. Ablation can be selected as a first-line therapy over surgery for tumour(s) < 3 cm due to its advantages, such as lower complication rates, reduced mortality and minimal invasiveness, making it feasible even when liver function is slightly impaired or in the presence of portal hypertension. Moreover, for transplant-eligible patients, the strategy of first-line percutaneous ablation followed by salvage transplantation in case of recurrence is gaining traction worldwide in the context of graft shortages [2]. However, unlike surgery, ablation does not allow for a full histopathological analysis of the resected specimen, limiting the tumour tissue analysis to samples obtained via micro-biopsies prior to the ablation. This is a limitation for prognostic stratification following a first curative treatment, especially for transplantable patients, as recognised histological markers of tumour aggressiveness, such as microvascular invasion and satellites, cannot be captured by biopsies. Intratumoral heterogeneity (ITH) is another promising marker with the potential to provide valuable information on prognosis and the risk of early recurrence. As ITH can only be fully assessed through histological and genomic evaluation of the entire lesion, imaging features of ITH, using modalities such as radiomics and/or deep-learning, could serve as surrogate markers in advanced HCC or in early-stage HCCs treated with ablation where, by definition, no surgical specimen is available.

In this issue of Liver International, Zhang et al. describe a transformer-based quantitative ITH model that integrates signatures extracted from ultrasound (US), contrast-enhanced US (CEUS) and magnetic resonance imaging (MRI), acquired before ablation, along with demographic, clinicopathological and laboratory variables to predict individual early recurrence risk [3]. The model was tested on cohorts of patients treated with radiofrequency ablation (RFA) and microwave ablation (MWA), and then validated on external cohorts undergoing RFA, laser ablation (LA) and irreversible electroporation (IRE) treatments. The primary structure of the network used to extract ITH-related features from imaging is referred to as the vision-transformer-based quantitative intratumoral heterogeneity (ViT-Q-ITH) model. This deep learning model segments images into patches and employs a self-attention mechanism to analyse correlations between these patches, capturing both global and local relationships to enhance the understanding of complex visual structures. A combined model was then developed by integrating the ViT-Q-ITH score with clinical factors. The study results show that the combined model achieved high performance in both the internal validation cohort (AUC 0.86) and the external test cohort (AUC 0.83), with sensitivities of 76% and 74% and specificities of 88% and 84%, respectively, outperforming both the traditional clinical model and the standalone ViT-Q-ITH model. Recurrence-free survival analyses further confirmed the superior stratification capability of the combined model, clearly distinguishing high-risk from low-risk patients more accurately than clinical models alone. In all test cohorts, high-risk patients identified by the combined model had a significantly higher probability of early recurrence (local or distant) compared to low-risk patients.

One of the study's strengths lies in the generalisability of the combined model when applied to ‘real-world’ data from external cohorts. Indeed, these external cohorts differed not only in the therapeutic modalities employed (some patients treated with IRE or LA instead of traditional RFA/MWA) but also in baseline clinical and biological characteristics (e.g., much smaller tumours and less advanced liver disease) compared to the training cohort. Despite these unfavourable conditions on paper, the developed combined model showed excellent performance, achieving AUCs over 0.8. This suggests that the model may be applicable across different clinical settings and ablation modalities, broadening its potential for clinical use. An innovative aspect of this study is the development of a model that integrates traditional clinical data, known to be linked to HCC recurrence, with readily available images such as single pictures from US and CEUS. It is also worth noting that the two MRI sequences used were unenhanced acquisitions (T2- and diffusion-weighted imaging), as opposed to numerous studies on tumour heterogeneity on imaging, which generally analyse vascularity/texture on contrast-enhanced volumes. This choice was likely made to limit variations related to image contrast, injection type and timing, thereby improving the model's generalisability across centres.

However, one recurrent challenge for the widespread adoption of radiomics or AI-based models lies in their interpretability. Although the vision-transformer offers superior performance, its neural network nature makes it more complex to interpret compared to traditional approaches based on simple clinical variables or static images. This is critical because predictive models must be understandable not only to AI specialists but also to the clinicians who want to use them in daily practice. Despite these challenges, this model appears robust, and there are several areas for potential development to improve its clinical applicability, particularly by extending validation to multi-institutional and international datasets to test the model's robustness in diverse populations. A potential target population for AI-based imaging biomarkers could be transplant-eligible patients treated with first-line ablation, in order to stratify, based on their predicted early recurrence risk, whether they should be fast-tracked for transplantation rather than following an ‘ablate and wait’ strategy. Additionally, integrating other data sources, such as genetic sequencing or circulating biomarkers, could further enhance the model's predictive power. Combining genetic data with imaging and clinical data could provide a more complete picture of tumour biology, improving prediction accuracy and enabling even more personalised risk stratification. Lastly, the challenge of interpretability must be addressed more thoroughly. Clinician involvement in model training and validation could help develop more transparent tools that can be used in daily practice, reducing the common feeling among physicians of dealing with a ‘black box’ with AI-based tools. In this context, explainable AI (XAI) techniques will play a crucial role, enabling deep learning models to maintain high accuracy without sacrificing an understanding of their internal mechanisms [4].

In conclusion, the study by Zhang et al. contributes to the growing body of evidence supporting the use of AI-based imaging biomarkers in the prognostic stratification of HCC [5]. While challenges remain, particularly in terms of interpretability and large-scale validation, these approaches have the potential to improve clinicians' ability to predict recurrence and tailor patient management accordingly.

The authors declare no conflicts of interest.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Liver International 医学-胃肠肝病学

CiteScore

13.90

自引率

4.50%

发文量

348

审稿时长

2 months

期刊介绍： Liver International promotes all aspects of the science of hepatology from basic research to applied clinical studies. Providing an international forum for the publication of high-quality original research in hepatology, it is an essential resource for everyone working on normal and abnormal structure and function in the liver and its constituent cells, including clinicians and basic scientists involved in the multi-disciplinary field of hepatology. The journal welcomes articles from all fields of hepatology, which may be published as original articles, brief definitive reports, reviews, mini-reviews, images in hepatology and letters to the Editor.