Recent advances in diagnostic imaging have led to an increase in the detection of early-stage breast cancer, prompting growing interest in minimally invasive breast surgery. These approaches include the omission of axillary lymph node dissection, use of endoscopic or robot-assisted surgery, and application of non-surgical ablation techniques. Non-surgical ablation therapy aims to eradicate tumor tissue in situ by delivering localized thermal or cryogenic energy under imaging guidance without surgical resection. Among these, radiofrequency ablation (RFA) and cryoablation have been the most extensively studied, showing promising results for small early-stage tumors. Clinical studies have reported high rates of complete tumor ablation, favorable safety profiles, and excellent cosmetic outcomes. Moreover, single-arm prospective trials have provided 5-year follow-up data, offering valuable insights into long-term efficacy. Despite these encouraging results, non-surgical ablation has not been internationally adopted as a standard treatment because of concerns regarding oncological surveillance and outcome validation. However, in Japan, RFA for early-stage breast cancer was approved for insurance coverage in December 2023. This review summarizes the current trends and clinical evidence for RFA and cryoablation, and discusses the regulatory pathways required for RFA insurance approval in Japan.
{"title":"Emerging advances in non-surgical ablation for early-stage breast cancer.","authors":"Shin Takayama, Takeshi Murata, Chikashi Watase, Hinako Maeda, Ayako Nakashoji, Ayumi Ogawa, Mengge Chen, Natsuko Ogi, Takayuki Kinoshita","doi":"10.1007/s11604-026-01951-5","DOIUrl":"https://doi.org/10.1007/s11604-026-01951-5","url":null,"abstract":"<p><p>Recent advances in diagnostic imaging have led to an increase in the detection of early-stage breast cancer, prompting growing interest in minimally invasive breast surgery. These approaches include the omission of axillary lymph node dissection, use of endoscopic or robot-assisted surgery, and application of non-surgical ablation techniques. Non-surgical ablation therapy aims to eradicate tumor tissue in situ by delivering localized thermal or cryogenic energy under imaging guidance without surgical resection. Among these, radiofrequency ablation (RFA) and cryoablation have been the most extensively studied, showing promising results for small early-stage tumors. Clinical studies have reported high rates of complete tumor ablation, favorable safety profiles, and excellent cosmetic outcomes. Moreover, single-arm prospective trials have provided 5-year follow-up data, offering valuable insights into long-term efficacy. Despite these encouraging results, non-surgical ablation has not been internationally adopted as a standard treatment because of concerns regarding oncological surveillance and outcome validation. However, in Japan, RFA for early-stage breast cancer was approved for insurance coverage in December 2023. This review summarizes the current trends and clinical evidence for RFA and cryoablation, and discusses the regulatory pathways required for RFA insurance approval in Japan.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1007/s11604-026-01953-3
S Dhanya Dedeepya, Vaishali Goel, Nivedita Nikhil Desai
{"title":"Comment on \"the usefulness of the follicle-preserving sign in differentiating between benign, borderline, and malignant ovarian tumors on magnetic resonance imaging\".","authors":"S Dhanya Dedeepya, Vaishali Goel, Nivedita Nikhil Desai","doi":"10.1007/s11604-026-01953-3","DOIUrl":"https://doi.org/10.1007/s11604-026-01953-3","url":null,"abstract":"","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: To evaluate the distribution patterns of sarcoidosis involvement on FDG-PET/CT in patients with known or suspected cardiac sarcoidosis (CS), with a particular focus on upper abdominal lymph nodes (LN) (periportal LN [PLN], anterior pancreaticoduodenal LN [APDLN], and posterior pancreaticoduodenal LN [PPDLN]) and the association of them with other lesions and myocardium.
Methods: We identified 861 FDG-PET/CT scans performed between July 2016 and August 2024 in patients with known or suspected CS, and included 178 cases for analysis of FDG uptake patterns suggestive of sarcoid involvement. FDG-positive LNs or regions were classified as sarcoidosis-related based on treatment response, characteristic uptake patterns, or histological confirmation. The occurrence ratio of FDG-positive lymph nodes or regions was also assessed in relation to myocardial FDG uptake patterns.
Results: FDG uptake was observed most frequently in hilar and mediastinal LNs (79% and 76%, respectively). Upper abdominal LN uptake was observed in 49.4% of patients, most commonly in the PLN (31.5%), APDLN (38.2%), and PPDLN (37.1%). Heatmap analyses revealed strong co-occurrence between thoracic and upper abdominal LNs, suggesting a lymphatic dissemination pattern. Peripheral LNs such as axillary, subclavian, and inguinal/pelvic stations demonstrated low uptake and minimal co-occurrence.
Conclusions: FDG-PET/CT provides valuable insight into the structured lymphatic dissemination of sarcoidosis. Frequent FDG uptake in upper abdominal lymph nodes, particularly when accompanied by thoracic involvement, represents a characteristic finding in sarcoidosis. Recognition of this pattern can improve diagnostic accuracy and help differentiate sarcoidosis from other systemic diseases. This study assessed lymph node involvement on FDG-PET/CT in patients with suspected or known cardiac sarcoidosis, revealing distinct dissemination patterns into the upper abdomen. These findings enhance understanding of disease pathophysiology and may improve diagnostic evaluation.
目的:评估已知或疑似心脏结节病(CS)患者的FDG-PET/CT结节病累及分布模式,特别关注上腹部淋巴结(门静脉周围淋巴结[PLN]、胰十二指肠前淋巴结[APDLN]和胰十二指肠后淋巴结[PPDLN])及其与其他病变和心肌的关系。方法:我们确定了2016年7月至2024年8月期间进行的861例已知或疑似CS患者的FDG- pet /CT扫描,其中178例用于分析提示肉瘤累及的FDG摄取模式。基于治疗反应、特征性摄取模式或组织学证实,fdg阳性的LNs或区域被归类为结节病相关。FDG阳性淋巴结或区域的发生率也与心肌FDG摄取模式有关。结果:FDG摄取在肺门和纵隔ln中最为常见(分别为79%和76%)。49.4%的患者观察到上腹部LN摄取,最常见的是PLN (31.5%), APDLN(38.2%)和PPDLN(37.1%)。热图分析显示,胸部和上腹部的LNs之间有很强的共发性,提示淋巴传播模式。外周淋巴结如腋窝、锁骨下和腹股沟/盆腔淋巴结的吸收较低,且极少合并。结论:FDG-PET/CT为结节病的结构性淋巴传播提供了有价值的见解。上腹部淋巴结频繁摄取FDG,特别是当伴有胸部受累时,是结节病的特征性表现。识别这种模式可以提高诊断的准确性,并有助于将结节病与其他全身性疾病区分开来。本研究评估了疑似或已知心脏结节病患者的FDG-PET/CT淋巴结受累情况,显示明显的上腹部播散模式。这些发现增强了对疾病病理生理学的认识,并可能改善诊断评价。
{"title":"FDG uptake in upper abdominal lymph node as a distinctive pattern in sarcoidosis.","authors":"Ryogo Minamimoto, Yumi Abe, Ryota Morimoto, Rintato Ito, Naotoshi Fujita, Toyoaki Murohara, Katsuhiko Kato, Shinji Naganawa","doi":"10.1007/s11604-025-01935-x","DOIUrl":"https://doi.org/10.1007/s11604-025-01935-x","url":null,"abstract":"<p><strong>Objectives: </strong>To evaluate the distribution patterns of sarcoidosis involvement on FDG-PET/CT in patients with known or suspected cardiac sarcoidosis (CS), with a particular focus on upper abdominal lymph nodes (LN) (periportal LN [PLN], anterior pancreaticoduodenal LN [APDLN], and posterior pancreaticoduodenal LN [PPDLN]) and the association of them with other lesions and myocardium.</p><p><strong>Methods: </strong>We identified 861 FDG-PET/CT scans performed between July 2016 and August 2024 in patients with known or suspected CS, and included 178 cases for analysis of FDG uptake patterns suggestive of sarcoid involvement. FDG-positive LNs or regions were classified as sarcoidosis-related based on treatment response, characteristic uptake patterns, or histological confirmation. The occurrence ratio of FDG-positive lymph nodes or regions was also assessed in relation to myocardial FDG uptake patterns.</p><p><strong>Results: </strong>FDG uptake was observed most frequently in hilar and mediastinal LNs (79% and 76%, respectively). Upper abdominal LN uptake was observed in 49.4% of patients, most commonly in the PLN (31.5%), APDLN (38.2%), and PPDLN (37.1%). Heatmap analyses revealed strong co-occurrence between thoracic and upper abdominal LNs, suggesting a lymphatic dissemination pattern. Peripheral LNs such as axillary, subclavian, and inguinal/pelvic stations demonstrated low uptake and minimal co-occurrence.</p><p><strong>Conclusions: </strong>FDG-PET/CT provides valuable insight into the structured lymphatic dissemination of sarcoidosis. Frequent FDG uptake in upper abdominal lymph nodes, particularly when accompanied by thoracic involvement, represents a characteristic finding in sarcoidosis. Recognition of this pattern can improve diagnostic accuracy and help differentiate sarcoidosis from other systemic diseases. This study assessed lymph node involvement on FDG-PET/CT in patients with suspected or known cardiac sarcoidosis, revealing distinct dissemination patterns into the upper abdomen. These findings enhance understanding of disease pathophysiology and may improve diagnostic evaluation.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1007/s11604-026-01946-2
Bökebatur Ahmet Raşit Mendi, Halitcan Batur, Nurdan Çay
Purpose: This study aimed to develop and validate machine learning models based on quantitative radiomics parameters extracted from T1-weighted MRI to differentiate enchondromas from atypical cartilaginous tumours (ACTs).
Methods: A retrospective cohort comprising 66 patients (35 with histopathologically confirmed enchondroma and 31 with ACT) was included in the study. T1-weighted MRI images were used for 2D segmentation, performed independently by two experienced observers on all visible slices of each lesion. A comprehensive set of 107 radiomics features was extracted from these segmented regions of interest. LASSO regression was applied for dimensionality reduction. Four distinct machine learning algorithms-Support Vector Machine (SVM), Random Forest Classifier (RFC), Extreme Gradient Boosting (XGBoost), and Decision Tree Analysis-were trained and validated using a 70:30 data split.
Results: The radiomics features demonstrated high inter- and intra-observer reproducibility. All evaluated machine learning models exhibited strong diagnostic performance, with Area Under the Curve (AUC) values exceeding 0.90. Specifically, SVM achieved an AUC of 0.922 (95% CI 0.893-0.951), RFC yielded an AUC of 0.920 (95% CI 0.881-0.963), and Decision Tree Analysis showed an AUC of 0.949 (95% CI 0.927-0.972). Notably, the XGBoost model achieved the highest diagnostic efficacy, boasting an impressive AUC of 0.987 (95% CI 0.976-0.999), coupled with a sensitivity of 89.35% and a specificity of 96.55%.
Conclusion: Our results indicate that the combination of MRI-based radiomics and machine learning algorithms, particularly XGBoost, offers a non-invasive and highly accurate method for distinguishing enchondroma from ACT.
目的:本研究旨在开发和验证基于从t1加权MRI中提取的定量放射组学参数的机器学习模型,以区分内生软骨瘤和非典型软骨瘤(ACTs)。方法:回顾性队列研究包括66例患者(35例经组织病理学证实为内生纤维瘤,31例为ACT)。使用t1加权MRI图像进行二维分割,由两名经验丰富的观察者对每个病变的所有可见切片独立执行。从这些感兴趣的分割区域中提取了107个放射组学特征。采用LASSO回归进行降维。四种不同的机器学习算法——支持向量机(SVM)、随机森林分类器(RFC)、极端梯度增强(XGBoost)和决策树分析——使用70:30的数据分割进行训练和验证。结果:放射组学特征表现出高度的观察者之间和观察者内部的可重复性。所有被评估的机器学习模型都表现出很强的诊断性能,曲线下面积(AUC)值超过0.90。具体而言,SVM的AUC为0.922 (95% CI 0.893-0.951), RFC的AUC为0.920 (95% CI 0.881-0.963),决策树分析的AUC为0.949 (95% CI 0.927-0.972)。值得注意的是,XGBoost模型的诊断效果最高,AUC为0.987 (95% CI 0.976-0.999),灵敏度为89.35%,特异性为96.55%。结论:我们的研究结果表明,结合基于mri的放射组学和机器学习算法,特别是XGBoost,提供了一种非侵入性和高度准确的方法来区分内软骨瘤和ACT。
{"title":"The radiomics fingerprint of cartilage tumours: radiomics-based MRI differentiation of enchondroma and atypical cartilaginous tumour.","authors":"Bökebatur Ahmet Raşit Mendi, Halitcan Batur, Nurdan Çay","doi":"10.1007/s11604-026-01946-2","DOIUrl":"https://doi.org/10.1007/s11604-026-01946-2","url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to develop and validate machine learning models based on quantitative radiomics parameters extracted from T1-weighted MRI to differentiate enchondromas from atypical cartilaginous tumours (ACTs).</p><p><strong>Methods: </strong>A retrospective cohort comprising 66 patients (35 with histopathologically confirmed enchondroma and 31 with ACT) was included in the study. T1-weighted MRI images were used for 2D segmentation, performed independently by two experienced observers on all visible slices of each lesion. A comprehensive set of 107 radiomics features was extracted from these segmented regions of interest. LASSO regression was applied for dimensionality reduction. Four distinct machine learning algorithms-Support Vector Machine (SVM), Random Forest Classifier (RFC), Extreme Gradient Boosting (XGBoost), and Decision Tree Analysis-were trained and validated using a 70:30 data split.</p><p><strong>Results: </strong>The radiomics features demonstrated high inter- and intra-observer reproducibility. All evaluated machine learning models exhibited strong diagnostic performance, with Area Under the Curve (AUC) values exceeding 0.90. Specifically, SVM achieved an AUC of 0.922 (95% CI 0.893-0.951), RFC yielded an AUC of 0.920 (95% CI 0.881-0.963), and Decision Tree Analysis showed an AUC of 0.949 (95% CI 0.927-0.972). Notably, the XGBoost model achieved the highest diagnostic efficacy, boasting an impressive AUC of 0.987 (95% CI 0.976-0.999), coupled with a sensitivity of 89.35% and a specificity of 96.55%.</p><p><strong>Conclusion: </strong>Our results indicate that the combination of MRI-based radiomics and machine learning algorithms, particularly XGBoost, offers a non-invasive and highly accurate method for distinguishing enchondroma from ACT.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146119002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1007/s11604-026-01950-6
Ji Su Ko, Hwon Heo, Chong Hyun Suh, Jeho Yi, Woo Hyun Shim
Purpose: To evaluate the capability of large language models (LLM), specifically GPT-4 and o1, in assessing adherence to the MI-CLEAR-LLM checklist in previously published studies.
Materials and methods: A total of 159 medical research articles related to LLM applications were analyzed. Two models-GPT-4o and o1-were tested in both text-based and image-based modalities. Structured prompts incorporating reasoning strategies such as chain-of-thought and few-shot learning were used to extract information corresponding to the six core items of the MI-CLEAR-LLM checklist. Human evaluations from a prior study served as the reference standard. Each model was evaluated across three independent trials to assess consistency. Accuracy and inter-trial agreement were calculated for each checklist item.
Results: Both GPT-4o and o1 demonstrated high accuracy in extracting objective, explicitly reported items, such as LLM specifications (name, manufacturer, web access, 85.9-100%) and stochasticity parameters (63.6-95%). However, performance declined for context-dependent items, including prompt session handling (Item4, 51.5-70.7%) and test data independence (Item6, 59.6-76.8%). Text-based models generally showed superior inter-trial consistency, with GPT-4o-text achieving the highest Fleiss' kappa (κ = 0.926). In contrast, image-based models exhibited greater variability (κ = 0.402-0.772).
Conclusion: LLMs show strong potential for automating the evaluation of reporting quality in medical research, particularly for clearly structured content. However, they still face substantial challenges in extracting context-dependent or inferential information.
{"title":"Evaluating guideline adherence in LLM studies using LLMs.","authors":"Ji Su Ko, Hwon Heo, Chong Hyun Suh, Jeho Yi, Woo Hyun Shim","doi":"10.1007/s11604-026-01950-6","DOIUrl":"https://doi.org/10.1007/s11604-026-01950-6","url":null,"abstract":"<p><strong>Purpose: </strong>To evaluate the capability of large language models (LLM), specifically GPT-4 and o1, in assessing adherence to the MI-CLEAR-LLM checklist in previously published studies.</p><p><strong>Materials and methods: </strong>A total of 159 medical research articles related to LLM applications were analyzed. Two models-GPT-4o and o1-were tested in both text-based and image-based modalities. Structured prompts incorporating reasoning strategies such as chain-of-thought and few-shot learning were used to extract information corresponding to the six core items of the MI-CLEAR-LLM checklist. Human evaluations from a prior study served as the reference standard. Each model was evaluated across three independent trials to assess consistency. Accuracy and inter-trial agreement were calculated for each checklist item.</p><p><strong>Results: </strong>Both GPT-4o and o1 demonstrated high accuracy in extracting objective, explicitly reported items, such as LLM specifications (name, manufacturer, web access, 85.9-100%) and stochasticity parameters (63.6-95%). However, performance declined for context-dependent items, including prompt session handling (Item4, 51.5-70.7%) and test data independence (Item6, 59.6-76.8%). Text-based models generally showed superior inter-trial consistency, with GPT-4o-text achieving the highest Fleiss' kappa (κ = 0.926). In contrast, image-based models exhibited greater variability (κ = 0.402-0.772).</p><p><strong>Conclusion: </strong>LLMs show strong potential for automating the evaluation of reporting quality in medical research, particularly for clearly structured content. However, they still face substantial challenges in extracting context-dependent or inferential information.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146124908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Unresectable giant hepatocellular carcinoma (HCC) has a poor prognosis, and few effective treatment methods are available. In this study, the clinical efficacy of intensity-modulated proton therapy (IMPT) was assessed.
Methods: A retrospective review was conducted on 71 consecutive patients with giant HCC treated with IMPT at two medical centers from September 2019 to December 2022. The outcomes analyzed included liver and gastrointestinal toxicity, objective response rate (ORR), local control (LC), overall survival (OS), intrahepatic recurrence-free survival (IHRFS), and distant metastasis-free survival (DMFS).
Results: The ORR was 90.2%. The one-year and two-year LC rates were 95.0% and 86.5%, respectively. The one-year and two-year OS rates were 80.0% and 53.8%, respectively, with a median OS of 26.7 months (95% CI 20.8-32.7). At one and two years, the IHRFS rates were 47.4% and 26.0%, and the DMFS rates were 57.7% and 36.8%, respectively. Multivariate analysis indicated that ECOG performance status (p = 0.008) and complete response (CR) after IMPT were independently associated with OS (p = 0.039). Two cases (2.8%) of Grade 3 gastrointestinal toxicity were observed. The incidence of liver failure after IMPT was 12.7%, of which 6.1% were considered radiation related.
Conclusions: IMPT has potential for treating giant HCC. Despite the excellent LC and ORR, intrahepatic recurrence and distant metastasis remain significant factors affecting survival. More aggressive follow-up and additional therapeutic strategies are necessary, particularly for patients who do not achieve CR after treatment.
背景:不可切除的巨大肝细胞癌(HCC)预后较差,目前有效的治疗方法很少。本研究评估调强质子治疗(IMPT)的临床疗效。方法:对2019年9月至2022年12月在两家医疗中心连续接受IMPT治疗的71例巨型HCC患者进行回顾性分析。结果分析包括肝脏和胃肠道毒性、客观缓解率(ORR)、局部控制(LC)、总生存期(OS)、肝内无复发生存期(IHRFS)和远端无转移生存期(DMFS)。结果:ORR为90.2%。一年期和两年的贷款利率分别为95.0%和86.5%。1年和2年的OS率分别为80.0%和53.8%,中位OS为26.7个月(95% CI 20.8-32.7)。1年和2年时,IHRFS率分别为47.4%和26.0%,DMFS率分别为57.7%和36.8%。多因素分析显示,IMPT后ECOG性能状态(p = 0.008)和完全缓解(CR)与OS独立相关(p = 0.039)。3级胃肠道毒性2例(2.8%)。IMPT后肝功能衰竭的发生率为12.7%,其中6.1%被认为与放疗有关。结论:IMPT治疗巨大肝癌有潜力。尽管LC和ORR很好,但肝内复发和远处转移仍然是影响生存的重要因素。更积极的随访和额外的治疗策略是必要的,特别是对于治疗后未达到CR的患者。
{"title":"Intensity-modulated proton therapy for patients with unresectable giant (≥ 10 cm) hepatocellular carcinoma: a retrospective analysis from two high-volume centers.","authors":"Jen-Yu Cheng, Yu-Ming Wang, Yen-Hao Chen, Chieh-Min Liu, Eng-Yen Huang, Hsin-You Ou, Tsung-Hui Hu, Sheng-Nan Lu, Chih-Chi Wang, Jeng-Hwei Tseng, Shi-Ming Lin, Chen-Chun Lin, Kun-Ming Chan, Wan-Yu Chen, Bing-Shen Huang","doi":"10.1007/s11604-026-01947-1","DOIUrl":"https://doi.org/10.1007/s11604-026-01947-1","url":null,"abstract":"<p><strong>Background: </strong>Unresectable giant hepatocellular carcinoma (HCC) has a poor prognosis, and few effective treatment methods are available. In this study, the clinical efficacy of intensity-modulated proton therapy (IMPT) was assessed.</p><p><strong>Methods: </strong>A retrospective review was conducted on 71 consecutive patients with giant HCC treated with IMPT at two medical centers from September 2019 to December 2022. The outcomes analyzed included liver and gastrointestinal toxicity, objective response rate (ORR), local control (LC), overall survival (OS), intrahepatic recurrence-free survival (IHRFS), and distant metastasis-free survival (DMFS).</p><p><strong>Results: </strong>The ORR was 90.2%. The one-year and two-year LC rates were 95.0% and 86.5%, respectively. The one-year and two-year OS rates were 80.0% and 53.8%, respectively, with a median OS of 26.7 months (95% CI 20.8-32.7). At one and two years, the IHRFS rates were 47.4% and 26.0%, and the DMFS rates were 57.7% and 36.8%, respectively. Multivariate analysis indicated that ECOG performance status (p = 0.008) and complete response (CR) after IMPT were independently associated with OS (p = 0.039). Two cases (2.8%) of Grade 3 gastrointestinal toxicity were observed. The incidence of liver failure after IMPT was 12.7%, of which 6.1% were considered radiation related.</p><p><strong>Conclusions: </strong>IMPT has potential for treating giant HCC. Despite the excellent LC and ORR, intrahepatic recurrence and distant metastasis remain significant factors affecting survival. More aggressive follow-up and additional therapeutic strategies are necessary, particularly for patients who do not achieve CR after treatment.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146125045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-01Epub Date: 2025-10-04DOI: 10.1007/s11604-025-01884-5
Haruto Sugawara, Akiyo Takada, Shimpei Kato
Purpose: To compare the accuracy and reproducibility of lesion-diameter measurements performed by three state-of-the-art LLMs with those obtained by radiologists.
Materials and methods: In this retrospective study using a public database, 83 patients with solitary colorectal-cancer liver metastases were identified. From each CT series, a radiologist extracted the single axial slice showing the maximal tumor diameter and converted it to a 512 × 512-pixel PNG image (window level 50 HU, window width 400 HU) with pixel size encoded in the filename. Three LLMs-ChatGPT-o3 (OpenAI), Gemini 2.5 Pro (Google), and Claude 4 Opus (Anthropic)-were prompted to estimate the longest lesion diameter twice, ≥ 1 week apart. Two board-certified radiologists (12 years' experience each) independently measured the same single slice images and one radiologist repeated the measurements after ≥ 1 week. Agreement was assessed with intraclass correlation coefficients (ICC); 95% confidence intervals were obtained by bootstrap resampling (5 000 iterations).
Results: Radiologist inter-observer agreement was excellent (ICC = 0.95, 95% CI 0.86-0.99); intra-observer agreement was 0.98 (95% CI 0.94-0.99). Gemini achieved good model-to-radiologist agreement (ICC = 0.81, 95% CI 0.68-0.89) and intra-model reproducibility (ICC = 0.78, 95% CI 0.65-0.87). GPT-o3 showed moderate agreement (ICC = 0.52) and poor reproducibility (ICC = 0.25); Claude showed poor agreement (ICC = 0.07) and reproducibility (ICC = 0.47).
Conclusion: LLMs do not yet match radiologists in measuring colorectal cancer liver metastasis; however, Gemini's good agreement and reproducibility highlight the rapid progress of image interpretation capability of LLMs.
目的:比较三个最先进的LLMs与放射科医生获得的病变直径测量的准确性和可重复性。材料和方法:在这项使用公共数据库的回顾性研究中,确定了83例孤立性结直肠癌肝转移患者。从每个CT序列中,放射科医生提取显示最大肿瘤直径的单轴切片,并将其转换为512 × 512像素的PNG图像(窗高50 HU,窗宽400 HU),像素大小在文件名中编码。三个llms - chatgpt - 03 (OpenAI), Gemini 2.5 Pro(谷歌)和Claude 4 Opus (Anthropic)-提示估计最长病变直径两次,间隔≥1周。两名委员会认证的放射科医生(每人有12年的经验)独立测量相同的单片图像,一名放射科医生在≥1周后重复测量。用类内相关系数(ICC)评估一致性;95%置信区间采用自举重采样(5 000次迭代)。结果:放射科医师间观察者一致性极好(ICC = 0.95, 95% CI 0.86-0.99);观察者间一致性为0.98 (95% CI 0.94-0.99)。Gemini获得了良好的模型-放射科医师一致性(ICC = 0.81, 95% CI 0.68-0.89)和模型内可重复性(ICC = 0.78, 95% CI 0.65-0.87)。GPT-o3一致性中等(ICC = 0.52),重现性较差(ICC = 0.25);Claude显示较差的一致性(ICC = 0.07)和可重复性(ICC = 0.47)。结论:LLMs在测量结直肠癌肝转移方面与放射科医师尚不一致;然而,Gemini良好的一致性和可重复性突出了llm图像判读能力的快速发展。
{"title":"Accuracy and reproducibility of large language model measurements of liver metastases: comparison with radiologist measurements.","authors":"Haruto Sugawara, Akiyo Takada, Shimpei Kato","doi":"10.1007/s11604-025-01884-5","DOIUrl":"10.1007/s11604-025-01884-5","url":null,"abstract":"<p><strong>Purpose: </strong>To compare the accuracy and reproducibility of lesion-diameter measurements performed by three state-of-the-art LLMs with those obtained by radiologists.</p><p><strong>Materials and methods: </strong>In this retrospective study using a public database, 83 patients with solitary colorectal-cancer liver metastases were identified. From each CT series, a radiologist extracted the single axial slice showing the maximal tumor diameter and converted it to a 512 × 512-pixel PNG image (window level 50 HU, window width 400 HU) with pixel size encoded in the filename. Three LLMs-ChatGPT-o3 (OpenAI), Gemini 2.5 Pro (Google), and Claude 4 Opus (Anthropic)-were prompted to estimate the longest lesion diameter twice, ≥ 1 week apart. Two board-certified radiologists (12 years' experience each) independently measured the same single slice images and one radiologist repeated the measurements after ≥ 1 week. Agreement was assessed with intraclass correlation coefficients (ICC); 95% confidence intervals were obtained by bootstrap resampling (5 000 iterations).</p><p><strong>Results: </strong>Radiologist inter-observer agreement was excellent (ICC = 0.95, 95% CI 0.86-0.99); intra-observer agreement was 0.98 (95% CI 0.94-0.99). Gemini achieved good model-to-radiologist agreement (ICC = 0.81, 95% CI 0.68-0.89) and intra-model reproducibility (ICC = 0.78, 95% CI 0.65-0.87). GPT-o3 showed moderate agreement (ICC = 0.52) and poor reproducibility (ICC = 0.25); Claude showed poor agreement (ICC = 0.07) and reproducibility (ICC = 0.47).</p><p><strong>Conclusion: </strong>LLMs do not yet match radiologists in measuring colorectal cancer liver metastasis; however, Gemini's good agreement and reproducibility highlight the rapid progress of image interpretation capability of LLMs.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":"339-345"},"PeriodicalIF":2.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12860874/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145225284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-01Epub Date: 2025-10-25DOI: 10.1007/s11604-025-01879-2
Luguang Chen, Pengyi Xing, Tiegong Wang, Xiaoyu Huang, Caixia Fu, Robert Grimm, Chengwei Shao, Jianping Lu
Purpose: The purpose is to evaluate the utility of whole-lesion and whole-prostate gland histogram and texture analysis based on biparametric MRI (bp-MRI) for differentiating clinically significant prostate cancer (csPCa) from non-clinically significant prostate cancer (ncsPCa). We further compared the diagnostic performance of these quantitative features with PI-RADS assessment, clinical parameters, and combined models.
Materials and methods: This retrospective study enrolled 337 patients (primary cohort, 260; validation cohort, 77) with pathologically proven prostate lesions. All patients underwent preoperative prostate bp-MRI [T2-weighted imaging and apparent diffusion coefficient (ADC) maps]. Histogram and texture features were extracted from both the whole lesion and the whole-prostate gland. Diagnostic models were constructed using multivariate logistic regression, incorporating PI-RADS scores, clinical parameters, and quantitative imaging features. Their performance was evaluated using the area under the receiver operating characteristic curve (AUC) and validated on an internal cohort.
Results: Multiple histogram and texture parameters from both whole-lesion and whole-prostate analyses significantly differed between csPCa and ncsPCa groups (p < 0.05), with ADC-derived features generally outperforming T2WI-derived ones. The combined model integrating texture features, clinical parameters, and PI-RADS (Texture&Clinics&PI-RADS) demonstrated the highest diagnostic performance for both whole-lesion analysis (AUCs: 0.938 for peripheral-zone or transitional-zone (PZ + TZ), 0.894 for peripheral-zone (PZ), 0.971 for transitional-zone (TZ) lesions) and whole-prostate analysis (AUCs: 0.926 for PZ + TZ, 0.804 for PZ, 0.981 for TZ lesions) in the primary cohort. This superior performance was consistently replicated in the validation cohort. Notably, no significant difference in diagnostic efficacy was found between whole-lesion and whole-prostate analyses for TZ lesions.
Conclusion: Both whole-lesion and whole-prostate histogram and texture analysis based on bp-MRI are promising non-invasive tools for identifying csPCa. The combination of texture features, clinical parameters, and PI-RADS scores achieved the best diagnostic performance. These findings indicate that whole-lesion and whole-prostate histogram and texture analyses may improve the detection of csPCa above conventional PI-RADS assessment.
{"title":"Biparametric MRI in prostate cancer: utility of whole-prostate and whole-lesion histogram and texture analysis for clinically significant prostate cancer.","authors":"Luguang Chen, Pengyi Xing, Tiegong Wang, Xiaoyu Huang, Caixia Fu, Robert Grimm, Chengwei Shao, Jianping Lu","doi":"10.1007/s11604-025-01879-2","DOIUrl":"10.1007/s11604-025-01879-2","url":null,"abstract":"<p><strong>Purpose: </strong>The purpose is to evaluate the utility of whole-lesion and whole-prostate gland histogram and texture analysis based on biparametric MRI (bp-MRI) for differentiating clinically significant prostate cancer (csPCa) from non-clinically significant prostate cancer (ncsPCa). We further compared the diagnostic performance of these quantitative features with PI-RADS assessment, clinical parameters, and combined models.</p><p><strong>Materials and methods: </strong>This retrospective study enrolled 337 patients (primary cohort, 260; validation cohort, 77) with pathologically proven prostate lesions. All patients underwent preoperative prostate bp-MRI [T2-weighted imaging and apparent diffusion coefficient (ADC) maps]. Histogram and texture features were extracted from both the whole lesion and the whole-prostate gland. Diagnostic models were constructed using multivariate logistic regression, incorporating PI-RADS scores, clinical parameters, and quantitative imaging features. Their performance was evaluated using the area under the receiver operating characteristic curve (AUC) and validated on an internal cohort.</p><p><strong>Results: </strong>Multiple histogram and texture parameters from both whole-lesion and whole-prostate analyses significantly differed between csPCa and ncsPCa groups (p < 0.05), with ADC-derived features generally outperforming T2WI-derived ones. The combined model integrating texture features, clinical parameters, and PI-RADS (Texture&Clinics&PI-RADS) demonstrated the highest diagnostic performance for both whole-lesion analysis (AUCs: 0.938 for peripheral-zone or transitional-zone (PZ + TZ), 0.894 for peripheral-zone (PZ), 0.971 for transitional-zone (TZ) lesions) and whole-prostate analysis (AUCs: 0.926 for PZ + TZ, 0.804 for PZ, 0.981 for TZ lesions) in the primary cohort. This superior performance was consistently replicated in the validation cohort. Notably, no significant difference in diagnostic efficacy was found between whole-lesion and whole-prostate analyses for TZ lesions.</p><p><strong>Conclusion: </strong>Both whole-lesion and whole-prostate histogram and texture analysis based on bp-MRI are promising non-invasive tools for identifying csPCa. The combination of texture features, clinical parameters, and PI-RADS scores achieved the best diagnostic performance. These findings indicate that whole-lesion and whole-prostate histogram and texture analyses may improve the detection of csPCa above conventional PI-RADS assessment.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":"360-375"},"PeriodicalIF":2.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145368003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-01Epub Date: 2025-10-16DOI: 10.1007/s11604-025-01889-0
Gouling Zhan, Endong Zhao, Xuehuan Liu, Xiao Gao, Dahe Zhan, Zhibo Zhou, Zuoxi Li, Jun Liu
Background: Accurate prediction of metachronous liver metastasis (MLM) within the 24 months remains a clinical challenge in rectal cancer. While radiomics offers noninvasive insights into tumor characteristics, few studies have investigated multi-sequence MRI-based habitat radiomics with interpretable modeling strategies.
Methods: This retrospective study enrolled 391 patients with pathologically confirmed rectal cancer. K-means clustering was applied to pretreatment T1-weighted imaging (T1WI) and T2-weighted imaging (T2WI) MRI to generate tumor subregions. Radiomic features were extracted from both sequences, and clinical variables were also included. Support vector machine (SVM) classifiers were used to construct radiomics, habitat, and combined models. Model performance was assessed using area under the ROC curve (AUC) and decision curve analysis (DCA). SHapley Additive exPlanations (SHAP) were employed to interpret the contribution of individual features.
Results: The habitat model demonstrated superior predictive performance compared to conventional radiomics, achieving AUCs of 0.875 in the training cohort, 0.829 in the internal validation cohort, and 0.810 in the external test cohort. The combined model, incorporating clinical variables and habitat features, achieved the highest performance in the validation cohort (AUC = 0.870) and external test cohort (AUC = 0.862). SHAP analysis revealed complementary contributions from T1WI and T2WI features, highlighting the intratumoral heterogeneity interpretability of the multi-sequence habitat approach.
Conclusion: Multi-sequence MRI-based habitat radiomics demonstrated strong performance in predicting MLM, and the integration with clinical variables further improved accuracy, providing a practical tool for individualized risk assessment and treatment planning.
{"title":"Interpretable habitat radiomics model based on multi-sequence MRI for risk prediction of metachronous liver metastasis in rectal cancer: a multicenter study.","authors":"Gouling Zhan, Endong Zhao, Xuehuan Liu, Xiao Gao, Dahe Zhan, Zhibo Zhou, Zuoxi Li, Jun Liu","doi":"10.1007/s11604-025-01889-0","DOIUrl":"10.1007/s11604-025-01889-0","url":null,"abstract":"<p><strong>Background: </strong>Accurate prediction of metachronous liver metastasis (MLM) within the 24 months remains a clinical challenge in rectal cancer. While radiomics offers noninvasive insights into tumor characteristics, few studies have investigated multi-sequence MRI-based habitat radiomics with interpretable modeling strategies.</p><p><strong>Methods: </strong>This retrospective study enrolled 391 patients with pathologically confirmed rectal cancer. K-means clustering was applied to pretreatment T1-weighted imaging (T1WI) and T2-weighted imaging (T2WI) MRI to generate tumor subregions. Radiomic features were extracted from both sequences, and clinical variables were also included. Support vector machine (SVM) classifiers were used to construct radiomics, habitat, and combined models. Model performance was assessed using area under the ROC curve (AUC) and decision curve analysis (DCA). SHapley Additive exPlanations (SHAP) were employed to interpret the contribution of individual features.</p><p><strong>Results: </strong>The habitat model demonstrated superior predictive performance compared to conventional radiomics, achieving AUCs of 0.875 in the training cohort, 0.829 in the internal validation cohort, and 0.810 in the external test cohort. The combined model, incorporating clinical variables and habitat features, achieved the highest performance in the validation cohort (AUC = 0.870) and external test cohort (AUC = 0.862). SHAP analysis revealed complementary contributions from T1WI and T2WI features, highlighting the intratumoral heterogeneity interpretability of the multi-sequence habitat approach.</p><p><strong>Conclusion: </strong>Multi-sequence MRI-based habitat radiomics demonstrated strong performance in predicting MLM, and the integration with clinical variables further improved accuracy, providing a practical tool for individualized risk assessment and treatment planning.</p>","PeriodicalId":14691,"journal":{"name":"Japanese Journal of Radiology","volume":" ","pages":"346-359"},"PeriodicalIF":2.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145300965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}