Establishing CT-derived Normative Liver and Spleen Volumes for Children: Validation in Regional and International Datasets.
建立儿童ct衍生的标准肝脏和脾脏体积:区域和国际数据集的验证。
IF 15.2
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.251953
Jae-Yeon Hwang, Young Hun Choi, Ah Young Jung, Young Ah Cho, Jin Seong Lee, Hee Mang Yoon, Seung Soo Lee, Seonok Kim, Jun Su Park, Mi Young Kim, Hwon Heo, Woo Hyun Shim, Ki Seok Choo, Elanchezhian Somasundaram, Vinicius de Padua V Alves, Pyeong Hwa Kim
Background Volumetric evaluation of liver and spleen size is useful, yet pediatric reference values remain limited and lack external validation. Purpose To establish normative liver and spleen volumes from contrast-enhanced CT in children and validate them in external regional and international datasets. Materials and Methods In this retrospective study, contrast-enhanced abdominal CT scans (patient age, 2-18 years) from Asan Medical Center and Pusan National University Yangsan Hospital (January 2005 to June 2021) were collected for the derivation dataset. Patients with chronic diseases, liver and/or spleen imaging abnormalities, or missing height and/or weight measurement within 3 months were excluded. Portal-phase CT images were segmented automatically with manual correction. Data from Seoul National University Children's Hospital (January 2018 to December 2021) and Cincinnati Children's Hospital Medical Center (January 2018 to July 2021) were used for regional and international validation, respectively. Model performance was assessed with the average pinball loss and absolute errors for predicting 50th percentiles. Results A total of 1298 children (median age, 11.3 years [IQR, 8.0-15.0 years]; 660 female) were included (derivation set, n = 1030 [all Asian]; regional validation set, n = 114 [all Asian]; international validation set, n = 154 [racially diverse]). The model including sex, height, and weight had the lowest average pinball loss for predicting 50th percentile liver (54.50 mL) and spleen (18.64 mL) volume and was therefore selected. Compared with previously published formulas, the model showed lower average pinball loss in the regional validation dataset for both liver (5th, 11.55 mL vs 11.93 mL; 50th, 50.88 mL vs 55.47-116.02 mL; 95th, 15.85 mL vs 19.49 mL) and spleen volumes (5th, 3.69 mL vs 3.74 mL; 50th, 15.19 mL vs 16.03 mL; 95th, 5.01 mL vs 5.59 mL). In international external validation, the average pinball loss was lower for the 50th percentiles (liver, 50.28 mL vs 50.30 mL; spleen, 20.97 mL vs 21.42 mL), and absolute errors were comparable (liver, 100.56 mL vs 100.60 mL [P = .53]; spleen, 41.93 mL vs 42.83 mL [P = .69]). Conclusion Normative liver and spleen volumes in children derived using quantile regression models and the respective predictive performance were presented. © RSNA, 2026 Supplemental material is available for this article.
求助PDF
{"title":"Establishing CT-derived Normative Liver and Spleen Volumes for Children: Validation in Regional and International Datasets.","authors":"Jae-Yeon Hwang, Young Hun Choi, Ah Young Jung, Young Ah Cho, Jin Seong Lee, Hee Mang Yoon, Seung Soo Lee, Seonok Kim, Jun Su Park, Mi Young Kim, Hwon Heo, Woo Hyun Shim, Ki Seok Choo, Elanchezhian Somasundaram, Vinicius de Padua V Alves, Pyeong Hwa Kim","doi":"10.1148/radiol.251953","DOIUrl":"https://doi.org/10.1148/radiol.251953","url":null,"abstract":"<p><p>Background Volumetric evaluation of liver and spleen size is useful, yet pediatric reference values remain limited and lack external validation. Purpose To establish normative liver and spleen volumes from contrast-enhanced CT in children and validate them in external regional and international datasets. Materials and Methods In this retrospective study, contrast-enhanced abdominal CT scans (patient age, 2-18 years) from Asan Medical Center and Pusan National University Yangsan Hospital (January 2005 to June 2021) were collected for the derivation dataset. Patients with chronic diseases, liver and/or spleen imaging abnormalities, or missing height and/or weight measurement within 3 months were excluded. Portal-phase CT images were segmented automatically with manual correction. Data from Seoul National University Children's Hospital (January 2018 to December 2021) and Cincinnati Children's Hospital Medical Center (January 2018 to July 2021) were used for regional and international validation, respectively. Model performance was assessed with the average pinball loss and absolute errors for predicting 50th percentiles. Results A total of 1298 children (median age, 11.3 years [IQR, 8.0-15.0 years]; 660 female) were included (derivation set, <i>n</i> = 1030 [all Asian]; regional validation set, <i>n</i> = 114 [all Asian]; international validation set, <i>n</i> = 154 [racially diverse]). The model including sex, height, and weight had the lowest average pinball loss for predicting 50th percentile liver (54.50 mL) and spleen (18.64 mL) volume and was therefore selected. Compared with previously published formulas, the model showed lower average pinball loss in the regional validation dataset for both liver (5th, 11.55 mL vs 11.93 mL; 50th, 50.88 mL vs 55.47-116.02 mL; 95th, 15.85 mL vs 19.49 mL) and spleen volumes (5th, 3.69 mL vs 3.74 mL; 50th, 15.19 mL vs 16.03 mL; 95th, 5.01 mL vs 5.59 mL). In international external validation, the average pinball loss was lower for the 50th percentiles (liver, 50.28 mL vs 50.30 mL; spleen, 20.97 mL vs 21.42 mL), and absolute errors were comparable (liver, 100.56 mL vs 100.60 mL [<i>P</i> = .53]; spleen, 41.93 mL vs 42.83 mL [<i>P</i> = .69]). Conclusion Normative liver and spleen volumes in children derived using quantile regression models and the respective predictive performance were presented. © RSNA, 2026 <i>Supplemental material is available for this article.</i></p>","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"318 1","pages":"e251953"},"PeriodicalIF":15.2,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Evaluating National Emergency X-Radiography Utilization Study and Canadian C-Spine Rule Criteria and Their Clinical Impact on Cervical Spine Imaging: Best Practice.
评估国家紧急x线摄影应用研究和加拿大颈椎规则标准及其对颈椎成像的临床影响:最佳实践。
IF 19.7
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.243834
Shadi Asadollahi,Parisa Arjmand,Arjun Chanmugam,Matthew Dattwyler,Caline Azzi,Mahla Radmard,David M Yousem
求助PDF
{"title":"Evaluating National Emergency X-Radiography Utilization Study and Canadian C-Spine Rule Criteria and Their Clinical Impact on Cervical Spine Imaging: Best Practice.","authors":"Shadi Asadollahi,Parisa Arjmand,Arjun Chanmugam,Matthew Dattwyler,Caline Azzi,Mahla Radmard,David M Yousem","doi":"10.1148/radiol.243834","DOIUrl":"https://doi.org/10.1148/radiol.243834","url":null,"abstract":"This article evaluates the effectiveness and implementation of two clinical decision rules for assessing cervical spine injuries after blunt trauma: the National Emergency X-Radiography Utilization Study (NEXUS) criteria and the Canadian C-Spine Rule (CCR). Both rules aim to reduce unnecessary imaging, which can be costly and expose patients to radiation, by identifying individuals at low risk who do not require cervical spine CT. The five NEXUS criteria are easy to apply but may have inconsistent interpretations. Although more complex and time-consuming, the CCR provides a structured algorithm to more effectively reduce unnecessary imaging. Both rules have high sensitivity for detecting clinically significant injuries. Potential pitfalls include incomplete documentation and variability in clinician interpretation, emphasizing the importance of proper training in implementing these rules. The American College of Radiology recommends CT as the initial imaging method for adults meeting NEXUS or CCR criteria. Pediatric clinical decision support rules vary by patient age and may use radiography and MRI in different instances, but older children are typically assessed like adults. With the increased availability of cervical spine CT, it is essential to improve imaging decisions with respect to patient impact and health care costs while minimizing radiation exposure, especially in younger patients.","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"14 1","pages":"e243834"},"PeriodicalIF":19.7,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145907543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Pretreatment CT Identification of Extranodal Extension in Laryngeal and Hypopharyngeal Cancers Using Deep Learning.
基于深度学习的喉癌和下咽癌结外延伸的预处理CT识别。
IF 15.2
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.250332
Na Shen, Yirui Wang, Cheng Yan, Jian Wang, Dandan Zheng, Xuewei Wang, Dazhou Guo, Haoshen Li, Qinji Yu, Zi Li, Yuzhen Chen, Ke Yan, Le Lu, Xianghua Ye, Mengsu Zeng, Xinsheng Huang, Tsung-Ying Ho, Fang Zhang, Dakai Jin
Background Accurate preoperative identification of pathologic extranodal extension (ENE) at CT is essential for precise treatment decisions in laryngeal and hypopharyngeal squamous cell cancer (LHSCC). However, human interpretation of ENE is neither reliable nor reproducible. Purpose To develop and evaluate the diagnostic performance of a new deep learning tool, DeepENE, in detecting metastatic and ENE lymph nodes on preoperative CT scans in patients with LHSCC in a multicenter cohort. Materials and Methods In this retrospective study, patients with LHSCC from Zhongshan Hospital, Fudan University (April 2011-August 2022), were included in training, validation, and internal test sets to develop DeepENE. For the reference standard, lymph nodes were segmented on CT scans and labeled for metastasis and ENE status based on pathologic findings. DeepENE was tested using three external cohorts of patients with LHSCC (external test sets 1-3) and one external cohort of patients with oral squamous cell carcinoma. The primary diagnostic metric was the area under the receiver operating characteristic curve (AUC). The performance of DeepENE was compared with that of five board-certified head and neck cancer specialists using the DeLong method. Results Overall, 289 patients with LHSCC with 1954 pathologically confirmed lymph nodes were evaluated. DeepENE achieved an AUC of 0.93 for ENE diagnosis in the internal test set under fivefold cross-validation, and AUCs of 0.96, 0.87, and 0.90 in external test sets 1, 2, and 3, respectively. DeepENE outperformed the five experts, especially in early-stage ENE detection in external test set 2 (AUC of 0.87 for DeepENE vs mean AUC of 0.66 for readers; P < .001). In external test set 1, DeepENE maintained a high sensitivity of 97% at specificity of 90%, compared with experts' mean sensitivity of 77% (P = .003). In external test sets 2 and 3, DeepENE had sensitivity of 78% and 80%, compared with experts' mean sensitivity of 36% (P < .001) and 46% (P < .001), respectively. Conclusion DeepENE accurately detected ENE on preoperative CT scans in patients with LHSCC and outperformed head and neck cancer specialists. © RSNA, 2026 Supplemental material is available for this article.
求助PDF
{"title":"Pretreatment CT Identification of Extranodal Extension in Laryngeal and Hypopharyngeal Cancers Using Deep Learning.","authors":"Na Shen, Yirui Wang, Cheng Yan, Jian Wang, Dandan Zheng, Xuewei Wang, Dazhou Guo, Haoshen Li, Qinji Yu, Zi Li, Yuzhen Chen, Ke Yan, Le Lu, Xianghua Ye, Mengsu Zeng, Xinsheng Huang, Tsung-Ying Ho, Fang Zhang, Dakai Jin","doi":"10.1148/radiol.250332","DOIUrl":"10.1148/radiol.250332","url":null,"abstract":"<p><p>Background Accurate preoperative identification of pathologic extranodal extension (ENE) at CT is essential for precise treatment decisions in laryngeal and hypopharyngeal squamous cell cancer (LHSCC). However, human interpretation of ENE is neither reliable nor reproducible. Purpose To develop and evaluate the diagnostic performance of a new deep learning tool, DeepENE, in detecting metastatic and ENE lymph nodes on preoperative CT scans in patients with LHSCC in a multicenter cohort. Materials and Methods In this retrospective study, patients with LHSCC from Zhongshan Hospital, Fudan University (April 2011-August 2022), were included in training, validation, and internal test sets to develop DeepENE. For the reference standard, lymph nodes were segmented on CT scans and labeled for metastasis and ENE status based on pathologic findings. DeepENE was tested using three external cohorts of patients with LHSCC (external test sets 1-3) and one external cohort of patients with oral squamous cell carcinoma. The primary diagnostic metric was the area under the receiver operating characteristic curve (AUC). The performance of DeepENE was compared with that of five board-certified head and neck cancer specialists using the DeLong method. Results Overall, 289 patients with LHSCC with 1954 pathologically confirmed lymph nodes were evaluated. DeepENE achieved an AUC of 0.93 for ENE diagnosis in the internal test set under fivefold cross-validation, and AUCs of 0.96, 0.87, and 0.90 in external test sets 1, 2, and 3, respectively. DeepENE outperformed the five experts, especially in early-stage ENE detection in external test set 2 (AUC of 0.87 for DeepENE vs mean AUC of 0.66 for readers; <i>P</i> < .001). In external test set 1, DeepENE maintained a high sensitivity of 97% at specificity of 90%, compared with experts' mean sensitivity of 77% (<i>P</i> = .003). In external test sets 2 and 3, DeepENE had sensitivity of 78% and 80%, compared with experts' mean sensitivity of 36% (<i>P</i> < .001) and 46% (<i>P</i> < .001), respectively. Conclusion DeepENE accurately detected ENE on preoperative CT scans in patients with LHSCC and outperformed head and neck cancer specialists. © RSNA, 2026 <i>Supplemental material is available for this article.</i></p>","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"318 1","pages":"e250332"},"PeriodicalIF":15.2,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Evaluation of Tibial Nerve Microcirculation in Diabetes Mellitus at Superresolution US.
糖尿病患者胫神经微循环的超分辨超声评价。
IF 15.2
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.250347
Fang Liu, ShuJun Xia, Qing Hua, CongCong Yuan, YuHang Zheng, RuoLin Tao, JiaLe Xu, YuLu Zhang, FangGang Wu, Wei Guo, Yuan Tian, YiJie Dong, JianQiao Zhou
Background Dysfunction of the peripheral nerve microcirculation is crucial in the onset and progression of diabetic neuropathy. Purpose To evaluate the feasibility of superresolution US for quantifying tibial nerve microvasculature in diabetic adults with and without peripheral neuropathy, and in control participants. Materials and Methods In this prospective single-center study, consecutive participants were enrolled between June 2024 and October 2024 and divided into three groups: group I, diabetic peripheral neuropathy with ulcers; group II, type 2 diabetes without peripheral neuropathy; and group III, the control group. All participants underwent conventional US in the tibial nerves followed by superresolution US with intravenous microbubble injection. Regions of interest were delineated manually along the tibial nerve. Quantitative parameters, including interfascicular and intrafascicular vessel ratio, complexity, density, velocity, perfusion index, and single-vessel curvature were compared using one-way analysis of variance (parametric) and Dunn-corrected Kruskal-Wallis (nonparametric) tests. Results A total of 100 participants were enrolled, including 30 participants in group I (mean age, 67 years ± 10 [SD]; 23 men), 35 in group II (mean age, 61 years ± 11; 20 men), and 35 in group III (mean age, 54 years ± 14; 15 men). Superresolution US showed group I had higher vessel ratio (23% vs 10% vs 6%), complexity level (1.3 vs 1.2 vs 1.1), maximum density (25 vs 20 vs 12), mean velocity (12.6 vs 8.7 vs 8.4 mm/sec), perfusion index (4.5 vs 1.4 vs 0.9), and curvature (1.19 vs 1.09 vs 1.05) compared with groups II and III, respectively (all P < .05). Microvascular tortuosity was observed in 83% (25 of 30), 43% (15 of 35), and 14% (five of 35) of groups I, II, and III, respectively (P < .001). Conclusion Superresolution US provided a 10-μm scale resolution view of tibial nerve microvascular structure, helping to identify microvascular alterations in participants with diabetic peripheral neuropathy with ulcers. © RSNA, 2026 Supplemental material is available for this article.
求助PDF
{"title":"Evaluation of Tibial Nerve Microcirculation in Diabetes Mellitus at Superresolution US.","authors":"Fang Liu, ShuJun Xia, Qing Hua, CongCong Yuan, YuHang Zheng, RuoLin Tao, JiaLe Xu, YuLu Zhang, FangGang Wu, Wei Guo, Yuan Tian, YiJie Dong, JianQiao Zhou","doi":"10.1148/radiol.250347","DOIUrl":"https://doi.org/10.1148/radiol.250347","url":null,"abstract":"<p><p>Background Dysfunction of the peripheral nerve microcirculation is crucial in the onset and progression of diabetic neuropathy. Purpose To evaluate the feasibility of superresolution US for quantifying tibial nerve microvasculature in diabetic adults with and without peripheral neuropathy, and in control participants. Materials and Methods In this prospective single-center study, consecutive participants were enrolled between June 2024 and October 2024 and divided into three groups: group I, diabetic peripheral neuropathy with ulcers; group II, type 2 diabetes without peripheral neuropathy; and group III, the control group. All participants underwent conventional US in the tibial nerves followed by superresolution US with intravenous microbubble injection. Regions of interest were delineated manually along the tibial nerve. Quantitative parameters, including interfascicular and intrafascicular vessel ratio, complexity, density, velocity, perfusion index, and single-vessel curvature were compared using one-way analysis of variance (parametric) and Dunn-corrected Kruskal-Wallis (nonparametric) tests. Results A total of 100 participants were enrolled, including 30 participants in group I (mean age, 67 years ± 10 [SD]; 23 men), 35 in group II (mean age, 61 years ± 11; 20 men), and 35 in group III (mean age, 54 years ± 14; 15 men). Superresolution US showed group I had higher vessel ratio (23% vs 10% vs 6%), complexity level (1.3 vs 1.2 vs 1.1), maximum density (25 vs 20 vs 12), mean velocity (12.6 vs 8.7 vs 8.4 mm/sec), perfusion index (4.5 vs 1.4 vs 0.9), and curvature (1.19 vs 1.09 vs 1.05) compared with groups II and III, respectively (all <i>P</i> < .05). Microvascular tortuosity was observed in 83% (25 of 30), 43% (15 of 35), and 14% (five of 35) of groups I, II, and III, respectively (<i>P</i> < .001). Conclusion Superresolution US provided a 10-μm scale resolution view of tibial nerve microvascular structure, helping to identify microvascular alterations in participants with diabetic peripheral neuropathy with ulcers. © RSNA, 2026 <i>Supplemental material is available for this article.</i></p>","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"318 1","pages":"e250347"},"PeriodicalIF":15.2,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Optimizing Large Language Models for Automated Protocoling of Abdominal and Pelvic CT Scans: The Power of Context.
优化腹部和骨盆CT扫描自动协议的大型语言模型:上下文的力量。
IF 19.7
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.252105
Bryan W Buckley,Adriano B Dias,Yangqing Deng,Heidi Schmidt,Ania Kielar,Satheesh Krishna,Rajesh Bhayana
Background Accurate protocoling is critical for imaging accuracy. Manual protocoling is time-consuming and error prone. Purpose To evaluate the performance of large language models (LLMs) in automatically assigning protocols for abdominal and pelvic CT scans after optimization with context engineering and fine-tuning and to compare performance with that of radiologists in practice. Materials and Methods This retrospective study included patients with abdominal or pelvic CT scans obtained between January 2024 and June 2024. Requisition data, human-selected protocol, and training level (resident, fellow, or radiologist) were extracted. Reference standard protocols were defined by radiologists in consultation with institutional guidelines. Context engineering involved detailed prompt instructions using a prompt set with GPT-4o (version 2024-08-06; Open AI). A subset of patients was reserved for fine-tuning (training set and validation set) and another for testing (internal test set). Two models were tested (prompting-only and fine-tuned). Model-selected protocols and original human-selected protocols were categorized compared with the reference standard after review by blinded radiologists as follows: exact match, equal alternative, reasonable but inferior, or inappropriate. Exact match and equal alternative were considered optimal. Performance of models and radiologists were compared using the McNemar test. Results This study included 1448 patients (mean age, 61 years ± 17 [SD]; 728 female patients). GPT-4o with prompting only selected optimal protocols more frequently than humans (96.2% [527 of 548 patients] vs 88.3% [484 of 548 patients]; P < .001), but there was no evidence of a difference in inappropriate protocols (1.3% [seven of 548 patients] vs 2.4% [13 of 548 patients]; P = .21). Fine-tuning GPT-4o did not improve the proportion of optimal protocols over prompting only (96.2% [527 of 548 patients] vs 96.2% [527 of 548 patients]; P > .99). In subgroup analyses, the proportion of protocols matching the reference standard was similar among radiologists (79.4% [173 of 218 patients]), fellows (74.9% [164 of 219 patients]), and residents (72.1% [80 of 111 patients]; P = .30). Conclusion For protocoling abdominal and pelvic CT scans, the LLM, GPT-4o, selected optimal protocols more frequently than radiologists when optimized with detailed prompting, and fine-tuning of the model did not further improve performance. © RSNA, 2026 Supplemental material is available for this article.
求助PDF
{"title":"Optimizing Large Language Models for Automated Protocoling of Abdominal and Pelvic CT Scans: The Power of Context.","authors":"Bryan W Buckley,Adriano B Dias,Yangqing Deng,Heidi Schmidt,Ania Kielar,Satheesh Krishna,Rajesh Bhayana","doi":"10.1148/radiol.252105","DOIUrl":"https://doi.org/10.1148/radiol.252105","url":null,"abstract":"Background Accurate protocoling is critical for imaging accuracy. Manual protocoling is time-consuming and error prone. Purpose To evaluate the performance of large language models (LLMs) in automatically assigning protocols for abdominal and pelvic CT scans after optimization with context engineering and fine-tuning and to compare performance with that of radiologists in practice. Materials and Methods This retrospective study included patients with abdominal or pelvic CT scans obtained between January 2024 and June 2024. Requisition data, human-selected protocol, and training level (resident, fellow, or radiologist) were extracted. Reference standard protocols were defined by radiologists in consultation with institutional guidelines. Context engineering involved detailed prompt instructions using a prompt set with GPT-4o (version 2024-08-06; Open AI). A subset of patients was reserved for fine-tuning (training set and validation set) and another for testing (internal test set). Two models were tested (prompting-only and fine-tuned). Model-selected protocols and original human-selected protocols were categorized compared with the reference standard after review by blinded radiologists as follows: exact match, equal alternative, reasonable but inferior, or inappropriate. Exact match and equal alternative were considered optimal. Performance of models and radiologists were compared using the McNemar test. Results This study included 1448 patients (mean age, 61 years ± 17 [SD]; 728 female patients). GPT-4o with prompting only selected optimal protocols more frequently than humans (96.2% [527 of 548 patients] vs 88.3% [484 of 548 patients]; P < .001), but there was no evidence of a difference in inappropriate protocols (1.3% [seven of 548 patients] vs 2.4% [13 of 548 patients]; P = .21). Fine-tuning GPT-4o did not improve the proportion of optimal protocols over prompting only (96.2% [527 of 548 patients] vs 96.2% [527 of 548 patients]; P > .99). In subgroup analyses, the proportion of protocols matching the reference standard was similar among radiologists (79.4% [173 of 218 patients]), fellows (74.9% [164 of 219 patients]), and residents (72.1% [80 of 111 patients]; P = .30). Conclusion For protocoling abdominal and pelvic CT scans, the LLM, GPT-4o, selected optimal protocols more frequently than radiologists when optimized with detailed prompting, and fine-tuning of the model did not further improve performance. © RSNA, 2026 Supplemental material is available for this article.","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"84 1","pages":"e252105"},"PeriodicalIF":19.7,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145907544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Clinical Guidelines and Their Role in Trauma-related Cervical Spine Imaging.
临床指南及其在创伤相关颈椎影像学中的作用。
IF 19.7
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.252327
Masis Isikbay,Jason Talbott
求助PDF
{"title":"Clinical Guidelines and Their Role in Trauma-related Cervical Spine Imaging.","authors":"Masis Isikbay,Jason Talbott","doi":"10.1148/radiol.252327","DOIUrl":"https://doi.org/10.1148/radiol.252327","url":null,"abstract":"","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"125 1","pages":"e252327"},"PeriodicalIF":19.7,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145907556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
How I Do It: Using the Dixon Method and Fat-Water Imaging in Musculoskeletal MRI.
我是怎么做的:在肌肉骨骼MRI中使用Dixon方法和脂肪-水成像。
IF 15.2
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.250374
Patrick Omoumi, Charbel Mourad, Stanislas Rapacchi, Maxime Pastor
The Dixon method is a fat suppression method based on the separation of fat and water signals. Originally limited to gradient-echo sequences, it can now be integrated with fast/turbo spin-echo MRI sequences, unlocking its potential for musculoskeletal imaging. Key advantages include robust fat suppression and the ability to generate four image contrasts from a single acquisition-including fat-only images, which are specific to the signal of fat-thereby providing opportunities for protocol optimization. Additionally, quantitative data readily available from Dixon sequences provide information on intralesional fat, which may serve as a diagnostic aid complementing the morphologic assessment. The Dixon method has been increasingly used in musculoskeletal MRI, primarily in bone marrow imaging to help detect and characterize focal marrow lesions, vertebral compression fractures, and degenerative spine conditions. Other applications include whole-body imaging, imaging of rheumatologic and neuromuscular disorders, tumor characterization, and imaging near metallic implants. Limitations include fat-water swapping artifacts and unreliable fat quantification (eg, when mineralization is increased). This article covers the strengths, applications, and limitations of the Dixon method in musculoskeletal MRI, with practical tips for incorporating this method into routine protocols, highlighting its most effective applications and noting areas where it is less reliable.
求助PDF
{"title":"How I Do It: Using the Dixon Method and Fat-Water Imaging in Musculoskeletal MRI.","authors":"Patrick Omoumi, Charbel Mourad, Stanislas Rapacchi, Maxime Pastor","doi":"10.1148/radiol.250374","DOIUrl":"https://doi.org/10.1148/radiol.250374","url":null,"abstract":"<p><p>The Dixon method is a fat suppression method based on the separation of fat and water signals. Originally limited to gradient-echo sequences, it can now be integrated with fast/turbo spin-echo MRI sequences, unlocking its potential for musculoskeletal imaging. Key advantages include robust fat suppression and the ability to generate four image contrasts from a single acquisition-including fat-only images, which are specific to the signal of fat-thereby providing opportunities for protocol optimization. Additionally, quantitative data readily available from Dixon sequences provide information on intralesional fat, which may serve as a diagnostic aid complementing the morphologic assessment. The Dixon method has been increasingly used in musculoskeletal MRI, primarily in bone marrow imaging to help detect and characterize focal marrow lesions, vertebral compression fractures, and degenerative spine conditions. Other applications include whole-body imaging, imaging of rheumatologic and neuromuscular disorders, tumor characterization, and imaging near metallic implants. Limitations include fat-water swapping artifacts and unreliable fat quantification (eg, when mineralization is increased). This article covers the strengths, applications, and limitations of the Dixon method in musculoskeletal MRI, with practical tips for incorporating this method into routine protocols, highlighting its most effective applications and noting areas where it is less reliable.</p>","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"318 1","pages":"e250374"},"PeriodicalIF":15.2,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Middle Meningeal Artery Embolization for Subdural Hematoma: CT/MRI End Points of the EMBOLISE Trial.
脑膜中动脉栓塞治疗硬膜下血肿:栓塞试验的CT/MRI终点。
IF 15.2
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.251746
Helge Kniep, Susanne Gellissen, Matthias Bechstein, Lukas Meyer, Gabriel Broocks, Christian Heitkamp, Laurens Winkelmeier, Fabian Flottmann, Vincent Geest, Uta Hanning, Christian Thaler, Maxim Bester, Jason M Davies, Jared Knopman, Maxim Mokin, Ameer E Hassan, Robert E Harbaugh, Alexander Khalessi, Adnan H Siddiqui, Bradley A Gross, Ramesh Grandhi, Jason Tarpley, Walavan Sivakumar, Mark Bain, R Webster Crowley, Thomas W Link, Justin F Fraser, Michael R Levitt, Peng Roc Chen, Ricardo A Hanel, Joe D Bernard, Mouhammad Jumaa, Patrick Youssef, Marshall C Cress, Mohammad Imran Chaudry, Hakeem J Shakir, Walter S Lesley, Joshua Billingsley, Jesse Jones, Matthew J Koch, Alexandra R Paul, William J Mack, Joshua W Osbun, Kathleen Dlouhy, Jonathan A Grossberg, Christopher P Kellner, Daniel H Sahlein, Justin Santarelli, Clemens M Schirmer, Justin Singer, Jesse J Liu, Aniel Q Majjhoo, Thomas Wolfe, Neil V Patel, Christopher Roark, Jens Fiehler
Background Chronic subdural hematomas (cSDHs) are associated with high recurrence risks following surgical evacuation. The EMBOLISE trial demonstrated that, compared with surgery alone, adjunctive middle meningeal artery embolization (MMAE) significantly reduced reoperation rates. However, given the limitations of the clinical end points of the trial, which may be subject to interrater variability and certain biases, the quantitative imaging metrics need to be evaluated. Purpose To evaluate the prespecified imaging end points of the EMBOLISE trial and assess the long-term resolution of cSDH through quantitative imaging analyses. Materials and Methods EMBOLISE was a multicenter, randomized, interventional trial conducted across 39 U.S. sites between December 2020 and August 2023. Prespecified secondary imaging end points included changes in hematoma volume and thickness and midline shifts from 24 hours to 90 days after the procedure at CT and MRI. The post hoc analyses performed herein extended the assessment to 180 days and included absolute hematoma metrics. Mixed-effects modeling was employed to adjust for confounders. Results Four hundred patients were enrolled in the EMBOLISE study, among whom 352 were included (mean age, 72 years ± 10.4 [SD]; 256 men). The mean cSDH volume was 126 mL at screening, with no intergroup differences. At 90 and 180 days, the MMAE plus surgery group had lower cSDH volumes (20.6 mL vs 28.9 mL [P = .03] and 19.4 mL vs 31.5 mL [P = .04], respectively). Mixed-effects models revealed a 6.9 mL (95% CI: -13.5, -0.40; approximately 25%) greater volume reduction and an 8.4 mL (95% CI: -15.2, -1.6; approximately 30%) lower absolute volume at 90 days in the MMAE group There was no evidence of a difference in the prespecified secondary imaging end points between the groups. Conclusion While the prespecified secondary imaging end points did not significantly differ, the absolute 90- and 180-day hematoma volumes were significantly lower in patients who received MMAE and surgery. Confounder-adjusted mixed-effects analysis indicated a greater reduction in hematoma volume with adjunctive MMAE. ClinicalTrials.gov identifier NCT04402632 © RSNA, 2026 Supplemental material is available for this article. See also the editorial by Ramasamy and Baker in this issue.
求助PDF
{"title":"Middle Meningeal Artery Embolization for Subdural Hematoma: CT/MRI End Points of the EMBOLISE Trial.","authors":"Helge Kniep, Susanne Gellissen, Matthias Bechstein, Lukas Meyer, Gabriel Broocks, Christian Heitkamp, Laurens Winkelmeier, Fabian Flottmann, Vincent Geest, Uta Hanning, Christian Thaler, Maxim Bester, Jason M Davies, Jared Knopman, Maxim Mokin, Ameer E Hassan, Robert E Harbaugh, Alexander Khalessi, Adnan H Siddiqui, Bradley A Gross, Ramesh Grandhi, Jason Tarpley, Walavan Sivakumar, Mark Bain, R Webster Crowley, Thomas W Link, Justin F Fraser, Michael R Levitt, Peng Roc Chen, Ricardo A Hanel, Joe D Bernard, Mouhammad Jumaa, Patrick Youssef, Marshall C Cress, Mohammad Imran Chaudry, Hakeem J Shakir, Walter S Lesley, Joshua Billingsley, Jesse Jones, Matthew J Koch, Alexandra R Paul, William J Mack, Joshua W Osbun, Kathleen Dlouhy, Jonathan A Grossberg, Christopher P Kellner, Daniel H Sahlein, Justin Santarelli, Clemens M Schirmer, Justin Singer, Jesse J Liu, Aniel Q Majjhoo, Thomas Wolfe, Neil V Patel, Christopher Roark, Jens Fiehler","doi":"10.1148/radiol.251746","DOIUrl":"https://doi.org/10.1148/radiol.251746","url":null,"abstract":"<p><p>Background Chronic subdural hematomas (cSDHs) are associated with high recurrence risks following surgical evacuation. The EMBOLISE trial demonstrated that, compared with surgery alone, adjunctive middle meningeal artery embolization (MMAE) significantly reduced reoperation rates. However, given the limitations of the clinical end points of the trial, which may be subject to interrater variability and certain biases, the quantitative imaging metrics need to be evaluated. Purpose To evaluate the prespecified imaging end points of the EMBOLISE trial and assess the long-term resolution of cSDH through quantitative imaging analyses. Materials and Methods EMBOLISE was a multicenter, randomized, interventional trial conducted across 39 U.S. sites between December 2020 and August 2023. Prespecified secondary imaging end points included changes in hematoma volume and thickness and midline shifts from 24 hours to 90 days after the procedure at CT and MRI. The post hoc analyses performed herein extended the assessment to 180 days and included absolute hematoma metrics. Mixed-effects modeling was employed to adjust for confounders. Results Four hundred patients were enrolled in the EMBOLISE study, among whom 352 were included (mean age, 72 years ± 10.4 [SD]; 256 men). The mean cSDH volume was 126 mL at screening, with no intergroup differences. At 90 and 180 days, the MMAE plus surgery group had lower cSDH volumes (20.6 mL vs 28.9 mL [<i>P</i> = .03] and 19.4 mL vs 31.5 mL [<i>P</i> = .04], respectively). Mixed-effects models revealed a 6.9 mL (95% CI: -13.5, -0.40; approximately 25%) greater volume reduction and an 8.4 mL (95% CI: -15.2, -1.6; approximately 30%) lower absolute volume at 90 days in the MMAE group There was no evidence of a difference in the prespecified secondary imaging end points between the groups. Conclusion While the prespecified secondary imaging end points did not significantly differ, the absolute 90- and 180-day hematoma volumes were significantly lower in patients who received MMAE and surgery. Confounder-adjusted mixed-effects analysis indicated a greater reduction in hematoma volume with adjunctive MMAE. ClinicalTrials.gov identifier NCT04402632 © RSNA, 2026 <i>Supplemental material is available for this article.</i> See also the editorial by Ramasamy and Baker in this issue.</p>","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"318 1","pages":"e251746"},"PeriodicalIF":15.2,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用
Prospective Multicenter Evaluation of Hepatocellular Carcinoma Surveillance under American Association for the Study of Liver Diseases Version 2023 Guidance.
根据美国肝病研究协会2023版指南对肝细胞癌监测的前瞻性多中心评估
IF 15.2
1区 医学
Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Pub Date : 2026-01-01
DOI: 10.1148/radiol.252252
Mei-Qing Cheng, Hang-Tong Hu, Shao-Hong Wu, Hui Huang, Ya-Dan Lin, Wen-Juan Tong, Xiao-Zhou Lu, Wei-Ping Ke, Rui-Fang Lu, Qian-Qian Xu, Yu Wang, Xin-Xin Lin, Ze-Zhi Liu, Zhi-Rong Lu, Ming-De Lu, Hong Yang, Wei Wang, Li-Da Chen
Background The 2023 update of the American Association for the Study of Liver Diseases (AASLD version 2023 [hereafter, v2023]) guidance introduced new triggers for hepatocellular carcinoma (HCC) surveillance, incorporating US visualization score, increasing α-fetoprotein (AFP) level, and lesion growth, but has not yet been validated. Purpose To assess the surveillance performance of AASLD v2023 for HCC detection in comparison to Liver Imaging Reporting and Data System (LI-RADS) version 2017 (hereafter, v2017) and AASLD version 2018 (hereafter, v2018) in participants at high risk of HCC. Materials and Methods This prospective study consecutively enrolled high-risk participants undergoing US and AFP surveillance across three institutions between July 2023 and October 2024. Surveillance performance of US LI-RADS v2017, AASLD v2018, and AASLD v2023 for HCC detection was compared through sensitivity, specificity, positive predictive value, and negative predictive value (NPV). Multivariable logistic regression helped identify predictors of false-negative and false-positive classifications. Results Among 953 participants (median age, 51 years; IQR, 45-58 years; 658 men), 5% (50 of 953) had HCC. The new surveillance triggers (visualization score VIS-C, increasing AFP level, and lesion growth) demonstrated sensitivities ranging from 8% (four of 50 participants) to 48% (24 of 50 participants), with high specificities of 94% (849 of 903 participants) to 99.4% (898 of 903 participants). As an integrated algorithm, AASLD v2023 achieved a sensitivity of 94% (47 of 50 participants) and NPV of 99.6% (758 of 761 participants) for HCC detection, surpassing US LI-RADS v2017 (sensitivity, 60% [30 of 50 participants], P < .001; NPV, 98% [814 of 834 participants], P = .001) and AASLD v2018 (sensitivity, 76% [38 of 50 participants], P = .02; NPV, 98.5% [805 of 817 participants], P = .04). Specificity was lower than that with US LI-RADS v2017 (84% [758 of 903 participants] vs 90% [814 of 903 participants], P < .001) and AASLD v2018 (84% [758 of 903 participants] vs 89% [805 of 903 participants], P = .001). For early HCC, sensitivity remained superior to that of other algorithms (P < .001, P = .04). At multivariable analysis, AFP level of less than 20 ng/mL (odds ratio, 11.76; P < .001) and absence of cirrhosis (odds ratio, 2.45; P = .03) were independently associated with false-positive findings for AASLD v2023. Conclusion AASLD v2023 improved the sensitivity and NPV for HCC surveillance, outperforming US LI-RADS v2017 and AASLD v2018. Chinese Clinical Trial Registry no. ChiCTR2100054330 © RSNA, 2026 Supplemental material is available for this article.
求助PDF
{"title":"Prospective Multicenter Evaluation of Hepatocellular Carcinoma Surveillance under American Association for the Study of Liver Diseases Version 2023 Guidance.","authors":"Mei-Qing Cheng, Hang-Tong Hu, Shao-Hong Wu, Hui Huang, Ya-Dan Lin, Wen-Juan Tong, Xiao-Zhou Lu, Wei-Ping Ke, Rui-Fang Lu, Qian-Qian Xu, Yu Wang, Xin-Xin Lin, Ze-Zhi Liu, Zhi-Rong Lu, Ming-De Lu, Hong Yang, Wei Wang, Li-Da Chen","doi":"10.1148/radiol.252252","DOIUrl":"10.1148/radiol.252252","url":null,"abstract":"<p><p>Background The 2023 update of the American Association for the Study of Liver Diseases (AASLD version 2023 [hereafter, v2023]) guidance introduced new triggers for hepatocellular carcinoma (HCC) surveillance, incorporating US visualization score, increasing α-fetoprotein (AFP) level, and lesion growth, but has not yet been validated. Purpose To assess the surveillance performance of AASLD v2023 for HCC detection in comparison to Liver Imaging Reporting and Data System (LI-RADS) version 2017 (hereafter, v2017) and AASLD version 2018 (hereafter, v2018) in participants at high risk of HCC. Materials and Methods This prospective study consecutively enrolled high-risk participants undergoing US and AFP surveillance across three institutions between July 2023 and October 2024. Surveillance performance of US LI-RADS v2017, AASLD v2018, and AASLD v2023 for HCC detection was compared through sensitivity, specificity, positive predictive value, and negative predictive value (NPV). Multivariable logistic regression helped identify predictors of false-negative and false-positive classifications. Results Among 953 participants (median age, 51 years; IQR, 45-58 years; 658 men), 5% (50 of 953) had HCC. The new surveillance triggers (visualization score VIS-C, increasing AFP level, and lesion growth) demonstrated sensitivities ranging from 8% (four of 50 participants) to 48% (24 of 50 participants), with high specificities of 94% (849 of 903 participants) to 99.4% (898 of 903 participants). As an integrated algorithm, AASLD v2023 achieved a sensitivity of 94% (47 of 50 participants) and NPV of 99.6% (758 of 761 participants) for HCC detection, surpassing US LI-RADS v2017 (sensitivity, 60% [30 of 50 participants], <i>P</i> < .001; NPV, 98% [814 of 834 participants], <i>P</i> = .001) and AASLD v2018 (sensitivity, 76% [38 of 50 participants], <i>P</i> = .02; NPV, 98.5% [805 of 817 participants], <i>P</i> = .04). Specificity was lower than that with US LI-RADS v2017 (84% [758 of 903 participants] vs 90% [814 of 903 participants], <i>P</i> < .001) and AASLD v2018 (84% [758 of 903 participants] vs 89% [805 of 903 participants], <i>P</i> = .001). For early HCC, sensitivity remained superior to that of other algorithms (<i>P</i> < .001, <i>P</i> = .04). At multivariable analysis, AFP level of less than 20 ng/mL (odds ratio, 11.76; <i>P</i> < .001) and absence of cirrhosis (odds ratio, 2.45; <i>P</i> = .03) were independently associated with false-positive findings for AASLD v2023. Conclusion AASLD v2023 improved the sensitivity and NPV for HCC surveillance, outperforming US LI-RADS v2017 and AASLD v2018. Chinese Clinical Trial Registry no. ChiCTR2100054330 © RSNA, 2026 <i>Supplemental material is available for this article.</i></p>","PeriodicalId":20896,"journal":{"name":"Radiology","volume":"318 1","pages":"e252252"},"PeriodicalIF":15.2,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146012185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
引用
批量引用