分析美国人群中乳腺癌筛查的地理空间和社会经济差异:机器学习方法。

IF 3.3 Q2 ONCOLOGY JMIR Cancer Pub Date : 2025-01-16 DOI:10.2196/59882
Soheil Hashtarkhani, Yiwang Zhou, Fekede Asefa Kumsa, Shelley White-Means, David L Schwartz, Arash Shaban-Nejad
{"title":"分析美国人群中乳腺癌筛查的地理空间和社会经济差异:机器学习方法。","authors":"Soheil Hashtarkhani, Yiwang Zhou, Fekede Asefa Kumsa, Shelley White-Means, David L Schwartz, Arash Shaban-Nejad","doi":"10.2196/59882","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Breast cancer screening plays a pivotal role in early detection and subsequent effective management of the disease, impacting patient outcomes and survival rates.</p><p><strong>Objective: </strong>This study aims to assess breast cancer screening rates nationwide in the United States and investigate the impact of social determinants of health on these screening rates.</p><p><strong>Methods: </strong>Data on mammography screening at the census tract level for 2018 and 2020 were collected from the Behavioral Risk Factor Surveillance System. We developed a large-scale dataset of social determinants of health, comprising 13 variables for 72,337 census tracts. Spatial analysis employing Getis-Ord Gi statistics was used to identify clusters of high and low breast cancer screening rates. To evaluate the influence of these social determinants, we implemented a random forest model, with the aim of comparing its performance to linear regression and support vector machine models. The models were evaluated using R2 and root mean squared error metrics. Shapley Additive Explanations values were subsequently used to assess the significance of variables and direction of their influence.</p><p><strong>Results: </strong>Geospatial analysis revealed elevated screening rates in the eastern and northern United States, while central and midwestern regions exhibited lower rates. The random forest model demonstrated superior performance, with an R2=64.53 and root mean squared error of 2.06, compared to linear regression and support vector machine models. Shapley Additive Explanations values indicated that the percentage of the Black population, the number of mammography facilities within a 10-mile radius, and the percentage of the population with at least a bachelor's degree were the most influential variables, all positively associated with mammography screening rates.</p><p><strong>Conclusions: </strong>These findings underscore the significance of social determinants and the accessibility of mammography services in explaining the variability of breast cancer screening rates in the United States, emphasizing the need for targeted policy interventions in areas with relatively lower screening rates.</p>","PeriodicalId":45538,"journal":{"name":"JMIR Cancer","volume":"11 ","pages":"e59882"},"PeriodicalIF":3.3000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11756836/pdf/","citationCount":"0","resultStr":"{\"title\":\"Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach.\",\"authors\":\"Soheil Hashtarkhani, Yiwang Zhou, Fekede Asefa Kumsa, Shelley White-Means, David L Schwartz, Arash Shaban-Nejad\",\"doi\":\"10.2196/59882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Breast cancer screening plays a pivotal role in early detection and subsequent effective management of the disease, impacting patient outcomes and survival rates.</p><p><strong>Objective: </strong>This study aims to assess breast cancer screening rates nationwide in the United States and investigate the impact of social determinants of health on these screening rates.</p><p><strong>Methods: </strong>Data on mammography screening at the census tract level for 2018 and 2020 were collected from the Behavioral Risk Factor Surveillance System. We developed a large-scale dataset of social determinants of health, comprising 13 variables for 72,337 census tracts. Spatial analysis employing Getis-Ord Gi statistics was used to identify clusters of high and low breast cancer screening rates. To evaluate the influence of these social determinants, we implemented a random forest model, with the aim of comparing its performance to linear regression and support vector machine models. The models were evaluated using R2 and root mean squared error metrics. Shapley Additive Explanations values were subsequently used to assess the significance of variables and direction of their influence.</p><p><strong>Results: </strong>Geospatial analysis revealed elevated screening rates in the eastern and northern United States, while central and midwestern regions exhibited lower rates. The random forest model demonstrated superior performance, with an R2=64.53 and root mean squared error of 2.06, compared to linear regression and support vector machine models. Shapley Additive Explanations values indicated that the percentage of the Black population, the number of mammography facilities within a 10-mile radius, and the percentage of the population with at least a bachelor's degree were the most influential variables, all positively associated with mammography screening rates.</p><p><strong>Conclusions: </strong>These findings underscore the significance of social determinants and the accessibility of mammography services in explaining the variability of breast cancer screening rates in the United States, emphasizing the need for targeted policy interventions in areas with relatively lower screening rates.</p>\",\"PeriodicalId\":45538,\"journal\":{\"name\":\"JMIR Cancer\",\"volume\":\"11 \",\"pages\":\"e59882\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-01-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11756836/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Cancer\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/59882\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cancer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/59882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:乳腺癌筛查在疾病的早期发现和随后的有效管理中起着关键作用,影响患者的预后和生存率。目的:本研究旨在评估美国全国乳腺癌筛查率,并调查健康的社会决定因素对这些筛查率的影响。方法:从行为危险因素监测系统中收集2018年和2020年普查区乳腺x线检查数据。我们开发了一个大规模的健康社会决定因素数据集,包括72,337个人口普查区的13个变量。采用Getis-Ord Gi统计数据进行空间分析,以确定乳腺癌筛查率高和低的群集。为了评估这些社会决定因素的影响,我们实施了一个随机森林模型,目的是将其性能与线性回归和支持向量机模型进行比较。使用R2和均方根误差指标对模型进行评估。随后使用沙普利加性解释值来评估变量的重要性及其影响的方向。结果:地理空间分析显示,美国东部和北部的筛查率较高,而中部和中西部地区的筛查率较低。与线性回归和支持向量机模型相比,随机森林模型表现出更好的性能,R2=64.53,均方根误差为2.06。Shapley加性解释值表明,黑人人口的百分比、10英里半径内乳房x光检查设施的数量以及至少拥有学士学位的人口的百分比是最具影响力的变量,它们都与乳房x光检查率呈正相关。结论:这些发现强调了社会决定因素和乳房x光检查服务的可及性在解释美国乳腺癌筛查率差异方面的重要性,强调了筛查率相对较低地区有针对性政策干预的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach.

Background: Breast cancer screening plays a pivotal role in early detection and subsequent effective management of the disease, impacting patient outcomes and survival rates.

Objective: This study aims to assess breast cancer screening rates nationwide in the United States and investigate the impact of social determinants of health on these screening rates.

Methods: Data on mammography screening at the census tract level for 2018 and 2020 were collected from the Behavioral Risk Factor Surveillance System. We developed a large-scale dataset of social determinants of health, comprising 13 variables for 72,337 census tracts. Spatial analysis employing Getis-Ord Gi statistics was used to identify clusters of high and low breast cancer screening rates. To evaluate the influence of these social determinants, we implemented a random forest model, with the aim of comparing its performance to linear regression and support vector machine models. The models were evaluated using R2 and root mean squared error metrics. Shapley Additive Explanations values were subsequently used to assess the significance of variables and direction of their influence.

Results: Geospatial analysis revealed elevated screening rates in the eastern and northern United States, while central and midwestern regions exhibited lower rates. The random forest model demonstrated superior performance, with an R2=64.53 and root mean squared error of 2.06, compared to linear regression and support vector machine models. Shapley Additive Explanations values indicated that the percentage of the Black population, the number of mammography facilities within a 10-mile radius, and the percentage of the population with at least a bachelor's degree were the most influential variables, all positively associated with mammography screening rates.

Conclusions: These findings underscore the significance of social determinants and the accessibility of mammography services in explaining the variability of breast cancer screening rates in the United States, emphasizing the need for targeted policy interventions in areas with relatively lower screening rates.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
JMIR Cancer
JMIR Cancer ONCOLOGY-
CiteScore
4.10
自引率
0.00%
发文量
64
审稿时长
12 weeks
期刊最新文献
Exploring the Impact of the Multimodal CAPABLE eHealth Intervention on Health-Related Quality of Life in Patients With Melanoma Undergoing Immune-Checkpoint Inhibition: Prospective Pilot Study. Spatiotemporal Correlation Analysis for the Incidence of Esophageal and Gastric Cancer From 2010 to 2019: Ecological Study. User and Developer Views on Using AI Technologies to Facilitate the Early Detection of Skin Cancers in Primary Care Settings: Qualitative Semistructured Interview Study. Exploring the Social Media Discussion of Breast Cancer Treatment Choices: Quantitative Natural Language Processing Study. A Machine Learning Approach Using Topic Modeling to Identify and Assess Experiences of Patients With Colorectal Cancer: Explorative Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1