首页 > 最新文献

Machine learning with applications最新文献

英文 中文
Robust image classification system via cloud computing, aligned multimodal embeddings, centroids and neighbours 通过云计算、对齐多模态嵌入、中心点和邻域实现稳健的图像分类系统
Pub Date : 2024-09-01 DOI: 10.1016/j.mlwa.2024.100583
Wei Lun Koh, James Boon Yong Koh, Bing Tian Dai

We propose a framework for a cloud-based application of an image classification system that is highly accessible, maintains data confidentiality, and robust to incorrect training labels. The end-to-end system is implemented using Amazon Web Services (AWS), with a detailed guide provided for replication, enhancing the ways which researchers can collaborate with a community of users for mutual benefits. A front-end web application allows users across the world to securely log in, contribute labelled training images conveniently via a drag-and-drop approach, and use that same application to query an up-to-date model that has knowledge of images from the community of users. This resulting system demonstrates that theory can be effectively interlaced with practice, with various considerations addressed by our architecture. Users will have access to an image classification model that can be updated and automatically deployed within minutes, gaining benefits from and at the same time providing benefits to the community of users. At the same time, researchers, who will act as administrators, will be able to conveniently and securely engage a large number of users with their respective machine learning models and build up a labelled database over time, paying only variable costs that is proportional to utilization.

我们为基于云的图像分类系统应用提出了一个框架,该框架具有高度可访问性、数据保密性和对错误训练标签的鲁棒性。端到端系统使用亚马逊网络服务(AWS)实施,并提供了详细的复制指南,从而增强了研究人员与用户社区互利合作的方式。前端网络应用程序允许世界各地的用户安全登录,通过拖放方式方便地提供标记过的训练图像,并使用同一应用程序查询最新模型,该模型拥有来自用户社区的图像知识。该系统的成果表明,理论可以有效地与实践相结合,我们的架构可以解决各种问题。用户可以访问可在数分钟内更新和自动部署的图像分类模型,从而从用户群中获益,同时也为用户群带来益处。与此同时,作为管理员的研究人员将能够方便、安全地让大量用户使用他们各自的机器学习模型,并随着时间的推移建立一个标记数据库,只需支付与使用率成正比的可变成本。
{"title":"Robust image classification system via cloud computing, aligned multimodal embeddings, centroids and neighbours","authors":"Wei Lun Koh,&nbsp;James Boon Yong Koh,&nbsp;Bing Tian Dai","doi":"10.1016/j.mlwa.2024.100583","DOIUrl":"10.1016/j.mlwa.2024.100583","url":null,"abstract":"<div><p>We propose a framework for a cloud-based application of an image classification system that is highly accessible, maintains data confidentiality, and robust to incorrect training labels. The end-to-end system is implemented using Amazon Web Services (AWS), with a detailed guide provided for replication, enhancing the ways which researchers can collaborate with a community of users for mutual benefits. A front-end web application allows users across the world to securely log in, contribute labelled training images conveniently via a drag-and-drop approach, and use that same application to query an up-to-date model that has knowledge of images from the community of users. This resulting system demonstrates that theory can be effectively interlaced with practice, with various considerations addressed by our architecture. Users will have access to an image classification model that can be updated and automatically deployed within minutes, gaining benefits from and at the same time providing benefits to the community of users. At the same time, researchers, who will act as administrators, will be able to conveniently and securely engage a large number of users with their respective machine learning models and build up a labelled database over time, paying only variable costs that is proportional to utilization.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100583"},"PeriodicalIF":0.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000598/pdfft?md5=e2b0096db158ce87d429b6d8deb1da6e&pid=1-s2.0-S2666827024000598-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142129217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of bike-sharing station demand using explainable artificial intelligence 利用可解释人工智能预测共享单车站点需求
Pub Date : 2024-08-09 DOI: 10.1016/j.mlwa.2024.100582
Frank Ngeni , Boniphace Kutela , Tumlumbe Juliana Chengula , Cuthbert Ruseruka , Hannah Musau , Norris Novat , Debbie Aisiana Indah , Sarah Kasomi

Bike-sharing systems have grown in popularity in metropolitan areas, providing a handy and environmentally friendly transportation choice for commuters and visitors alike. As demand for bike-sharing programs grows, efficient capacity planning becomes critical to ensuring good user experience and system sustainability in terms of demand. The random forest model was used in this study to predict bike-sharing station demand and is considered a strong ensemble learning approach that can successfully capture complicated nonlinear correlations and interactions between input variables. This study employed data from the Smart Location Database (SLD) to test the model accuracy in estimating station demand and used a form of explainable artificial intelligence (XAI) function to further understand machine learning (ML) prediction outcomes owing to the blackbox tendencies of ML models. Vehicle Miles of Travel (VMT) and Greenhouse Gas (GHG) emissions were the most important features in predicting docking station demand individually but not holistically based on the datasets. The percentage of zero-car households, gross residential density, road network density, aggregate frequency of transit service, and gross activity density were found to have a moderate influence on the prediction model. Further, there may be a better prediction model generating sensible results for every type of explanatory variable, but their contributions are minimum to the prediction outcome. By measuring each feature's contribution to demand prediction in feature engineering, bike-sharing operators can acquire a better understanding of the bike-sharing station capacity and forecast future demands during planning. At the same time, ML models will need further assessment before a holistic conclusion.

共享单车系统在大都市地区越来越受欢迎,为通勤者和游客提供了便捷、环保的交通选择。随着共享单车需求的增长,高效的容量规划对确保良好的用户体验和系统需求的可持续性至关重要。本研究采用随机森林模型来预测共享单车站点的需求,该模型被认为是一种强大的集合学习方法,能够成功捕捉输入变量之间复杂的非线性关联和相互作用。本研究使用智能地点数据库(SLD)中的数据来测试模型在估算站点需求方面的准确性,并使用一种可解释人工智能(XAI)函数来进一步理解机器学习(ML)预测结果,因为ML模型具有黑箱倾向。车辆行驶里程(VMT)和温室气体(GHG)排放量是单独预测停靠站需求的最重要特征,但不是基于数据集的整体预测。零汽车家庭比例、住宅总密度、路网密度、公交服务总频率和活动总密度对预测模型的影响不大。此外,每一种解释变量都可能有一个更好的预测模型来产生合理的结果,但它们对预测结果的贡献都是最小的。通过在特征工程中衡量每个特征对需求预测的贡献,共享单车运营商可以更好地了解共享单车站点的容量,并在规划过程中预测未来的需求。同时,ML 模型还需要进一步评估才能得出整体结论。
{"title":"Prediction of bike-sharing station demand using explainable artificial intelligence","authors":"Frank Ngeni ,&nbsp;Boniphace Kutela ,&nbsp;Tumlumbe Juliana Chengula ,&nbsp;Cuthbert Ruseruka ,&nbsp;Hannah Musau ,&nbsp;Norris Novat ,&nbsp;Debbie Aisiana Indah ,&nbsp;Sarah Kasomi","doi":"10.1016/j.mlwa.2024.100582","DOIUrl":"10.1016/j.mlwa.2024.100582","url":null,"abstract":"<div><p>Bike-sharing systems have grown in popularity in metropolitan areas, providing a handy and environmentally friendly transportation choice for commuters and visitors alike. As demand for bike-sharing programs grows, efficient capacity planning becomes critical to ensuring good user experience and system sustainability in terms of demand. The random forest model was used in this study to predict bike-sharing station demand and is considered a strong ensemble learning approach that can successfully capture complicated nonlinear correlations and interactions between input variables. This study employed data from the Smart Location Database (SLD) to test the model accuracy in estimating station demand and used a form of explainable artificial intelligence (XAI) function to further understand machine learning (ML) prediction outcomes owing to the blackbox tendencies of ML models. Vehicle Miles of Travel (VMT) and Greenhouse Gas (GHG) emissions were the most important features in predicting docking station demand individually but not holistically based on the datasets. The percentage of zero-car households, gross residential density, road network density, aggregate frequency of transit service, and gross activity density were found to have a moderate influence on the prediction model. Further, there may be a better prediction model generating sensible results for every type of explanatory variable, but their contributions are minimum to the prediction outcome. By measuring each feature's contribution to demand prediction in feature engineering, bike-sharing operators can acquire a better understanding of the bike-sharing station capacity and forecast future demands during planning. At the same time, ML models will need further assessment before a holistic conclusion.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100582"},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000586/pdfft?md5=bf46aecfa5d4b69f24c5a8d196610032&pid=1-s2.0-S2666827024000586-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142012918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing advanced driver assistance systems through explainable artificial intelligence for driver anomaly detection 通过用于驾驶员异常检测的可解释人工智能增强高级驾驶员辅助系统
Pub Date : 2024-08-05 DOI: 10.1016/j.mlwa.2024.100580
Tumlumbe Juliana Chengula , Judith Mwakalonge , Gurcan Comert , Methusela Sulle , Saidi Siuhi , Eric Osei

The recent advancements in Advanced Driver Assistance Systems (ADAS) have significantly contributed to road safety and driving comfort. An integral aspect of these systems is the detection of driver anomalies such as drowsiness, distraction, and impairment, which are crucial for preventing accidents. Building upon previous studies that utilized ensemble model learning (XGBoost) with deep learning models (ResNet50, DenseNet201, and InceptionV3) for anomaly detection, this study introduces a comprehensive feature importance analysis using the SHAP (SHapley Additive exPlanations) technique. The technique is implemented through explainable artificial intelligence (XAI). The primary objective is to unravel the complex decision-making process of the ensemble model, which has previously demonstrated near-perfect performance metrics in classifying driver behaviors using in-vehicle cameras. By applying SHAP, the study aims to identify and quantify the contribution of each feature – such as facial expressions, head position, yawning, and sleeping – in predicting driver states. This analysis offers insights into the model’s inner workings and guides the enhancement of feature engineering for more precise and reliable anomaly detection. The findings of this study are expected to impact the development of future ADAS technologies significantly. By pinpointing the most influential features and understanding their dynamics, a model can be optimized for various driving scenarios, ensuring that ADAS systems are robust, accurate, and tailored to real-world conditions. Ultimately, this study contributes to the overarching goal of enhancing road safety through technologically advanced, data-driven approaches.

先进驾驶辅助系统(ADAS)的最新进展极大地促进了道路安全和驾驶舒适性。这些系统不可或缺的一个方面是检测驾驶员的异常情况,如嗜睡、分心和损伤,这对预防事故至关重要。以往的研究利用集合模型学习(XGBoost)和深度学习模型(ResNet50、DenseNet201 和 InceptionV3)进行异常检测,本研究在此基础上采用 SHAP(SHapley Additive exPlanations)技术引入了全面的特征重要性分析。该技术是通过可解释人工智能(XAI)实现的。其主要目的是揭示集合模型的复杂决策过程,该模型之前在使用车载摄像头对驾驶员行为进行分类时已展示出近乎完美的性能指标。通过应用 SHAP,该研究旨在识别和量化每个特征(如面部表情、头部位置、打哈欠和睡眠)在预测驾驶员状态方面的贡献。这种分析有助于深入了解模型的内部运作,并指导特征工程的改进,从而实现更精确、更可靠的异常检测。这项研究的结果有望对未来 ADAS 技术的发展产生重大影响。通过精确定位最具影响力的特征并了解其动态变化,可以针对各种驾驶场景对模型进行优化,从而确保 ADAS 系统的稳健性、准确性,并符合实际情况。最终,这项研究有助于实现通过技术先进、数据驱动的方法提高道路安全的总体目标。
{"title":"Enhancing advanced driver assistance systems through explainable artificial intelligence for driver anomaly detection","authors":"Tumlumbe Juliana Chengula ,&nbsp;Judith Mwakalonge ,&nbsp;Gurcan Comert ,&nbsp;Methusela Sulle ,&nbsp;Saidi Siuhi ,&nbsp;Eric Osei","doi":"10.1016/j.mlwa.2024.100580","DOIUrl":"10.1016/j.mlwa.2024.100580","url":null,"abstract":"<div><p>The recent advancements in Advanced Driver Assistance Systems (ADAS) have significantly contributed to road safety and driving comfort. An integral aspect of these systems is the detection of driver anomalies such as drowsiness, distraction, and impairment, which are crucial for preventing accidents. Building upon previous studies that utilized ensemble model learning (XGBoost) with deep learning models (ResNet50, DenseNet201, and InceptionV3) for anomaly detection, this study introduces a comprehensive feature importance analysis using the SHAP (SHapley Additive exPlanations) technique. The technique is implemented through explainable artificial intelligence (XAI). The primary objective is to unravel the complex decision-making process of the ensemble model, which has previously demonstrated near-perfect performance metrics in classifying driver behaviors using in-vehicle cameras. By applying SHAP, the study aims to identify and quantify the contribution of each feature – such as facial expressions, head position, yawning, and sleeping – in predicting driver states. This analysis offers insights into the model’s inner workings and guides the enhancement of feature engineering for more precise and reliable anomaly detection. The findings of this study are expected to impact the development of future ADAS technologies significantly. By pinpointing the most influential features and understanding their dynamics, a model can be optimized for various driving scenarios, ensuring that ADAS systems are robust, accurate, and tailored to real-world conditions. Ultimately, this study contributes to the overarching goal of enhancing road safety through technologically advanced, data-driven approaches.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100580"},"PeriodicalIF":0.0,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000562/pdfft?md5=f80e9f1f7d3e0f04778d0e4fbebf27c0&pid=1-s2.0-S2666827024000562-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141953205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating interpretability and alpha control in GF-KCSD testing with measurement error: A Kernel approach 在有测量误差的 GF-KCSD 测试中控制可解释性和阿尔法:核方法
Pub Date : 2024-08-05 DOI: 10.1016/j.mlwa.2024.100581
Elham Afzali, Saman Muthukumarana, Liqun Wang

The Gradient-Free Kernel Conditional Stein Discrepancy (GF-KCSD), presented in our prior work, represents a significant advancement in goodness-of-fit testing for conditional distributions. This method offers a robust alternative to previous gradient-based techniques, specially when the gradient calculation is intractable or computationally expensive. In this study, we explore previously unexamined aspects of GF-KCSD, with a particular focus on critical values and test power—essential components for effective hypothesis testing. We also present novel investigation on the impact of measurement errors on the performance of GF-KCSD in comparison to established benchmarks, enhancing our understanding of its resilience to these errors. Through controlled experiments using synthetic data, we demonstrate GF-KCSD’s superior ability to control type-I error rates and maintain high statistical power, even in the presence of measurement inaccuracies. Our empirical evaluation extends to real-world datasets, including brain MRI data. The findings confirm that GF-KCSD performs comparably to KCSD in hypothesis testing effectiveness while requiring significantly less computational time. This demonstrates GF-KCSD’s capability as an efficient tool for analyzing complex data, enhancing its value for scenarios that demand rapid and robust statistical analysis.

无梯度核条件斯泰因差异(GF-KCSD)在我们之前的工作中已经提出,它代表了条件分布拟合优度测试的一大进步。这种方法为以前基于梯度的技术提供了一种稳健的替代方法,尤其是在梯度计算难以实现或计算成本高昂的情况下。在本研究中,我们探索了 GF-KCSD 以前未曾研究过的方面,特别是临界值和检验功率--有效假设检验的重要组成部分。与已有的基准相比,我们还对测量误差对 GF-KCSD 性能的影响进行了新颖的研究,从而加深了我们对 GF-KCSD 抵御这些误差能力的理解。通过使用合成数据进行受控实验,我们证明了 GF-KCSD 即使在存在测量误差的情况下,也能控制 I 类错误率并保持较高的统计能力。我们的实证评估扩展到了真实世界的数据集,包括脑磁共振成像数据。结果证实,GF-KCSD 在假设检验有效性方面的表现与 KCSD 不相上下,而所需的计算时间却大大减少。这证明了 GF-KCSD 作为分析复杂数据的高效工具的能力,提高了它在需要快速、稳健统计分析的应用场景中的价值。
{"title":"Navigating interpretability and alpha control in GF-KCSD testing with measurement error: A Kernel approach","authors":"Elham Afzali,&nbsp;Saman Muthukumarana,&nbsp;Liqun Wang","doi":"10.1016/j.mlwa.2024.100581","DOIUrl":"10.1016/j.mlwa.2024.100581","url":null,"abstract":"<div><p>The Gradient-Free Kernel Conditional Stein Discrepancy (GF-KCSD), presented in our prior work, represents a significant advancement in goodness-of-fit testing for conditional distributions. This method offers a robust alternative to previous gradient-based techniques, specially when the gradient calculation is intractable or computationally expensive. In this study, we explore previously unexamined aspects of GF-KCSD, with a particular focus on critical values and test power—essential components for effective hypothesis testing. We also present novel investigation on the impact of measurement errors on the performance of GF-KCSD in comparison to established benchmarks, enhancing our understanding of its resilience to these errors. Through controlled experiments using synthetic data, we demonstrate GF-KCSD’s superior ability to control type-I error rates and maintain high statistical power, even in the presence of measurement inaccuracies. Our empirical evaluation extends to real-world datasets, including brain MRI data. The findings confirm that GF-KCSD performs comparably to KCSD in hypothesis testing effectiveness while requiring significantly less computational time. This demonstrates GF-KCSD’s capability as an efficient tool for analyzing complex data, enhancing its value for scenarios that demand rapid and robust statistical analysis.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100581"},"PeriodicalIF":0.0,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000574/pdfft?md5=64343827db2919a23c638fc11f2df65c&pid=1-s2.0-S2666827024000574-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141978188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supervised machine learning in drug discovery and development: Algorithms, applications, challenges, and prospects 药物发现和开发中的监督机器学习:算法、应用、挑战和前景
Pub Date : 2024-07-24 DOI: 10.1016/j.mlwa.2024.100576
George Obaido , Ibomoiye Domor Mienye , Oluwaseun F. Egbelowo , Ikiomoye Douglas Emmanuel , Adeola Ogunleye , Blessing Ogbuokiri , Pere Mienye , Kehinde Aruleba

Drug discovery and development is a time-consuming process that involves identifying, designing, and testing new drugs to address critical medical needs. In recent years, machine learning (ML) has played a vital role in technological advancements and has shown promising results in various drug discovery and development stages. ML can be categorized into supervised, unsupervised, semi-supervised, and reinforcement learning. Supervised learning is the most used category, helping organizations solve several real-world problems. This study presents a comprehensive survey of supervised learning algorithms in drug design and development, focusing on their learning process and succinct mathematical formulations, which are lacking in the literature. Additionally, the study discusses widely encountered challenges in applying supervised learning for drug discovery and potential solutions. This study will be beneficial to researchers and practitioners in the pharmaceutical industry as it provides a simplified yet comprehensive review of the main concepts, algorithms, challenges, and prospects in supervised learning.

药物发现和开发是一个耗时的过程,包括识别、设计和测试新药,以满足关键的医疗需求。近年来,机器学习(ML)在技术进步中发挥了重要作用,并在药物发现和开发的各个阶段取得了可喜的成果。机器学习可分为监督学习、无监督学习、半监督学习和强化学习。监督学习是使用最多的一类,可以帮助企业解决一些实际问题。本研究对药物设计与开发中的监督学习算法进行了全面调查,重点关注其学习过程和简洁的数学公式,这在文献中是缺乏的。此外,本研究还讨论了将监督学习应用于药物发现过程中广泛遇到的挑战以及潜在的解决方案。本研究对监督学习的主要概念、算法、挑战和前景进行了简明而全面的评述,对制药行业的研究人员和从业人员大有裨益。
{"title":"Supervised machine learning in drug discovery and development: Algorithms, applications, challenges, and prospects","authors":"George Obaido ,&nbsp;Ibomoiye Domor Mienye ,&nbsp;Oluwaseun F. Egbelowo ,&nbsp;Ikiomoye Douglas Emmanuel ,&nbsp;Adeola Ogunleye ,&nbsp;Blessing Ogbuokiri ,&nbsp;Pere Mienye ,&nbsp;Kehinde Aruleba","doi":"10.1016/j.mlwa.2024.100576","DOIUrl":"10.1016/j.mlwa.2024.100576","url":null,"abstract":"<div><p>Drug discovery and development is a time-consuming process that involves identifying, designing, and testing new drugs to address critical medical needs. In recent years, machine learning (ML) has played a vital role in technological advancements and has shown promising results in various drug discovery and development stages. ML can be categorized into supervised, unsupervised, semi-supervised, and reinforcement learning. Supervised learning is the most used category, helping organizations solve several real-world problems. This study presents a comprehensive survey of supervised learning algorithms in drug design and development, focusing on their learning process and succinct mathematical formulations, which are lacking in the literature. Additionally, the study discusses widely encountered challenges in applying supervised learning for drug discovery and potential solutions. This study will be beneficial to researchers and practitioners in the pharmaceutical industry as it provides a simplified yet comprehensive review of the main concepts, algorithms, challenges, and prospects in supervised learning.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100576"},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000525/pdfft?md5=96d7cf4045bc85637f8a61b12ff40fe9&pid=1-s2.0-S2666827024000525-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141952685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of presence or absence of metastasis in WSI patches of breast cancer using the dual-enhanced convolutional ensemble neural network 利用双增强卷积集合神经网络检测乳腺癌 WSI 斑块中是否存在转移灶
Pub Date : 2024-07-23 DOI: 10.1016/j.mlwa.2024.100579
Ruigang Ge , Guoyue Chen , Kazuki Saruta , Yuki Terata

Breast cancer (BC) is a prevalent malignancy worldwide, posing a significant public health burden due to its high incidence rate. Accurate detection is crucial for improving survival rates, and pathological diagnosis through biopsy is essential for detailed BC detection. Convolutional Neural Network (CNN)-based methods have been proposed to support this detection, utilizing patches from Whole Slide Imaging (WSI) combined with sophisticated CNNs. In this research, we introduced DECENN, a novel deep learning architecture designed to overcome the limitations of single CNN models under fixed pre-trained parameter transfer learning settings. DECENN employs an ensemble of VGG16 and DenseNet121, integrated with innovative modules such as Multi-Scale Feature Extraction, Heterogeneous Convolution Enhancement, Feature Harmonization and Fusion, and Feature Integration Output. Through progressive stages – from baseline models, intermediate DCNN and DCNN+ models, to the fully integrated DECENN model – significant performance improvements were observed in experiments using 5-fold cross-validation on the Patch Camelyon(PCam) dataset. DECENN achieved an AUC of 99.70% ± 0.12%, an F-score of 98.93% ± 0.06%, and an Accuracy of 98.92% ± 0.06%, (p<0.001). These results highlight DECENN’s potential to significantly enhance the automated detection and diagnostic accuracy of BC metastasis in biopsy specimens.

乳腺癌(BC)是一种全球流行的恶性肿瘤,由于发病率高,给公共卫生带来了巨大负担。准确的检测对于提高生存率至关重要,而通过活检进行病理诊断对于乳腺癌的详细检测至关重要。有人提出了基于卷积神经网络(CNN)的方法,利用全切片成像(WSI)的斑块与复杂的 CNN 相结合来支持这种检测。在这项研究中,我们引入了 DECENN,这是一种新型深度学习架构,旨在克服单一 CNN 模型在固定预训练参数迁移学习设置下的局限性。DECENN 采用了 VGG16 和 DenseNet121 的集合,并集成了多尺度特征提取、异构卷积增强、特征协调与融合以及特征集成输出等创新模块。通过从基线模型、中间 DCNN 和 DCNN+ 模型到完全集成的 DECENN 模型等渐进阶段,在 Patch Camelyon(PCam) 数据集上使用 5 倍交叉验证进行的实验中观察到了显著的性能改进。DECENN 的 AUC 为 99.70% ± 0.12%,F-score 为 98.93% ± 0.06%,准确率为 98.92% ± 0.06%,(p<0.001)。这些结果凸显了 DECENN 在显著提高活检样本中 BC 转移的自动检测和诊断准确性方面的潜力。
{"title":"Detection of presence or absence of metastasis in WSI patches of breast cancer using the dual-enhanced convolutional ensemble neural network","authors":"Ruigang Ge ,&nbsp;Guoyue Chen ,&nbsp;Kazuki Saruta ,&nbsp;Yuki Terata","doi":"10.1016/j.mlwa.2024.100579","DOIUrl":"10.1016/j.mlwa.2024.100579","url":null,"abstract":"<div><p>Breast cancer (BC) is a prevalent malignancy worldwide, posing a significant public health burden due to its high incidence rate. Accurate detection is crucial for improving survival rates, and pathological diagnosis through biopsy is essential for detailed BC detection. Convolutional Neural Network (CNN)-based methods have been proposed to support this detection, utilizing patches from Whole Slide Imaging (WSI) combined with sophisticated CNNs. In this research, we introduced DECENN, a novel deep learning architecture designed to overcome the limitations of single CNN models under fixed pre-trained parameter transfer learning settings. DECENN employs an ensemble of VGG16 and DenseNet121, integrated with innovative modules such as Multi-Scale Feature Extraction, Heterogeneous Convolution Enhancement, Feature Harmonization and Fusion, and Feature Integration Output. Through progressive stages – from baseline models, intermediate DCNN and DCNN+ models, to the fully integrated DECENN model – significant performance improvements were observed in experiments using 5-fold cross-validation on the Patch Camelyon(PCam) dataset. DECENN achieved an AUC of 99.70% ± 0.12%, an F-score of 98.93% ± 0.06%, and an Accuracy of 98.92% ± 0.06%, (<span><math><mrow><mi>p</mi><mo>&lt;</mo><mn>0</mn><mo>.</mo><mn>001</mn></mrow></math></span>). These results highlight DECENN’s potential to significantly enhance the automated detection and diagnostic accuracy of BC metastasis in biopsy specimens.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100579"},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000550/pdfft?md5=5deed5e31245e0c9768d99b90207c738&pid=1-s2.0-S2666827024000550-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141844498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data efficient contrastive learning in histopathology using active sampling 利用主动采样在组织病理学中进行数据高效对比学习
Pub Date : 2024-07-22 DOI: 10.1016/j.mlwa.2024.100577
Tahsin Reasat , Asif Sushmit , David S. Smith

Deep learning (DL) based diagnostics systems can provide accurate and robust quantitative analysis in digital pathology. These algorithms require large amounts of annotated training data which is impractical in pathology due to the high resolution of histopathological images. Hence, self-supervised methods have been proposed to learn features using ad-hoc pretext tasks. The self-supervised training process uses a large unlabeled dataset which makes the learning process time consuming. In this work, we propose a new method for actively sampling informative members from the training set using a small proxy network, decreasing sample requirement by 93% and training time by 62% while maintaining the same performance of the traditional self-supervised learning method. The code is available on github.

基于深度学习(DL)的诊断系统可在数字病理学中提供准确、稳健的定量分析。这些算法需要大量有注释的训练数据,而由于组织病理学图像的高分辨率,这在病理学中是不切实际的。因此,有人提出了利用临时借口任务学习特征的自监督方法。自我监督训练过程使用大量未标记的数据集,这使得学习过程非常耗时。在这项工作中,我们提出了一种利用小型代理网络从训练集中主动抽取信息成员的新方法,在保持与传统自我监督学习方法相同性能的同时,将样本要求降低了 93%,将训练时间缩短了 62%。代码可在 github 上获取。
{"title":"Data efficient contrastive learning in histopathology using active sampling","authors":"Tahsin Reasat ,&nbsp;Asif Sushmit ,&nbsp;David S. Smith","doi":"10.1016/j.mlwa.2024.100577","DOIUrl":"10.1016/j.mlwa.2024.100577","url":null,"abstract":"<div><p>Deep learning (DL) based diagnostics systems can provide accurate and robust quantitative analysis in digital pathology. These algorithms require large amounts of annotated training data which is impractical in pathology due to the high resolution of histopathological images. Hence, self-supervised methods have been proposed to learn features using ad-hoc pretext tasks. The self-supervised training process uses a large unlabeled dataset which makes the learning process time consuming. In this work, we propose a new method for actively sampling informative members from the training set using a small proxy network, decreasing sample requirement by 93% and training time by 62% while maintaining the same performance of the traditional self-supervised learning method. The code is available on <span><span>github</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100577"},"PeriodicalIF":0.0,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000537/pdfft?md5=3a2df5b10799802c10eef16a87edcda2&pid=1-s2.0-S2666827024000537-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141961574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A framework for modelling customer invoice payment predictions 客户发票付款预测建模框架
Pub Date : 2024-07-19 DOI: 10.1016/j.mlwa.2024.100578
Willem Roux Moore, Jan H. van Vuuren

By offering clients attractive credit terms on sales, a company may increase its turnover, but granting credit also incurs the cost of money tied up in accounts receivable (AR), increased administration and a heightened probability of incurring bad debt. The management of credit sales, although eminently important to any business, is often performed manually, which may be time-consuming, expensive and inaccurate. Such an administrative workload becomes increasingly cumbersome as the number of credit sales increases. As a result, a new approach towards proactively identifying invoices from AR accounts that are likely to be paid late, or not at all, has recently been proposed in the literature, with the aim of employing intervention strategies more effectively. Several computational techniques from the credit scoring literature and particularly techniques from the realms of survival analysis or machine learning have been embedded in the aforementioned approach. This body of work is, however, lacking due to the limited guidance provided during the data preparation phase of the model development process and because survival analytic and machine learning techniques have not yet been ensembled. In this paper, we propose a generic framework for modelling invoice payment predictions with the aim of facilitating the process of preparing transaction data for analysis, generating relevant features from past customer behaviours, and selecting and ensembling suitable models for predicting the time to payment associated with invoices. We also introduce a new sequential ensembling approach, called the Survival Boost algorithm. The rationale behind this method is that features generated by a survival analytic model can enhance the efficacy of a machine learning classification algorithm.

通过向客户提供有吸引力的赊销条件,公司可以提高营业额,但提供赊销也会产生应收账款(AR)的资金占用成本、管理费用增加以及产生坏账的可能性增大。赊销管理虽然对任何企业都非常重要,但通常都是手工操作,可能费时、费钱且不准确。随着赊销数量的增加,这种管理工作也变得越来越繁琐。因此,最近有文献提出了一种新方法,即从应收账款账户中主动识别可能逾期付款或根本不付款的发票,以便更有效地采用干预策略。上述方法中包含了信用评分文献中的一些计算技术,特别是生存分析或机器学习领域的技术。然而,由于在模型开发过程的数据准备阶段所提供的指导有限,而且生存分析和机器学习技术还没有被组合起来,因此这方面的工作还很欠缺。在本文中,我们提出了建立发票付款预测模型的通用框架,目的是简化准备交易数据进行分析的过程,从过去的客户行为中生成相关特征,并选择和组合合适的模型来预测与发票相关的付款时间。我们还引入了一种新的顺序集合方法,称为 "生存提升算法"。这种方法的原理是,生存分析模型生成的特征可以提高机器学习分类算法的效率。
{"title":"A framework for modelling customer invoice payment predictions","authors":"Willem Roux Moore,&nbsp;Jan H. van Vuuren","doi":"10.1016/j.mlwa.2024.100578","DOIUrl":"10.1016/j.mlwa.2024.100578","url":null,"abstract":"<div><p>By offering clients attractive credit terms on sales, a company may increase its turnover, but granting credit also incurs the cost of money tied up in <em>accounts receivable</em> (AR), increased administration and a heightened probability of incurring bad debt. The management of credit sales, although eminently important to any business, is often performed manually, which may be time-consuming, expensive and inaccurate. Such an administrative workload becomes increasingly cumbersome as the number of credit sales increases. As a result, a new approach towards proactively identifying invoices from AR accounts that are likely to be paid late, or not at all, has recently been proposed in the literature, with the aim of employing intervention strategies more effectively. Several computational techniques from the credit scoring literature and particularly techniques from the realms of survival analysis or machine learning have been embedded in the aforementioned approach. This body of work is, however, lacking due to the limited guidance provided during the data preparation phase of the model development process and because survival analytic and machine learning techniques have not yet been ensembled. In this paper, we propose a generic framework for modelling invoice payment predictions with the aim of facilitating the process of preparing transaction data for analysis, generating relevant features from past customer behaviours, and selecting and ensembling suitable models for predicting the time to payment associated with invoices. We also introduce a new sequential ensembling approach, called the Survival Boost algorithm. The rationale behind this method is that features generated by a survival analytic model can enhance the efficacy of a machine learning classification algorithm.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100578"},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000549/pdfft?md5=bf6f5de7c56807d70f8402b9b60468a6&pid=1-s2.0-S2666827024000549-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141839227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepCKID: A Multi-Head Attention-Based Deep Neural Network Model Leveraging Classwise Knowledge to Handle Imbalanced Textual Data DeepCKID:利用分类知识处理不平衡文本数据的基于多头注意力的深度神经网络模型
Pub Date : 2024-07-14 DOI: 10.1016/j.mlwa.2024.100575
Amit Kumar Sah , Muhammad Abulaish

This paper presents DeepCKID, a Multi-Head Attention (MHA)-based deep learning model that exploits statistical and semantic knowledge corresponding to documents across different classes in the datasets to improve the model’s ability to detect minority class instances in imbalanced text classification. In this process, corresponding to each document, DeepCKID extracts — (i) word-level statistical and semantic knowledge, namely, class correlation and class similarity corresponding to each word, based on its association with different classes in the dataset, and (ii) class-level knowledge from the document using n-grams and relation triplets corresponding to classwise keywords present, identified using cosine similarity utilizing Transformers-based Pre-trained Language Models (PLMs). DeepCKID encodes the word-level and class-level features using deep convolutional networks, which can learn meaningful patterns from them. At first, DeepCKID combines the semantically meaningful Sentence-BERT document embeddings and word-level feature matrix to give the final document representation, which it further fuses to the different classwise encoded representations to strengthen feature propagation. DeepCKID then passes the encoded document representation and its different classwise representations through an MHA layer to identify the important features at different positions of the feature subspaces, resulting in a latent dense vector accentuating its association with a particular class. Finally, DeepCKID passes the latent vector to the softmax layer to learn the corresponding class label. We evaluate DeepCKID over six publicly available Amazon reviews datasets using four Transformers-based PLMs. We compare DeepCKID with three approaches and four ablation-like baselines. Our study suggests that in most cases, DeepCKID outperforms all the comparison approaches, including baselines.

本文介绍了基于多头注意力(MHA)的深度学习模型 DeepCKID,该模型利用数据集中不同类别文档对应的统计和语义知识,提高模型在不平衡文本分类中检测少数类别实例的能力。在此过程中,DeepCKID 会针对每篇文档提取:(i) 词语级统计和语义知识,即根据每个词与数据集中不同类别的关联度,提取与之对应的类别相关性和类别相似性;(ii) 类别级知识,即利用基于变换器的预训练语言模型(PLMs),使用余弦相似性识别文档中与存在的类别关键字相对应的 n-grams 和关系三元组。DeepCKID 利用深度卷积网络对词级和类级特征进行编码,并从中学习有意义的模式。首先,DeepCKID 将具有语义意义的 Sentence-BERT 文档嵌入和词级特征矩阵结合起来,给出最终的文档表示,并进一步将其融合到不同的类级编码表示中,以加强特征传播。然后,DeepCKID 将编码后的文档表示及其不同的分类表示通过一个 MHA 层,以识别特征子空间不同位置的重要特征,从而产生一个强调与特定类别关联的潜在密集向量。最后,DeepCKID 将潜向量传递给 softmax 层,以学习相应的类标签。我们使用四个基于 Transformers 的 PLM,对六个公开的亚马逊评论数据集进行了 DeepCKID 评估。我们将 DeepCKID 与三种方法和四种类似消融的基线进行了比较。我们的研究表明,在大多数情况下,DeepCKID 优于包括基线在内的所有比较方法。
{"title":"DeepCKID: A Multi-Head Attention-Based Deep Neural Network Model Leveraging Classwise Knowledge to Handle Imbalanced Textual Data","authors":"Amit Kumar Sah ,&nbsp;Muhammad Abulaish","doi":"10.1016/j.mlwa.2024.100575","DOIUrl":"10.1016/j.mlwa.2024.100575","url":null,"abstract":"<div><p>This paper presents DeepCKID, a Multi-Head Attention (MHA)-based deep learning model that exploits statistical and semantic knowledge corresponding to documents across different classes in the datasets to improve the model’s ability to detect minority class instances in imbalanced text classification. In this process, corresponding to each document, DeepCKID extracts — (i) word-level statistical and semantic knowledge, namely, class correlation and class similarity corresponding to each word, based on its association with different classes in the dataset, and (ii) class-level knowledge from the document using <span><math><mi>n</mi></math></span>-grams and relation triplets corresponding to classwise keywords present, identified using cosine similarity utilizing Transformers-based Pre-trained Language Models (PLMs). DeepCKID encodes the word-level and class-level features using deep convolutional networks, which can learn meaningful patterns from them. At first, DeepCKID combines the semantically meaningful Sentence-BERT document embeddings and word-level feature matrix to give the final document representation, which it further fuses to the different classwise encoded representations to strengthen feature propagation. DeepCKID then passes the encoded document representation and its different classwise representations through an MHA layer to identify the important features at different positions of the feature subspaces, resulting in a latent dense vector accentuating its association with a particular class. Finally, DeepCKID passes the latent vector to the softmax layer to learn the corresponding class label. We evaluate DeepCKID over six publicly available Amazon reviews datasets using four Transformers-based PLMs. We compare DeepCKID with three approaches and four ablation-like baselines. Our study suggests that in most cases, DeepCKID outperforms all the comparison approaches, including baselines.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100575"},"PeriodicalIF":0.0,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000513/pdfft?md5=8efb9f85f258bdd00899e0b78ef5e189&pid=1-s2.0-S2666827024000513-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141716561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial instability of crash prediction models: A case of scooter crashes 碰撞预测模型的空间不稳定性:滑板车碰撞事故案例
Pub Date : 2024-07-14 DOI: 10.1016/j.mlwa.2024.100574
Tumlumbe Juliana Chengula , Boniphace Kutela , Norris Novat , Hellen Shita , Abdallah Kinero , Reuben Tamakloe , Sarah Kasomi

Scooters have gained widespread popularity in recent years due to their accessibility and affordability, but safety concerns persist due to the vulnerability of riders. Researchers are actively investigating the safety implications associated with scooters, given their relatively new status as transportation options. However, analyzing scooter safety presents a unique challenge due to the complexity of determining safe riding environments. This study presents a comprehensive analysis of scooter crash risk within various buffer zones, utilizing the Extreme Gradient Boosting (XGBoost) machine learning algorithm. The core objective was to unravel the multifaceted factors influencing scooter crashes and assess the predictive model’s performance across different buffers or spatial proximity to crash sites. After evaluating the model’s accuracy, sensitivity, and specificity across buffer distances ranging from 5 ft to 250 ft with the scooter crash as a reference point, a discernible trend emerged: as the buffer distance decreases, the model’s sensitivity increases, although at the expense of accuracy and specificity, which exhibit a gradual decline. Notably, at the widest buffer of 250 ft, the model achieved a high accuracy of 97% and specificity of 99%, but with a lower sensitivity of 31%. Contrastingly, at the closest buffer of 5 ft, sensitivity peaked at 95%, albeit with slightly reduced accuracy and specificity. Feature importance analysis highlighted the most significant predictor across all buffer distances, emphasizing the impact of vehicle interactions on scooter crash likelihood. Explainable Artificial Intelligence through SHAP value analysis provided deeper insights into each feature’s contribution to the predictive model, revealing passenger vehicle types of significantly escalated crash risks. Intriguingly, specific vehicular maneuvers, notably stopping in traffic lanes, alongside the absence of Traffic Control Devices (TCDs), were identified as the major contributors to increased crash occurrences. Road conditions, particularly wet and dry, also emerged as substantial risk factors. Furthermore, the study highlights the significance of road design, where elements like junction types and horizontal alignments – specifically 4 and 5-legged intersections and curves – are closely associated with heightened crash risks. These findings articulate a complex and spatially detailed framework of factors impacting scooter crashes, offering vital insights for urban planning and policymaking.

近年来,滑板车因其方便和经济实惠而受到广泛欢迎,但由于骑行者易受伤害,安全问题依然存在。鉴于滑板车作为交通工具的地位相对较新,研究人员正在积极调查与滑板车相关的安全问题。然而,由于确定安全骑行环境的复杂性,分析滑板车的安全性是一项独特的挑战。本研究利用极端梯度提升(XGBoost)机器学习算法,对各种缓冲区内的滑板车碰撞风险进行了全面分析。研究的核心目标是揭示影响滑板车碰撞事故的多方面因素,并评估预测模型在不同缓冲区或碰撞地点附近空间的性能。以滑板车撞车事故为参照点,在 5 英尺到 250 英尺的缓冲距离范围内评估模型的准确性、灵敏度和特异性后,发现了一个明显的趋势:随着缓冲距离的减小,模型的灵敏度增加,但准确性和特异性却逐渐下降。值得注意的是,在最宽的 250 英尺缓冲区内,模型的准确性和特异性分别高达 97% 和 99%,但灵敏度却较低,仅为 31%。相反,在最近的 5 英尺缓冲区内,灵敏度达到了 95% 的峰值,但准确率和特异性略有下降。特征重要性分析突出显示了所有缓冲距离上最重要的预测因素,强调了车辆相互作用对滑板车碰撞可能性的影响。通过 SHAP 值分析的可解释人工智能对每个特征对预测模型的贡献提供了更深入的见解,揭示了碰撞风险显著增加的乘用车类型。耐人寻味的是,特定的车辆操作,特别是在车道上停车,以及交通控制装置(TCD)的缺失,被认为是导致碰撞事故增加的主要因素。路况,尤其是干湿路况,也是重要的风险因素。此外,该研究还强调了道路设计的重要性,其中路口类型和横向排列(特别是四脚和五脚交叉路口和弯道)等要素与碰撞风险的增加密切相关。这些发现阐明了影响滑板车碰撞事故的复杂而详细的空间因素框架,为城市规划和政策制定提供了重要启示。
{"title":"Spatial instability of crash prediction models: A case of scooter crashes","authors":"Tumlumbe Juliana Chengula ,&nbsp;Boniphace Kutela ,&nbsp;Norris Novat ,&nbsp;Hellen Shita ,&nbsp;Abdallah Kinero ,&nbsp;Reuben Tamakloe ,&nbsp;Sarah Kasomi","doi":"10.1016/j.mlwa.2024.100574","DOIUrl":"10.1016/j.mlwa.2024.100574","url":null,"abstract":"<div><p>Scooters have gained widespread popularity in recent years due to their accessibility and affordability, but safety concerns persist due to the vulnerability of riders. Researchers are actively investigating the safety implications associated with scooters, given their relatively new status as transportation options. However, analyzing scooter safety presents a unique challenge due to the complexity of determining safe riding environments. This study presents a comprehensive analysis of scooter crash risk within various buffer zones, utilizing the Extreme Gradient Boosting (XGBoost) machine learning algorithm. The core objective was to unravel the multifaceted factors influencing scooter crashes and assess the predictive model’s performance across different buffers or spatial proximity to crash sites. After evaluating the model’s accuracy, sensitivity, and specificity across buffer distances ranging from 5 ft to 250 ft with the scooter crash as a reference point, a discernible trend emerged: as the buffer distance decreases, the model’s sensitivity increases, although at the expense of accuracy and specificity, which exhibit a gradual decline. Notably, at the widest buffer of 250 ft, the model achieved a high accuracy of 97% and specificity of 99%, but with a lower sensitivity of 31%. Contrastingly, at the closest buffer of 5 ft, sensitivity peaked at 95%, albeit with slightly reduced accuracy and specificity. Feature importance analysis highlighted the most significant predictor across all buffer distances, emphasizing the impact of vehicle interactions on scooter crash likelihood. Explainable Artificial Intelligence through SHAP value analysis provided deeper insights into each feature’s contribution to the predictive model, revealing passenger vehicle types of significantly escalated crash risks. Intriguingly, specific vehicular maneuvers, notably stopping in traffic lanes, alongside the absence of Traffic Control Devices (TCDs), were identified as the major contributors to increased crash occurrences. Road conditions, particularly wet and dry, also emerged as substantial risk factors. Furthermore, the study highlights the significance of road design, where elements like junction types and horizontal alignments – specifically 4 and 5-legged intersections and curves – are closely associated with heightened crash risks. These findings articulate a complex and spatially detailed framework of factors impacting scooter crashes, offering vital insights for urban planning and policymaking.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"17 ","pages":"Article 100574"},"PeriodicalIF":0.0,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827024000501/pdfft?md5=c3afb02a60606c22b0434ac053d3571a&pid=1-s2.0-S2666827024000501-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine learning with applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1