首页 > 最新文献

Machine learning with applications最新文献

英文 中文
ChatReview: A ChatGPT-enabled natural language processing framework to study domain-specific user reviews ChatReview:支持 ChatGPT 的自然语言处理框架,用于研究特定领域的用户评论
Pub Date : 2023-12-28 DOI: 10.1016/j.mlwa.2023.100522
Brittany Ho, Ta’Rhonda Mayberry, Khanh Linh Nguyen, Manohar Dhulipala, Vivek Krishnamani Pallipuram

Intelligent search engines including pre-trained generative transformers (GPT) have revolutionized the user search experience. Several fields including e-commerce, education, and hospitality are increasingly exploring GPT tools to study user reviews and gain critical insights to improve their service quality. However, massive user-review data and imprecise prompt engineering lead to biased, irrelevant, and impersonal search results. In addition, exposing user data to these search engines may pose privacy issues. Motivated by these factors, we present ChatReview, a ChatGPT-enabled natural language processing (NLP) framework that effectively studies domain-specific user reviews to offer relevant and personalized search results at multiple levels of granularity. The framework accomplishes this task using four phases including data collection, tokenization, query construction, and response generation. The data collection phase involves gathering domain-specific user reviews from public and private repositories. In the tokenization phase, ChatReview applies sentiment analysis to extract keywords and categorize them into various sentiment classes. This process creates a token repository that best describes the user sentiments for a given user-review data. In the query construction phase, the framework uses the token repository and domain knowledge to construct three types of ChatGPT prompts including explicit, implicit, and creative. In the response generation phase, ChatReview pipelines these prompts into ChatGPT to generate search results at varying levels of granularity. We analyze our framework using three real-world domains including education, local restaurants, and hospitality. We assert that our framework simplifies prompt engineering for general users to produce effective results while minimizing the exposure of sensitive user data to search engines. We also present a one-of-a-kind Large Language Model (LLM) peer assessment of the ChatReview framework. Specifically, we employ Google’s Bard to objectively and qualitatively analyze the various ChatReview outputs. Our Bard-based analyses yield over 90% satisfaction, establishing ChatReview as a viable survey analysis tool.

包括预训练生成变换器(GPT)在内的智能搜索引擎彻底改变了用户的搜索体验。包括电子商务、教育和酒店业在内的多个领域正越来越多地探索使用 GPT 工具来研究用户评论并获得重要见解,从而提高服务质量。然而,海量的用户评论数据和不精确的提示工程会导致搜索结果出现偏差、不相关和不人性化。此外,向这些搜索引擎公开用户数据可能会带来隐私问题。在这些因素的推动下,我们提出了一个支持 ChatGPT 的自然语言处理(NLP)框架--ChatReview,它能有效地研究特定领域的用户评论,从而在多个粒度级别上提供相关的个性化搜索结果。该框架通过数据收集、标记化、查询构建和响应生成等四个阶段来完成这项任务。数据收集阶段包括从公共和私人资源库中收集特定领域的用户评论。在标记化阶段,ChatReview 应用情感分析来提取关键词,并将其归类为各种情感类别。这一过程创建了一个标记库,能最好地描述给定用户评论数据的用户情感。在查询构建阶段,该框架使用标记库和领域知识构建三种类型的 ChatGPT 提示,包括显式、隐式和创意提示。在生成回复阶段,ChatReview 将这些提示导入 ChatGPT,生成不同粒度的搜索结果。我们使用三个真实世界领域分析了我们的框架,包括教育、本地餐馆和酒店。我们认为,我们的框架简化了一般用户的提示工程,从而产生了有效的结果,同时最大限度地减少了向搜索引擎暴露敏感用户数据的情况。我们还对 ChatReview 框架进行了独一无二的大语言模型(LLM)同行评估。具体来说,我们利用谷歌的 Bard 对 ChatReview 的各种输出结果进行了客观的定性分析。我们基于 Bard 的分析获得了 90% 以上的满意度,从而将 ChatReview 确立为一种可行的调查分析工具。
{"title":"ChatReview: A ChatGPT-enabled natural language processing framework to study domain-specific user reviews","authors":"Brittany Ho,&nbsp;Ta’Rhonda Mayberry,&nbsp;Khanh Linh Nguyen,&nbsp;Manohar Dhulipala,&nbsp;Vivek Krishnamani Pallipuram","doi":"10.1016/j.mlwa.2023.100522","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100522","url":null,"abstract":"<div><p>Intelligent search engines including pre-trained generative transformers (GPT) have revolutionized the user search experience. Several fields including e-commerce, education, and hospitality are increasingly exploring GPT tools to study user reviews and gain critical insights to improve their service quality. However, massive user-review data and imprecise prompt engineering lead to biased, irrelevant, and impersonal search results. In addition, exposing user data to these search engines may pose privacy issues. Motivated by these factors, we present ChatReview, a ChatGPT-enabled natural language processing (NLP) framework that effectively studies domain-specific user reviews to offer relevant and personalized search results at multiple levels of granularity. The framework accomplishes this task using four phases including data collection, tokenization, query construction, and response generation. The data collection phase involves gathering domain-specific user reviews from public and private repositories. In the tokenization phase, ChatReview applies sentiment analysis to extract keywords and categorize them into various sentiment classes. This process creates a token repository that best describes the user sentiments for a given user-review data. In the query construction phase, the framework uses the token repository and domain knowledge to construct three types of ChatGPT prompts including explicit, implicit, and creative. In the response generation phase, ChatReview pipelines these prompts into ChatGPT to generate search results at varying levels of granularity. We analyze our framework using three real-world domains including education, local restaurants, and hospitality. We assert that our framework simplifies prompt engineering for general users to produce effective results while minimizing the exposure of sensitive user data to search engines. We also present a one-of-a-kind Large Language Model (LLM) peer assessment of the ChatReview framework. Specifically, we employ Google’s Bard to objectively and qualitatively analyze the various ChatReview outputs. Our Bard-based analyses yield over 90% satisfaction, establishing ChatReview as a viable survey analysis tool.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100522"},"PeriodicalIF":0.0,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000750/pdfft?md5=82dd36b16ed5d43b7a9134111f9ce072&pid=1-s2.0-S2666827023000750-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139109222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning nonlinear integral operators via recurrent neural networks and its application in solving integro-differential equations 通过递归神经网络学习非线性积分算子及其在求解积分微分方程中的应用
Pub Date : 2023-12-27 DOI: 10.1016/j.mlwa.2023.100524
Hardeep Bassi , Yuanran Zhu , Senwei Liang , Jia Yin , Cian C. Reeves , Vojtěch Vlček , Chao Yang

In this paper, we propose using LSTM-RNNs (Long Short-Term Memory-Recurrent Neural Networks) to learn and represent nonlinear integral operators that appear in nonlinear integro-differential equations (IDEs). The LSTM-RNN representation of the nonlinear integral operator allows us to turn a system of nonlinear integro-differential equations into a system of ordinary differential equations for which many efficient solvers are available. Furthermore, because the use of LSTM-RNN representation of the nonlinear integral operator in an IDE eliminates the need to perform a numerical integration in each numerical time evolution step, the overall temporal cost of the LSTM-RNN-based IDE solver can be reduced to O(nT) from O(nT2) if a nT-step trajectory is to be computed. We illustrate the efficiency and robustness of this LSTM-RNN-based numerical IDE solver with a model problem. Additionally, we highlight the generalizability of the learned integral operator by applying it to IDEs driven by different external forces. As a practical application, we show how this methodology can effectively solve the Dyson’s equation for quantum many-body systems.

在本文中,我们建议使用 LSTM-RNN(长短期记忆-递归神经网络)来学习和表示出现在非线性积分微分方程(IDE)中的非线性积分算子。通过 LSTM-RNN 表示非线性积分算子,我们可以将非线性积分微分方程系转化为常微分方程系,而常微分方程系有许多高效的求解器。此外,由于在 IDE 中使用 LSTM-RNN 表示非线性积分算子,无需在每个数值时间演化步中执行数值积分,因此如果要计算 nT 步轨迹,基于 LSTM-RNN 的 IDE 求解器的总体时间成本可从 O(nT2) 降至 O(nT)。我们通过一个模型问题说明了这种基于 LSTM-RNN 的数值 IDE 求解器的效率和鲁棒性。此外,我们还将学习到的积分算子应用于由不同外力驱动的 IDE,从而突出了它的通用性。在实际应用中,我们展示了这种方法如何有效地求解量子多体系统的戴森方程。
{"title":"Learning nonlinear integral operators via recurrent neural networks and its application in solving integro-differential equations","authors":"Hardeep Bassi ,&nbsp;Yuanran Zhu ,&nbsp;Senwei Liang ,&nbsp;Jia Yin ,&nbsp;Cian C. Reeves ,&nbsp;Vojtěch Vlček ,&nbsp;Chao Yang","doi":"10.1016/j.mlwa.2023.100524","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100524","url":null,"abstract":"<div><p>In this paper, we propose using LSTM-RNNs (Long Short-Term Memory-Recurrent Neural Networks) to learn and represent nonlinear integral operators that appear in nonlinear integro-differential equations (IDEs). The LSTM-RNN representation of the nonlinear integral operator allows us to turn a system of nonlinear integro-differential equations into a system of ordinary differential equations for which many efficient solvers are available. Furthermore, because the use of LSTM-RNN representation of the nonlinear integral operator in an IDE eliminates the need to perform a numerical integration in each numerical time evolution step, the overall temporal cost of the LSTM-RNN-based IDE solver can be reduced to <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msub><mrow><mi>n</mi></mrow><mrow><mi>T</mi></mrow></msub><mo>)</mo></mrow></mrow></math></span> from <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msubsup><mrow><mi>n</mi></mrow><mrow><mi>T</mi></mrow><mrow><mn>2</mn></mrow></msubsup><mo>)</mo></mrow></mrow></math></span> if a <span><math><msub><mrow><mi>n</mi></mrow><mrow><mi>T</mi></mrow></msub></math></span>-step trajectory is to be computed. We illustrate the efficiency and robustness of this LSTM-RNN-based numerical IDE solver with a model problem. Additionally, we highlight the generalizability of the learned integral operator by applying it to IDEs driven by different external forces. As a practical application, we show how this methodology can effectively solve the Dyson’s equation for quantum many-body systems.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100524"},"PeriodicalIF":0.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000774/pdfft?md5=e7dece6581dbf391df9d505308d5c9d8&pid=1-s2.0-S2666827023000774-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139109224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatiotemporal integration of GCN and E-LSTM networks for PM2.5 forecasting GCN 和 E-LSTM 网络的时空整合用于 PM2.5 预报
Pub Date : 2023-12-15 DOI: 10.1016/j.mlwa.2023.100521
Ali Kamali Mohammadzadeh , Halima Salah , Roohollah Jahanmahin , Abd E Ali Hussain , Sara Masoud , Yaoxian Huang

PM2.5, inhalable particles, with a size of 2.5 micrometers or less, negatively impact the environment as well as our health. Monitoring PM2.5 is critical to guard against extreme events by alerting people and initiating actions to alleviate PM2.5′s impacts. Developing PM2.5 forecasting frameworks empowers the authorities to predict extremely polluted events in advance and gives them time to implement necessary strategies in advance (e.g., Action! Days). Understanding the spatiotemporal behavior of PM2.5 and meteorological factors is of significance for having accurate predictions. This study utilizes EPA sensor data to quantify the PM2.5 air quality index (AQI) and meteorological factors such as temperature over 2015–2019 across Michigan, USA. Here, a spatiotemporal deep neural structure is proposed through integrating graph convolutional neural (GCN) and exogenous long short-term memory (E-LSTM) networks to incorporate spatial and temporal patterns within PM2.5 AQI and meteorological factors for predicting PM2.5 AQI. Results illustrate that not only does our proposed framework outperform the traditional approaches such as LSTM and E-LSTM, but also it is robust against the network structure of EPA stations. The study's findings demonstrate that the integration of GCN with E-LSTM significantly enhances the accuracy of PM2.5 AQI predictions compared to traditional models. This advancement indicates a promising direction for environmental monitoring, offering improved forecasting tools that can aid in timely and effective decision-making for air quality management and public health protection.

PM2.5是可吸入颗粒物,大小为2.5微米或更小,对环境和我们的健康都有负面影响。监测 PM2.5 对防范极端事件至关重要,可以提醒人们并采取行动减轻 PM2.5 的影响。开发PM2.5预报框架可使当局提前预测极端污染事件,并有时间提前实施必要的策略(如 "行动!日")。了解 PM2.5 的时空行为和气象因素对于准确预测具有重要意义。本研究利用美国环保署的传感器数据,量化了 2015-2019 年美国密歇根州的 PM2.5 空气质量指数(AQI)和气象因素(如温度)。在此,通过整合图卷积神经(GCN)和外源长短期记忆(ESTM)网络,提出了一种时空深度神经结构,将 PM2.5 空气质量指数和气象因素中的时空模式纳入预测 PM2.5 空气质量指数。结果表明,我们提出的框架不仅优于 LSTM 和 E-LSTM 等传统方法,而且对 EPA 站点的网络结构具有鲁棒性。研究结果表明,与传统模型相比,GCN 与 E-LSTM 的集成大大提高了 PM2.5 空气质量指数预测的准确性。这一进展为环境监测指明了一个大有可为的方向,提供了更好的预测工具,有助于为空气质量管理和公共健康保护做出及时有效的决策。
{"title":"Spatiotemporal integration of GCN and E-LSTM networks for PM2.5 forecasting","authors":"Ali Kamali Mohammadzadeh ,&nbsp;Halima Salah ,&nbsp;Roohollah Jahanmahin ,&nbsp;Abd E Ali Hussain ,&nbsp;Sara Masoud ,&nbsp;Yaoxian Huang","doi":"10.1016/j.mlwa.2023.100521","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100521","url":null,"abstract":"<div><p>PM<sub>2.5</sub>, inhalable particles, with a size of 2.5 micrometers or less, negatively impact the environment as well as our health. Monitoring PM<sub>2.5</sub> is critical to guard against extreme events by alerting people and initiating actions to alleviate PM<sub>2.5′</sub>s impacts. Developing PM<sub>2.5</sub> forecasting frameworks empowers the authorities to predict extremely polluted events in advance and gives them time to implement necessary strategies in advance (e.g., Action! Days). Understanding the spatiotemporal behavior of PM<sub>2.5</sub> and meteorological factors is of significance for having accurate predictions. This study utilizes EPA sensor data to quantify the PM<sub>2.5</sub> air quality index (AQI) and meteorological factors such as temperature over 2015–2019 across Michigan, USA. Here, a spatiotemporal deep neural structure is proposed through integrating graph convolutional neural (GCN) and exogenous long short-term memory (E-LSTM) networks to incorporate spatial and temporal patterns within PM<sub>2.5</sub> AQI and meteorological factors for predicting PM<sub>2.5</sub> AQI. Results illustrate that not only does our proposed framework outperform the traditional approaches such as LSTM and E-LSTM, but also it is robust against the network structure of EPA stations. The study's findings demonstrate that the integration of GCN with E-LSTM significantly enhances the accuracy of PM<sub>2.5</sub> AQI predictions compared to traditional models. This advancement indicates a promising direction for environmental monitoring, offering improved forecasting tools that can aid in timely and effective decision-making for air quality management and public health protection.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100521"},"PeriodicalIF":0.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000749/pdfft?md5=c4a01ec68a9b50044fed32950ca15dc5&pid=1-s2.0-S2666827023000749-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138839832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving top-N recommendations using batch approximation for weighted pair-wise loss 使用批量近似加权配对损失改进 Top-N 推荐
Pub Date : 2023-12-13 DOI: 10.1016/j.mlwa.2023.100520
Sofia Aftab, Heri Ramampiaro

In collaborative filtering, matrix factorization and collaborative metric learning are challenged by situations where non-preferred items may appear so close to a user in the feature embedding space that they lead to degrading the recommendation performance. We call such items ‘potential impostor’ risks. Addressing the issues with ‘potential impostor’ is important because it can result in inefficient learning and poor feature extraction. To achieve this, we propose a novel loss function formulation designed to enhance learning efficiency by actively identifying and addressing impostors, leveraging item associations and learning the distribution of negative items. This approach is crucial for models to differentiate between positive and negative items effectively, even when they are closely aligned in the feature space. Here, a loss function is generally an objective optimization function that is defined based on user–item interaction data, through either implicit or explicit feedback. The loss function essentially decides how well a recommendation algorithm performs. In this paper, we introduce and define the concept of ‘potential impostor’, highlighting its impact on learned representation quality and algorithmic efficiency. We tackle the limitations of non-metric methods, like the Weighted Approximate Rank Pairwise Loss (WARP) method, which struggles to capture item–item similarities, by using a ‘similarity propagation’ strategy with a new loss term. Similarly, we address fixed margin inefficiencies in Weighted Collaborative Metric Learning (WCML), through density distribution approximation. This moves potential impostors away from the margin for more robust learning. Additionally, we propose a large-scale batch approximation algorithm for increased detection of impostors, coupled with an active learning strategy for improved top-N recommendation performance. Our extensive empirical analysis across five major and diverse datasets demonstrates the effectiveness and feasibility of our methods, compared to existing techniques with respect to improving AUC, reducing impostor rate, and increasing the average distance metrics. More specifically, our evaluation shows that our two proposed methods outperform the existing state-of-the-art techniques, with an improvement of AUC by 3.5% and 3.7%, NDCG by 1.0% and 9.1% and HR by 1.3% and 3.6%, respectively. Similarly, the impostor rate is decreased by 35% and 18%, and their average distance is increased by 33% and 37%, respectively.

在协同过滤中,当非首选项在特征嵌入空间中与用户非常接近而导致推荐性能下降时,矩阵分解和协同度量学习受到挑战。我们把这类产品称为“潜在的冒充者”风险。解决“潜在的冒充者”的问题很重要,因为它可能导致低效的学习和糟糕的特征提取。为了实现这一目标,我们提出了一种新的损失函数公式,旨在通过主动识别和处理冒名顶替者、利用项目关联和学习负面项目的分布来提高学习效率。这种方法对于模型有效区分正负项至关重要,即使它们在特征空间中紧密对齐。在这里,损失函数通常是基于用户-物品交互数据,通过隐式或显式反馈定义的目标优化函数。损失函数本质上决定了推荐算法的性能。在本文中,我们引入并定义了“潜在的冒充者”的概念,强调了它对学习表征质量和算法效率的影响。我们通过使用带有新损失项的“相似性传播”策略,解决了非度量方法的局限性,如加权近似秩成对损失(WARP)方法,该方法难以捕获项与项之间的相似性。同样,我们通过密度分布近似解决加权协同度量学习(WCML)中的固定边际低效问题。这让潜在的冒牌货远离了更强大的学习空间。此外,我们提出了一种大规模批处理近似算法来增加对冒名顶替者的检测,并结合主动学习策略来提高top-N推荐性能。我们对五个主要和不同的数据集进行了广泛的实证分析,与现有技术相比,我们的方法在提高AUC、降低冒名顶替率和增加平均距离指标方面具有有效性和可行性。更具体地说,我们的评估表明,我们提出的两种方法优于现有的最先进技术,AUC分别提高了3.5%和3.7%,NDCG分别提高了1.0%和9.1%,HR分别提高了1.3%和3.6%。同样,骗子率降低了35%和18%,他们的平均距离分别增加了33%和37%。
{"title":"Improving top-N recommendations using batch approximation for weighted pair-wise loss","authors":"Sofia Aftab,&nbsp;Heri Ramampiaro","doi":"10.1016/j.mlwa.2023.100520","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100520","url":null,"abstract":"<div><p>In collaborative filtering, matrix factorization and collaborative metric learning are challenged by situations where non-preferred items may appear so close to a user in the feature embedding space that they lead to degrading the recommendation performance. We call such items ‘potential impostor’ risks. Addressing the issues with ‘potential impostor’ is important because it can result in inefficient learning and poor feature extraction. To achieve this, we propose a novel loss function formulation designed to enhance learning efficiency by actively identifying and addressing impostors, leveraging item associations and learning the distribution of negative items. This approach is crucial for models to differentiate between positive and negative items effectively, even when they are closely aligned in the feature space. Here, a loss function is generally an objective optimization function that is defined based on user–item interaction data, through either implicit or explicit feedback. The loss function essentially decides how well a recommendation algorithm performs. In this paper, we introduce and define the concept of ‘potential impostor’, highlighting its impact on learned representation quality and algorithmic efficiency. We tackle the limitations of non-metric methods, like the Weighted Approximate Rank Pairwise Loss (WARP) method, which struggles to capture item–item similarities, by using a ‘similarity propagation’ strategy with a new loss term. Similarly, we address fixed margin inefficiencies in Weighted Collaborative Metric Learning (WCML), through density distribution approximation. This moves potential impostors away from the margin for more robust learning. Additionally, we propose a large-scale batch approximation algorithm for increased detection of impostors, coupled with an active learning strategy for improved top-<span><math><mi>N</mi></math></span> recommendation performance. Our extensive empirical analysis across five major and diverse datasets demonstrates the effectiveness and feasibility of our methods, compared to existing techniques with respect to improving AUC, reducing impostor rate, and increasing the average distance metrics. More specifically, our evaluation shows that our two proposed methods outperform the existing state-of-the-art techniques, with an improvement of AUC by 3.5% and 3.7%, NDCG by 1.0% and 9.1% and HR by 1.3% and 3.6%, respectively. Similarly, the impostor rate is decreased by 35% and 18%, and their average distance is increased by 33% and 37%, respectively.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100520"},"PeriodicalIF":0.0,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000737/pdfft?md5=9ed329936dd4420c5fffd4c4464c6908&pid=1-s2.0-S2666827023000737-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138656881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive survey on machine learning applications for drilling and blasting in surface mining 露天采矿中钻孔和爆破的机器学习应用综合调查
Pub Date : 2023-12-11 DOI: 10.1016/j.mlwa.2023.100517
Venkat Munagala , Srikanth Thudumu , Irini Logothetis , Sushil Bhandari , Rajesh Vasa , Kon Mouzakis

Drilling and blasting operations are pivotal for productivity and safety in hard rock surface mining. These operations are restricted due to complexities such as site-specific uncertainties, safety risks, and environmental and economic constraints. Machine Learning (ML) is a transformative approach to tackle these complexities resulting in significant cost reductions. ML applications can reduce overall blasting costs by up to 23% and decrease the amount of explosives by as much as 89% compared to traditional methods. This survey presents a comprehensive review of how ML can be applied to optimize drill and blast designs while accounting for its operational challenges. Our research highlights the difficulties in collecting quality site-specific data, the complexity of interpreting this data into insightful information, the selection of ML models relating to mining objectives, and the need for established methods to assess blast efficiency quantitatively. We provide a synthesis of ML model development practices in drilling and blasting and demonstrate the value of ML methodologies. Based on our survey, we present actionable recommendations for developing ML methodologies to improve safety, reduce costs, and enhance efficiency in drilling and blasting processes. This includes establishing standardized data schematics, multiobjective model optimization, and comprehensive evaluation metrics. These benefits can guide mine management and engineers to adopt ML techniques and improve on-ground operational practices. This survey aims to serve as a resource for both practitioners and researchers shaping the future research direction in ML applications for drilling and blasting practices.

钻孔和爆破作业对硬岩露天采矿的生产率和安全性至关重要。这些作业因其复杂性而受到限制,例如具体地点的不确定性、安全风险以及环境和经济限制。机器学习(ML)是解决这些复杂问题的变革性方法,可显著降低成本。与传统方法相比,ML 应用可将总体爆破成本最多降低 23%,炸药用量最多减少 89%。本调查全面回顾了如何应用 ML 来优化钻孔和爆破设计,同时考虑到其操作方面的挑战。我们的研究强调了收集高质量特定地点数据的困难、将这些数据解释为有洞察力的信息的复杂性、选择与采矿目标相关的 ML 模型,以及采用既定方法定量评估爆破效率的必要性。我们综述了钻孔爆破中的 ML 模型开发实践,并展示了 ML 方法的价值。在调查的基础上,我们提出了开发 ML 方法的可行建议,以提高钻孔和爆破过程的安全性、降低成本并提高效率。这包括建立标准化数据图表、多目标模型优化和综合评估指标。这些优势可以指导矿山管理层和工程师采用 ML 技术,改进现场操作实践。本调查旨在为从业人员和研究人员提供资源,为钻孔和爆破实践中的 ML 应用确定未来的研究方向。
{"title":"A comprehensive survey on machine learning applications for drilling and blasting in surface mining","authors":"Venkat Munagala ,&nbsp;Srikanth Thudumu ,&nbsp;Irini Logothetis ,&nbsp;Sushil Bhandari ,&nbsp;Rajesh Vasa ,&nbsp;Kon Mouzakis","doi":"10.1016/j.mlwa.2023.100517","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100517","url":null,"abstract":"<div><p>Drilling and blasting operations are pivotal for productivity and safety in hard rock surface mining. These operations are restricted due to complexities such as site-specific uncertainties, safety risks, and environmental and economic constraints. Machine Learning (ML) is a transformative approach to tackle these complexities resulting in significant cost reductions. ML applications can reduce overall blasting costs by up to 23% and decrease the amount of explosives by as much as 89% compared to traditional methods. This survey presents a comprehensive review of how ML can be applied to optimize drill and blast designs while accounting for its operational challenges. Our research highlights the difficulties in collecting quality site-specific data, the complexity of interpreting this data into insightful information, the selection of ML models relating to mining objectives, and the need for established methods to assess blast efficiency quantitatively. We provide a synthesis of ML model development practices in drilling and blasting and demonstrate the value of ML methodologies. Based on our survey, we present actionable recommendations for developing ML methodologies to improve safety, reduce costs, and enhance efficiency in drilling and blasting processes. This includes establishing standardized data schematics, multiobjective model optimization, and comprehensive evaluation metrics. These benefits can guide mine management and engineers to adopt ML techniques and improve on-ground operational practices. This survey aims to serve as a resource for both practitioners and researchers shaping the future research direction in ML applications for drilling and blasting practices.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100517"},"PeriodicalIF":0.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000701/pdfft?md5=5f0687a7294aa89ab47bf61d3c8a02e7&pid=1-s2.0-S2666827023000701-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138577447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An inpatient fall risk assessment tool: Application of machine learning models on intrinsic and extrinsic risk factors 住院病人跌倒风险评估工具:将机器学习模型应用于内在和外在风险因素
Pub Date : 2023-12-08 DOI: 10.1016/j.mlwa.2023.100519
Sonia Jahangiri, Masoud Abdollahi, Rasika Patil, Ehsan Rashedi, Nasibeh Azadeh-Fard

Background

This study aimed to identify the most impactful set of intrinsic and extrinsic fall risk factors and develop a data-driven inpatient fall risk assessment tool (FRAT).

Methods

The dataset used for the study comprised in-hospital fall records from 2012 to 2017. Four machine learning (ML) algorithms, Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting (Gboost), and Deep Neural Network (DNN) were utilized to predict the inpatient fall risk level. To enhance the performance of the prediction models, two approaches were implemented, including (1) feature selection to identify the optimal feature set and (2) the development of three distinct shift-wise models. Furthermore, the optimal feature sets in the shift-wise models were extracted.

Results

According to the results, DNN outperformed other methods by reaching an accuracy, sensitivity, specificity, and AUC of 0.71, 0.8, 0.6, and 0.7, respectively, considering the full set of features. The performance of the models was further improved (by 3-5 %) by conducting a feature selection process for all models. Specifically, the DNN model achieved an accuracy of 0.74 while considering the optimal set of predictors. Moreover, the shift-wise RF models demonstrated higher accuracies (by 4-10 %) compared to the same model using a full feature set.

Conclusions

This study's outcome confirms ML models' compelling capability in developing an inpatient FRAT while considering intrinsic and extrinsic factors. The insight from such models could form a foundation to (1) monitor the inpatients’ fall risk, (2) identify the major factors involved in inpatient falls, and (3) create subject-specific self-care plans.

背景本研究旨在确定一组最有影响的内在和外在跌倒风险因素,并开发一种数据驱动的住院病人跌倒风险评估工具(FRAT)。研究采用了四种机器学习(ML)算法:支持向量机(SVM)、随机森林(RF)、梯度提升(Gboost)和深度神经网络(DNN)来预测住院病人跌倒风险水平。为提高预测模型的性能,采用了两种方法,包括(1)特征选择以确定最佳特征集;(2)开发三种不同的移位模型。结果根据结果,考虑到全套特征,DNN 的准确度、灵敏度、特异度和 AUC 分别达到 0.71、0.8、0.6 和 0.7,优于其他方法。通过对所有模型进行特征选择,模型的性能得到了进一步提高(提高了 3-5%)。具体来说,DNN 模型在考虑最优预测因子集时,准确率达到了 0.74。此外,与使用完整特征集的同一模型相比,移位 RF 模型的准确率更高(4-10%)。此类模型的洞察力可为以下工作奠定基础:(1) 监控住院病人的跌倒风险;(2) 识别住院病人跌倒的主要因素;(3) 制定针对特定对象的自我护理计划。
{"title":"An inpatient fall risk assessment tool: Application of machine learning models on intrinsic and extrinsic risk factors","authors":"Sonia Jahangiri,&nbsp;Masoud Abdollahi,&nbsp;Rasika Patil,&nbsp;Ehsan Rashedi,&nbsp;Nasibeh Azadeh-Fard","doi":"10.1016/j.mlwa.2023.100519","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100519","url":null,"abstract":"<div><h3>Background</h3><p>This study aimed to identify the most impactful set of intrinsic and extrinsic fall risk factors and develop a data-driven inpatient fall risk assessment tool (FRAT).</p></div><div><h3>Methods</h3><p>The dataset used for the study comprised in-hospital fall records from 2012 to 2017. Four machine learning (ML) algorithms, Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting (Gboost), and Deep Neural Network (DNN) were utilized to predict the inpatient fall risk level. To enhance the performance of the prediction models, two approaches were implemented, including (1) feature selection to identify the optimal feature set and (2) the development of three distinct shift-wise models. Furthermore, the optimal feature sets in the shift-wise models were extracted.</p></div><div><h3>Results</h3><p>According to the results, DNN outperformed other methods by reaching an accuracy, sensitivity, specificity, and AUC of 0.71, 0.8, 0.6, and 0.7, respectively, considering the full set of features. The performance of the models was further improved (by 3-5 %) by conducting a feature selection process for all models. Specifically, the DNN model achieved an accuracy of 0.74 while considering the optimal set of predictors. Moreover, the shift-wise RF models demonstrated higher accuracies (by 4-10 %) compared to the same model using a full feature set.</p></div><div><h3>Conclusions</h3><p>This study's outcome confirms ML models' compelling capability in developing an inpatient FRAT while considering intrinsic and extrinsic factors. The insight from such models could form a foundation to (1) monitor the inpatients’ fall risk, (2) identify the major factors involved in inpatient falls, and (3) create subject-specific self-care plans.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100519"},"PeriodicalIF":0.0,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000725/pdfft?md5=4a2d470f18c0e62a81481dcea8beab5f&pid=1-s2.0-S2666827023000725-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138582125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia 弗吉尼亚州诺福克市街头洪水的机器学习替代模型的比较
Pub Date : 2023-11-29 DOI: 10.1016/j.mlwa.2023.100518
Diana McSpadden , Steven Goldenberg , Binata Roy , Malachi Schram , Jonathan L. Goodall , Heather Richter

Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to personal and property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using data from Norfolk rainfall events between 2016 and 2018, this study compares the performance of a previous surrogate model based on a random forest algorithm with two deep learning models: Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). The comparison of deep learning to the random forest algorithm is motivated by the desire to utilize a machine learning architecture that allows for the future inclusion of common uncertainty quantification techniques and the effective integration of relevant, multi-modal features.

以弗吉尼亚州诺福克为例的低洼沿海城市面临着由降雨和潮汐引起的街道洪水的挑战,这给交通和下水道系统带来了压力,并可能导致人身和财产损失。虽然高保真度、基于物理的模拟提供了城市洪水的准确预测,但其计算复杂性使其不适合实时应用。本研究利用2016年至2018年诺福克降雨事件的数据,将之前基于随机森林算法的替代模型与两种深度学习模型(长短期记忆(LSTM)和门控循环单元(GRU))的性能进行了比较。将深度学习与随机森林算法进行比较的动机是希望利用一种机器学习架构,该架构允许未来包含常见的不确定性量化技术,并有效整合相关的多模态特征。
{"title":"A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia","authors":"Diana McSpadden ,&nbsp;Steven Goldenberg ,&nbsp;Binata Roy ,&nbsp;Malachi Schram ,&nbsp;Jonathan L. Goodall ,&nbsp;Heather Richter","doi":"10.1016/j.mlwa.2023.100518","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100518","url":null,"abstract":"<div><p>Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to personal and property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using data from Norfolk rainfall events between 2016 and 2018, this study compares the performance of a previous surrogate model based on a random forest algorithm with two deep learning models: Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). The comparison of deep learning to the random forest algorithm is motivated by the desire to utilize a machine learning architecture that allows for the future inclusion of common uncertainty quantification techniques and the effective integration of relevant, multi-modal features.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100518"},"PeriodicalIF":0.0,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000713/pdfft?md5=0b7cf337c2324211c4a189837657851e&pid=1-s2.0-S2666827023000713-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138466741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considerations for adapting higher education technology courses for AI large language models: A critical review of the impact of ChatGPT 为人工智能大型语言模型调整高等教育技术课程的考虑因素:对 ChatGPT 影响的批判性评论
Pub Date : 2023-11-25 DOI: 10.1016/j.mlwa.2023.100513
Omar Tayan , Ali Hassan , Khaled Khankan , Sanaa Askool

Following the very recent launch of the ChatGPT chatbot, numerous comments and speculations were posted concerning the potential aspects of society that are expected to benefit from this AI revolution. In particular, the education sector is considered as one of the primary domains affected by this application, the impact of which remains yet to be fully understood. Furthermore, many Higher Education institutions are required to get to terms with its impact on teaching and learning, and to clarify their stances on the use of ChatGPT software. This study was developed to investigate some critical case studies considered as relevant to the inevitable re-evaluation of educational aspects needed, ranging from academic missions to student and course learning outcomes and its ethical uses. Following a review of some of the pros and cons of ChatGPT in the higher educational sector, this paper shall demonstrate several case studies of early trials in teaching and learning assessments related to various specializations. Next, the ability of some well-known AI detector software and analyzed in terms of their capacity to successfully detect AI-generated content. Analysis shall be made of the foreseen impact on important aspects including challenges and benefits related to its use in course assessments as well as academic integrity and ethical use. The study concludes with a set of recommendations made from our findings and benchmarks obtained from top universities in order to assist faculty members and decision makers at Higher Education institutions concerning their response strategy and use of ChatGPT.

在最近推出ChatGPT聊天机器人之后,关于社会可能从这场人工智能革命中受益的潜在方面,发表了许多评论和猜测。特别是,教育部门被认为是受这一应用影响的主要领域之一,其影响仍有待充分了解。此外,许多高等教育机构被要求接受它对教学和学习的影响,并澄清他们对使用ChatGPT软件的立场。本研究旨在调查一些重要的案例研究,这些案例研究被认为与必要的教育方面的不可避免的重新评估有关,从学术任务到学生和课程学习成果及其道德用途。在回顾了ChatGPT在高等教育领域的一些优点和缺点之后,本文将展示与各种专业相关的教与学评估早期试验的几个案例研究。接下来,分析了一些知名的AI检测软件的能力,并分析了它们成功检测AI生成内容的能力。应分析可预见的对重要方面的影响,包括在课程评估中使用该技术的挑战和利益,以及学术诚信和道德使用。该研究总结了我们从顶尖大学获得的调查结果和基准得出的一系列建议,以帮助高等教育机构的教师和决策者制定应对策略和使用ChatGPT。
{"title":"Considerations for adapting higher education technology courses for AI large language models: A critical review of the impact of ChatGPT","authors":"Omar Tayan ,&nbsp;Ali Hassan ,&nbsp;Khaled Khankan ,&nbsp;Sanaa Askool","doi":"10.1016/j.mlwa.2023.100513","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100513","url":null,"abstract":"<div><p>Following the very recent launch of the ChatGPT chatbot, numerous comments and speculations were posted concerning the potential aspects of society that are expected to benefit from this AI revolution. In particular, the education sector is considered as one of the primary domains affected by this application, the impact of which remains yet to be fully understood. Furthermore, many Higher Education institutions are required to get to terms with its impact on teaching and learning, and to clarify their stances on the use of ChatGPT software. This study was developed to investigate some critical case studies considered as relevant to the inevitable re-evaluation of educational aspects needed, ranging from academic missions to student and course learning outcomes and its ethical uses. Following a review of some of the pros and cons of ChatGPT in the higher educational sector, this paper shall demonstrate several case studies of early trials in teaching and learning assessments related to various specializations. Next, the ability of some well-known AI detector software and analyzed in terms of their capacity to successfully detect AI-generated content. Analysis shall be made of the foreseen impact on important aspects including challenges and benefits related to its use in course assessments as well as academic integrity and ethical use. The study concludes with a set of recommendations made from our findings and benchmarks obtained from top universities in order to assist faculty members and decision makers at Higher Education institutions concerning their response strategy and use of ChatGPT.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100513"},"PeriodicalIF":0.0,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266682702300066X/pdfft?md5=e38f604707a5a8e7810d6cdf19695918&pid=1-s2.0-S266682702300066X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138489575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning for an explainable cost prediction of medical insurance 用于医疗保险可解释成本预测的机器学习
Pub Date : 2023-11-24 DOI: 10.1016/j.mlwa.2023.100516
Ugochukwu Orji , Elochukwu Ukwandu

Predictive modeling in healthcare continues to be an active actuarial research topic as more insurance companies aim to maximize the potential of Machine Learning (ML) approaches to increase their productivity and efficiency. In this paper, the authors deployed three regression-based ensemble ML models that combine variations of decision trees through Extreme Gradient Boosting (XGBoost), Gradient-boosting Machine (GBM), and Random Forest (RF) methods in predicting medical insurance costs. Explainable Artificial Intelligence (XAi) methods SHapley Additive exPlanations (SHAP) and Individual Conditional Expectation (ICE) plots were deployed to discover and explain the key determinant factors that influence medical insurance premium prices in the dataset. The dataset used comprised 986 records and is publicly available in the KAGGLE repository. The models were evaluated using four performance evaluation metrics, including R-squared (R2), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Mean Absolute Percentage Error (MAPE). The results show that all models produced impressive outcomes; however, the XGBoost model achieved a better overall performance although it also expanded more computational resources, while the RF model recorded a lesser prediction error and consumed far fewer computing resources than the XGBoost model. Furthermore, we compared the outcome of both XAi methods in identifying the key determinant features that influenced the PremiumPrices for each model and whereas both XAi methods produced similar outcomes, we found that the ICE plots showed in more detail the interactions between each variable than the SHAP analysis which seemed to be more high-level. It is the aim of the authors that the contributions of this study will help policymakers, insurers, and potential medical insurance buyers in their decision-making process for selecting the right policies that meet their specific needs.

随着越来越多的保险公司致力于最大限度地发挥机器学习(ML)方法的潜力,以提高其生产力和效率,医疗保健领域的预测建模仍然是一个活跃的精算研究主题。在本文中,作者部署了三种基于回归的集成ML模型,这些模型通过极端梯度增强(XGBoost)、梯度增强机(GBM)和随机森林(RF)方法结合决策树的变化来预测医疗保险费用。采用可解释人工智能(XAi)方法SHapley加性解释(SHAP)和个体条件期望(ICE)图来发现和解释影响数据集中医疗保险保费价格的关键决定因素。使用的数据集包含986条记录,并且在KAGGLE存储库中公开可用。采用四种性能评价指标对模型进行评价,包括r平方(R2)、平均绝对误差(MAE)、均方根误差(RMSE)和平均绝对百分比误差(MAPE)。结果表明,所有模型都产生了令人印象深刻的结果;然而,XGBoost模型获得了更好的整体性能,尽管它也扩展了更多的计算资源,而RF模型记录的预测误差更小,消耗的计算资源远少于XGBoost模型。此外,我们比较了两种XAi方法的结果,以确定影响每种模型的保费价格的关键决定因素,尽管两种XAi方法产生了相似的结果,但我们发现ICE图比SHAP分析更详细地显示了每个变量之间的相互作用,而SHAP分析似乎更高级。作者的目的是,本研究的贡献将有助于决策者,保险公司和潜在的医疗保险购买者在他们的决策过程中选择正确的政策,以满足他们的具体需求。
{"title":"Machine learning for an explainable cost prediction of medical insurance","authors":"Ugochukwu Orji ,&nbsp;Elochukwu Ukwandu","doi":"10.1016/j.mlwa.2023.100516","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100516","url":null,"abstract":"<div><p>Predictive modeling in healthcare continues to be an active actuarial research topic as more insurance companies aim to maximize the potential of Machine Learning (ML) approaches to increase their productivity and efficiency. In this paper, the authors deployed three regression-based ensemble ML models that combine variations of decision trees through Extreme Gradient Boosting (XGBoost), Gradient-boosting Machine (GBM), and Random Forest (RF) methods in predicting medical insurance costs. Explainable Artificial Intelligence (XAi) methods SHapley Additive exPlanations (SHAP) and Individual Conditional Expectation (ICE) plots were deployed to discover and explain the key determinant factors that influence medical insurance premium prices in the dataset. The dataset used comprised 986 records and is publicly available in the KAGGLE repository. The models were evaluated using four performance evaluation metrics, including R-squared (R<sup>2</sup>), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Mean Absolute Percentage Error (MAPE). The results show that all models produced impressive outcomes; however, the XGBoost model achieved a better overall performance although it also expanded more computational resources, while the RF model recorded a lesser prediction error and consumed far fewer computing resources than the XGBoost model. Furthermore, we compared the outcome of both XAi methods in identifying the key determinant features that influenced the PremiumPrices for each model and whereas both XAi methods produced similar outcomes, we found that the ICE plots showed in more detail the interactions between each variable than the SHAP analysis which seemed to be more high-level. It is the aim of the authors that the contributions of this study will help policymakers, insurers, and potential medical insurance buyers in their decision-making process for selecting the right policies that meet their specific needs.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100516"},"PeriodicalIF":0.0,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000695/pdfft?md5=fcd73094ae2ec3d7f5d01f086997c258&pid=1-s2.0-S2666827023000695-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138480780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting aggression in clinical treatment videos 在临床治疗视频中检测攻击性
Pub Date : 2023-11-22 DOI: 10.1016/j.mlwa.2023.100515
Walker S. Arce , Seth G. Walker , Jordan DeBrine , Benjamin S. Riggan , James E. Gehringer

Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.

许多临床空间配备了集中的视频记录系统来监控病人与病人之间的互动。考虑到人们对基于视频的机器学习方法的兴趣日益增加,使用这些临床记录来自动收集观察数据的潜力是显而易见的。为了探索这一点,我们的临床团队训练了编码员,并对7名患者的功能评估和治疗过程进行了视频注释。常用的临床软件在调整行为和视频数据方面存在固有的局限性,因此采用自定义软件工具来解决这一功能差距。在为这个工具开发了一个基于canvas的编码员培训课程后,一个由六名训练有素的编码员组成的团队注释了82.33小时的数据。考虑了两种机器学习方法,其中都使用卷积神经网络作为视频特征提取器。第一种方法使用循环网络作为提取特征的分类器,第二种方法使用Transformer架构。这两个模型都产生了有希望的指标,表明从临床视频中检测攻击的能力是可能的和可推广的。模型性能与特征提取器在ImageNet上的性能直接相关,在ImageNet上,ConvNeXtXL产生了性能最好的模型。这可以应用于自动化患者事件响应,以提高患者和临床医生的安全性,并可以直接集成到现有的视频管理系统中进行实时分析。
{"title":"Detecting aggression in clinical treatment videos","authors":"Walker S. Arce ,&nbsp;Seth G. Walker ,&nbsp;Jordan DeBrine ,&nbsp;Benjamin S. Riggan ,&nbsp;James E. Gehringer","doi":"10.1016/j.mlwa.2023.100515","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100515","url":null,"abstract":"<div><p>Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"14 ","pages":"Article 100515"},"PeriodicalIF":0.0,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000683/pdfft?md5=7be193e80aa9244b29f8609ccc55e9e6&pid=1-s2.0-S2666827023000683-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138355934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine learning with applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1