Machine learning with applications最新文献_第4页

Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans 玩文字游戏比较 ChatGPT 和人类的词汇量和词汇多样性

Machine learning with applications

Pub Date : 2024-11-12 DOI: 10.1016/j.mlwa.2024.100602

Pedro Reviriego , Javier Conde , Elena Merino-Gómez , Gonzalo Martínez , José Alberto Hernández

The introduction of Artificial Intelligence (AI) generative language models such as GPT (Generative Pre-trained Transformer) and conversational tools such as ChatGPT has triggered a revolution that can transform how text is generated. This has many implications, for example, as AI-generated text becomes a significant fraction of the text, would this affect the language capabilities of readers and also the training of newer AI tools? Would it affect the evolution of languages? Focusing on one specific aspect of the language: words; will the use of tools such as ChatGPT increase or reduce the vocabulary used or the lexical diversity? This has implications for words, as those not included in AI-generated content will tend to be less and less popular and may eventually be lost. In this work, we perform an initial comparison of the vocabulary and lexical diversity of ChatGPT and humans when performing the same tasks. In more detail, two datasets containing the answers to different types of questions answered by ChatGPT and humans, and a third dataset in which ChatGPT paraphrases sentences and questions are used. The analysis shows that ChatGPT-3.5 tends to use fewer distinct words and lower diversity than humans while ChatGPT-4 has a similar lexical diversity as humans and in some cases even larger. These results are very preliminary and additional datasets and ChatGPT configurations have to be evaluated to extract more general conclusions. Therefore, further research is needed to understand how the use of ChatGPT and more broadly generative AI tools will affect the vocabulary and lexical diversity in different types of text and languages.

人工智能（AI）生成语言模型（如 GPT（预训练生成转换器））和会话工具（如 ChatGPT）的引入引发了一场革命，可以改变文本的生成方式。这将产生许多影响，例如，当人工智能生成的文本成为文本的重要组成部分时，这是否会影响读者的语言能力以及更新的人工智能工具的训练？它会影响语言的进化吗？聚焦语言的一个具体方面：词汇；使用 ChatGPT 等工具会增加还是减少词汇量或词汇多样性？这对词汇会产生影响，因为那些未包含在人工智能生成的内容中的词汇会越来越少，最终可能会消失。在这项工作中，我们初步比较了 ChatGPT 和人类在执行相同任务时的词汇量和词汇多样性。具体来说，我们使用了两个数据集，其中包含 ChatGPT 和人类回答的不同类型问题的答案，以及 ChatGPT 对句子和问题进行转述的第三个数据集。分析表明，与人类相比，ChatGPT-3.5 倾向于使用较少的独特词汇和较低的多样性，而 ChatGPT-4 的词汇多样性与人类相似，在某些情况下甚至更大。这些结果还只是初步的，还需要对更多的数据集和 ChatGPT 配置进行评估，才能得出更普遍的结论。因此，还需要进一步研究，以了解 ChatGPT 和更广泛的生成式人工智能工具的使用将如何影响不同类型文本和语言的词汇量和词汇多样性。

{"title":"Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans","authors":"Pedro Reviriego , Javier Conde , Elena Merino-Gómez , Gonzalo Martínez , José Alberto Hernández","doi":"10.1016/j.mlwa.2024.100602","DOIUrl":"10.1016/j.mlwa.2024.100602","url":null,"abstract":"<div><div>The introduction of Artificial Intelligence (AI) generative language models such as GPT (Generative Pre-trained Transformer) and conversational tools such as ChatGPT has triggered a revolution that can transform how text is generated. This has many implications, for example, as AI-generated text becomes a significant fraction of the text, would this affect the language capabilities of readers and also the training of newer AI tools? Would it affect the evolution of languages? Focusing on one specific aspect of the language: words; will the use of tools such as ChatGPT increase or reduce the vocabulary used or the lexical diversity? This has implications for words, as those not included in AI-generated content will tend to be less and less popular and may eventually be lost. In this work, we perform an initial comparison of the vocabulary and lexical diversity of ChatGPT and humans when performing the same tasks. In more detail, two datasets containing the answers to different types of questions answered by ChatGPT and humans, and a third dataset in which ChatGPT paraphrases sentences and questions are used. The analysis shows that ChatGPT-3.5 tends to use fewer distinct words and lower diversity than humans while ChatGPT-4 has a similar lexical diversity as humans and in some cases even larger. These results are very preliminary and additional datasets and ChatGPT configurations have to be evaluated to extract more general conclusions. Therefore, further research is needed to understand how the use of ChatGPT and more broadly generative AI tools will affect the vocabulary and lexical diversity in different types of text and languages.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100602"},"PeriodicalIF":0.0,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A survey on knowledge distillation: Recent advancements 知识提炼调查：最新进展

Machine learning with applications

Pub Date : 2024-11-10 DOI: 10.1016/j.mlwa.2024.100605

Amir Moslemi , Anna Briskina , Zubeka Dang , Jason Li

Deep learning has achieved notable success across academia, medicine, and industry. Its ability to identify complex patterns in large-scale data and to manage millions of parameters has made it highly advantageous. However, deploying deep learning models presents a significant challenge due to their high computational demands. Knowledge distillation (KD) has emerged as a key technique for model compression and efficient knowledge transfer, enabling the deployment of deep learning models on resource-limited devices without compromising performance. This survey examines recent advancements in KD, highlighting key innovations in architectures, training paradigms, and application domains. We categorize contemporary KD methods into traditional approaches, such as response-based, feature-based, and relation-based knowledge distillation, and novel advanced paradigms, including self-distillation, cross-modal distillation, and adversarial distillation strategies. Additionally, we discuss emerging challenges, particularly in the context of distillation under limited data scenarios, privacy-preserving KD, and the interplay with other model compression techniques like quantization. Our survey also explores applications across computer vision, natural language processing, and multimodal tasks, where KD has driven performance improvements and enhanced model compression. This review aims to provide researchers and practitioners with a comprehensive understanding of the state-of-the-art in knowledge distillation, bridging foundational concepts with the latest methodologies and practical implications.

深度学习在学术界、医学界和工业界都取得了显著的成就。深度学习能够识别大规模数据中的复杂模式并管理数百万个参数，这使其具有极大的优势。然而，由于对计算的要求很高，部署深度学习模型是一项重大挑战。知识蒸馏（KD）已成为模型压缩和高效知识传输的一项关键技术，可在资源有限的设备上部署深度学习模型，同时不影响性能。本调查研究了知识蒸馏的最新进展，突出了架构、训练范式和应用领域的关键创新。我们将当代 KD 方法分为传统方法（如基于响应、基于特征和基于关系的知识蒸馏）和新型高级范式（包括自蒸馏、跨模态蒸馏和对抗性蒸馏策略）。此外，我们还讨论了新出现的挑战，特别是在有限数据场景下的蒸馏、保护隐私的 KD 以及与量化等其他模型压缩技术的相互作用等方面。我们的调查还探讨了计算机视觉、自然语言处理和多模态任务中的应用，在这些应用中，KD 推动了性能的提高并加强了模型压缩。本综述旨在让研究人员和从业人员全面了解知识蒸馏的最新进展，将基础概念与最新方法和实际影响联系起来。

{"title":"A survey on knowledge distillation: Recent advancements","authors":"Amir Moslemi , Anna Briskina , Zubeka Dang , Jason Li","doi":"10.1016/j.mlwa.2024.100605","DOIUrl":"10.1016/j.mlwa.2024.100605","url":null,"abstract":"<div><div>Deep learning has achieved notable success across academia, medicine, and industry. Its ability to identify complex patterns in large-scale data and to manage millions of parameters has made it highly advantageous. However, deploying deep learning models presents a significant challenge due to their high computational demands. Knowledge distillation (KD) has emerged as a key technique for model compression and efficient knowledge transfer, enabling the deployment of deep learning models on resource-limited devices without compromising performance. This survey examines recent advancements in KD, highlighting key innovations in architectures, training paradigms, and application domains. We categorize contemporary KD methods into traditional approaches, such as response-based, feature-based, and relation-based knowledge distillation, and novel advanced paradigms, including self-distillation, cross-modal distillation, and adversarial distillation strategies. Additionally, we discuss emerging challenges, particularly in the context of distillation under limited data scenarios, privacy-preserving KD, and the interplay with other model compression techniques like quantization. Our survey also explores applications across computer vision, natural language processing, and multimodal tasks, where KD has driven performance improvements and enhanced model compression. This review aims to provide researchers and practitioners with a comprehensive understanding of the state-of-the-art in knowledge distillation, bridging foundational concepts with the latest methodologies and practical implications.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100605"},"PeriodicalIF":0.0,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Texas rural land market integration: A causal analysis using machine learning applications 得克萨斯州农村土地市场一体化：利用机器学习应用进行因果分析

Machine learning with applications

Pub Date : 2024-11-08 DOI: 10.1016/j.mlwa.2024.100604

Tian Su , Senarath Dharmasena , David Leatham , Charles Gilliland

Texas rural land markets have several special features that makes it unique from other rural land markets in the United States. In 2021, Texas agricultural land, including buildings, is valued at $299.88 billion, which is almost 10% of the nation's total agricultural real estate value and 83% of the state's land is categorized as rural. In addition, due to its size and geologic features, Texas’ diverse landscape creates complex and widely divergent conditions affecting ownership and marketing of the land. Despite this complexity, lack of granular level and reliable transactional data on land sales has prevented thorough investigation into Texas land markets to uncover various interdependencies. Using quarterly transactional land value data from 1966 to 2017, this study uses cutting-edge machine learning algorithms and probabilistic graphical models to uncover causal interaction patterns of different land markets in Texas. The results reveal that Texas rural land markets are interdependent. Current and potential landholders and lenders can use the results from this work to aid strategic decision making. Financial institutions and investment groups could be made aware of the trend of one land market relative to other markets and adjust their holdings accordingly. Landowners may better understand changes in net wealth, which affect their ability to borrow capital and operate efficiently. Moreover, lenders may also benefit from the information to manage collateral and thus maintain the stability of their operation.

得克萨斯州的农村土地市场有几个特点，使其有别于美国其他农村土地市场。2021 年，得克萨斯州农业用地（包括建筑物）的价值为 2,998.8 亿美元，几乎占全国农业房地产总价值的 10%，而该州 83% 的土地被归类为农村土地。此外，得克萨斯州幅员辽阔，地质特征各异，地形复杂多样，影响了土地的所有权和销售。尽管情况复杂，但由于缺乏细粒度的、可靠的土地销售交易数据，因此无法对得克萨斯州的土地市场进行深入调查，以揭示各种相互依存关系。本研究利用 1966 年至 2017 年的季度交易地价数据，采用最先进的机器学习算法和概率图形模型，揭示了德克萨斯州不同土地市场的因果互动模式。研究结果表明，得克萨斯州农村土地市场是相互依存的。当前和潜在的土地所有者及贷款人可以利用这项研究的结果来帮助做出战略决策。金融机构和投资集团可以了解一个土地市场相对于其他市场的趋势，并据此调整其持有的土地。土地所有者可以更好地了解净财富的变化，这将影响他们借贷资本和有效经营的能力。此外，贷款人也可从信息中获益，以管理抵押品，从而保持其经营的稳定性。

{"title":"Texas rural land market integration: A causal analysis using machine learning applications","authors":"Tian Su , Senarath Dharmasena , David Leatham , Charles Gilliland","doi":"10.1016/j.mlwa.2024.100604","DOIUrl":"10.1016/j.mlwa.2024.100604","url":null,"abstract":"<div><div>Texas rural land markets have several special features that makes it unique from other rural land markets in the United States. In 2021, Texas agricultural land, including buildings, is valued at $299.88 billion, which is almost 10% of the nation's total agricultural real estate value and 83% of the state's land is categorized as rural. In addition, due to its size and geologic features, Texas’ diverse landscape creates complex and widely divergent conditions affecting ownership and marketing of the land. Despite this complexity, lack of granular level and reliable transactional data on land sales has prevented thorough investigation into Texas land markets to uncover various interdependencies. Using quarterly transactional land value data from 1966 to 2017, this study uses cutting-edge machine learning algorithms and probabilistic graphical models to uncover causal interaction patterns of different land markets in Texas. The results reveal that Texas rural land markets are interdependent. Current and potential landholders and lenders can use the results from this work to aid strategic decision making. Financial institutions and investment groups could be made aware of the trend of one land market relative to other markets and adjust their holdings accordingly. Landowners may better understand changes in net wealth, which affect their ability to borrow capital and operate efficiently. Moreover, lenders may also benefit from the information to manage collateral and thus maintain the stability of their operation.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100604"},"PeriodicalIF":0.0,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Applications of cluster-based transfer learning in image and localization tasks 基于聚类的迁移学习在图像和定位任务中的应用

Machine learning with applications

Pub Date : 2024-11-07 DOI: 10.1016/j.mlwa.2024.100601

Liuyi Yang, Patrick Finnerty, Chikara Ohta

Transfer learning can address the issue of insufficient labels in machine learning. Using knowledge in a labeled domain (source domain) can assist in acquiring and learning knowledge in a domain (target domain) that lacks some or all labels. In this paper, we propose a new cluster-based semi-supervised transfer learning (CBSSTL) under a new assumption that samples in the target domain are unlabeled but contain cluster information. Furthermore, we propose a new transfer learning framework and a method for fine-tuning parameters. We tested and compared the proposed method with other unsupervised and semi-supervised transfer learning methods on well-known image datasets. The experimental results demonstrate the effectiveness of the proposed method. Additionally, we created a localization dataset for transfer learning. Finally, we tested and analyzed the proposed method on this dataset. Its particularly challenging nature makes it difficult for our method to work effectively.

迁移学习可以解决机器学习中标签不足的问题。利用有标签领域（源领域）的知识可以帮助获取和学习缺乏部分或全部标签的领域（目标领域）的知识。在本文中，我们提出了一种新的基于聚类的半监督迁移学习（CBSSTL），其新假设是：目标域中的样本没有标签，但包含聚类信息。此外，我们还提出了一种新的迁移学习框架和参数微调方法。我们在著名的图像数据集上对所提出的方法与其他无监督和半监督迁移学习方法进行了测试和比较。实验结果证明了所提方法的有效性。此外，我们还创建了一个用于迁移学习的定位数据集。最后，我们在该数据集上测试并分析了所提出的方法。该数据集特别具有挑战性，这使得我们的方法难以有效发挥作用。

引用次数: 0

Vulnerability detection using BERT based LLM model with transparency obligation practice towards trustworthy AI 使用基于 BERT 的 LLM 模型进行漏洞检测，履行透明义务，实现可信赖的人工智能

Machine learning with applications

Pub Date : 2024-11-02 DOI: 10.1016/j.mlwa.2024.100598

Jean Haurogné , Nihala Basheer , Shareeful Islam

Vulnerabilities in the source code are one of the main causes of potential threats in software-intensive systems. There are a large number of vulnerabilities published each day, and effective vulnerability detection is critical to identifying and mitigating these vulnerabilities. AI has emerged as a promising solution to enhance vulnerability detection, offering the ability to analyse vast amounts of data and identify patterns indicative of potential threats. However, AI-based methods often face several challenges, specifically when dealing with large datasets and understanding the specific context of the problem. Large Language Model (LLM) is now widely considered to tackle more complex tasks and handle large datasets, which also exhibits limitations in terms of explaining the model outcome and existing works focus on providing overview of explainability and transparency. This research introduces a novel transparency obligation practice for vulnerability detection using BERT based LLMs. We address the black-box nature of LLMs by employing XAI techniques, unique combination of SHAP, LIME, heat map. We propose an architecture that combines the BERT model with transparency obligation practices, which ensures the assurance of transparency throughout the entire LLM life cycle. An experiment is performed with a large source code dataset to demonstrate the applicability of the proposed approach. The result shows higher accuracy of 91.8 % for the vulnerability detection and model explainability outcome is highly influenced by “vulnerable”, “function”, "mysql_tmpdir_list", “strmov” tokens using both SHAP and LIME framework. Heatmap of attention weights, highlights the local token interactions that aid in understanding the model's decision points.

源代码中的漏洞是造成软件密集型系统潜在威胁的主要原因之一。每天都有大量漏洞发布，有效的漏洞检测对于识别和缓解这些漏洞至关重要。人工智能已成为加强漏洞检测的一种前景广阔的解决方案，它能够分析海量数据并识别表明潜在威胁的模式。然而，基于人工智能的方法往往面临一些挑战，特别是在处理大型数据集和了解问题的具体背景时。目前，人们普遍认为大型语言模型（LLM）可以解决更复杂的任务和处理大型数据集，但它在解释模型结果方面也表现出局限性，现有的工作主要集中在提供可解释性和透明度方面的概述。本研究介绍了使用基于 BERT 的 LLM 进行漏洞检测的新型透明度义务实践。我们采用 XAI 技术、SHAP、LIME 和热图的独特组合，解决了 LLM 的黑箱性质。我们提出了一种将 BERT 模型与透明度义务实践相结合的架构，可确保整个 LLM 生命周期的透明度。我们使用大型源代码数据集进行了实验，以证明所提方法的适用性。结果表明，使用 SHAP 和 LIME 框架，漏洞检测的准确率高达 91.8%，模型的可解释性结果受 "vulnerable"、"function"、"mysql_tmpdir_list "和 "strmov "令牌的影响很大。注意力权重热图，突出显示了有助于理解模型决策点的局部标记相互作用。

{"title":"Vulnerability detection using BERT based LLM model with transparency obligation practice towards trustworthy AI","authors":"Jean Haurogné , Nihala Basheer , Shareeful Islam","doi":"10.1016/j.mlwa.2024.100598","DOIUrl":"10.1016/j.mlwa.2024.100598","url":null,"abstract":"<div><div>Vulnerabilities in the source code are one of the main causes of potential threats in software-intensive systems. There are a large number of vulnerabilities published each day, and effective vulnerability detection is critical to identifying and mitigating these vulnerabilities. AI has emerged as a promising solution to enhance vulnerability detection, offering the ability to analyse vast amounts of data and identify patterns indicative of potential threats. However, AI-based methods often face several challenges, specifically when dealing with large datasets and understanding the specific context of the problem. Large Language Model (LLM) is now widely considered to tackle more complex tasks and handle large datasets, which also exhibits limitations in terms of explaining the model outcome and existing works focus on providing overview of explainability and transparency. This research introduces a novel transparency obligation practice for vulnerability detection using BERT based LLMs. We address the black-box nature of LLMs by employing XAI techniques, unique combination of SHAP, LIME, heat map. We propose an architecture that combines the BERT model with transparency obligation practices, which ensures the assurance of transparency throughout the entire LLM life cycle. An experiment is performed with a large source code dataset to demonstrate the applicability of the proposed approach. The result shows higher accuracy of 91.8 % for the vulnerability detection and model explainability outcome is highly influenced by “vulnerable”, “function”, \"mysql_tmpdir_list\", “strmov” tokens using both SHAP and LIME framework. Heatmap of attention weights, highlights the local token interactions that aid in understanding the model's decision points.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100598"},"PeriodicalIF":0.0,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Moral decision making: Explainable insights into the role of working memory in autonomous driving 道德决策：关于工作记忆在自动驾驶中作用的可解释见解

Machine learning with applications

Pub Date : 2024-10-31 DOI: 10.1016/j.mlwa.2024.100599

Amandeep Singh, Yovela Murzello, Hyowon Lee, Shene Abdalla, Siby Samuel

The intersection of Artificial Intelligence (AI) and moral philosophy presents unique challenges in the development of autonomous vehicles, particularly in scenarios requiring split-second ethical decisions. This study examines the relationship between working memory (WM) and moral judgments in simulated AV scenarios, quantifying the effects of varying cognitive load on utilitarian decision-making under different time constraints. We experimented with 336 participants, each completing 16 simulated driving trials presenting unique ethical dilemmas. Results reveal a complex interplay between cognitive load and ethical choices. Under high temporal pressure (1-second response window), utilitarian decisions decreased significantly from 92.77 % to 70.08 %. Extended time constraints led to increased utilitarian choices. Statistical analyses validated these findings across diverse ethical contexts. Chi-square tests revealed significant associations between WM load and utilitarian decisions in 1-second conditions, particularly for high-stakes scenarios. Logistic regression showed that WM significantly decreased the likelihood of utilitarian decisions in these scenarios. Six supervised machine learning models were employed, with Gaussian Naive Bayes achieving the highest predictive accuracy (82.2 % to 97.0 %) in distinguishing utilitarian decisions. Partial Dependence analysis revealed a strong negative correlation between WM and utilitarian decisions, especially in the 1-second interval. The 2-second interval emerged as potentially optimal for balancing time constraints and cognitive load. These findings contribute to the theoretical understanding of ethical decision-making under cognitive load and provide practical insights for developing ethically aligned autonomous systems, with implications for improving safety, optimizing takeover protocols, and enhancing the ethical reasoning capabilities of autonomous driving systems.

人工智能（AI）与道德哲学的交叉为自动驾驶汽车的开发带来了独特的挑战，尤其是在需要瞬间做出道德决定的场景中。本研究探讨了在模拟自动驾驶汽车场景中工作记忆（WM）与道德判断之间的关系，量化了在不同时间限制下不同认知负荷对功利决策的影响。我们对 336 名参与者进行了实验，每个人都完成了 16 次模拟驾驶试验，这些试验呈现了独特的道德困境。结果显示，认知负荷与道德选择之间存在复杂的相互作用。在高时间压力下（1 秒钟反应窗口），功利性决策从 92.77% 显著下降到 70.08%。时间限制延长则导致功利性选择增加。统计分析在不同的伦理环境中验证了这些发现。卡方检验显示，在 1 秒钟的条件下，WM 负荷与功利性决策之间存在显著关联，尤其是在高风险情景下。逻辑回归表明，在这些情景中，WM 会显著降低做出功利性决策的可能性。研究人员采用了六种有监督的机器学习模型，其中高斯直觉贝叶斯模型在区分功利性决策方面的预测准确率最高（82.2% 到 97.0%）。偏倚分析表明，WM 和功利性决策之间存在很强的负相关，尤其是在 1 秒的时间间隔内。2秒钟的时间间隔可能是平衡时间限制和认知负荷的最佳时间间隔。这些发现有助于从理论上理解认知负荷下的道德决策，并为开发符合道德规范的自动驾驶系统提供了实用见解，对提高安全性、优化接管协议和增强自动驾驶系统的道德推理能力具有重要意义。

{"title":"Moral decision making: Explainable insights into the role of working memory in autonomous driving","authors":"Amandeep Singh, Yovela Murzello, Hyowon Lee, Shene Abdalla, Siby Samuel","doi":"10.1016/j.mlwa.2024.100599","DOIUrl":"10.1016/j.mlwa.2024.100599","url":null,"abstract":"<div><div>The intersection of Artificial Intelligence (AI) and moral philosophy presents unique challenges in the development of autonomous vehicles, particularly in scenarios requiring split-second ethical decisions. This study examines the relationship between working memory (WM) and moral judgments in simulated AV scenarios, quantifying the effects of varying cognitive load on utilitarian decision-making under different time constraints. We experimented with 336 participants, each completing 16 simulated driving trials presenting unique ethical dilemmas. Results reveal a complex interplay between cognitive load and ethical choices. Under high temporal pressure (1-second response window), utilitarian decisions decreased significantly from 92.77 % to 70.08 %. Extended time constraints led to increased utilitarian choices. Statistical analyses validated these findings across diverse ethical contexts. Chi-square tests revealed significant associations between WM load and utilitarian decisions in 1-second conditions, particularly for high-stakes scenarios. Logistic regression showed that WM significantly decreased the likelihood of utilitarian decisions in these scenarios. Six supervised machine learning models were employed, with Gaussian Naive Bayes achieving the highest predictive accuracy (82.2 % to 97.0 %) in distinguishing utilitarian decisions. Partial Dependence analysis revealed a strong negative correlation between WM and utilitarian decisions, especially in the 1-second interval. The 2-second interval emerged as potentially optimal for balancing time constraints and cognitive load. These findings contribute to the theoretical understanding of ethical decision-making under cognitive load and provide practical insights for developing ethically aligned autonomous systems, with implications for improving safety, optimizing takeover protocols, and enhancing the ethical reasoning capabilities of autonomous driving systems.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100599"},"PeriodicalIF":0.0,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep learning applications for real-time and early detection of fall armyworm, African armyworm, and maize stem borer 深度学习在实时和早期检测秋虫、非洲象鼻虫和玉米螟中的应用

Machine learning with applications

Pub Date : 2024-10-30 DOI: 10.1016/j.mlwa.2024.100596

Ivan Oyege , Harriet Sibitenda , Maruthi Sridhar Balaji Bhaskar

The application of artificial intelligence for identifying Fall armyworm (Spodoptera frugiperda), African armyworm (Spodoptera exempta), and Maize stem borer (Busseola fusca) is critical due to the threats they pose to global food production. This study aims to evaluate and identify the most accurate and robust DL models in detecting and classifying these three significant agricultural pests. Seven traditional DL models: Convolutional Neural Network, Visual Geometry Group (VGG16), Residual Networks (ResNet50), MobileNetV2, InceptionV3, Deep Neural Network (DNN), and InceptionResNetV2 and the advanced You Look Only Once (YOLOv8) model were trained and tested using pest image datasets. The results showed that all traditional models except DNN had high accuracies ranging from 93.17% (InceptionResNetV2) to 99.43% (MobileNet) in training and testing, with losses ranging from 1.71% (MobileNetV2) to 24.99% (InceptionResNetV2). DNN had a slightly lower accuracy range of 55.27% to 56.39% and a loss range of 85.02% to 89.96% in training and testing. YOLOv8 emerged as the best and most robust model in the pest detection and classification tasks, achieving Precision and Recall scores ranging from 98.4% to 100% on single-class and multi-class classifications, making it highly suitable for real-world pest management applications. This research pioneers the use of DL for the classification and detection of maize stem borer, African armyworm and Fall armyworm, unique and separately addressing a critical gap in agricultural pest management in corn. With early and accurate pest identification, crop protection measures can be implemented efficiently. The findings lead to reduced crop damage and enhanced food security.

人工智能在识别秋季军虫（Spodoptera frugiperda）、非洲军虫（Spodoptera exempta）和玉米螟（Busseola fusca）方面的应用至关重要，因为它们对全球粮食生产构成威胁。本研究旨在评估和确定在检测和分类这三种重要农业害虫方面最准确、最稳健的 DL 模型。七个传统的 DL 模型：使用害虫图像数据集对卷积神经网络、视觉几何组 (VGG16)、残差网络 (ResNet50)、MobileNetV2、InceptionV3、深度神经网络 (DNN) 和 InceptionResNetV2 以及先进的 You Look Only Once (YOLOv8) 模型进行了训练和测试。结果显示，除 DNN 外，所有传统模型在训练和测试中的准确率都很高，从 93.17%（InceptionResNetV2）到 99.43%（MobileNet）不等，损失率从 1.71%（MobileNetV2）到 24.99%（InceptionResNetV2）不等。DNN 的准确率略低，为 55.27% 至 56.39%，在训练和测试中的损失率为 85.02% 至 89.96%。在害虫检测和分类任务中，YOLOv8 成为最佳和最稳健的模型，在单类和多类分类中取得了 98.4% 到 100% 的精确度和召回率，非常适合实际害虫管理应用。这项研究开创性地将 DL 用于玉米螟、非洲大食心虫和秋季大食心虫的分类和检测，独特地分别解决了玉米农业害虫管理中的一个关键缺口。有了早期准确的害虫识别，就可以高效地实施作物保护措施。这些发现可减少作物损失，提高粮食安全。

{"title":"Deep learning applications for real-time and early detection of fall armyworm, African armyworm, and maize stem borer","authors":"Ivan Oyege , Harriet Sibitenda , Maruthi Sridhar Balaji Bhaskar","doi":"10.1016/j.mlwa.2024.100596","DOIUrl":"10.1016/j.mlwa.2024.100596","url":null,"abstract":"<div><div>The application of artificial intelligence for identifying Fall armyworm (<em>Spodoptera frugiperda</em>), African armyworm (<em>Spodoptera exempta</em>), and Maize stem borer (<em>Busseola fusca</em>) is critical due to the threats they pose to global food production. This study aims to evaluate and identify the most accurate and robust DL models in detecting and classifying these three significant agricultural pests. Seven traditional DL models: Convolutional Neural Network, Visual Geometry Group (VGG16), Residual Networks (ResNet50), MobileNetV2, InceptionV3, Deep Neural Network (DNN), and InceptionResNetV2 and the advanced You Look Only Once (YOLOv8) model were trained and tested using pest image datasets. The results showed that all traditional models except DNN had high accuracies ranging from 93.17% (InceptionResNetV2) to 99.43% (MobileNet) in training and testing, with losses ranging from 1.71% (MobileNetV2) to 24.99% (InceptionResNetV2). DNN had a slightly lower accuracy range of 55.27% to 56.39% and a loss range of 85.02% to 89.96% in training and testing. YOLOv8 emerged as the best and most robust model in the pest detection and classification tasks, achieving Precision and Recall scores ranging from 98.4% to 100% on single-class and multi-class classifications, making it highly suitable for real-world pest management applications. This research pioneers the use of DL for the classification and detection of maize stem borer, African armyworm and Fall armyworm, unique and separately addressing a critical gap in agricultural pest management in corn. With early and accurate pest identification, crop protection measures can be implemented efficiently. The findings lead to reduced crop damage and enhanced food security.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100596"},"PeriodicalIF":0.0,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142572647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing SMOTE for imbalanced data with abnormal minority instances 针对异常少数实例的不平衡数据增强 SMOTE

Machine learning with applications

Pub Date : 2024-10-29 DOI: 10.1016/j.mlwa.2024.100597

Surani Matharaarachchi , Mike Domaratzki , Saman Muthukumarana

Imbalanced datasets are frequent in machine learning, where certain classes are markedly underrepresented compared to others. This imbalance often results in sub-optimal model performance, as classifiers tend to favour the majority class. A significant challenge arises when abnormal instances, such as outliers, exist within the minority class, diminishing the effectiveness of traditional re-sampling methods like the Synthetic Minority Over-sampling Technique (SMOTE). This manuscript addresses this critical issue by introducing four SMOTE extensions: Distance ExtSMOTE, Dirichlet ExtSMOTE, FCRP SMOTE, and BGMM SMOTE. These methods leverage a weighted average of neighbouring instances to enhance the quality of synthetic samples and mitigate the impact of outliers. Comprehensive experiments conducted on diverse simulated and real-world imbalanced datasets demonstrate that the proposed methods improve classification performance compared to the original SMOTE and its most competitive variants. Notably, we demonstrate that Dirichlet ExtSMOTE outperforms most other proposed and existing SMOTE variants in terms of achieving better F1 score, MCC, and PR-AUC. Our results underscore the effectiveness of these advanced SMOTE extensions in tackling class imbalance, particularly in the presence of abnormal instances, offering robust solutions for real-world applications.

不平衡数据集在机器学习中经常出现，与其他类别相比，某些类别的代表性明显不足。这种不平衡往往会导致模型性能达不到最优，因为分类器往往倾向于大多数类别。当少数类中存在异常实例（如离群值）时，就会出现一个重大挑战，从而降低传统再采样方法（如合成少数群体过度采样技术（SMOTE））的有效性。本手稿通过引入四种 SMOTE 扩展来解决这一关键问题：距离 ExtSMOTE、Dirichlet ExtSMOTE、FCRP SMOTE 和 BGMM SMOTE。这些方法利用相邻实例的加权平均来提高合成样本的质量，并减轻异常值的影响。在各种模拟和真实世界不平衡数据集上进行的综合实验表明，与原始 SMOTE 及其最具竞争力的变体相比，所提出的方法提高了分类性能。值得注意的是，我们证明了 Dirichlet ExtSMOTE 在获得更好的 F1 分数、MCC 和 PR-AUC 方面优于大多数其他提出的和现有的 SMOTE 变体。我们的研究结果凸显了这些先进的 SMOTE 扩展在解决类不平衡方面的有效性，尤其是在存在异常实例的情况下，为现实世界的应用提供了稳健的解决方案。

{"title":"Enhancing SMOTE for imbalanced data with abnormal minority instances","authors":"Surani Matharaarachchi , Mike Domaratzki , Saman Muthukumarana","doi":"10.1016/j.mlwa.2024.100597","DOIUrl":"10.1016/j.mlwa.2024.100597","url":null,"abstract":"<div><div>Imbalanced datasets are frequent in machine learning, where certain classes are markedly underrepresented compared to others. This imbalance often results in sub-optimal model performance, as classifiers tend to favour the majority class. A significant challenge arises when abnormal instances, such as outliers, exist within the minority class, diminishing the effectiveness of traditional re-sampling methods like the Synthetic Minority Over-sampling Technique (SMOTE). This manuscript addresses this critical issue by introducing four SMOTE extensions: Distance ExtSMOTE, Dirichlet ExtSMOTE, FCRP SMOTE, and BGMM SMOTE. These methods leverage a weighted average of neighbouring instances to enhance the quality of synthetic samples and mitigate the impact of outliers. Comprehensive experiments conducted on diverse simulated and real-world imbalanced datasets demonstrate that the proposed methods improve classification performance compared to the original SMOTE and its most competitive variants. Notably, we demonstrate that Dirichlet ExtSMOTE outperforms most other proposed and existing SMOTE variants in terms of achieving better F1 score, MCC, and PR-AUC. Our results underscore the effectiveness of these advanced SMOTE extensions in tackling class imbalance, particularly in the presence of abnormal instances, offering robust solutions for real-world applications.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100597"},"PeriodicalIF":0.0,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142578485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A machine learning framework to measure Water Drop Penetration Time (WDPT) for soil water repellency analysis 测量水滴渗透时间（WDPT）的机器学习框架，用于土壤憎水性分析

Machine learning with applications

Pub Date : 2024-10-28 DOI: 10.1016/j.mlwa.2024.100595

Danxu Wang , Emma Regentova , Venkatesan Muthukumar , Markus Berli , Frederick C. Harris Jr.

The heat from wildfires volatilizes soil’s organic compounds which form a waxy layer when condensed on cooler soil particles causing soil to repel water. Timely assessment of soil water repellency (SWR) is critical for prediction and prevention of detrimental impacts of hydrophobic soils such as soil erosion, reduced availability of water to plants, and water runoff after rainfalls leading to floods. The Water Drop Penetration Time (WDPT), i.e., the time elapsed from a drop landing on the soil surface to its complete absorption is commonly used to assess the SWR level. Its manual measurements have variability based on the used instruments and subjective observations. The goal of this work is to design an automated system to perform standardized WDPT tests and assess the SWR levels. It consists of an electronically controlled mechanism to release a water drop, and a video camera to record the water penetration process. The latter is modeled as an “action” in video and Temporal Action Localization (TAL) analytics is used for predicting the WDPT and assessing the SWR level.

野火产生的热量会挥发土壤中的有机化合物，这些有机化合物在较冷的土壤颗粒上凝结后会形成一层蜡质层，导致土壤憎水。及时评估土壤憎水性（SWR）对于预测和预防疏水性土壤的有害影响至关重要，例如土壤侵蚀、植物可用水量减少以及降雨后导致洪水的径流。水滴渗透时间（WDPT），即水滴从落到土壤表面到被完全吸收的时间，通常用于评估 SWR 水平。其人工测量值会因所用仪器和主观观察而产生差异。这项工作的目标是设计一个自动系统，用于执行标准化的 WDPT 测试和评估 SWR 水平。该系统由一个释放水滴的电子控制装置和一个记录水渗透过程的摄像机组成。后者在视频中被建模为一个 "动作"，并使用时间动作定位（TAL）分析法来预测 WDPT 和评估 SWR 水平。

{"title":"A machine learning framework to measure Water Drop Penetration Time (WDPT) for soil water repellency analysis","authors":"Danxu Wang , Emma Regentova , Venkatesan Muthukumar , Markus Berli , Frederick C. Harris Jr.","doi":"10.1016/j.mlwa.2024.100595","DOIUrl":"10.1016/j.mlwa.2024.100595","url":null,"abstract":"<div><div>The heat from wildfires volatilizes soil’s organic compounds which form a waxy layer when condensed on cooler soil particles causing soil to repel water. Timely assessment of soil water repellency (SWR) is critical for prediction and prevention of detrimental impacts of hydrophobic soils such as soil erosion, reduced availability of water to plants, and water runoff after rainfalls leading to floods. The Water Drop Penetration Time (WDPT), i.e., the time elapsed from a drop landing on the soil surface to its complete absorption is commonly used to assess the SWR level. Its manual measurements have variability based on the used instruments and subjective observations. The goal of this work is to design an automated system to perform standardized WDPT tests and assess the SWR levels. It consists of an electronically controlled mechanism to release a water drop, and a video camera to record the water penetration process. The latter is modeled as an “action” in video and Temporal Action Localization (TAL) analytics is used for predicting the WDPT and assessing the SWR level.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100595"},"PeriodicalIF":0.0,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142554391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Review of machine learning applications for defect detection in composite materials 复合材料缺陷检测中的机器学习应用综述

Machine learning with applications

Pub Date : 2024-10-28 DOI: 10.1016/j.mlwa.2024.100600

Vahid Daghigh , Hamid Daghigh , Thomas E. Lacy Jr. , Mohammad Naraghi

Machine learning (ML) techniques have shown promising applications in a broad range of topics in engineering, composite materials behavior analysis, and manufacturing. This paper reviews successful ML implementations for defect and damage identification and progression in composites. The focus is on predicting composites' responses under specific loads and environments and optimizing setting and imperfection sensitivity. Discussions and recommendations toward promising ML implementation practices for fruitful interpretable results in the composites’ analysis are provided.

机器学习（ML）技术在工程、复合材料行为分析和制造等广泛领域的应用前景广阔。本文回顾了在复合材料缺陷和损伤识别与发展方面成功的 ML 实施。重点是预测复合材料在特定载荷和环境下的反应，以及优化设置和缺陷敏感性。本文就复合材料分析中有望获得可解释结果的 ML 实施实践进行了讨论并提出了建议。

引用次数: 0