首页 > 最新文献

Applied AI letters最新文献

英文 中文
On Training Spiking Neural Networks by Means of a Novel Quantum Inspired Machine Learning Method 用一种新的量子启发机器学习方法训练尖峰神经网络
Pub Date : 2025-03-06 DOI: 10.1002/ail2.114
Jean Michel Sellier, Alexandre Martini

In spite of the high potential shown by spiking neural networks (e.g., temporal patterns), training them remains an open and complex problem. In practice, while in theory these networks are computationally as powerful as mainstream artificial neural networks, they have not reached the same accuracy levels yet. The major reason for such a situation seems to be represented by the lack of adequate training algorithms for deep spiking neural networks, since spike signals are not differentiable, that is, no direct way to compute a gradient is provided. Recently, a novel training method, based on the (digital) simulation of certain quantum systems, has been suggested. It has already shown interesting advantages, among which is the fact that no gradient is required to be computed. In this work, we apply this approach to the problem of training spiking neural networks, and we show that this recent training method is capable of training deep and complex spiking neural networks on the MNIST data set.

尽管脉冲神经网络显示出很高的潜力(例如,时间模式),但训练它们仍然是一个开放和复杂的问题。在实践中,虽然理论上这些网络在计算上与主流人工神经网络一样强大,但它们还没有达到相同的精度水平。造成这种情况的主要原因似乎是缺乏足够的深度尖峰神经网络训练算法,因为尖峰信号是不可微的,也就是说,没有直接的方法来计算梯度。最近,人们提出了一种基于某些量子系统的(数字)模拟的新训练方法。它已经显示出一些有趣的优点,其中之一就是不需要计算梯度。在这项工作中,我们将这种方法应用于训练尖峰神经网络的问题,并且我们证明了这种最新的训练方法能够在MNIST数据集上训练深度和复杂的尖峰神经网络。
{"title":"On Training Spiking Neural Networks by Means of a Novel Quantum Inspired Machine Learning Method","authors":"Jean Michel Sellier,&nbsp;Alexandre Martini","doi":"10.1002/ail2.114","DOIUrl":"https://doi.org/10.1002/ail2.114","url":null,"abstract":"<p>In spite of the high potential shown by spiking neural networks (e.g., temporal patterns), training them remains an open and complex problem. In practice, while in theory these networks are computationally as powerful as mainstream artificial neural networks, they have not reached the same accuracy levels yet. The major reason for such a situation seems to be represented by the lack of adequate training algorithms for deep spiking neural networks, since spike signals are not differentiable, that is, no direct way to compute a gradient is provided. Recently, a novel training method, based on the (digital) simulation of certain quantum systems, has been suggested. It has already shown interesting advantages, among which is the fact that no gradient is required to be computed. In this work, we apply this approach to the problem of training spiking neural networks, and we show that this recent training method is capable of training deep and complex spiking neural networks on the MNIST data set.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.114","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143565188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault Detection and Classification for Photovoltaic Panel System Using Machine Learning Techniques 基于机器学习技术的光伏板系统故障检测与分类
Pub Date : 2025-03-05 DOI: 10.1002/ail2.115
Ghalia Nassreddine, Amal El Arid, Mohamad Nassereddine, Obada Al Khatib

The deployment of solar photovoltaic (PV) panel systems, as renewable energy sources, has seen a rise recently. Consequently, it is imperative to implement efficient methods for the accurate detection and diagnosis of PV system faults to prevent unexpected power disruptions. This paper introduces a potential strategy for fault identification and classification through the utilization of machine learning (ML) techniques. The study aimed to use ML algorithms to identify and classify normal operations, seven different types of faults, in two operational modes (maximum power point tracking and intermediate power point tracking). Four machine learning algorithms and ensemble methods (decision trees, k-nearest neighbors, random forest, and extreme gradient boosting) were employed, followed by hyperparameter tuning and cross-validation to determine the best configuration. The results indicated that ensemble methods, particularly XGBoost, excelled in detecting and classifying faults in PV systems, achieving a 99% accuracy rate after hyperparameter adjustments. The TPR values show a high sensitivity of 0.999, with some achieving a perfect score of 1.000. The FPR shows very low values, with the majority of metrics indicating FPRs at or close to 0%. This performance is crucial in the solar energy context, as failing to detect faults can result in significant energy loss and increased maintenance costs.

作为可再生能源,太阳能光伏(PV)面板系统的部署近年来呈上升趋势。因此,建立有效的方法对光伏系统故障进行准确的检测和诊断,以防止意外停电是十分必要的。本文介绍了一种利用机器学习技术进行故障识别和分类的潜在策略。该研究旨在使用机器学习算法在两种运行模式(最大功率点跟踪和中间功率点跟踪)下识别和分类正常运行,七种不同类型的故障。采用了四种机器学习算法和集成方法(决策树、k近邻、随机森林和极端梯度增强),然后进行超参数调优和交叉验证以确定最佳配置。结果表明,集成方法,特别是XGBoost,在光伏系统故障检测和分类方面表现优异,经过超参数调整后的准确率达到99%。TPR值显示出0.999的高灵敏度,其中一些达到了1.000的满分。FPR显示非常低的值,大多数指标表明FPR等于或接近0%。这种性能在太阳能环境中至关重要,因为未能检测到故障可能导致重大的能量损失和维护成本的增加。
{"title":"Fault Detection and Classification for Photovoltaic Panel System Using Machine Learning Techniques","authors":"Ghalia Nassreddine,&nbsp;Amal El Arid,&nbsp;Mohamad Nassereddine,&nbsp;Obada Al Khatib","doi":"10.1002/ail2.115","DOIUrl":"https://doi.org/10.1002/ail2.115","url":null,"abstract":"<p>The deployment of solar photovoltaic (PV) panel systems, as renewable energy sources, has seen a rise recently. Consequently, it is imperative to implement efficient methods for the accurate detection and diagnosis of PV system faults to prevent unexpected power disruptions. This paper introduces a potential strategy for fault identification and classification through the utilization of machine learning (ML) techniques. The study aimed to use ML algorithms to identify and classify normal operations, seven different types of faults, in two operational modes (maximum power point tracking and intermediate power point tracking). Four machine learning algorithms and ensemble methods (decision trees, k-nearest neighbors, random forest, and extreme gradient boosting) were employed, followed by hyperparameter tuning and cross-validation to determine the best configuration. The results indicated that ensemble methods, particularly XGBoost, excelled in detecting and classifying faults in PV systems, achieving a 99% accuracy rate after hyperparameter adjustments. The TPR values show a high sensitivity of 0.999, with some achieving a perfect score of 1.000. The FPR shows very low values, with the majority of metrics indicating FPRs at or close to 0%. This performance is crucial in the solar energy context, as failing to detect faults can result in significant energy loss and increased maintenance costs.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.115","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143554350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Earnings Call Scripts Generation With Large Language Models Using Few-Shot Learning Prompt Engineering and Fine-Tuning Methods 使用少量学习提示工程和微调方法的大型语言模型生成收益电话脚本
Pub Date : 2025-02-13 DOI: 10.1002/ail2.110
Sovik Kumar Nath, Yanyan Zhang, Jia Vivian Li

Company earnings calls are pivotal events that offer crucial insights into a company's financial well-being and future outlook. Large language models (LLMs) present a promising avenue for automatically generating the initial draft of earnings call scripts, leveraging financial data and past examples. We evaluate two distinct methods: (1) few-shot learning prompt engineering with a large language model (LLM) and (2) fine-tuning a large language model on earnings call transcript data. Our findings indicate that both methods can produce coherent scripts encompassing key metrics, updates, and guidance. However, there are inherent trade-offs in comprehensiveness, potential hallucinations, writing style, ease of use, and cost. We discuss the pros and cons of each method to guide practitioners on effectively harnessing LLMs for earnings call script generation. Notably, we employ a human and two different LLMs to act as judges to compare the outcomes generated by the two approaches.

公司财报电话会议是了解公司财务状况和未来前景的关键事件。利用财务数据和过去的例子,大型语言模型(llm)为自动生成收益电话会议脚本的初始草案提供了一个有前途的途径。我们评估了两种不同的方法:(1)使用大型语言模型(LLM)进行少量学习提示工程,(2)对收益电话会议记录数据的大型语言模型进行微调。我们的发现表明,这两种方法都可以产生包含关键指标、更新和指导的连贯脚本。然而,在全面性、潜在的幻觉、写作风格、易用性和成本方面存在固有的权衡。我们讨论了每种方法的优点和缺点,以指导从业者有效地利用llm来生成收益电话会议脚本。值得注意的是,我们聘请了一个人和两个不同的法学硕士作为法官来比较两种方法产生的结果。
{"title":"Earnings Call Scripts Generation With Large Language Models Using Few-Shot Learning Prompt Engineering and Fine-Tuning Methods","authors":"Sovik Kumar Nath,&nbsp;Yanyan Zhang,&nbsp;Jia Vivian Li","doi":"10.1002/ail2.110","DOIUrl":"https://doi.org/10.1002/ail2.110","url":null,"abstract":"<p>Company earnings calls are pivotal events that offer crucial insights into a company's financial well-being and future outlook. Large language models (LLMs) present a promising avenue for automatically generating the initial draft of earnings call scripts, leveraging financial data and past examples. We evaluate two distinct methods: (1) few-shot learning prompt engineering with a large language model (LLM) and (2) fine-tuning a large language model on earnings call transcript data. Our findings indicate that both methods can produce coherent scripts encompassing key metrics, updates, and guidance. However, there are inherent trade-offs in comprehensiveness, potential hallucinations, writing style, ease of use, and cost. We discuss the pros and cons of each method to guide practitioners on effectively harnessing LLMs for earnings call script generation. Notably, we employ a human and two different LLMs to act as judges to compare the outcomes generated by the two approaches.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.110","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scaling Laws for Discriminative Classification in Large Language Models 大型语言模型判别分类的标度规律
Pub Date : 2025-02-13 DOI: 10.1002/ail2.109
Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich

Modern large language models (LLMs) represent a paradigm shift in what can plausibly be expected of machine learning models. The fact that LLMs can effectively generate sensible answers to a diverse range of queries suggests that they would be useful in customer support applications. While powerful, LLMs have been observed to be prone to hallucination which unfortunately makes their near-term use in customer support applications challenging. To address this issue, we present a system that allows us to use an LLM to augment our customer support advocates by re-framing the language modeling task as a discriminative classification task. In this framing, we seek to present the Top-K best template responses for a customer support advocate to use when responding to a customer. We present the result of both offline and online experiments where we observed offline gains and statistically significant online lifts for our experimental system. Along the way, we present observed scaling curves for validation loss and Top-K accuracy, resulted from model parameter ablation studies. We close by discussing the space of trade-offs with respect to model size, latency, and accuracy as well as and suggesting future applications to explore.

现代大型语言模型(llm)代表了对机器学习模型的合理期望的范式转变。llm可以有效地为各种查询生成合理的答案,这表明它们在客户支持应用程序中很有用。虽然llm功能强大,但人们观察到llm容易产生幻觉,不幸的是,这使得它们在客户支持应用程序中的短期应用具有挑战性。为了解决这个问题,我们提出了一个系统,通过将语言建模任务重新定义为判别分类任务,该系统允许我们使用LLM来增强我们的客户支持倡导者。在这个框架中,我们试图为客户支持倡导者提供Top-K最佳模板响应,以便在响应客户时使用。我们给出了离线和在线实验的结果,我们观察到我们的实验系统的离线增益和统计显着的在线提升。在此过程中,我们给出了观察到的验证损失和Top-K精度的缩放曲线,这是由模型参数消融研究产生的。最后,我们讨论了模型大小、延迟和准确性方面的权衡空间,并提出了未来需要探索的应用。
{"title":"Scaling Laws for Discriminative Classification in Large Language Models","authors":"Dean Wyatte,&nbsp;Fatemeh Tahmasbi,&nbsp;Ming Li,&nbsp;Thomas Markovich","doi":"10.1002/ail2.109","DOIUrl":"https://doi.org/10.1002/ail2.109","url":null,"abstract":"<p>Modern large language models (LLMs) represent a paradigm shift in what can plausibly be expected of machine learning models. The fact that LLMs can effectively generate sensible answers to a diverse range of queries suggests that they would be useful in customer support applications. While powerful, LLMs have been observed to be prone to hallucination which unfortunately makes their near-term use in customer support applications challenging. To address this issue, we present a system that allows us to use an LLM to augment our customer support advocates by re-framing the language modeling task as a discriminative classification task. In this framing, we seek to present the Top-K best template responses for a customer support advocate to use when responding to a customer. We present the result of both offline and online experiments where we observed offline gains and statistically significant online lifts for our experimental system. Along the way, we present observed scaling curves for validation loss and Top-K accuracy, resulted from model parameter ablation studies. We close by discussing the space of trade-offs with respect to model size, latency, and accuracy as well as and suggesting future applications to explore.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.109","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fingerprinting-Based Indoor Localization in a 3 × 3 Meter Grid Using OFDM Signals at Sub-6 GHz 基于指纹识别的3 × 3米栅格室内定位,使用sub - 6ghz OFDM信号
Pub Date : 2024-11-11 DOI: 10.1002/ail2.104
Jaspreet Kaur, Kang Tan, Muhammad Z. Khan, Olaoluwa R. Popoola, Muhammad A. Imran, Qammer H. Abbasi, Hasan T. Abbas

Accurately determining the indoor location of mobile devices has garnered significant interest due to the complex challenges posed by non-line-of-sight (NLOS) propagation and multipath effects. To address this challenge, this paper proposes a new approach to indoor positioning that utilises channel state information (CSI) and machine learning (ML) techniques to improve accuracy. The proposed method extracts the amplitude and phase differences of the subcarriers from the CSI data to create fingerprints. ML algorithms and network architecture are utilised to train the CSI data from two antennas, in the form of phase and amplitude. Experiments conducted in a standard indoor environment demonstrate the effectiveness of the proposed method.

由于非视距(NLOS)传播和多径效应带来的复杂挑战,准确确定移动设备的室内位置已经引起了人们的极大兴趣。为了解决这一挑战,本文提出了一种新的室内定位方法,该方法利用通道状态信息(CSI)和机器学习(ML)技术来提高准确性。该方法从CSI数据中提取子载波的幅差和相位差来创建指纹。利用ML算法和网络架构来训练来自两个天线的CSI数据,以相位和幅度的形式。在标准室内环境下进行的实验证明了该方法的有效性。
{"title":"Fingerprinting-Based Indoor Localization in a 3 × 3 Meter Grid Using OFDM Signals at Sub-6 GHz","authors":"Jaspreet Kaur,&nbsp;Kang Tan,&nbsp;Muhammad Z. Khan,&nbsp;Olaoluwa R. Popoola,&nbsp;Muhammad A. Imran,&nbsp;Qammer H. Abbasi,&nbsp;Hasan T. Abbas","doi":"10.1002/ail2.104","DOIUrl":"https://doi.org/10.1002/ail2.104","url":null,"abstract":"<p>Accurately determining the indoor location of mobile devices has garnered significant interest due to the complex challenges posed by non-line-of-sight (NLOS) propagation and multipath effects. To address this challenge, this paper proposes a new approach to indoor positioning that utilises channel state information (CSI) and machine learning (ML) techniques to improve accuracy. The proposed method extracts the amplitude and phase differences of the subcarriers from the CSI data to create fingerprints. ML algorithms and network architecture are utilised to train the CSI data from two antennas, in the form of phase and amplitude. Experiments conducted in a standard indoor environment demonstrate the effectiveness of the proposed method.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.104","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Predictive Pollution Control Through Traffic Flux Forecasting With Deep Learning: A Case Study in the City of Valencia 基于深度学习的交通流量预测对污染的预测控制——以巴伦西亚市为例
Pub Date : 2024-10-29 DOI: 10.1002/ail2.106
Miguel G. Folgado, Verónica Sanz, Johannes Hirn, Edgar Lorenzo-Sáez, Javier F. Urchueguía

Traffic congestion represents a significant urban challenge, with notable implications for public health and environmental well-being. Consequently, urban decision-makers prioritize the mitigation of congestion. This study delves into the efficacy of harnessing extensive data on urban traffic dynamics, coupled with comprehensive knowledge of road networks, to enable Artificial Intelligence (AI) in forecasting traffic flux well in advance. Such forecasts hold promise for informing emission reduction measures, particularly those aligned with Low Emission Zone policies. The investigation centers on Valencia, leveraging its robust traffic sensor infrastructure, one of the most densely deployed worldwide, encompassing approximately 3500 sensors strategically positioned across the city. Employing historical data spanning 2016 and 2017, we undertake the task of training and characterizing a Long Short-Term Memory (LSTM) Neural Network for the prediction of temporal traffic patterns. Our findings demonstrate the LSTM's efficacy in real-time forecasting of traffic flow evolution, facilitated by its ability to discern salient patterns within the dataset.

交通拥堵是城市面临的一项重大挑战,对公共健康和环境福祉具有显著影响。因此,城市决策者优先考虑缓解拥堵。这项研究深入探讨了利用大量城市交通动态数据,再加上对道路网络的全面了解,使人工智能(AI)能够提前预测交通流量的有效性。这种预测有望为减排措施提供信息,特别是那些与低排放区政策相一致的措施。调查以瓦伦西亚为中心,利用其强大的交通传感器基础设施,这是世界上部署最密集的基础设施之一,包括大约3500个传感器,战略性地分布在整个城市。利用2016年和2017年的历史数据,我们承担了训练和表征长短期记忆(LSTM)神经网络的任务,用于预测时间交通模式。我们的研究结果证明了LSTM在实时预测交通流演变方面的有效性,这得益于其在数据集中识别显著模式的能力。
{"title":"Towards Predictive Pollution Control Through Traffic Flux Forecasting With Deep Learning: A Case Study in the City of Valencia","authors":"Miguel G. Folgado,&nbsp;Verónica Sanz,&nbsp;Johannes Hirn,&nbsp;Edgar Lorenzo-Sáez,&nbsp;Javier F. Urchueguía","doi":"10.1002/ail2.106","DOIUrl":"https://doi.org/10.1002/ail2.106","url":null,"abstract":"<p>Traffic congestion represents a significant urban challenge, with notable implications for public health and environmental well-being. Consequently, urban decision-makers prioritize the mitigation of congestion. This study delves into the efficacy of harnessing extensive data on urban traffic dynamics, coupled with comprehensive knowledge of road networks, to enable Artificial Intelligence (AI) in forecasting traffic flux well in advance. Such forecasts hold promise for informing emission reduction measures, particularly those aligned with Low Emission Zone policies. The investigation centers on Valencia, leveraging its robust traffic sensor infrastructure, one of the most densely deployed worldwide, encompassing approximately 3500 sensors strategically positioned across the city. Employing historical data spanning 2016 and 2017, we undertake the task of training and characterizing a Long Short-Term Memory (LSTM) Neural Network for the prediction of temporal traffic patterns. Our findings demonstrate the LSTM's efficacy in real-time forecasting of traffic flow evolution, facilitated by its ability to discern salient patterns within the dataset.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mechanical Ventilator Settings Estimation From an AI Model 基于AI模型的机械呼吸机设置估计
Pub Date : 2024-10-24 DOI: 10.1002/ail2.103
Ali Moghadam, Ramana M. Pidaparti

Mechanical ventilation (MV) is used in subjects with respiratory problems for assistance in breathing. Various MV settings are adjusted at the clinician's discretion based on the patient's respiratory condition. In this study, an AI (artificial intelligence) model using artificial neural networks (ANNs) along with Bayesian Optimization (BO) was developed to estimate the desired MV settings for various subject scenarios. The ANN model with two hidden layers was trained with experimental data collected from subjects (canines and felines) in our previous work. Inverse mapping of the trained ANNs was conducted using BO to predict the acceptable MV settings for specific subject outcomes. Our results suggest that the model can support veterinarians in estimating the proper MV parameters for optimal subject outcome.

机械通气(MV)用于有呼吸问题的受试者,以帮助呼吸。根据患者的呼吸状况,临床医生可以自行调整各种MV设置。在这项研究中,使用人工神经网络(ann)和贝叶斯优化(BO)开发了一个AI(人工智能)模型,以估计各种主题场景的理想MV设置。具有两个隐藏层的人工神经网络模型是用我们之前的工作中从受试者(犬科动物和猫科动物)收集的实验数据进行训练的。使用BO对训练好的人工神经网络进行逆映射,以预测特定受试者结果的可接受MV设置。我们的结果表明,该模型可以支持兽医估计适当的MV参数,以获得最佳的受试者结果。
{"title":"Mechanical Ventilator Settings Estimation From an AI Model","authors":"Ali Moghadam,&nbsp;Ramana M. Pidaparti","doi":"10.1002/ail2.103","DOIUrl":"https://doi.org/10.1002/ail2.103","url":null,"abstract":"<p>Mechanical ventilation (MV) is used in subjects with respiratory problems for assistance in breathing. Various MV settings are adjusted at the clinician's discretion based on the patient's respiratory condition. In this study, an AI (artificial intelligence) model using artificial neural networks (ANNs) along with Bayesian Optimization (BO) was developed to estimate the desired MV settings for various subject scenarios. The ANN model with two hidden layers was trained with experimental data collected from subjects (canines and felines) in our previous work. Inverse mapping of the trained ANNs was conducted using BO to predict the acceptable MV settings for specific subject outcomes. Our results suggest that the model can support veterinarians in estimating the proper MV parameters for optimal subject outcome.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.103","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning-Driven Modeling of Dynamic Acoustic Sensing in Biomimetic Soft-Robotic Pinnae 仿生软体机器人Pinnae动态声传感的深度学习驱动建模
Pub Date : 2024-10-22 DOI: 10.1002/ail2.107
Sounak Chakrabarti, Rolf Müller

Biological function often depends on complex mechanisms of a dynamic, time-variant nature. An example is certain bat species (horseshoe bats—Rhinolophidae) that use intricate pinna musculatures to execute a variety of pinna deformations. While prior work has indicated the potential significance of these motions for sensory information encoding, it remains unclear how the complex time-variant pinna geometries could be controlled to enhance sensory performance. To address this issue, this work has investigated deep neural network models as digital twins for biomimetic pinnae. The networks were trained to predict the acoustic impacts of the deformed pinna geometries. A total of three network architectures have been evaluated for this purpose using physical numerical simulations (boundary element method) as ground truth. The networks predicted the acoustic beampattern function from pinna shape or even directly from the states of actuators that were used to deform the pinna shapes in simulation. Inserting prior knowledge in the form of beam-shaped basis functions did not improve network performance. The ability of the networks to produce beampattern predictions with low computational effort (in about three milliseconds each) should lend itself readily to supporting learning methods such as deep reinforcement learning that require many such functional evaluations.

生物功能往往依赖于动态的、时变性质的复杂机制。一个例子是某些蝙蝠物种(马蹄蝙蝠-鼻蝠科),它们使用复杂的耳廓肌肉组织来执行各种耳廓变形。虽然先前的工作已经表明了这些运动对感觉信息编码的潜在意义,但如何控制复杂的时变耳廓几何形状以提高感觉性能仍不清楚。为了解决这个问题,本研究将深度神经网络模型作为仿生耳廓的数字双胞胎进行了研究。这些网络被训练来预测变形的耳廓几何形状的声学影响。为此目的,使用物理数值模拟(边界元方法)作为基础真值对总共三种网络架构进行了评估。该网络根据耳廓形状甚至直接根据仿真中用于变形耳廓形状的致动器的状态预测声波束图函数。以梁型基函数的形式插入先验知识并不能提高网络性能。网络以较低的计算工作量(每次大约3毫秒)产生波束模式预测的能力,应该可以很容易地支持学习方法,如需要许多这样的功能评估的深度强化学习。
{"title":"Deep Learning-Driven Modeling of Dynamic Acoustic Sensing in Biomimetic Soft-Robotic Pinnae","authors":"Sounak Chakrabarti,&nbsp;Rolf Müller","doi":"10.1002/ail2.107","DOIUrl":"https://doi.org/10.1002/ail2.107","url":null,"abstract":"<p>Biological function often depends on complex mechanisms of a dynamic, time-variant nature. An example is certain bat species (horseshoe bats—Rhinolophidae) that use intricate pinna musculatures to execute a variety of pinna deformations. While prior work has indicated the potential significance of these motions for sensory information encoding, it remains unclear how the complex time-variant pinna geometries could be controlled to enhance sensory performance. To address this issue, this work has investigated deep neural network models as digital twins for biomimetic pinnae. The networks were trained to predict the acoustic impacts of the deformed pinna geometries. A total of three network architectures have been evaluated for this purpose using physical numerical simulations (boundary element method) as ground truth. The networks predicted the acoustic beampattern function from pinna shape or even directly from the states of actuators that were used to deform the pinna shapes in simulation. Inserting prior knowledge in the form of beam-shaped basis functions did not improve network performance. The ability of the networks to produce beampattern predictions with low computational effort (in about three milliseconds each) should lend itself readily to supporting learning methods such as deep reinforcement learning that require many such functional evaluations.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.107","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid Fuzzy Deep Belief Network Extreme Learning Machine Framework With Hyperbolic Secant Activation Function for Robust Semi-Supervised Sentiment Classification 基于双曲割线激活函数的混合模糊深度信念网络极限学习机框架用于鲁棒半监督情感分类
Pub Date : 2024-10-13 DOI: 10.1002/ail2.102
Maryam Mozafari, Mohammad Hossein Moattar

Sentiment classification deals with extracting and classifying the text sentiment. Fuzzy Deep Belief Network (DBN) has proved its efficiency in dealing with sentiment analysis and suitability for classifying unlabeled or semi-labeled data. Previous structures of deep belief networks are mostly made of traditional activation functions such as sigmoid. In this paper, a new activation function, which is referred to as hyperbolic secant function, is proposed. The new activation function not only solves gradient zeroing problem but also increases the accuracy and efficiency. Besides, extreme learning machine (ELM) is proposed as the decision layer to increase the accuracy and improve the generalizability through solving gradient-based learning problem. The efficiency of the proposed method has been experimented on “IMDB” movie critic dataset, 20-newspaper dataset and Sentiment Analysis dataset. The results of the proposed method are more accurate and precise as compared with the previous approaches.

情感分类处理的是文本情感的提取和分类。模糊深度信念网络(DBN)在处理情感分析方面的有效性和对未标记或半标记数据分类的适用性得到了证明。以往的深度信念网络结构多由传统的s型激活函数构成。本文提出了一种新的激活函数,称为双曲正割函数。新的激活函数不仅解决了梯度归零问题,而且提高了精度和效率。此外,提出极限学习机(ELM)作为决策层,通过解决基于梯度的学习问题来提高准确率和泛化能力。在“IMDB”影评人数据集、20家报纸数据集和情感分析数据集上进行了实验。与以往的方法相比,该方法的计算结果更加准确和精确。
{"title":"A Hybrid Fuzzy Deep Belief Network Extreme Learning Machine Framework With Hyperbolic Secant Activation Function for Robust Semi-Supervised Sentiment Classification","authors":"Maryam Mozafari,&nbsp;Mohammad Hossein Moattar","doi":"10.1002/ail2.102","DOIUrl":"https://doi.org/10.1002/ail2.102","url":null,"abstract":"<p>Sentiment classification deals with extracting and classifying the text sentiment. Fuzzy Deep Belief Network (DBN) has proved its efficiency in dealing with sentiment analysis and suitability for classifying unlabeled or semi-labeled data. Previous structures of deep belief networks are mostly made of traditional activation functions such as sigmoid. In this paper, a new activation function, which is referred to as hyperbolic secant function, is proposed. The new activation function not only solves gradient zeroing problem but also increases the accuracy and efficiency. Besides, extreme learning machine (ELM) is proposed as the decision layer to increase the accuracy and improve the generalizability through solving gradient-based learning problem. The efficiency of the proposed method has been experimented on “IMDB” movie critic dataset, 20-newspaper dataset and Sentiment Analysis dataset. The results of the proposed method are more accurate and precise as compared with the previous approaches.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.102","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Application of 3D Vision Transformers and Explainable AI in Prosthetic Dentistry 三维视觉变形与可解释人工智能在牙科修复中的应用
Pub Date : 2024-09-03 DOI: 10.1002/ail2.101
Faisal Ahmed Sifat, Md Sahadul Hasan Arian, Saif Ahmed, Taseef Hasan Farook, Nabeel Mohammed, James Dudley

To create and validate a transformer-based deep neural network architecture for classifying 3D scans of teeth for computer-assisted manufacturing and dental prosthetic rehabilitation surpassing previously reported validation accuracies obtained with convolutional neural networks (CNNs). Voxel-based representation and encoding input data in a high-dimensional space forms of preprocessing were investigated using 34 3D models of teeth obtained from intraoral scanning. Independent CNNs and vision transformers (ViTs), and their combination (CNN and ViT hybrid model) were implemented to classify the 3D scans directly from standard tessellation language (.stl) files and an Explainable AI (ExAI) model was generated to qualitatively explore the deterministic patterns that influenced the outcomes of the automation process. The results demonstrate that the CNN and ViT hybrid model architecture surpasses conventional supervised CNN, achieving a consistent validation accuracy of 90% through three-fold cross-validation. This process validated our initial findings, where each instance had the opportunity to be part of the validation set, ensuring it remained unseen during training. Furthermore, employing high-dimensional encoding of input data solely with 3DCNN yields a validation accuracy of 80%. When voxel data preprocessing is utilized, ViT outperforms CNN, achieving validation accuracies of 80% and 50%, respectively. The study also highlighted the saliency map's ability to identify areas of tooth cavity preparation of restorative importance, that can theoretically enable more accurate 3D printed prosthetic outputs. The investigation introduced a CNN and ViT hybrid model for classification of 3D tooth models in digital dentistry, and it was the first to employ ExAI in the efforts to automate the process of dental computer-assisted manufacturing.

创建并验证基于变压器的深度神经网络架构,用于对牙齿3D扫描进行分类,用于计算机辅助制造和牙科假肢康复,超过先前报道的卷积神经网络(cnn)获得的验证精度。利用口腔内扫描获得的34个牙齿三维模型,研究了基于体素的高维空间表示和编码输入数据的预处理形式。实现了独立的CNN和视觉转换器(ViT)及其组合(CNN和ViT混合模型),直接从标准细分语言(.stl)文件中对3D扫描进行分类,并生成了可解释的AI (ExAI)模型,以定性地探索影响自动化过程结果的确定性模式。结果表明,CNN和ViT混合模型架构优于传统的有监督CNN,通过三次交叉验证,验证准确率达到90%。这个过程验证了我们最初的发现,其中每个实例都有机会成为验证集的一部分,确保它在训练期间不可见。此外,仅使用3DCNN对输入数据进行高维编码,验证准确率达到80%。当使用体素数据预处理时,ViT优于CNN,验证准确率分别达到80%和50%。该研究还强调了显著性图识别牙腔准备修复重要性区域的能力,理论上可以实现更精确的3D打印假体输出。该研究引入了CNN和ViT混合模型,用于数字牙科中3D牙齿模型的分类,并且首次使用ExAI来实现牙科计算机辅助制造过程的自动化。
{"title":"An Application of 3D Vision Transformers and Explainable AI in Prosthetic Dentistry","authors":"Faisal Ahmed Sifat,&nbsp;Md Sahadul Hasan Arian,&nbsp;Saif Ahmed,&nbsp;Taseef Hasan Farook,&nbsp;Nabeel Mohammed,&nbsp;James Dudley","doi":"10.1002/ail2.101","DOIUrl":"https://doi.org/10.1002/ail2.101","url":null,"abstract":"<p>To create and validate a transformer-based deep neural network architecture for classifying 3D scans of teeth for computer-assisted manufacturing and dental prosthetic rehabilitation surpassing previously reported validation accuracies obtained with convolutional neural networks (CNNs). Voxel-based representation and encoding input data in a high-dimensional space forms of preprocessing were investigated using 34 3D models of teeth obtained from intraoral scanning. Independent CNNs and vision transformers (ViTs), and their combination (CNN and ViT hybrid model) were implemented to classify the 3D scans directly from standard tessellation language (.stl) files and an Explainable AI (ExAI) model was generated to qualitatively explore the deterministic patterns that influenced the outcomes of the automation process. The results demonstrate that the CNN and ViT hybrid model architecture surpasses conventional supervised CNN, achieving a consistent validation accuracy of 90% through three-fold cross-validation. This process validated our initial findings, where each instance had the opportunity to be part of the validation set, ensuring it remained unseen during training. Furthermore, employing high-dimensional encoding of input data solely with 3DCNN yields a validation accuracy of 80%. When voxel data preprocessing is utilized, ViT outperforms CNN, achieving validation accuracies of 80% and 50%, respectively. The study also highlighted the saliency map's ability to identify areas of tooth cavity preparation of restorative importance, that can theoretically enable more accurate 3D printed prosthetic outputs. The investigation introduced a CNN and ViT hybrid model for classification of 3D tooth models in digital dentistry, and it was the first to employ ExAI in the efforts to automate the process of dental computer-assisted manufacturing.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"5 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.101","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142859877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied AI letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1