首页 > 最新文献

Applied AI letters最新文献

英文 中文
Deep aspect extraction and classification for opinion mining in e-commerce applications using convolutional neural network feature extraction followed by long short term memory attention model 基于卷积神经网络特征提取和长短期记忆注意模型的深度方面提取和分类在电子商务应用中的意见挖掘
Pub Date : 2023-08-09 DOI: 10.1002/ail2.86
Kamal Sharbatian, Mohammad Hossein Moattar

Users of e-commerce websites review different aspects of a product in the comment section. In this research, an approach is proposed for opinion aspect extraction and recognition in selling systems. We have used the users' opinions from the Digikala website (www.Digikala.com), which is an Iranian e-commerce company. In this research, a language-independent framework is proposed that is adjustable to other languages. In this regard, after necessary text processing and preparation steps, the existence of an aspect in an opinion is determined using deep learning algorithms. The proposed model combines Convolutional Neural Network (CNN) and long-short-term memory (LSTM) deep learning approaches. CNN is one of the best algorithms for extracting latent features from data. On the other hand, LSTM can detect latent temporal relationships among different words in a text due to its memory ability and attention model. The approach is evaluated on six classes of opinion aspects. Based on the experiments, the proposed model's accuracy, precision, and recall are 70%, 60%, and 85%, respectively. The proposed model was compared in terms of the above criteria with CNN, Naive Bayes, and SVM algorithms and showed satisfying performance.

电子商务网站的用户会在评论区对产品的不同方面进行评论。本研究提出了一种销售系统中意见方面的提取与识别方法。我们使用了来自Digikala网站(www.Digikala.com)的用户意见,这是一家伊朗电子商务公司。在这项研究中,我们提出了一个独立于语言的框架,可以调整到其他语言。在这方面,经过必要的文本处理和准备步骤,使用深度学习算法确定意见中某个方面的存在。该模型结合了卷积神经网络(CNN)和长短期记忆(LSTM)深度学习方法。CNN是从数据中提取潜在特征的最佳算法之一。另一方面,由于LSTM的记忆能力和注意模型,它可以检测文本中不同单词之间的潜在时间关系。对该方法进行了六类意见方面的评估。实验结果表明,该模型的准确率为70%,精密度为60%,召回率为85%。将该模型与CNN、朴素贝叶斯和支持向量机算法在上述标准下进行了比较,结果令人满意。
{"title":"Deep aspect extraction and classification for opinion mining in e-commerce applications using convolutional neural network feature extraction followed by long short term memory attention model","authors":"Kamal Sharbatian,&nbsp;Mohammad Hossein Moattar","doi":"10.1002/ail2.86","DOIUrl":"10.1002/ail2.86","url":null,"abstract":"<p>Users of e-commerce websites review different aspects of a product in the comment section. In this research, an approach is proposed for opinion aspect extraction and recognition in selling systems. We have used the users' opinions from the Digikala website (www.Digikala.com), which is an Iranian e-commerce company. In this research, a language-independent framework is proposed that is adjustable to other languages. In this regard, after necessary text processing and preparation steps, the existence of an aspect in an opinion is determined using deep learning algorithms. The proposed model combines Convolutional Neural Network (CNN) and long-short-term memory (LSTM) deep learning approaches. CNN is one of the best algorithms for extracting latent features from data. On the other hand, LSTM can detect latent temporal relationships among different words in a text due to its memory ability and attention model. The approach is evaluated on six classes of opinion aspects. Based on the experiments, the proposed model's accuracy, precision, and recall are 70%, 60%, and 85%, respectively. The proposed model was compared in terms of the above criteria with CNN, Naive Bayes, and SVM algorithms and showed satisfying performance.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"4 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.86","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42279959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting mobile money transaction fraud using machine learning algorithms 使用机器学习算法预测移动货币交易欺诈
Pub Date : 2023-07-12 DOI: 10.1002/ail2.85
Mark E. Lokanan

The ease with which mobile money is used to facilitate cross-border payments presents a global threat to law enforcement in the fight against money laundering and terrorist financing. This paper aims to utilize machine learning classifiers to predict transactions flagged as a fraud in mobile money transfers. The data for this study were obtained from real-time transactions that simulate a well-known mobile transfer fraud scheme. Logistic regression is used as the baseline model and is compared with ensemble and gradient descent models. The results indicate that the logistic regression model still showed reasonable performance while not performing as well as the other models. Among all the measures, the random forest classifier exhibited outstanding performance. The amount of money transferred emerged as the top feature for predicting money laundering transactions in mobile money transfers. These findings suggest that further research is needed to enhance the logistic regression model, and the random forest classifier should be explored as a potential tool for law enforcement and financial institutions to detect money laundering activities in mobile money transfers.

使用移动货币便利跨境支付的便利性对打击洗钱和恐怖主义融资的执法部门构成了全球性威胁。本文旨在利用机器学习分类器来预测移动转账中被标记为欺诈的交易。本研究的数据是从模拟一个众所周知的移动转账欺诈方案的实时交易中获得的。采用Logistic回归作为基线模型,并与集合模型和梯度下降模型进行了比较。结果表明,逻辑回归模型虽然表现不如其他模型,但仍具有合理的性能。在所有度量中,随机森林分类器表现出优异的性能。在移动汇款中,汇款金额成为预测洗钱交易的首要特征。这些发现表明,需要进一步研究来增强逻辑回归模型,并应探索随机森林分类器作为执法和金融机构检测移动汇款中洗钱活动的潜在工具。
{"title":"Predicting mobile money transaction fraud using machine learning algorithms","authors":"Mark E. Lokanan","doi":"10.1002/ail2.85","DOIUrl":"10.1002/ail2.85","url":null,"abstract":"<p>The ease with which mobile money is used to facilitate cross-border payments presents a global threat to law enforcement in the fight against money laundering and terrorist financing. This paper aims to utilize machine learning classifiers to predict transactions flagged as a fraud in mobile money transfers. The data for this study were obtained from real-time transactions that simulate a well-known mobile transfer fraud scheme. Logistic regression is used as the baseline model and is compared with ensemble and gradient descent models. The results indicate that the logistic regression model still showed reasonable performance while not performing as well as the other models. Among all the measures, the random forest classifier exhibited outstanding performance. The amount of money transferred emerged as the top feature for predicting money laundering transactions in mobile money transfers. These findings suggest that further research is needed to enhance the logistic regression model, and the random forest classifier should be explored as a potential tool for law enforcement and financial institutions to detect money laundering activities in mobile money transfers.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"4 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.85","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46263497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated patent classification for crop protection via domain adaptation 通过领域适应的作物保护自动专利分类
Pub Date : 2023-02-15 DOI: 10.1002/ail2.80
Dimitrios Christofidellis, Marzena Maria Lehmann, Torsten Luksch, Marco Stenta, Matteo Manica

Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state-of-the-art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain-adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state-of-the-art models with increased precision, recall, and F1-score. We base our evaluation on both standard patent classification datasets derived from patent offices-defined code hierarchies and more practical real-world use-case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated.

专利显示了大多数科学领域的技术如何随着时间的推移而演变。使用这个有价值的知识库的最佳方法是使用高效和有效的信息检索和相关现有技术的搜索。专利分类,即将专利分配给一个或多个预定义的类别,是合成发明信息内容的基本步骤。为此,基于transformer的架构,特别是那些来自BERT家族的架构已经在文献中提出,并且通过为分类任务设置新的最先进的性能,它们已经显示出显着的结果。在这里,我们通过严格评估和实施一系列最新的迁移学习技术(例如,领域自适应预训练和适配器)来研究领域自适应如何在专利分类中突破性能界限。我们的分析显示了如何利用这些进步来开发具有更高精度、召回率和f1分数的最先进模型。我们的评估基于来自专利局定义的代码层次结构的标准专利分类数据集,以及包含农化工业领域标签的更实际的现实用例场景。这些领域适应技术在多语言环境下的专利分类应用也被检查和评估。
{"title":"Automated patent classification for crop protection via domain adaptation","authors":"Dimitrios Christofidellis,&nbsp;Marzena Maria Lehmann,&nbsp;Torsten Luksch,&nbsp;Marco Stenta,&nbsp;Matteo Manica","doi":"10.1002/ail2.80","DOIUrl":"10.1002/ail2.80","url":null,"abstract":"<p>Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state-of-the-art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain-adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state-of-the-art models with increased precision, recall, and <i>F</i>1-score. We base our evaluation on both standard patent classification datasets derived from patent offices-defined code hierarchies and more practical real-world use-case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.80","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43702643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced recognition of adolescents with schizophrenia and a computational contrast of their neuroanatomy with healthy patients using brainwave signals 使用脑波信号增强对青少年精神分裂症患者的识别及其与健康患者神经解剖学的计算对比
Pub Date : 2023-01-12 DOI: 10.1002/ail2.79
Ejay Nsugbe

Schizophrenia is a psychiatric disorder which is prevalent in individuals around the world, where diagnosis methods for this disorder are done via a combination of interview style questioning of the patient alongside a review of their medical record; but these methods have been largely criticised for being subjective between psychiatrists and largely unreplicable. Schizophrenia also occurs in adolescent individuals who have been said to be even more challenging to diagnose largely due to delusions being mistaken for childhood fantasies, and established methods for adult patients being applied to diagnose adolescents. This work investigates the use of electroencephalography (EEG) signals acquired from adolescent patients in the age range of 10–14 years, alongside signal processing methods and machine learning modelling towards the diagnosis of adolescent schizophrenia. The results from the machine learning modelling showed that the linear discriminant analysis (LDA) and fine K-nearest neighbour (KNN) produced the best recognition results for models with easy and hard interpretability, respectively. Additionally, a computational method was applied towards contrasting the neuroanatomical activation patterns in the brain of the schizophrenic and normal adolescents, where it was seen that the neural activation patterns of the normal adolescents showed a greater consistency when compared with the schizophrenics.

精神分裂症是一种精神疾病,在世界各地的个体中普遍存在,这种疾病的诊断方法是通过对患者的访谈式提问和对其医疗记录的回顾相结合来完成的;但这些方法在很大程度上受到了批评,因为它们在精神科医生之间是主观的,而且基本上不可复制。精神分裂症也发生在青少年身上,据说他们更难诊断,很大程度上是因为错觉被误认为是童年的幻想,而成年患者的既定方法被用于诊断青少年。这项工作调查了从10-14岁的青少年患者中获得的脑电图(EEG)信号的使用,以及信号处理方法和机器学习建模对青少年精神分裂症的诊断。机器学习建模的结果表明,线性判别分析(LDA)和精细k近邻(KNN)分别对易解释性和难解释性的模型产生了最好的识别结果。此外,应用计算方法对比了精神分裂症患者和正常青少年的大脑神经解剖学激活模式,发现正常青少年的神经激活模式与精神分裂症患者相比表现出更大的一致性。
{"title":"Enhanced recognition of adolescents with schizophrenia and a computational contrast of their neuroanatomy with healthy patients using brainwave signals","authors":"Ejay Nsugbe","doi":"10.1002/ail2.79","DOIUrl":"10.1002/ail2.79","url":null,"abstract":"<p>Schizophrenia is a psychiatric disorder which is prevalent in individuals around the world, where diagnosis methods for this disorder are done via a combination of interview style questioning of the patient alongside a review of their medical record; but these methods have been largely criticised for being subjective between psychiatrists and largely unreplicable. Schizophrenia also occurs in adolescent individuals who have been said to be even more challenging to diagnose largely due to delusions being mistaken for childhood fantasies, and established methods for adult patients being applied to diagnose adolescents. This work investigates the use of electroencephalography (EEG) signals acquired from adolescent patients in the age range of 10–14 years, alongside signal processing methods and machine learning modelling towards the diagnosis of adolescent schizophrenia. The results from the machine learning modelling showed that the linear discriminant analysis (LDA) and fine K-nearest neighbour (KNN) produced the best recognition results for models with easy and hard interpretability, respectively. Additionally, a computational method was applied towards contrasting the neuroanatomical activation patterns in the brain of the schizophrenic and normal adolescents, where it was seen that the neural activation patterns of the normal adolescents showed a greater consistency when compared with the schizophrenics.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.79","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42576192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Twin neural network regression 双神经网络回归
Pub Date : 2022-10-04 DOI: 10.1002/ail2.78
Sebastian Johann Wetzel, Kevin Ryczko, Roger Gordon Melko, Isaac Tamblyn

We introduce twin neural network regression (TNNR). This method predicts differences between the target values of two different data points rather than the targets themselves. The solution of a traditional regression problem is then obtained by averaging over an ensemble of all predicted differences between the targets of an unseen data point and all training data points. Whereas ensembles are normally costly to produce, TNNR intrinsically creates an ensemble of predictions of twice the size of the training set while only training a single neural network. Since ensembles have been shown to be more accurate than single models this property naturally transfers to TNNR. We show that TNNs are able to compete or yield more accurate predictions for different data sets, compared with other state-of-the-art methods. Furthermore, TNNR is constrained by self-consistency conditions. We find that the violation of these conditions provides a signal for the prediction uncertainty.

我们介绍了孪生神经网络回归(TNNR)。这种方法预测的是两个不同数据点目标值之间的差异,而不是目标值本身。传统回归问题的解决方案是通过对未见数据点和所有训练数据点的目标之间的所有预测差异的集合进行平均来获得的。虽然集成通常是昂贵的,但TNNR本质上创造了一个两倍于训练集大小的预测集成,而只训练一个神经网络。由于综合模型已被证明比单一模型更精确,这种特性自然地转移到TNNR。我们表明,与其他最先进的方法相比,tnn能够在不同的数据集上竞争或产生更准确的预测。此外,TNNR受自洽条件的约束。我们发现这些条件的违背为预测的不确定性提供了一个信号。
{"title":"Twin neural network regression","authors":"Sebastian Johann Wetzel,&nbsp;Kevin Ryczko,&nbsp;Roger Gordon Melko,&nbsp;Isaac Tamblyn","doi":"10.1002/ail2.78","DOIUrl":"10.1002/ail2.78","url":null,"abstract":"<p>We introduce twin neural network regression (TNNR). This method predicts differences between the target values of two different data points rather than the targets themselves. The solution of a traditional regression problem is then obtained by averaging over an ensemble of all predicted differences between the targets of an unseen data point and all training data points. Whereas ensembles are normally costly to produce, TNNR intrinsically creates an ensemble of predictions of twice the size of the training set while only training a single neural network. Since ensembles have been shown to be more accurate than single models this property naturally transfers to TNNR. We show that TNNs are able to compete or yield more accurate predictions for different data sets, compared with other state-of-the-art methods. Furthermore, TNNR is constrained by self-consistency conditions. We find that the violation of these conditions provides a signal for the prediction uncertainty.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"3 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.78","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77494627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Evaluating perceptual and semantic interpretability of saliency methods: A case study of melanoma 评估显著性方法的感知和语义可解释性:黑色素瘤的案例研究
Pub Date : 2022-09-13 DOI: 10.1002/ail2.77
Harshit Bokadia, Scott Cheng-Hsin Yang, Zhaobin Li, Tomas Folke, Patrick Shafto

In order to be useful, XAI explanations have to be faithful to the AI system they seek to elucidate and also interpretable to the people that engage with them. There exist multiple algorithmic methods for assessing faithfulness, but this is not so for interpretability, which is typically only assessed through expensive user studies. Here we propose two complementary metrics to algorithmically evaluate the interpretability of saliency map explanations. One metric assesses perceptual interpretability by quantifying the visual coherence of the saliency map. The second metric assesses semantic interpretability by capturing the degree of overlap between the saliency map and textbook features—features human experts use to make a classification. We use a melanoma dataset and a deep-neural network classifier as a case-study to explore how our two interpretability metrics relate to each other and a faithfulness metric. Across six commonly used saliency methods, we find that none achieves high scores across all three metrics for all test images, but that different methods perform well in different regions of the data distribution. This variation between methods can be leveraged to consistently achieve high interpretability and faithfulness by using our metrics to inform saliency mask selection on a case-by-case basis. Our interpretability metrics provide a new way to evaluate saliency-based explanations and allow for the adaptive combination of saliency-based explanation methods.

为了发挥作用,XAI解释必须忠实于它们试图解释的AI系统,并且能够让参与其中的人理解。存在多种算法方法来评估忠实度,但对于可解释性而言并非如此,这通常只能通过昂贵的用户研究来评估。在这里,我们提出了两个互补的指标,以算法评估显著性地图解释的可解释性。一种度量通过量化显著性图的视觉一致性来评估感知可解释性。第二个指标通过捕捉显著性图和教科书特征之间的重叠程度来评估语义可解释性,这些特征是人类专家用来进行分类的。我们使用黑色素瘤数据集和深度神经网络分类器作为案例研究,探索我们的两个可解释性指标如何相互关联以及可信度指标。在六种常用的显著性方法中,我们发现没有一种方法能够在所有测试图像的所有三个度量中获得高分,但是不同的方法在数据分布的不同区域表现良好。可以利用方法之间的这种差异,通过使用我们的指标来根据具体情况通知显着掩码选择,从而始终如一地实现高可解释性和可靠性。我们的可解释性指标提供了一种新的方法来评估基于显著性的解释,并允许基于显著性的解释方法的自适应组合。
{"title":"Evaluating perceptual and semantic interpretability of saliency methods: A case study of melanoma","authors":"Harshit Bokadia,&nbsp;Scott Cheng-Hsin Yang,&nbsp;Zhaobin Li,&nbsp;Tomas Folke,&nbsp;Patrick Shafto","doi":"10.1002/ail2.77","DOIUrl":"10.1002/ail2.77","url":null,"abstract":"<p>In order to be useful, XAI explanations have to be faithful to the AI system they seek to elucidate and also interpretable to the people that engage with them. There exist multiple algorithmic methods for assessing faithfulness, but this is not so for interpretability, which is typically only assessed through expensive user studies. Here we propose two complementary metrics to algorithmically evaluate the interpretability of saliency map explanations. One metric assesses perceptual interpretability by quantifying the visual coherence of the saliency map. The second metric assesses semantic interpretability by capturing the degree of overlap between the saliency map and textbook features—features human experts use to make a classification. We use a melanoma dataset and a deep-neural network classifier as a case-study to explore how our two interpretability metrics relate to each other and a faithfulness metric. Across six commonly used saliency methods, we find that none achieves high scores across all three metrics for all test images, but that different methods perform well in different regions of the data distribution. This variation between methods can be leveraged to consistently achieve high interpretability and faithfulness by using our metrics to inform saliency mask selection on a case-by-case basis. Our interpretability metrics provide a new way to evaluate saliency-based explanations and allow for the adaptive combination of saliency-based explanation methods.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"3 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.77","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45070752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Applying machine learning for large scale field calibration of low-cost PM2.5 and PM10 air pollution sensors 将机器学习应用于低成本PM2.5和PM10空气污染传感器的大规模现场校准
Pub Date : 2022-07-31 DOI: 10.1002/ail2.76
Priscilla Adong, Engineer Bainomugisha, Deo Okure, Richard Sserunjogi

Low-cost air quality monitoring networks can potentially increase the availability of high-resolution monitoring to inform analytic and evidence-informed approaches to better manage air quality. This is particularly relevant in low and middle-income settings where access to traditional reference-grade monitoring networks remains a challenge. However, low-cost air quality sensors are impacted by ambient conditions which could lead to over- or underestimation of pollution concentrations and thus require field calibration to improve their accuracy and reliability. In this paper, we demonstrate the feasibility of using machine learning methods for large-scale calibration of AirQo sensors, low-cost PM sensors custom-designed for and deployed in Sub-Saharan urban settings. The performance of various machine learning methods is assessed by comparing model corrected PM using k-nearest neighbours, support vector regression, multivariate linear regression, ridge regression, lasso regression, elastic net regression, XGBoost, multilayer perceptron, random forest and gradient boosting with collocated reference PM concentrations from a Beta Attenuation Monitor (BAM). To this end, random forest and lasso regression models were superior for PM2.5 and PM10 calibration, respectively. Employing the random forest model decreased RMSE of raw data from 18.6 μg/m3 to 7.2 μg/m3 with an average BAM PM2.5 concentration of 37.8 μg/m3 while the lasso regression model decreased RMSE from 13.4 μg/m3 to 7.9 μg/m3 with an average BAM PM10 concentration of 51.1 μg/m3. We validate our models through cross-unit and cross-site validation, allowing analysis of AirQo devices' consistency. The resulting calibration models were deployed to the entire large-scale air quality monitoring network consisting of over 120 AirQo devices, which demonstrates the use of machine learning systems to address practical challenges in a developing world setting.

低成本空气质量监测网络有可能增加高分辨率监测的可用性,为更好地管理空气质量的分析和循证方法提供信息。这在低收入和中等收入环境中尤其重要,因为在这些环境中,使用传统的参考级监测网络仍然是一项挑战。然而,低成本的空气质量传感器受到环境条件的影响,可能导致对污染浓度的高估或低估,因此需要现场校准以提高其准确性和可靠性。在本文中,我们展示了使用机器学习方法大规模校准AirQo传感器的可行性,AirQo传感器是为撒哈拉以南城市环境定制并部署的低成本PM传感器。各种机器学习方法的性能通过比较模型校正PM来评估,使用k-近邻、支持向量回归、多元线性回归、脊回归、lasso回归、弹性网回归、XGBoost、多层感知器、随机森林和梯度增强,并使用来自Beta衰减监视器(BAM)的参考PM浓度。因此,随机森林模型和套索回归模型分别对PM2.5和PM10的校准具有优势。采用随机森林模型将原始数据的RMSE从18.6 μg/m3降低到7.2 μg/m3, BAM PM2.5平均浓度为37.8 μg/m3;套索回归模型将RMSE从13.4 μg/m3降低到7.9 μg/m3, BAM PM10平均浓度为51.1 μg/m3。我们通过跨单元和跨站点验证来验证我们的模型,从而分析AirQo设备的一致性。由此产生的校准模型被部署到由120多台AirQo设备组成的整个大规模空气质量监测网络中,这证明了机器学习系统在解决发展中国家环境中的实际挑战方面的应用。
{"title":"Applying machine learning for large scale field calibration of low-cost PM2.5 and PM10 air pollution sensors","authors":"Priscilla Adong,&nbsp;Engineer Bainomugisha,&nbsp;Deo Okure,&nbsp;Richard Sserunjogi","doi":"10.1002/ail2.76","DOIUrl":"10.1002/ail2.76","url":null,"abstract":"<p>Low-cost air quality monitoring networks can potentially increase the availability of high-resolution monitoring to inform analytic and evidence-informed approaches to better manage air quality. This is particularly relevant in low and middle-income settings where access to traditional reference-grade monitoring networks remains a challenge. However, low-cost air quality sensors are impacted by ambient conditions which could lead to over- or underestimation of pollution concentrations and thus require field calibration to improve their accuracy and reliability. In this paper, we demonstrate the feasibility of using machine learning methods for large-scale calibration of AirQo sensors, low-cost PM sensors custom-designed for and deployed in Sub-Saharan urban settings. The performance of various machine learning methods is assessed by comparing model corrected PM using <i>k</i>-nearest neighbours, support vector regression, multivariate linear regression, ridge regression, lasso regression, elastic net regression, XGBoost, multilayer perceptron, random forest and gradient boosting with collocated reference PM concentrations from a Beta Attenuation Monitor (BAM). To this end, random forest and lasso regression models were superior for PM<sub>2.5</sub> and PM<sub>10</sub> calibration, respectively. Employing the random forest model decreased RMSE of raw data from 18.6 μg/m<sup>3</sup> to 7.2 μg/m<sup>3</sup> with an average BAM PM<sub>2.5</sub> concentration of 37.8 μg/m<sup>3</sup> while the lasso regression model decreased RMSE from 13.4 μg/m<sup>3</sup> to 7.9 μg/m<sup>3</sup> with an average BAM PM<sub>10</sub> concentration of 51.1 μg/m<sup>3</sup>. We validate our models through cross-unit and cross-site validation, allowing analysis of AirQo devices' consistency. The resulting calibration models were deployed to the entire large-scale air quality monitoring network consisting of over 120 AirQo devices, which demonstrates the use of machine learning systems to address practical challenges in a developing world setting.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"3 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.76","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48427411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Deep learning to predict power output from respiratory inductive plethysmography data 深度学习预测呼吸感应脉搏波数据的功率输出
Pub Date : 2022-03-17 DOI: 10.1002/ail2.65
Erik Johannes B. L. G Husom, Pierre Bernabé, Sagar Sen

Power output is one of the most accurate methods for measuring exercise intensity during outdoor endurance sports, since it records the actual effect of the work performed by the muscles over time. However, power meters are expensive and are limited to activity forms where it is possible to embed sensors in the propulsion system such as in cycling. We investigate using breathing to estimate power output during exercise, in order to create a portable method for tracking physical effort that is universally applicable in many activity forms. Breathing can be quantified through respiratory inductive plethysmography (RIP), which entails recording the movement of the rib cage and abdomen caused by breathing, and it enables us to have a portable, non-invasive device for measuring breathing. RIP signals, heart rate and power output were recorded during a N-of-1 study of a person performing a set of workouts on a stationary bike. The recorded data were used to build predictive models through deep learning algorithms. A convolutional neural network (CNN) trained on features derived from RIP signals and heart rate obtained a mean absolute percentage error (MAPE) of 0.20 (ie, 20% average error). The model showed promising capability of estimating correct power levels and reactivity to changes in power output, but the accuracy is significantly lower than that of cycling power meters.

在户外耐力运动中,功率输出是测量运动强度最准确的方法之一,因为它记录了肌肉在一段时间内完成的工作的实际效果。然而,功率计是昂贵的,并且仅限于活动形式,在这些活动形式中可以将传感器嵌入推进系统,例如在自行车中。我们研究使用呼吸来估计运动过程中的能量输出,以便创建一种便携式方法来跟踪身体的努力,这是普遍适用于许多活动形式。呼吸可以通过呼吸感应容积描记术(RIP)来量化,它需要记录由呼吸引起的胸腔和腹部的运动,它使我们有一个便携式的,非侵入性的测量呼吸的设备。在一项N-of-1的研究中,研究人员记录了一个人在固定自行车上进行一系列锻炼时的RIP信号、心率和能量输出。记录的数据通过深度学习算法建立预测模型。卷积神经网络(CNN)对来自RIP信号和心率的特征进行训练,得到的平均绝对百分比误差(MAPE)为0.20(即平均误差为20%)。该模型在正确估计功率水平和对输出功率变化的反应性方面表现出良好的能力,但精度明显低于循环功率表。
{"title":"Deep learning to predict power output from respiratory inductive plethysmography data","authors":"Erik Johannes B. L. G Husom,&nbsp;Pierre Bernabé,&nbsp;Sagar Sen","doi":"10.1002/ail2.65","DOIUrl":"10.1002/ail2.65","url":null,"abstract":"<p>Power output is one of the most accurate methods for measuring exercise intensity during outdoor endurance sports, since it records the actual effect of the work performed by the muscles over time. However, power meters are expensive and are limited to activity forms where it is possible to embed sensors in the propulsion system such as in cycling. We investigate using breathing to estimate power output during exercise, in order to create a portable method for tracking physical effort that is universally applicable in many activity forms. Breathing can be quantified through respiratory inductive plethysmography (RIP), which entails recording the movement of the rib cage and abdomen caused by breathing, and it enables us to have a portable, non-invasive device for measuring breathing. RIP signals, heart rate and power output were recorded during a N-of-1 study of a person performing a set of workouts on a stationary bike. The recorded data were used to build predictive models through deep learning algorithms. A convolutional neural network (CNN) trained on features derived from RIP signals and heart rate obtained a mean absolute percentage error (MAPE) of 0.20 (ie, 20% average error). The model showed promising capability of estimating correct power levels and reactivity to changes in power output, but the accuracy is significantly lower than that of cycling power meters.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.65","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46623297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Information 问题信息
Pub Date : 2022-02-01 DOI: 10.1002/ail2.25
{"title":"Issue Information","authors":"","doi":"10.1002/ail2.25","DOIUrl":"https://doi.org/10.1002/ail2.25","url":null,"abstract":"","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41467571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Qualitative Investigation in Explainable Artificial Intelligence: Further Insight from Social Science 可解释人工智能的定性研究——来自社会科学的进一步认识
Pub Date : 2022-01-17 DOI: 10.1002/ail2.64
Adam J. Johs, Denise E. Agosto, Rosina O. Weber
{"title":"Qualitative Investigation in Explainable Artificial Intelligence: Further Insight from Social Science","authors":"Adam J. Johs, Denise E. Agosto, Rosina O. Weber","doi":"10.1002/ail2.64","DOIUrl":"https://doi.org/10.1002/ail2.64","url":null,"abstract":"","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44715694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Applied AI letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1