首页 > 最新文献

Applied AI letters最新文献

英文 中文
Fine-Tuned Pretrained Transformer for Amharic News Headline Generation 针对阿姆哈拉语新闻标题生成的微调预训练变换器
Pub Date : 2024-07-19 DOI: 10.1002/ail2.98
Mizanu Zelalem Degu, Million Meshesha

Amharic is one of the under-resourced languages, making news headline generation particularly challenging due to the scarcity of high-quality linguistic datasets necessary for training effective natural language processing models. In this study, we fine-tuned the small check point of the T5v1.1 model (t5-small) to perform Amharic news headline generation with an Amharic dataset that is comprised of over 70k news articles along with their headline. Fine-tuning the model involves dataset collection from Amharic news websites, text cleaning, news article size optimization using the TF-IDF algorithm, and tokenization. In addition, a tokenizer model is developed using the byte pair encoding (BPE) algorithm prior to feeding the dataset for feature extraction and summarization. Metrics including Rouge-L, BLEU, and Meteor were used to evaluate the performance of the model and a score of 0.5, 0.24, and 0.71, respectively, was achieved on the test partition of the dataset that contains 7230 instances. The results were good relative to result of the t5 model without fine-tuning, which are 0.1, 0.03, and 0.14, respectively. A postprocessing technique using a rule-based approach was used for further improving summaries generated by the model. The addition of the postprocessing helped the system to achieve Rouge-L, BLEU, and Meteor scores of 0.72, 0.52, and 0.81, respectively. The result value is relatively better than the result achieved by the nonfine-tuned T5v1.1 model and the result of previous studies report on abstractive-based text summarization for Amharic language, which had a 0.27 Rouge-L score. This contributes a valuable insight for practical application and further improvement of the model in the future by increasing the article length, using more training data, using machine learning–based adaptive postprocessing techniques, and fine-tuning other available pretrained models for text summarization.

阿姆哈拉语是一种资源不足的语言,由于缺乏训练有效自然语言处理模型所需的高质量语言数据集,新闻标题的生成尤其具有挑战性。在本研究中,我们对 T5v1.1 模型(t5-small)的小型检查点进行了微调,以便使用由超过 70k 篇新闻文章及其标题组成的阿姆哈拉语数据集生成阿姆哈拉语新闻标题。对模型的微调包括从阿姆哈拉语新闻网站收集数据集、文本清理、使用 TF-IDF 算法优化新闻文章大小以及标记化。此外,在将数据集输入特征提取和摘要之前,还使用字节对编码(BPE)算法开发了一个标记化模型。该模型在包含 7230 个实例的数据集测试分区上分别获得了 0.5、0.24 和 0.71 分。与未进行微调的 t5 模型的结果(分别为 0.1、0.03 和 0.14)相比,结果很好。为进一步改进模型生成的摘要,使用了基于规则的后处理技术。增加后处理后,系统的 Rouge-L、BLEU 和 Meteor 分数分别达到 0.72、0.52 和 0.81。该结果值相对优于未经微调的 T5v1.1 模型所取得的结果,也优于之前关于基于抽象的阿姆哈拉语文本摘要的研究报告所取得的结果,后者的 Rouge-L 分数为 0.27。这为实际应用提供了宝贵的启示,并有助于今后通过增加文章长度、使用更多训练数据、使用基于机器学习的自适应后处理技术以及微调其他可用的文本摘要预训练模型来进一步改进该模型。
{"title":"Fine-Tuned Pretrained Transformer for Amharic News Headline Generation","authors":"Mizanu Zelalem Degu,&nbsp;Million Meshesha","doi":"10.1002/ail2.98","DOIUrl":"10.1002/ail2.98","url":null,"abstract":"<p>Amharic is one of the under-resourced languages, making news headline generation particularly challenging due to the scarcity of high-quality linguistic datasets necessary for training effective natural language processing models. In this study, we fine-tuned the small check point of the T5v1.1 model (t5-small) to perform Amharic news headline generation with an Amharic dataset that is comprised of over 70k news articles along with their headline. Fine-tuning the model involves dataset collection from Amharic news websites, text cleaning, news article size optimization using the TF-IDF algorithm, and tokenization. In addition, a tokenizer model is developed using the byte pair encoding (BPE) algorithm prior to feeding the dataset for feature extraction and summarization. Metrics including Rouge-L, BLEU, and Meteor were used to evaluate the performance of the model and a score of 0.5, 0.24, and 0.71, respectively, was achieved on the test partition of the dataset that contains 7230 instances. The results were good relative to result of the t5 model without fine-tuning, which are 0.1, 0.03, and 0.14, respectively. A postprocessing technique using a rule-based approach was used for further improving summaries generated by the model. The addition of the postprocessing helped the system to achieve Rouge-L, BLEU, and Meteor scores of 0.72, 0.52, and 0.81, respectively. The result value is relatively better than the result achieved by the nonfine-tuned T5v1.1 model and the result of previous studies report on abstractive-based text summarization for Amharic language, which had a 0.27 Rouge-L score. This contributes a valuable insight for practical application and further improvement of the model in the future by increasing the article length, using more training data, using machine learning–based adaptive postprocessing techniques, and fine-tuning other available pretrained models for text summarization.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.98","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141822582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TL-GNN: Android Malware Detection Using Transfer Learning TL-GNN:利用迁移学习检测安卓恶意软件
Pub Date : 2024-05-10 DOI: 10.1002/ail2.94
Ali Raza, Zahid Hussain Qaisar, Naeem Aslam, Muhammad Faheem, Muhammad Waqar Ashraf, Muhammad Naman Chaudhry

Malware growth has accelerated due to the widespread use of Android applications. Android smartphone attacks have increased due to the widespread use of these devices. While deep learning models offer high efficiency and accuracy, training them on large and complex datasets is computationally expensive. Hence, a method that effectively detects new malware variants at a low computational cost is required. A transfer learning method to detect Android malware is proposed in this research. Because of transferring known features from a source model that has been trained to a target model, the transfer learning approach reduces the need for new training data and minimizes the need for huge amounts of computational power. We performed many experiments on 1.2 million Android application samples for performance evaluation. In addition, we evaluated how well our framework performed in comparison with traditional deep learning and standard machine learning models. In comparison with state-of-the-art Android malware detection methods, the proposed framework offers improved classification accuracy of 98.87%, a precision of 99.55%, recall of 97.30%, F1-measure of 99.42%, and a quicker detection rate of 5.14 ms using the transfer learning strategy.

由于 Android 应用程序的广泛使用,恶意软件增长速度加快。由于安卓智能手机的广泛使用,这些设备受到的攻击也在增加。虽然深度学习模型具有很高的效率和准确性,但在大型复杂数据集上训练这些模型的计算成本很高。因此,需要一种能以低计算成本有效检测新恶意软件变种的方法。本研究提出了一种检测安卓恶意软件的迁移学习方法。由于将已知特征从已训练的源模型转移到目标模型,转移学习方法减少了对新训练数据的需求,并最大限度地降低了对大量计算能力的需求。我们在 120 万个安卓应用样本上进行了多次实验,以评估性能。此外,我们还评估了我们的框架与传统深度学习和标准机器学习模型的性能对比。与最先进的安卓恶意软件检测方法相比,拟议框架的分类准确率提高了 98.87%,精确度提高了 99.55%,召回率提高了 97.30%,F1-measure 提高了 99.42%,使用迁移学习策略的快速检测率提高了 5.14 毫秒。
{"title":"TL-GNN: Android Malware Detection Using Transfer Learning","authors":"Ali Raza,&nbsp;Zahid Hussain Qaisar,&nbsp;Naeem Aslam,&nbsp;Muhammad Faheem,&nbsp;Muhammad Waqar Ashraf,&nbsp;Muhammad Naman Chaudhry","doi":"10.1002/ail2.94","DOIUrl":"10.1002/ail2.94","url":null,"abstract":"<p>Malware growth has accelerated due to the widespread use of Android applications. Android smartphone attacks have increased due to the widespread use of these devices. While deep learning models offer high efficiency and accuracy, training them on large and complex datasets is computationally expensive. Hence, a method that effectively detects new malware variants at a low computational cost is required. A transfer learning method to detect Android malware is proposed in this research. Because of transferring known features from a source model that has been trained to a target model, the transfer learning approach reduces the need for new training data and minimizes the need for huge amounts of computational power. We performed many experiments on 1.2 million Android application samples for performance evaluation. In addition, we evaluated how well our framework performed in comparison with traditional deep learning and standard machine learning models. In comparison with state-of-the-art Android malware detection methods, the proposed framework offers improved classification accuracy of 98.87%, a precision of 99.55%, recall of 97.30%, <i>F</i>1-measure of 99.42%, and a quicker detection rate of 5.14 ms using the transfer learning strategy.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.94","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140990897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building Text and Speech Benchmark Datasets and Models for Low-Resourced East African Languages: Experiences and Lessons 为资源匮乏的东非语言建立文本和语音基准数据集和模型:经验与教训
Pub Date : 2024-03-26 DOI: 10.1002/ail2.92
Joyce Nakatumba-Nabende, Claire Babirye, Peter Nabende, Jeremy Francis Tusubira, Jonathan Mukiibi, Eric Peter Wairagala, Chodrine Mutebi, Tobius Saul Bateesa, Alvin Nahabwe, Hewitt Tusiime, Andrew Katumba

Africa has over 2000 languages; however, those languages are not well represented in the existing natural language processing ecosystem. African languages lack essential digital resources to effectively engage in advancing language technologies. There is a need to generate high-quality natural language processing resources for low-resourced African languages. Obtaining high-quality speech and text data is expensive and tedious because it can involve manual sourcing and verification of data sources. This paper discusses the process taken to curate and annotate text and speech datasets for five East African languages: Luganda, Runyankore-Rukiga, Acholi, Lumasaba, and Swahili. We also present results obtained from baseline models for machine translation, topic modeling and classification, sentiment classification, and automatic speech recognition tasks. Finally, we discuss the experiences, challenges, and lessons learned in creating the text and speech datasets.

非洲有 2000 多种语言,但这些语言在现有的自然语言处理生态系统中并没有得到很好的体现。非洲语言缺乏必要的数字资源,无法有效地参与先进的语言技术。有必要为资源匮乏的非洲语言生成高质量的自然语言处理资源。获取高质量的语音和文本数据既昂贵又繁琐,因为这可能涉及数据源的人工采购和验证。本文讨论了为五种东非语言整理和注释文本和语音数据集的过程:这五种东非语言是:卢干达语、Runyankore-Rukiga 语、阿乔利语、卢马萨巴语和斯瓦希里语。我们还介绍了基线模型在机器翻译、主题建模和分类、情感分类以及自动语音识别任务中取得的成果。最后,我们讨论了创建文本和语音数据集的经验、挑战和教训。
{"title":"Building Text and Speech Benchmark Datasets and Models for Low-Resourced East African Languages: Experiences and Lessons","authors":"Joyce Nakatumba-Nabende,&nbsp;Claire Babirye,&nbsp;Peter Nabende,&nbsp;Jeremy Francis Tusubira,&nbsp;Jonathan Mukiibi,&nbsp;Eric Peter Wairagala,&nbsp;Chodrine Mutebi,&nbsp;Tobius Saul Bateesa,&nbsp;Alvin Nahabwe,&nbsp;Hewitt Tusiime,&nbsp;Andrew Katumba","doi":"10.1002/ail2.92","DOIUrl":"10.1002/ail2.92","url":null,"abstract":"<p>Africa has over 2000 languages; however, those languages are not well represented in the existing natural language processing ecosystem. African languages lack essential digital resources to effectively engage in advancing language technologies. There is a need to generate high-quality natural language processing resources for low-resourced African languages. Obtaining high-quality speech and text data is expensive and tedious because it can involve manual sourcing and verification of data sources. This paper discusses the process taken to curate and annotate text and speech datasets for five East African languages: Luganda, Runyankore-Rukiga, Acholi, Lumasaba, and Swahili. We also present results obtained from baseline models for machine translation, topic modeling and classification, sentiment classification, and automatic speech recognition tasks. Finally, we discuss the experiences, challenges, and lessons learned in creating the text and speech datasets.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.92","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140378695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CMD + V for chemistry: Image to chemical structure conversion directly done in the clipboard CMD + V 用于化学:在剪贴板中直接完成图像到化学结构的转换
Pub Date : 2024-01-25 DOI: 10.1002/ail2.91
Oliver Tobias Schilter, Teodoro Laino, Philippe Schwaller

We present Clipboard-to-SMILES Converter (C2SC), a macOS application that directly converts molecular structures from the clipboard. The app focuses on seamlessly converting screenshots of molecules into a desired molecular representation. It supports a wide range of molecular representations, such as SMILES, SELFIES, InChI's, IUPAC names, RDKit Mol's, and CAS numbers, allowing effortless conversion between these formats within the clipboard. C2SC automatically saves converted molecules to a local history file and displays the last 10 entries for quick access. Additionally, it incorporates several SMILES operations, including canonicalization, augmentation, as well price-searching molecules on chemical vendors for the cost-effective purchasing option. Beyond the one-click conversion from clipboard to molecular structures, the app offers continuous monitoring of the clipboard which automatically converts any supported representations or images detected into SMILES. The convenient interface, directly in the status bar, as well as availability as macOS application, makes C2SC useful for a broad community not requiring any programming expertise. Most conversions are performed locally, notably the image-to-SMILES conversion, with internet access only necessary for specific tasks like price lookups. In summary, C2SC provides a user-friendly and efficient solution for converting molecular structures directly from the clipboard, offering seamless conversions between comprehensive chemical representations and can be directly downloaded from https://github.com/O-Schilter/Clipboard-to-SMILES-Converter.

我们推出的剪贴板到分子结构转换器(Clipboard-to-SMILES Converter,简称 C2SC)是一款 macOS 应用程序,可直接从剪贴板转换分子结构。该应用程序的重点是将分子截图无缝转换为所需的分子表示法。它支持多种分子表示法,如 SMILES、SELFIES、InChI、IUPAC 名称、RDKit Mol 和 CAS 号码,可在剪贴板内轻松实现这些格式之间的转换。C2SC 会自动将转换后的分子保存到本地历史文件中,并显示最近 10 个条目,以便快速访问。此外,它还集成了多种 SMILES 操作,包括规范化、扩增以及在化学供应商上搜索分子价格,以实现经济高效的购买选择。除了从剪贴板到分子结构的一键转换外,该程序还提供对剪贴板的持续监控,可自动将检测到的任何支持的表示法或图像转换为 SMILES。C2SC 的界面非常方便,可直接在状态栏中显示,还可作为 macOS 应用程序使用,这使得 C2SC 能够在不需要任何编程专业知识的情况下为广大用户所用。大多数转换都在本地完成,尤其是图像到 SMILES 的转换,只有在执行价格查询等特定任务时才需要访问互联网。总之,C2SC 为直接从剪贴板转换分子结构提供了一个用户友好的高效解决方案,可在全面的化学表示法之间进行无缝转换,并可直接从 https://github.com/O-Schilter/Clipboard-to-SMILES-Converter 下载。
{"title":"CMD + V for chemistry: Image to chemical structure conversion directly done in the clipboard","authors":"Oliver Tobias Schilter,&nbsp;Teodoro Laino,&nbsp;Philippe Schwaller","doi":"10.1002/ail2.91","DOIUrl":"10.1002/ail2.91","url":null,"abstract":"<p>We present Clipboard-to-SMILES Converter (C2SC), a macOS application that directly converts molecular structures from the clipboard. The app focuses on seamlessly converting screenshots of molecules into a desired molecular representation. It supports a wide range of molecular representations, such as SMILES, SELFIES, InChI's, IUPAC names, RDKit Mol's, and CAS numbers, allowing effortless conversion between these formats within the clipboard. C2SC automatically saves converted molecules to a local history file and displays the last 10 entries for quick access. Additionally, it incorporates several SMILES operations, including canonicalization, augmentation, as well price-searching molecules on chemical vendors for the cost-effective purchasing option. Beyond the one-click conversion from clipboard to molecular structures, the app offers continuous monitoring of the clipboard which automatically converts any supported representations or images detected into SMILES. The convenient interface, directly in the status bar, as well as availability as macOS application, makes C2SC useful for a broad community not requiring any programming expertise. Most conversions are performed locally, notably the image-to-SMILES conversion, with internet access only necessary for specific tasks like price lookups. In summary, C2SC provides a user-friendly and efficient solution for converting molecular structures directly from the clipboard, offering seamless conversions between comprehensive chemical representations and can be directly downloaded from https://github.com/O-Schilter/Clipboard-to-SMILES-Converter.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.91","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139597018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flood susceptibility mapping at the country scale using machine learning approaches 利用机器学习方法绘制国家级洪水易感性地图
Pub Date : 2023-12-15 DOI: 10.1002/ail2.88
Geoffrey Dawson, Junaid Butt, Anne Jones, Paolo Fraccaro
River (fluvial), surface water (pluvial) and coastal flooding pose a significant risk to the United Kingdom. Therefore, it is important to assess flood risk particularly as the impacts of flooding are projected to increase due to climate change. Here we present a high resolution combined fluvial and pluvial flood susceptibility map of England. This flood susceptibility model is created by using past flood events and a series of meaningful hydrological parameters to a training machine learning model. We tested the relative performance of different machine learning algorithms, including Classification and Regression Trees, Random Forest and XGBoost and found the XGBoost performed the best, with an area under the receiver operating characteristic ROC Curve (AUC) of 0.93. We also found the model performed well on unseen areas, and we discuss the possibility of extending to regions that has no information on past flood events. Additionally, to aid in understanding what factors may impact flood risk to a particular area, we used Shapley additive explanations which allowed us to investigate the sensitivity of the predicted flood probability to flood factors at a given location.
河水(河流)、地表水(冲积水)和沿海洪水给英国带来了巨大风险。因此,对洪水风险进行评估非常重要,尤其是预计气候变化将导致洪水影响加剧。在此,我们展示了一张高分辨率的英格兰河流和冲积洪水易发性综合地图。该洪水易发性模型是通过使用过去的洪水事件和一系列有意义的水文参数来训练机器学习模型而创建的。我们测试了不同机器学习算法的相对性能,包括分类树和回归树、随机森林和 XGBoost,发现 XGBoost 性能最佳,接收器工作特征 ROC 曲线下面积 (AUC) 为 0.93。我们还发现该模型在未见过的地区表现良好,并讨论了将其扩展到没有过去洪水事件信息的地区的可能性。此外,为了帮助了解哪些因素可能会影响特定地区的洪水风险,我们使用了夏普利加法解释,这使我们能够研究特定地点的预测洪水概率对洪水因素的敏感性。
{"title":"Flood susceptibility mapping at the country scale using machine learning approaches","authors":"Geoffrey Dawson, Junaid Butt, Anne Jones, Paolo Fraccaro","doi":"10.1002/ail2.88","DOIUrl":"https://doi.org/10.1002/ail2.88","url":null,"abstract":"River (fluvial), surface water (pluvial) and coastal flooding pose a significant risk to the United Kingdom. Therefore, it is important to assess flood risk particularly as the impacts of flooding are projected to increase due to climate change. Here we present a high resolution combined fluvial and pluvial flood susceptibility map of England. This flood susceptibility model is created by using past flood events and a series of meaningful hydrological parameters to a training machine learning model. We tested the relative performance of different machine learning algorithms, including Classification and Regression Trees, Random Forest and XGBoost and found the XGBoost performed the best, with an area under the receiver operating characteristic ROC Curve (AUC) of 0.93. We also found the model performed well on unseen areas, and we discuss the possibility of extending to regions that has no information on past flood events. Additionally, to aid in understanding what factors may impact flood risk to a particular area, we used Shapley additive explanations which allowed us to investigate the sensitivity of the predicted flood probability to flood factors at a given location.","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138997858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards bespoke optimizations of energy efficiency in HPC environments 在高性能计算环境中实现能源效率的定制优化
Pub Date : 2023-12-13 DOI: 10.1002/ail2.87
Robert Tracey, Vadim Elisseev, M. Smyrnakis, Lan Hoang, Mark Fellows, Michael Ackers, Andrew Laughton, Stephen Hill, Phillip Folkes, John Whittle
We present bespoke energy efficiency optimizations in high performance computing (HPC) environments using holistic approach to data collection, analysis and proactive management of resources and workloads. Our solution has three major components: (i) platform for collecting, storing and processing data from multiple sources across hardware and software stacks, (ii) collections of regression machine learning (ML) algorithms for making workloads classifications and energy usage predictions, (iii) agent‐based decision‐making framework for delivering control decisions to middle‐ware and infrastructure thus supporting real time or near real energy efficiency optimizations. We will provide some concrete examples of using our proposed approach in HPC environment.
我们采用数据收集、分析以及资源和工作负载主动管理的整体方法,为高性能计算(HPC)环境提供定制的能效优化方案。我们的解决方案由三个主要部分组成:(i) 用于收集、存储和处理来自硬件和软件堆栈的多源数据的平台;(ii) 用于进行工作负载分类和能源使用预测的回归机器学习(ML)算法集合;(iii) 基于代理的决策框架,用于向中间件和基础设施提供控制决策,从而支持实时或接近实时的能效优化。我们将提供一些在高性能计算环境中使用我们提出的方法的具体实例。
{"title":"Towards bespoke optimizations of energy efficiency in HPC environments","authors":"Robert Tracey, Vadim Elisseev, M. Smyrnakis, Lan Hoang, Mark Fellows, Michael Ackers, Andrew Laughton, Stephen Hill, Phillip Folkes, John Whittle","doi":"10.1002/ail2.87","DOIUrl":"https://doi.org/10.1002/ail2.87","url":null,"abstract":"We present bespoke energy efficiency optimizations in high performance computing (HPC) environments using holistic approach to data collection, analysis and proactive management of resources and workloads. Our solution has three major components: (i) platform for collecting, storing and processing data from multiple sources across hardware and software stacks, (ii) collections of regression machine learning (ML) algorithms for making workloads classifications and energy usage predictions, (iii) agent‐based decision‐making framework for delivering control decisions to middle‐ware and infrastructure thus supporting real time or near real energy efficiency optimizations. We will provide some concrete examples of using our proposed approach in HPC environment.","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139005506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On a quantum inspired approach to train machine learning models 关于训练机器学习模型的量子启发方法
Pub Date : 2023-12-13 DOI: 10.1002/ail2.89
Jean Michel Sellier
In this work, a novel technique to train machine learning models is introduced, which is based on digital simulations of certain types of quantum systems. This represents a drastic departure from the standard approach of quantum machine learning which, to this day, is based on the use of actual physical quantum systems. To provide a clear context, the field of quantum inspired machine learning is first provided. Then, we proceed with a detailed description of our proposed method. To conclude, some preliminary, yet compelling, results are presented and discussed. Although at a seminal stage, the author firmly believes that this approach could represent a valid and robust alternative to the way machine learning models are trained today.
在这项工作中,介绍了一种训练机器学习模型的新技术,它基于对某些类型量子系统的数字模拟。这与量子机器学习的标准方法大相径庭,后者至今仍基于实际物理量子系统的使用。为了提供一个清晰的背景,我们首先介绍了量子启发式机器学习领域。然后,我们将详细介绍我们提出的方法。最后,我们将介绍和讨论一些初步但令人信服的结果。尽管还处于开创性阶段,但作者坚信,这种方法可以成为当今机器学习模型训练方式的有效而稳健的替代方案。
{"title":"On a quantum inspired approach to train machine learning models","authors":"Jean Michel Sellier","doi":"10.1002/ail2.89","DOIUrl":"https://doi.org/10.1002/ail2.89","url":null,"abstract":"In this work, a novel technique to train machine learning models is introduced, which is based on digital simulations of certain types of quantum systems. This represents a drastic departure from the standard approach of quantum machine learning which, to this day, is based on the use of actual physical quantum systems. To provide a clear context, the field of quantum inspired machine learning is first provided. Then, we proceed with a detailed description of our proposed method. To conclude, some preliminary, yet compelling, results are presented and discussed. Although at a seminal stage, the author firmly believes that this approach could represent a valid and robust alternative to the way machine learning models are trained today.","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139003616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep aspect extraction and classification for opinion mining in e-commerce applications using convolutional neural network feature extraction followed by long short term memory attention model 基于卷积神经网络特征提取和长短期记忆注意模型的深度方面提取和分类在电子商务应用中的意见挖掘
Pub Date : 2023-08-09 DOI: 10.1002/ail2.86
Kamal Sharbatian, Mohammad Hossein Moattar

Users of e-commerce websites review different aspects of a product in the comment section. In this research, an approach is proposed for opinion aspect extraction and recognition in selling systems. We have used the users' opinions from the Digikala website (www.Digikala.com), which is an Iranian e-commerce company. In this research, a language-independent framework is proposed that is adjustable to other languages. In this regard, after necessary text processing and preparation steps, the existence of an aspect in an opinion is determined using deep learning algorithms. The proposed model combines Convolutional Neural Network (CNN) and long-short-term memory (LSTM) deep learning approaches. CNN is one of the best algorithms for extracting latent features from data. On the other hand, LSTM can detect latent temporal relationships among different words in a text due to its memory ability and attention model. The approach is evaluated on six classes of opinion aspects. Based on the experiments, the proposed model's accuracy, precision, and recall are 70%, 60%, and 85%, respectively. The proposed model was compared in terms of the above criteria with CNN, Naive Bayes, and SVM algorithms and showed satisfying performance.

电子商务网站的用户会在评论区对产品的不同方面进行评论。本研究提出了一种销售系统中意见方面的提取与识别方法。我们使用了来自Digikala网站(www.Digikala.com)的用户意见,这是一家伊朗电子商务公司。在这项研究中,我们提出了一个独立于语言的框架,可以调整到其他语言。在这方面,经过必要的文本处理和准备步骤,使用深度学习算法确定意见中某个方面的存在。该模型结合了卷积神经网络(CNN)和长短期记忆(LSTM)深度学习方法。CNN是从数据中提取潜在特征的最佳算法之一。另一方面,由于LSTM的记忆能力和注意模型,它可以检测文本中不同单词之间的潜在时间关系。对该方法进行了六类意见方面的评估。实验结果表明,该模型的准确率为70%,精密度为60%,召回率为85%。将该模型与CNN、朴素贝叶斯和支持向量机算法在上述标准下进行了比较,结果令人满意。
{"title":"Deep aspect extraction and classification for opinion mining in e-commerce applications using convolutional neural network feature extraction followed by long short term memory attention model","authors":"Kamal Sharbatian,&nbsp;Mohammad Hossein Moattar","doi":"10.1002/ail2.86","DOIUrl":"10.1002/ail2.86","url":null,"abstract":"<p>Users of e-commerce websites review different aspects of a product in the comment section. In this research, an approach is proposed for opinion aspect extraction and recognition in selling systems. We have used the users' opinions from the Digikala website (www.Digikala.com), which is an Iranian e-commerce company. In this research, a language-independent framework is proposed that is adjustable to other languages. In this regard, after necessary text processing and preparation steps, the existence of an aspect in an opinion is determined using deep learning algorithms. The proposed model combines Convolutional Neural Network (CNN) and long-short-term memory (LSTM) deep learning approaches. CNN is one of the best algorithms for extracting latent features from data. On the other hand, LSTM can detect latent temporal relationships among different words in a text due to its memory ability and attention model. The approach is evaluated on six classes of opinion aspects. Based on the experiments, the proposed model's accuracy, precision, and recall are 70%, 60%, and 85%, respectively. The proposed model was compared in terms of the above criteria with CNN, Naive Bayes, and SVM algorithms and showed satisfying performance.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.86","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42279959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting mobile money transaction fraud using machine learning algorithms 使用机器学习算法预测移动货币交易欺诈
Pub Date : 2023-07-12 DOI: 10.1002/ail2.85
Mark E. Lokanan

The ease with which mobile money is used to facilitate cross-border payments presents a global threat to law enforcement in the fight against money laundering and terrorist financing. This paper aims to utilize machine learning classifiers to predict transactions flagged as a fraud in mobile money transfers. The data for this study were obtained from real-time transactions that simulate a well-known mobile transfer fraud scheme. Logistic regression is used as the baseline model and is compared with ensemble and gradient descent models. The results indicate that the logistic regression model still showed reasonable performance while not performing as well as the other models. Among all the measures, the random forest classifier exhibited outstanding performance. The amount of money transferred emerged as the top feature for predicting money laundering transactions in mobile money transfers. These findings suggest that further research is needed to enhance the logistic regression model, and the random forest classifier should be explored as a potential tool for law enforcement and financial institutions to detect money laundering activities in mobile money transfers.

使用移动货币便利跨境支付的便利性对打击洗钱和恐怖主义融资的执法部门构成了全球性威胁。本文旨在利用机器学习分类器来预测移动转账中被标记为欺诈的交易。本研究的数据是从模拟一个众所周知的移动转账欺诈方案的实时交易中获得的。采用Logistic回归作为基线模型,并与集合模型和梯度下降模型进行了比较。结果表明,逻辑回归模型虽然表现不如其他模型,但仍具有合理的性能。在所有度量中,随机森林分类器表现出优异的性能。在移动汇款中,汇款金额成为预测洗钱交易的首要特征。这些发现表明,需要进一步研究来增强逻辑回归模型,并应探索随机森林分类器作为执法和金融机构检测移动汇款中洗钱活动的潜在工具。
{"title":"Predicting mobile money transaction fraud using machine learning algorithms","authors":"Mark E. Lokanan","doi":"10.1002/ail2.85","DOIUrl":"10.1002/ail2.85","url":null,"abstract":"<p>The ease with which mobile money is used to facilitate cross-border payments presents a global threat to law enforcement in the fight against money laundering and terrorist financing. This paper aims to utilize machine learning classifiers to predict transactions flagged as a fraud in mobile money transfers. The data for this study were obtained from real-time transactions that simulate a well-known mobile transfer fraud scheme. Logistic regression is used as the baseline model and is compared with ensemble and gradient descent models. The results indicate that the logistic regression model still showed reasonable performance while not performing as well as the other models. Among all the measures, the random forest classifier exhibited outstanding performance. The amount of money transferred emerged as the top feature for predicting money laundering transactions in mobile money transfers. These findings suggest that further research is needed to enhance the logistic regression model, and the random forest classifier should be explored as a potential tool for law enforcement and financial institutions to detect money laundering activities in mobile money transfers.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.85","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46263497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated patent classification for crop protection via domain adaptation 通过领域适应的作物保护自动专利分类
Pub Date : 2023-02-15 DOI: 10.1002/ail2.80
Dimitrios Christofidellis, Marzena Maria Lehmann, Torsten Luksch, Marco Stenta, Matteo Manica

Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state-of-the-art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain-adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state-of-the-art models with increased precision, recall, and F1-score. We base our evaluation on both standard patent classification datasets derived from patent offices-defined code hierarchies and more practical real-world use-case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated.

专利显示了大多数科学领域的技术如何随着时间的推移而演变。使用这个有价值的知识库的最佳方法是使用高效和有效的信息检索和相关现有技术的搜索。专利分类,即将专利分配给一个或多个预定义的类别,是合成发明信息内容的基本步骤。为此,基于transformer的架构,特别是那些来自BERT家族的架构已经在文献中提出,并且通过为分类任务设置新的最先进的性能,它们已经显示出显着的结果。在这里,我们通过严格评估和实施一系列最新的迁移学习技术(例如,领域自适应预训练和适配器)来研究领域自适应如何在专利分类中突破性能界限。我们的分析显示了如何利用这些进步来开发具有更高精度、召回率和f1分数的最先进模型。我们的评估基于来自专利局定义的代码层次结构的标准专利分类数据集,以及包含农化工业领域标签的更实际的现实用例场景。这些领域适应技术在多语言环境下的专利分类应用也被检查和评估。
{"title":"Automated patent classification for crop protection via domain adaptation","authors":"Dimitrios Christofidellis,&nbsp;Marzena Maria Lehmann,&nbsp;Torsten Luksch,&nbsp;Marco Stenta,&nbsp;Matteo Manica","doi":"10.1002/ail2.80","DOIUrl":"10.1002/ail2.80","url":null,"abstract":"<p>Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state-of-the-art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain-adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state-of-the-art models with increased precision, recall, and <i>F</i>1-score. We base our evaluation on both standard patent classification datasets derived from patent offices-defined code hierarchies and more practical real-world use-case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.80","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43702643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied AI letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1