首页 > 最新文献

Machine learning and knowledge extraction最新文献

英文 中文
Capsule Network with Its Limitation, Modification, and Applications - A Survey 胶囊网络及其局限性、改进与应用综述
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-08-02 DOI: 10.3390/make5030047
Mahmood Ul Haq, M. A. J. Sethi, A. Rehman
Numerous advancements in various fields, including pattern recognition and image classification, have been made thanks to modern computer vision and machine learning methods. The capsule network is one of the advanced machine learning algorithms that encodes features based on their hierarchical relationships. Basically, a capsule network is a type of neural network that performs inverse graphics to represent the object in different parts and view the existing relationship between these parts, unlike CNNs, which lose most of the evidence related to spatial location and requires lots of training data. So, we present a comparative review of various capsule network architectures used in various applications. The paper’s main contribution is that it summarizes and explains the significant current published capsule network architectures with their advantages, limitations, modifications, and applications.
由于现代计算机视觉和机器学习方法,在模式识别和图像分类等各个领域取得了许多进步。胶囊网络是一种先进的机器学习算法,它根据特征的层次关系对特征进行编码。基本上,胶囊网络是一种神经网络,它通过逆图形来表示物体在不同的部分,并查看这些部分之间存在的关系,而不像cnn会丢失大部分与空间位置相关的证据,并且需要大量的训练数据。因此,我们对各种应用中使用的各种胶囊网络架构进行了比较回顾。本文的主要贡献是总结和解释了当前发布的重要胶囊网络体系结构及其优点、局限性、修改和应用。
{"title":"Capsule Network with Its Limitation, Modification, and Applications - A Survey","authors":"Mahmood Ul Haq, M. A. J. Sethi, A. Rehman","doi":"10.3390/make5030047","DOIUrl":"https://doi.org/10.3390/make5030047","url":null,"abstract":"Numerous advancements in various fields, including pattern recognition and image classification, have been made thanks to modern computer vision and machine learning methods. The capsule network is one of the advanced machine learning algorithms that encodes features based on their hierarchical relationships. Basically, a capsule network is a type of neural network that performs inverse graphics to represent the object in different parts and view the existing relationship between these parts, unlike CNNs, which lose most of the evidence related to spatial location and requires lots of training data. So, we present a comparative review of various capsule network architectures used in various applications. The paper’s main contribution is that it summarizes and explains the significant current published capsule network architectures with their advantages, limitations, modifications, and applications.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"52 1","pages":"891-921"},"PeriodicalIF":0.0,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90471447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Autoencoder Feature Residuals for Network Intrusion Detection: One-Class Pretraining for Improved Performance 网络入侵检测的自编码器特征残差:改进性能的单类预训练
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-31 DOI: 10.3390/make5030046
B. Lewandowski, R. Paffenroth
The proliferation of novel attacks and growing amounts of data has caused practitioners in the field of network intrusion detection to constantly work towards keeping up with this evolving adversarial landscape. Researchers have been seeking to harness deep learning techniques in efforts to detect zero-day attacks and allow network intrusion detection systems to more efficiently alert network operators. The technique outlined in this work uses a one-class training process to shape autoencoder feature residuals for the effective detection of network attacks. Compared to an original set of input features, we show that autoencoder feature residuals are a suitable replacement, and often perform at least as well as the original feature set. This quality allows autoencoder feature residuals to prevent the need for extensive feature engineering without reducing classification performance. Additionally, it is found that without generating new data compared to an original feature set, using autoencoder feature residuals often improves classifier performance. Practical side effects from using autoencoder feature residuals emerge by analyzing the potential data compression benefits they provide.
新型攻击的激增和不断增长的数据量使得网络入侵检测领域的从业者不断努力跟上这种不断发展的对抗环境。研究人员一直在寻求利用深度学习技术来检测零日攻击,并允许网络入侵检测系统更有效地提醒网络运营商。这项工作中概述的技术使用一类训练过程来塑造自动编码器特征残差,以有效检测网络攻击。与原始输入特征集相比,我们表明自编码器特征残差是一个合适的替代品,并且通常表现至少与原始特征集一样好。这种质量允许自动编码器特征残差,以防止需要大量的特征工程,而不会降低分类性能。此外,与原始特征集相比,在不生成新数据的情况下,使用自编码器特征残差通常可以提高分类器的性能。通过分析自编码器特征残差提供的潜在数据压缩优势,可以得出使用自编码器特征残差的实际副作用。
{"title":"Autoencoder Feature Residuals for Network Intrusion Detection: One-Class Pretraining for Improved Performance","authors":"B. Lewandowski, R. Paffenroth","doi":"10.3390/make5030046","DOIUrl":"https://doi.org/10.3390/make5030046","url":null,"abstract":"The proliferation of novel attacks and growing amounts of data has caused practitioners in the field of network intrusion detection to constantly work towards keeping up with this evolving adversarial landscape. Researchers have been seeking to harness deep learning techniques in efforts to detect zero-day attacks and allow network intrusion detection systems to more efficiently alert network operators. The technique outlined in this work uses a one-class training process to shape autoencoder feature residuals for the effective detection of network attacks. Compared to an original set of input features, we show that autoencoder feature residuals are a suitable replacement, and often perform at least as well as the original feature set. This quality allows autoencoder feature residuals to prevent the need for extensive feature engineering without reducing classification performance. Additionally, it is found that without generating new data compared to an original feature set, using autoencoder feature residuals often improves classifier performance. Practical side effects from using autoencoder feature residuals emerge by analyzing the potential data compression benefits they provide.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"21 1","pages":"868-890"},"PeriodicalIF":0.0,"publicationDate":"2023-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80102408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Latent Space Compression for Lightning-Fast Fine-Tuning and Inference of Transformer-Based Models 基于变压器模型的快速微调和推理的有效潜在空间压缩
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-30 DOI: 10.3390/make5030045
Ala Alam Falaki, R. Gras
This paper presents a technique to reduce the number of parameters in a transformer-based encoder–decoder architecture by incorporating autoencoders. To discover the optimal compression, we trained different autoencoders on the embedding space (encoder’s output) of several pre-trained models. The experiments reveal that reducing the embedding size has the potential to dramatically decrease the GPU memory usage while speeding up the inference process. The proposed architecture was included in the BART model and tested for summarization, translation, and classification tasks. The summarization results show that a 60% decoder size reduction (from 96 M to 40 M parameters) will make the inference twice as fast and use less than half of GPU memory during fine-tuning process with only a 4.5% drop in R-1 score. The same trend is visible for translation and partially for classification tasks. Our approach reduces the GPU memory usage and processing time of large-scale sequence-to-sequence models for fine-tuning and inference. The implementation and checkpoints are available on GitHub.
本文提出了一种通过加入自编码器来减少基于变压器的编解码器结构中参数数量的技术。为了发现最优压缩,我们在几个预训练模型的嵌入空间(编码器的输出)上训练不同的自编码器。实验表明,减小嵌入尺寸有可能显著降低GPU内存的使用,同时加快推理过程。提议的体系结构包含在BART模型中,并测试了摘要、翻译和分类任务。总结结果表明,解码器尺寸减少60%(从96 M到40 M参数)将使推理速度提高一倍,并且在微调过程中使用不到一半的GPU内存,R-1分数仅下降4.5%。同样的趋势在翻译任务和部分分类任务中也很明显。我们的方法减少了GPU内存的使用和用于微调和推理的大规模序列到序列模型的处理时间。实现和检查点可以在GitHub上获得。
{"title":"Efficient Latent Space Compression for Lightning-Fast Fine-Tuning and Inference of Transformer-Based Models","authors":"Ala Alam Falaki, R. Gras","doi":"10.3390/make5030045","DOIUrl":"https://doi.org/10.3390/make5030045","url":null,"abstract":"This paper presents a technique to reduce the number of parameters in a transformer-based encoder–decoder architecture by incorporating autoencoders. To discover the optimal compression, we trained different autoencoders on the embedding space (encoder’s output) of several pre-trained models. The experiments reveal that reducing the embedding size has the potential to dramatically decrease the GPU memory usage while speeding up the inference process. The proposed architecture was included in the BART model and tested for summarization, translation, and classification tasks. The summarization results show that a 60% decoder size reduction (from 96 M to 40 M parameters) will make the inference twice as fast and use less than half of GPU memory during fine-tuning process with only a 4.5% drop in R-1 score. The same trend is visible for translation and partially for classification tasks. Our approach reduces the GPU memory usage and processing time of large-scale sequence-to-sequence models for fine-tuning and inference. The implementation and checkpoints are available on GitHub.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"477 1","pages":"847-867"},"PeriodicalIF":0.0,"publicationDate":"2023-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80796528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low Cost Evolutionary Neural Architecture Search (LENAS) Applied to Traffic Forecasting 低成本进化神经结构搜索(LENAS)在交通预测中的应用
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-28 DOI: 10.3390/make5030044
Daniel Klosa, C. Büskens
Traffic forecasting is an important task for transportation engineering as it helps authorities to plan and control traffic flow, detect congestion, and reduce environmental impact. In recent times, the application of deep learning techniques to handle large and complex datasets has become prevalent. However, these methods necessitate a proficiency in neural architecture engineering, a skill set that many decision makers in traffic management centers may not possess. Neural architecture search (NAS) methods have gained popularity for alleviating the problem of neural architecture engineering by discovering customized neural architectures for various tasks. Their application to traffic prediction has only recently been explored. Performance estimation of neural architectures, a sub-problem of NAS and often the bottleneck in terms of computation time, hinders the adaptation of research to real-world applications. Recently, zero-cost (ZC) proxies have emerged as a cost-effective means of evaluating network architectures without requiring training, circumventing the bottleneck at the expense of accuracy. This work extends previous research on evolutionary NAS (ENAS) by evaluating the utility of ZC proxies for the task of traffic prediction. We answer research questions related to the stability of zero-cost proxies and their correlation with validation losses on real-world datasets. When used in the ENAS framework, we show that ZC proxies can speed up the search process by two orders of magnitude without greatly affecting the accuracy of the prediction model. Traffic forecasting is an important task for transportation engineering as it helps authorities to plan and control traffic flow, detect congestion, and reduce environmental impact. Deep learning techniques have gained traction in handling such complex datasets, but require expertise in neural architecture engineering, often beyond the scope of traffic management decision-makers. Our study aims to address this challenge by using neural architecture search (NAS) methods. These methods, which simplify neural architecture engineering by discovering task-specific neural architectures, are only recently applied to traffic prediction. We specifically focus on the performance estimation of neural architectures, a computationally demanding sub-problem of NAS, that often hinders the real-world application of these methods. Extending prior work on evolutionary NAS (ENAS), our work evaluates the utility of zero-cost (ZC) proxies, recently emerged cost-effective evaluators of network architectures. These proxies operate without necessitating training, thereby circumventing the computational bottleneck, albeit at a slight cost to accuracy. Our findings indicate that, when integrated into the ENAS framework, ZC proxies can accelerate the search process by two orders of magnitude at a small cost of accuracy. These results establish the viability of ZC proxies as a practical solution to accelerate NAS methods wh
交通预测是交通工程中的一项重要任务,它可以帮助当局规划和控制交通流量,检测拥堵,减少对环境的影响。近年来,深度学习技术在处理大型复杂数据集方面的应用已经变得非常普遍。然而,这些方法需要熟练掌握神经结构工程,这是交通管理中心的许多决策者可能不具备的技能。神经架构搜索(NAS)方法通过为各种任务发现定制的神经架构来缓解神经架构工程的问题,从而获得了广泛的应用。直到最近才开始探索它们在交通预测中的应用。神经网络架构的性能评估是NAS的一个子问题,通常是计算时间方面的瓶颈,阻碍了研究适应现实世界的应用。最近,零成本(ZC)代理已经成为一种不需要训练就能评估网络架构的经济有效的方法,以牺牲准确性为代价规避了瓶颈。这项工作通过评估ZC代理在流量预测任务中的效用,扩展了先前对进化NAS (ENAS)的研究。我们回答了与零成本代理的稳定性及其与现实世界数据集验证损失的相关性相关的研究问题。当在ENAS框架中使用时,我们发现ZC代理可以将搜索过程加快两个数量级,而不会对预测模型的准确性产生很大影响。交通预测是交通工程中的一项重要任务,它可以帮助当局规划和控制交通流量,检测拥堵,减少对环境的影响。深度学习技术在处理如此复杂的数据集方面获得了牵引力,但需要神经架构工程方面的专业知识,这通常超出了交通管理决策者的范围。我们的研究旨在通过使用神经结构搜索(NAS)方法来解决这一挑战。这些方法通过发现特定任务的神经结构来简化神经结构工程,直到最近才应用于交通预测。我们特别关注神经架构的性能评估,这是NAS的一个计算要求很高的子问题,经常阻碍这些方法在现实世界中的应用。我们的工作扩展了先前在渐进式NAS (ENAS)上的工作,评估了零成本(ZC)代理的效用,这是最近出现的网络架构的成本效益评估器。这些代理不需要训练就可以运行,从而绕过了计算瓶颈,尽管准确性会有轻微的损失。我们的研究结果表明,当整合到ENAS框架中时,ZC代理可以在很小的准确性代价下将搜索过程加快两个数量级。这些结果确立了ZC代理作为加速NAS方法同时保持模型准确性的实用解决方案的可行性。我们的研究通过展示ZC代理如何增强用于流量预测的NAS方法的可访问性和可用性来为该领域做出贡献,尽管在神经结构工程专业知识方面存在潜在的局限性。这种新颖的方法极大地帮助了深度学习技术在现实世界交通管理场景中的有效应用。
{"title":"Low Cost Evolutionary Neural Architecture Search (LENAS) Applied to Traffic Forecasting","authors":"Daniel Klosa, C. Büskens","doi":"10.3390/make5030044","DOIUrl":"https://doi.org/10.3390/make5030044","url":null,"abstract":"Traffic forecasting is an important task for transportation engineering as it helps authorities to plan and control traffic flow, detect congestion, and reduce environmental impact. In recent times, the application of deep learning techniques to handle large and complex datasets has become prevalent. However, these methods necessitate a proficiency in neural architecture engineering, a skill set that many decision makers in traffic management centers may not possess. Neural architecture search (NAS) methods have gained popularity for alleviating the problem of neural architecture engineering by discovering customized neural architectures for various tasks. Their application to traffic prediction has only recently been explored. Performance estimation of neural architectures, a sub-problem of NAS and often the bottleneck in terms of computation time, hinders the adaptation of research to real-world applications. Recently, zero-cost (ZC) proxies have emerged as a cost-effective means of evaluating network architectures without requiring training, circumventing the bottleneck at the expense of accuracy. This work extends previous research on evolutionary NAS (ENAS) by evaluating the utility of ZC proxies for the task of traffic prediction. We answer research questions related to the stability of zero-cost proxies and their correlation with validation losses on real-world datasets. When used in the ENAS framework, we show that ZC proxies can speed up the search process by two orders of magnitude without greatly affecting the accuracy of the prediction model. Traffic forecasting is an important task for transportation engineering as it helps authorities to plan and control traffic flow, detect congestion, and reduce environmental impact. Deep learning techniques have gained traction in handling such complex datasets, but require expertise in neural architecture engineering, often beyond the scope of traffic management decision-makers. Our study aims to address this challenge by using neural architecture search (NAS) methods. These methods, which simplify neural architecture engineering by discovering task-specific neural architectures, are only recently applied to traffic prediction. We specifically focus on the performance estimation of neural architectures, a computationally demanding sub-problem of NAS, that often hinders the real-world application of these methods. Extending prior work on evolutionary NAS (ENAS), our work evaluates the utility of zero-cost (ZC) proxies, recently emerged cost-effective evaluators of network architectures. These proxies operate without necessitating training, thereby circumventing the computational bottleneck, albeit at a slight cost to accuracy. Our findings indicate that, when integrated into the ENAS framework, ZC proxies can accelerate the search process by two orders of magnitude at a small cost of accuracy. These results establish the viability of ZC proxies as a practical solution to accelerate NAS methods wh","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"1 1","pages":"830-846"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90233389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification Confidence in Exploratory Learning: A User's Guide 探索性学习中的分类信心:用户指南
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-21 DOI: 10.3390/make5030043
P. Salamon, David Salamon, V. A. Cantu, Michelle An, Tyler Perry, Robert A. Edwards, A. Segall
This paper investigates the post-hoc calibration of confidence for “exploratory” machine learning classification problems. The difficulty in these problems stems from the continuing desire to push the boundaries of which categories have enough examples to generalize from when curating datasets, and confusion regarding the validity of those categories. We argue that for such problems the “one-versus-all” approach (top-label calibration) must be used rather than the “calibrate-the-full-response-matrix” approach advocated elsewhere in the literature. We introduce and test four new algorithms designed to handle the idiosyncrasies of category-specific confidence estimation using only the test set and the final model. Chief among these methods is the use of kernel density ratios for confidence calibration including a novel algorithm for choosing the bandwidth. We test our claims and explore the limits of calibration on a bioinformatics application (PhANNs) as well as the classic MNIST benchmark. Finally, our analysis argues that post-hoc calibration should always be performed, may be performed using only the test dataset, and should be sanity-checked visually.
本文研究了“探索性”机器学习分类问题的置信度事后校准。这些问题的困难在于,在管理数据集时,人们一直希望突破哪些类别有足够的例子可以概括的界限,以及对这些类别的有效性感到困惑。我们认为,对于此类问题,必须使用“一个对所有”的方法(顶级标签校准),而不是文献中其他地方提倡的“校准全响应矩阵”方法。我们介绍并测试了四种新的算法,这些算法设计用于仅使用测试集和最终模型来处理特定类别置信估计的特性。这些方法中最主要的是使用核密度比进行置信度校准,其中包括一种选择带宽的新算法。我们测试了我们的主张,并探索了生物信息学应用程序(PhANNs)以及经典MNIST基准的校准限制。最后,我们的分析认为,事后校准应该始终执行,可以只使用测试数据集执行,并且应该进行视觉上的安全性检查。
{"title":"Classification Confidence in Exploratory Learning: A User's Guide","authors":"P. Salamon, David Salamon, V. A. Cantu, Michelle An, Tyler Perry, Robert A. Edwards, A. Segall","doi":"10.3390/make5030043","DOIUrl":"https://doi.org/10.3390/make5030043","url":null,"abstract":"This paper investigates the post-hoc calibration of confidence for “exploratory” machine learning classification problems. The difficulty in these problems stems from the continuing desire to push the boundaries of which categories have enough examples to generalize from when curating datasets, and confusion regarding the validity of those categories. We argue that for such problems the “one-versus-all” approach (top-label calibration) must be used rather than the “calibrate-the-full-response-matrix” approach advocated elsewhere in the literature. We introduce and test four new algorithms designed to handle the idiosyncrasies of category-specific confidence estimation using only the test set and the final model. Chief among these methods is the use of kernel density ratios for confidence calibration including a novel algorithm for choosing the bandwidth. We test our claims and explore the limits of calibration on a bioinformatics application (PhANNs) as well as the classic MNIST benchmark. Finally, our analysis argues that post-hoc calibration should always be performed, may be performed using only the test dataset, and should be sanity-checked visually.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"33 1","pages":"803-829"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83946288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning and Autonomous Vehicles: Strategic Themes, Applications, and Research Agenda Using SciMAT and Content-Centric Analysis, a Systematic Review 深度学习和自动驾驶汽车:战略主题、应用和研究议程,使用SciMAT和以内容为中心的分析,系统综述
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-13 DOI: 10.3390/make5030041
Fábio Eid Morooka, Adalberto Manoel Junior, T. Sigahi, Jefferson de Souza Pinto, I. Rampasso, R. Anholon
Applications of deep learning (DL) in autonomous vehicle (AV) projects have gained increasing interest from both researchers and companies. This has caused a rapid expansion of scientific production on DL-AV in recent years, encouraging researchers to conduct systematic literature reviews (SLRs) to organize knowledge on the topic. However, a critical analysis of the existing SLRs on DL-AV reveals some methodological gaps, particularly regarding the use of bibliometric software, which are powerful tools for analyzing large amounts of data and for providing a holistic understanding on the structure of knowledge of a particular field. This study aims to identify the strategic themes and trends in DL-AV research using the Science Mapping Analysis Tool (SciMAT) and content analysis. Strategic diagrams and cluster networks were developed using SciMAT, allowing the identification of motor themes and research opportunities. The content analysis allowed categorization of the contribution of the academic literature on DL applications in AV project design; neural networks and AI models used in AVs; and transdisciplinary themes in DL-AV research, including energy, legislation, ethics, and cybersecurity. Potential research avenues are discussed for each of these categories. The findings presented in this study can benefit both experienced scholars who can gain access to condensed information about the literature on DL-AV and new researchers who may be attracted to topics related to technological development and other issues with social and environmental impacts.
深度学习(DL)在自动驾驶汽车(AV)项目中的应用已经引起了研究人员和公司越来越多的兴趣。这导致近年来DL-AV的科学研究迅速增加,鼓励研究人员进行系统的文献综述(slr)来组织有关该主题的知识。然而,对DL-AV上现有单反的批判性分析揭示了一些方法上的差距,特别是关于文献计量软件的使用,这些软件是分析大量数据和提供对特定领域知识结构的整体理解的强大工具。本研究旨在利用科学制图分析工具(SciMAT)和内容分析,确定DL-AV研究的战略主题和趋势。使用SciMAT开发了战略图表和集群网络,从而确定了运动主题和研究机会。内容分析允许对数字数据在AV项目设计中应用的学术文献的贡献进行分类;用于自动驾驶汽车的神经网络和人工智能模型;以及DL-AV研究的跨学科主题,包括能源、立法、伦理和网络安全。对每一个类别的潜在研究途径进行了讨论。本研究的发现既有利于有经验的学者,他们可以获得关于DL-AV文献的浓缩信息,也有利于新研究人员,他们可能会被与技术发展和其他社会和环境影响问题相关的主题所吸引。
{"title":"Deep Learning and Autonomous Vehicles: Strategic Themes, Applications, and Research Agenda Using SciMAT and Content-Centric Analysis, a Systematic Review","authors":"Fábio Eid Morooka, Adalberto Manoel Junior, T. Sigahi, Jefferson de Souza Pinto, I. Rampasso, R. Anholon","doi":"10.3390/make5030041","DOIUrl":"https://doi.org/10.3390/make5030041","url":null,"abstract":"Applications of deep learning (DL) in autonomous vehicle (AV) projects have gained increasing interest from both researchers and companies. This has caused a rapid expansion of scientific production on DL-AV in recent years, encouraging researchers to conduct systematic literature reviews (SLRs) to organize knowledge on the topic. However, a critical analysis of the existing SLRs on DL-AV reveals some methodological gaps, particularly regarding the use of bibliometric software, which are powerful tools for analyzing large amounts of data and for providing a holistic understanding on the structure of knowledge of a particular field. This study aims to identify the strategic themes and trends in DL-AV research using the Science Mapping Analysis Tool (SciMAT) and content analysis. Strategic diagrams and cluster networks were developed using SciMAT, allowing the identification of motor themes and research opportunities. The content analysis allowed categorization of the contribution of the academic literature on DL applications in AV project design; neural networks and AI models used in AVs; and transdisciplinary themes in DL-AV research, including energy, legislation, ethics, and cybersecurity. Potential research avenues are discussed for each of these categories. The findings presented in this study can benefit both experienced scholars who can gain access to condensed information about the literature on DL-AV and new researchers who may be attracted to topics related to technological development and other issues with social and environmental impacts.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"66 1","pages":"763-781"},"PeriodicalIF":0.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75884660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Value of Numbers in Clinical Text Classification 数字在临床文本分类中的价值
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-07-07 DOI: 10.3390/make5030040
Kristian Miok, P. Corcoran, Irena Spasic
Clinical text often includes numbers of various types and formats. However, most current text classification approaches do not take advantage of these numbers. This study aims to demonstrate that using numbers as features can significantly improve the performance of text classification models. This study also demonstrates the feasibility of extracting such features from clinical text. Unsupervised learning was used to identify patterns of number usage in clinical text. These patterns were analyzed manually and converted into pattern-matching rules. Information extraction was used to incorporate numbers as features into a document representation model. We evaluated text classification models trained on such representation. Our experiments were performed with two document representation models (vector space model and word embedding model) and two classification models (support vector machines and neural networks). The results showed that even a handful of numerical features can significantly improve text classification performance. We conclude that commonly used document representations do not represent numbers in a way that machine learning algorithms can effectively utilize them as features. Although we demonstrated that traditional information extraction can be effective in converting numbers into features, further community-wide research is required to systematically incorporate number representation into the word embedding process.
临床文本通常包括各种类型和格式的数字。然而,大多数当前的文本分类方法都没有利用这些数字。本研究旨在证明使用数字作为特征可以显著提高文本分类模型的性能。本研究也证明了从临床文本中提取这些特征的可行性。使用无监督学习来识别临床文本中数字使用的模式。手动分析这些模式并将其转换为模式匹配规则。信息提取用于将数字作为特征纳入文档表示模型。我们评估了在这种表示上训练的文本分类模型。我们的实验使用了两种文档表示模型(向量空间模型和词嵌入模型)和两种分类模型(支持向量机和神经网络)。结果表明,即使少量的数字特征也能显著提高文本分类性能。我们得出的结论是,常用的文档表示方式不能以机器学习算法可以有效地利用它们作为特征的方式表示数字。虽然我们证明了传统的信息提取可以有效地将数字转换为特征,但需要进一步的社区研究来系统地将数字表示纳入词嵌入过程。
{"title":"The Value of Numbers in Clinical Text Classification","authors":"Kristian Miok, P. Corcoran, Irena Spasic","doi":"10.3390/make5030040","DOIUrl":"https://doi.org/10.3390/make5030040","url":null,"abstract":"Clinical text often includes numbers of various types and formats. However, most current text classification approaches do not take advantage of these numbers. This study aims to demonstrate that using numbers as features can significantly improve the performance of text classification models. This study also demonstrates the feasibility of extracting such features from clinical text. Unsupervised learning was used to identify patterns of number usage in clinical text. These patterns were analyzed manually and converted into pattern-matching rules. Information extraction was used to incorporate numbers as features into a document representation model. We evaluated text classification models trained on such representation. Our experiments were performed with two document representation models (vector space model and word embedding model) and two classification models (support vector machines and neural networks). The results showed that even a handful of numerical features can significantly improve text classification performance. We conclude that commonly used document representations do not represent numbers in a way that machine learning algorithms can effectively utilize them as features. Although we demonstrated that traditional information extraction can be effective in converting numbers into features, further community-wide research is required to systematically incorporate number representation into the word embedding process.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"30 1","pages":"746-762"},"PeriodicalIF":0.0,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83600174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Forest Fire Detection Algorithm Based on Improved YOLOv5 基于改进YOLOv5的森林火灾检测算法研究
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-06-28 DOI: 10.3390/make5030039
Jianfeng Li, Xiao-Feng Lian
Forest fires are one of the world’s deadliest natural disasters. Early detection of forest fires can help minimize the damage to ecosystems and forest life. In this paper, we propose an improved fire detection method YOLOv5-IFFDM for YOLOv5. Firstly, the fire and smoke detection accuracy and the network perception accuracy of small targets are improved by adding an attention mechanism to the backbone network. Secondly, the loss function is improved and the SoftPool pyramid pooling structure is used to improve the regression accuracy and detection performance of the model and the robustness of the model. In addition, a random mosaic augmentation technique is used to enhance the data to increase the generalization ability of the model, and re-clustering of flame and smoke detection a priori frames are used to improve the accuracy and speed. Finally, the parameters of the convolutional and normalization layers of the trained model are homogeneously merged to further reduce the model processing load and to improve the detection speed. Experimental results on self-built forest-fire and smoke datasets show that this algorithm has high detection accuracy and fast detection speed, with average accuracy of fire up to 90.5% and smoke up to 84.3%, and detection speed up to 75 FPS (frames per second transmission), which can meet the requirements of real-time and efficient fire detection.
森林火灾是世界上最致命的自然灾害之一。早期发现森林火灾有助于将对生态系统和森林生物的损害降到最低。本文针对YOLOv5提出了一种改进的火灾探测方法YOLOv5- iffdm。首先,通过在骨干网中加入关注机制,提高了对小目标的火灾、烟雾探测精度和网络感知精度;其次,对损失函数进行改进,利用SoftPool金字塔池结构提高模型的回归精度和检测性能,增强模型的鲁棒性;此外,采用随机拼接增强技术对数据进行增强以提高模型的泛化能力,并采用火焰和烟雾检测先验帧重新聚类以提高模型的精度和速度。最后,对训练模型的卷积层和归一化层参数进行均匀合并,进一步降低模型处理负荷,提高检测速度。在自建森林火灾和烟雾数据集上的实验结果表明,该算法检测精度高,检测速度快,对火灾的平均检测精度可达90.5%,对烟雾的平均检测精度可达84.3%,检测速度可达75 FPS(帧/秒传输),能够满足实时、高效的火灾检测要求。
{"title":"Research on Forest Fire Detection Algorithm Based on Improved YOLOv5","authors":"Jianfeng Li, Xiao-Feng Lian","doi":"10.3390/make5030039","DOIUrl":"https://doi.org/10.3390/make5030039","url":null,"abstract":"Forest fires are one of the world’s deadliest natural disasters. Early detection of forest fires can help minimize the damage to ecosystems and forest life. In this paper, we propose an improved fire detection method YOLOv5-IFFDM for YOLOv5. Firstly, the fire and smoke detection accuracy and the network perception accuracy of small targets are improved by adding an attention mechanism to the backbone network. Secondly, the loss function is improved and the SoftPool pyramid pooling structure is used to improve the regression accuracy and detection performance of the model and the robustness of the model. In addition, a random mosaic augmentation technique is used to enhance the data to increase the generalization ability of the model, and re-clustering of flame and smoke detection a priori frames are used to improve the accuracy and speed. Finally, the parameters of the convolutional and normalization layers of the trained model are homogeneously merged to further reduce the model processing load and to improve the detection speed. Experimental results on self-built forest-fire and smoke datasets show that this algorithm has high detection accuracy and fast detection speed, with average accuracy of fire up to 90.5% and smoke up to 84.3%, and detection speed up to 75 FPS (frames per second transmission), which can meet the requirements of real-time and efficient fire detection.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"10 1","pages":"725-745"},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79529380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Machine Learning with Eye-Tracking Data to Predict if a Recruiter Will Approve a Resume 使用机器学习和眼球追踪数据来预测招聘人员是否会批准一份简历
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-06-28 DOI: 10.3390/make5030038
Angel Pina, Corbin Petersheim, Josh Cherian, J. Lahey, Gerianne Alexander, T. Hammond
When job seekers are unsuccessful in getting a position, they often do not get feedback to inform them on how to develop a better application in the future. Therefore, there is a critical need to understand what qualifications recruiters value in order to help applicants. To address this need, we utilized eye-trackers to measure and record visual data of recruiters screening resumes to gain insight into which Areas of Interest (AOIs) influenced recruiters’ decisions the most. Using just this eye-tracking data, we trained a machine learning classifier to predict whether or not a recruiter would move a resume on to the next level of the hiring process with an AUC of 0.767. We found that features associated with recruiters looking outside the content of a resume were most predictive of their decision as well as total time viewing the resume and time spent on the Experience and Education sections. We hypothesize that this behavior is indicative of the recruiter reflecting on the content of the resume. These initial results show that applicants should focus on designing clear and concise resumes that are easy for recruiters to absorb and think about, with additional attention given to the Experience and Education sections.
当求职者找不到工作的时候,他们通常不会得到反馈,告诉他们如何在未来更好地申请工作。因此,为了帮助求职者,我们非常有必要了解招聘人员看重哪些资质。为了满足这一需求,我们利用眼动仪来测量和记录招聘人员筛选简历的视觉数据,以深入了解哪些兴趣领域(AOIs)对招聘人员的决定影响最大。仅使用这些眼球追踪数据,我们训练了一个机器学习分类器来预测招聘人员是否会将简历转移到招聘流程的下一个阶段,AUC为0.767。我们发现,与招聘人员看简历内容之外的特征有关的特征,以及看简历的总时间和花在经历和教育方面的时间,最能预测他们的决定。我们假设这种行为表明招聘人员对简历的内容进行了反思。这些初步结果表明,求职者应该专注于设计清晰简洁的简历,以便招聘人员容易吸收和思考,同时还要注意工作经历和教育背景部分。
{"title":"Using Machine Learning with Eye-Tracking Data to Predict if a Recruiter Will Approve a Resume","authors":"Angel Pina, Corbin Petersheim, Josh Cherian, J. Lahey, Gerianne Alexander, T. Hammond","doi":"10.3390/make5030038","DOIUrl":"https://doi.org/10.3390/make5030038","url":null,"abstract":"When job seekers are unsuccessful in getting a position, they often do not get feedback to inform them on how to develop a better application in the future. Therefore, there is a critical need to understand what qualifications recruiters value in order to help applicants. To address this need, we utilized eye-trackers to measure and record visual data of recruiters screening resumes to gain insight into which Areas of Interest (AOIs) influenced recruiters’ decisions the most. Using just this eye-tracking data, we trained a machine learning classifier to predict whether or not a recruiter would move a resume on to the next level of the hiring process with an AUC of 0.767. We found that features associated with recruiters looking outside the content of a resume were most predictive of their decision as well as total time viewing the resume and time spent on the Experience and Education sections. We hypothesize that this behavior is indicative of the recruiter reflecting on the content of the resume. These initial results show that applicants should focus on designing clear and concise resumes that are easy for recruiters to absorb and think about, with additional attention given to the Experience and Education sections.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"25 1","pages":"713-724"},"PeriodicalIF":0.0,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83289449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drug-Drug Interaction Extraction from Biomedical Text Using Relation BioBERT with BLSTM 基于BioBERT和BLSTM的生物医学文本药物-药物相互作用提取
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-06-10 DOI: 10.3390/make5020036
Maryam KafiKang, Abdeltawab Hendawi
In the context of pharmaceuticals, drug-drug interactions (DDIs) occur when two or more drugs interact, potentially altering the intended effects of the drugs and resulting in adverse patient health outcomes. Therefore, it is essential to identify and comprehend these interactions. In recent years, an increasing number of novel compounds have been discovered, resulting in the discovery of numerous new DDIs. There is a need for effective methods to extract and analyze DDIs, as the majority of this information is still predominantly located in biomedical articles and sources. Despite the development of various techniques, accurately predicting DDIs remains a significant challenge. This paper proposes a novel solution to this problem by leveraging the power of Relation BioBERT (R-BioBERT) to detect and classify DDIs and the Bidirectional Long Short-Term Memory (BLSTM) to improve the accuracy of predictions. In addition to determining whether two drugs interact, the proposed method also identifies the specific types of interactions between them. Results show that the use of BLSTM leads to significantly higher F-scores compared to our baseline model, as demonstrated on three well-known DDI extraction datasets that includes SemEval 2013, TAC 2018, and TAC 2019.
在药物领域,当两种或多种药物相互作用时,就会发生药物-药物相互作用(ddi),可能会改变药物的预期效果,并导致不良的患者健康结果。因此,识别和理解这些相互作用是至关重要的。近年来,越来越多的新化合物被发现,从而发现了许多新的ddi。需要有效的方法来提取和分析ddi,因为大多数此类信息仍然主要位于生物医学文章和来源中。尽管发展了各种技术,但准确预测ddi仍然是一个重大挑战。本文提出了一种新的解决方案,利用关系生物记忆(R-BioBERT)对ddi进行检测和分类,并利用双向长短期记忆(BLSTM)来提高预测的准确性。除了确定两种药物是否相互作用外,该方法还确定了它们之间相互作用的具体类型。结果表明,与我们的基线模型相比,使用BLSTM可以显著提高f分数,这在三个著名的DDI提取数据集(包括SemEval 2013、TAC 2018和TAC 2019)上得到了证明。
{"title":"Drug-Drug Interaction Extraction from Biomedical Text Using Relation BioBERT with BLSTM","authors":"Maryam KafiKang, Abdeltawab Hendawi","doi":"10.3390/make5020036","DOIUrl":"https://doi.org/10.3390/make5020036","url":null,"abstract":"In the context of pharmaceuticals, drug-drug interactions (DDIs) occur when two or more drugs interact, potentially altering the intended effects of the drugs and resulting in adverse patient health outcomes. Therefore, it is essential to identify and comprehend these interactions. In recent years, an increasing number of novel compounds have been discovered, resulting in the discovery of numerous new DDIs. There is a need for effective methods to extract and analyze DDIs, as the majority of this information is still predominantly located in biomedical articles and sources. Despite the development of various techniques, accurately predicting DDIs remains a significant challenge. This paper proposes a novel solution to this problem by leveraging the power of Relation BioBERT (R-BioBERT) to detect and classify DDIs and the Bidirectional Long Short-Term Memory (BLSTM) to improve the accuracy of predictions. In addition to determining whether two drugs interact, the proposed method also identifies the specific types of interactions between them. Results show that the use of BLSTM leads to significantly higher F-scores compared to our baseline model, as demonstrated on three well-known DDI extraction datasets that includes SemEval 2013, TAC 2018, and TAC 2019.","PeriodicalId":93033,"journal":{"name":"Machine learning and knowledge extraction","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135006472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Machine learning and knowledge extraction
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1