Journal of Artificial Intelligence and Soft Computing Research最新文献

英文中文

Performance Analysis of Rough Set–Based Hybrid Classification Systems in the Case of Missing Values 基于粗糙集的混合分类系统在缺失值情况下的性能分析

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-10-01 DOI: 10.2478/jaiscr-2021-0018

R. Nowicki, R. Seliga, Dariusz Żelasko, Y. Hayashi

Abstract The paper presents a performance analysis of a selected few rough set–based classification systems. They are hybrid solutions designed to process information with missing values. Rough set-–based classification systems combine various classification methods, such as support vector machines, k–nearest neighbour, fuzzy systems, and neural networks with the rough set theory. When all input values take the form of real numbers, and they are available, the structure of the classifier returns to a non–rough set version. The performance of the four systems has been analysed based on the classification results obtained for benchmark databases downloaded from the machine learning repository of the University of California at Irvine.

摘要本文对几种基于粗糙集的分类系统进行了性能分析。它们是混合解决方案，用于处理缺少值的信息。基于粗糙集的分类系统结合了各种分类方法，如支持向量机、k近邻、模糊系统和神经网络与粗糙集理论。当所有输入值都采用实数的形式，并且它们可用时，分类器的结构返回到非粗糙集版本。基于从加州大学欧文分校的机器学习存储库下载的基准数据库获得的分类结果，分析了这四个系统的性能。

引用次数: 0

Handwritten Word Recognition Using Fuzzy Matching Degrees 使用模糊匹配度的手写单词识别

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-05-29 DOI: 10.2478/jaiscr-2021-0014

Michal R. Wróbel, Janusz T. Starczewski, J. Fijałkowska, A. Siwocha, Christian Napoli

Abstract Handwritten text recognition systems interpret the scanned script images as text composed of letters. In this paper, efficient offline methods using fuzzy degrees, as well as interval fuzzy degrees of type-2, are proposed to recognize letters beforehand decomposed into strokes. For such strokes, the first stage methods are used to create a set of hypotheses as to whether a group of strokes matches letter or digit patterns. Subsequently, the second-stage methods are employed to select the most promising set of hypotheses with the use of fuzzy degrees. In a primary version of the second-stage system, standard fuzzy memberships are used to measure compatibility between strokes and character patterns. As an extension of the system thus created, interval type-2 fuzzy degrees are employed to perform a selection of hypotheses that fit multiple handwriting typefaces.

摘要手写体文本识别系统将扫描的脚本图像解释为由字母组成的文本。本文提出了使用模糊度和类型2的区间模糊度的有效离线方法来识别预先分解为笔划的字母。对于这样的笔画，第一阶段的方法用于创建一组关于一组笔画是否与字母或数字模式匹配的假设。随后，采用第二阶段的方法，使用模糊度来选择最有希望的假设集。在第二阶段系统的初级版本中，标准模糊隶属度用于测量笔划和字符模式之间的兼容性。作为这样创建的系统的扩展，区间类型2模糊度被用来执行适合多个手写字体的假设的选择。

引用次数: 2

Learning Novelty Detection Outside a Class of Random Curves with Application to COVID-19 Growth 一类随机曲线外学习新颖性检测及其在COVID-19生长中的应用

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-05-29 DOI: 10.2478/jaiscr-2021-0012

Wojciech Rafajłowicz

Abstract Let a class of proper curves is specified by positive examples only. We aim to propose a learning novelty detection algorithm that decides whether a new curve is outside this class or not. In opposite to the majority of the literature, two sources of a curve variability are present, namely, the one inherent to curves from the proper class and observations errors’. Therefore, firstly a decision function is trained on historical data, and then, descriptors of each curve to be classified are learned from noisy observations.When the intrinsic variability is Gaussian, a decision threshold can be established from T 2 Hotelling distribution and tuned to more general cases. Expansion coefficients in a selected orthogonal series are taken as descriptors and an algorithm for their learning is proposed that follows nonparametric curve fitting approaches. Its fast version is derived for descriptors that are based on the cosine series. Additionally, the asymptotic normality of learned descriptors and the bound for the probability of their large deviations are proved. The influence of this bound on the decision threshold is also discussed.The proposed approach covers curves described as functional data projected onto a finite-dimensional subspace of a Hilbert space as well a shape sensitive description of curves, known as square-root velocity (SRV). It was tested both on synthetic data and on real-life observations of the COVID-19 growth curves.

摘要设一类正曲线仅由正例指定。我们的目的是提出一种学习新颖性检测算法，该算法决定新曲线是否在该类之外。与大多数文献相反，曲线可变性有两个来源，即来自适当类别的曲线固有的来源和观测误差。因此，首先在历史数据上训练决策函数，然后从噪声观测中学习每个要分类的曲线的描述符。当内在变异性是高斯时，可以从T2霍特林分布建立决策阈值，并将其调整到更一般的情况。以所选正交序列中的展开系数为描述符，提出了一种遵循非参数曲线拟合方法的展开系数学习算法。它的快速版本是为基于余弦级数的描述符派生的。此外，还证明了学习描述符的渐近正态性及其大偏差概率的界。文中还讨论了该界对决策阈值的影响。所提出的方法涵盖了被描述为投影到希尔伯特空间的有限维子空间上的函数数据的曲线，以及被称为平方根速度（SRV）的曲线的形状敏感描述。它在合成数据和新冠肺炎增长曲线的真实观察中进行了测试。

{"title":"Learning Novelty Detection Outside a Class of Random Curves with Application to COVID-19 Growth","authors":"Wojciech Rafajłowicz","doi":"10.2478/jaiscr-2021-0012","DOIUrl":"https://doi.org/10.2478/jaiscr-2021-0012","url":null,"abstract":"Abstract Let a class of proper curves is specified by positive examples only. We aim to propose a learning novelty detection algorithm that decides whether a new curve is outside this class or not. In opposite to the majority of the literature, two sources of a curve variability are present, namely, the one inherent to curves from the proper class and observations errors’. Therefore, firstly a decision function is trained on historical data, and then, descriptors of each curve to be classified are learned from noisy observations.When the intrinsic variability is Gaussian, a decision threshold can be established from T 2 Hotelling distribution and tuned to more general cases. Expansion coefficients in a selected orthogonal series are taken as descriptors and an algorithm for their learning is proposed that follows nonparametric curve fitting approaches. Its fast version is derived for descriptors that are based on the cosine series. Additionally, the asymptotic normality of learned descriptors and the bound for the probability of their large deviations are proved. The influence of this bound on the decision threshold is also discussed.The proposed approach covers curves described as functional data projected onto a finite-dimensional subspace of a Hilbert space as well a shape sensitive description of curves, known as square-root velocity (SRV). It was tested both on synthetic data and on real-life observations of the COVID-19 growth curves.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"11 1","pages":"195 - 215"},"PeriodicalIF":2.8,"publicationDate":"2021-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44979290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A New Approach to Detection of Changes in Multidimensional Patterns - Part II 多维模式变化检测的新方法——第二部分

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-05-29 DOI: 10.2478/jaiscr-2021-0013

T. Gałkowski, A. Krzyżak, Zofia Patora-Wysocka, Z. Filutowicz, Lipo Wang

Abstract In the paper we develop an algorithm based on the Parzen kernel estimate for detection of sudden changes in 3-dimensional shapes which happen along the edge curves. Such problems commonly arise in various areas of computer vision, e.g., in edge detection, bioinformatics and processing of satellite imagery. In many engineering problems abrupt change detection may help in fault protection e.g. the jump detection in functions describing the static and dynamic properties of the objects in mechanical systems. We developed an algorithm for detecting abrupt changes which is nonparametric in nature and utilizes Parzen regression estimates of multivariate functions and their derivatives. In tests we apply this method, particularly but not exclusively, to the functions of two variables.

摘要本文提出了一种基于Parzen核估计的边缘曲线三维形状突变检测算法。这些问题通常出现在计算机视觉的各个领域，例如边缘检测、生物信息学和卫星图像处理。在许多工程问题中，突变检测有助于故障保护，例如在描述机械系统中物体的静态和动态特性的函数中进行跳变检测。我们开发了一种算法来检测非参数性质的突变，并利用多元函数及其导数的Parzen回归估计。在测试中，我们特别但不完全地将这种方法应用于两个变量的函数。

引用次数: 1

Bandwidth Selection for Kernel Generalized Regression Neural Networks in Identification of Hammerstein Systems 核广义回归神经网络在Hammerstein系统辨识中的带宽选择

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-05-29 DOI: 10.2478/jaiscr-2021-0011

Jiaqing Lv, M. Pawlak

Abstract This paper addresses the issue of data-driven smoothing parameter (bandwidth) selection in the context of nonparametric system identification of dynamic systems. In particular, we examine the identification problem of the block-oriented Hammerstein cascade system. A class of kernel-type Generalized Regression Neural Networks (GRNN) is employed as the identification algorithm. The statistical accuracy of the kernel GRNN estimate is critically influenced by the choice of the bandwidth. Given the need of data-driven bandwidth specification we propose several automatic selection methods that are compared by means of simulation studies. Our experiments reveal that the method referred to as the partitioned cross-validation algorithm can be recommended as the practical procedure for the bandwidth choice for the kernel GRNN estimate in terms of its statistical accuracy and implementation aspects.

摘要本文讨论了动态系统非参数系统辨识中数据驱动的平滑参数（带宽）选择问题。特别地，我们研究了面向块的Hammerstein级联系统的辨识问题。采用一类核型广义回归神经网络（GRNN）作为辨识算法。核GRNN估计的统计精度受到带宽选择的严重影响。鉴于数据驱动带宽规范的需要，我们提出了几种自动选择方法，并通过仿真研究进行了比较。我们的实验表明，就其统计准确性和实现方面而言，被称为分区交叉验证算法的方法可以被推荐为内核GRNN估计的带宽选择的实用程序。

引用次数: 1

Hardware Implementation of a Takagi-Sugeno Neuro-Fuzzy System Optimized by a Population Algorithm 基于群体算法优化的Takagi-Sugeno神经模糊系统的硬件实现

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-05-29 DOI: 10.2478/jaiscr-2021-0015

P. Dziwiński, A. Przybył, P. Trippner, J. Paszkowski, Y. Hayashi

Abstract Over the last several decades, neuro-fuzzy systems (NFS) have been widely analyzed and described in the literature because of their many advantages. They can model the uncertainty characteristic of human reasoning and the possibility of a universal approximation. These properties allow, for example, for the implementation of nonlinear control and modeling systems of better quality than would be possible with the use of classical methods. However, according to the authors, the number of NFS applications deployed so far is not large enough. This is because the implementation of NFS on typical digital platforms, such as, for example, microcontrollers, has not led to sufficiently high performance. On the other hand, the world literature describes many cases of NFS hardware implementation in programmable gate arrays (FPGAs) offering sufficiently high performance. Unfortunately, the complexity and cost of such systems were so high that the solutions were not very successful. This paper proposes a method of the hardware implementation of MRBF-TS systems. Such systems are created by modifying a subclass of Takagi-Sugeno (TS) fuzzy-neural structures, i.e. the NFS group functionally equivalent to networks with radial basis functions (RBF). The structure of the MRBF-TS is designed to be well suited to the implementation on an FPGA. Thanks to this, it is possible to obtain both very high computing efficiency and high accuracy with relatively low consumption of hardware resources. This paper describes both, the method of implementing MRBFTS type structures on the FPGA and the method of designing such structures based on the population algorithm. The described solution allows for the implementation of control or modeling systems, the implementation of which was impossible so far due to technical or economic reasons.

摘要在过去的几十年里，神经模糊系统因其许多优点而在文献中得到了广泛的分析和描述。它们可以模拟人类推理的不确定性特征和普遍近似的可能性。例如，这些特性允许实现比使用经典方法更好质量的非线性控制和建模系统。然而，根据作者的说法，到目前为止部署的NFS应用程序数量还不够多。这是因为在典型的数字平台（例如微控制器）上实现NFS并没有带来足够高的性能。另一方面，世界文献描述了在提供足够高性能的可编程门阵列（FPGA）中实现NFS硬件的许多情况。不幸的是，这种系统的复杂性和成本太高，以至于解决方案并不十分成功。本文提出了一种MRBF-TS系统的硬件实现方法。这样的系统是通过修改Takagi-Sugeno（TS）模糊神经结构的一个子类来创建的，即在功能上等效于具有径向基函数（RBF）的网络的NFS组。MRBF-TS的结构被设计为非常适合在FPGA上实现。得益于此，可以以相对低的硬件资源消耗获得非常高的计算效率和高精度。本文介绍了在FPGA上实现MRBFTS型结构的方法，以及基于总体算法设计这种结构的方法。所描述的解决方案允许实现控制或建模系统，由于技术或经济原因，迄今为止不可能实现这些系统。

{"title":"Hardware Implementation of a Takagi-Sugeno Neuro-Fuzzy System Optimized by a Population Algorithm","authors":"P. Dziwiński, A. Przybył, P. Trippner, J. Paszkowski, Y. Hayashi","doi":"10.2478/jaiscr-2021-0015","DOIUrl":"https://doi.org/10.2478/jaiscr-2021-0015","url":null,"abstract":"Abstract Over the last several decades, neuro-fuzzy systems (NFS) have been widely analyzed and described in the literature because of their many advantages. They can model the uncertainty characteristic of human reasoning and the possibility of a universal approximation. These properties allow, for example, for the implementation of nonlinear control and modeling systems of better quality than would be possible with the use of classical methods. However, according to the authors, the number of NFS applications deployed so far is not large enough. This is because the implementation of NFS on typical digital platforms, such as, for example, microcontrollers, has not led to sufficiently high performance. On the other hand, the world literature describes many cases of NFS hardware implementation in programmable gate arrays (FPGAs) offering sufficiently high performance. Unfortunately, the complexity and cost of such systems were so high that the solutions were not very successful. This paper proposes a method of the hardware implementation of MRBF-TS systems. Such systems are created by modifying a subclass of Takagi-Sugeno (TS) fuzzy-neural structures, i.e. the NFS group functionally equivalent to networks with radial basis functions (RBF). The structure of the MRBF-TS is designed to be well suited to the implementation on an FPGA. Thanks to this, it is possible to obtain both very high computing efficiency and high accuracy with relatively low consumption of hardware resources. This paper describes both, the method of implementing MRBFTS type structures on the FPGA and the method of designing such structures based on the population algorithm. The described solution allows for the implementation of control or modeling systems, the implementation of which was impossible so far due to technical or economic reasons.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"11 1","pages":"243 - 266"},"PeriodicalIF":2.8,"publicationDate":"2021-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42812454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Machine Learning and Traditional Econometric Models: A Systematic Mapping Study 机器学习与传统计量经济学模型的系统映射研究

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-04-01 DOI: 10.2478/jaiscr-2022-0006

M. Pérez-Pons, Javier Parra-Domínguez, S. Omatu, E. Herrera-Viedma, J. Corchado

Abstract Context: Machine Learning (ML) is a disruptive concept that has given rise to and generated interest in different applications in many fields of study. The purpose of Machine Learning is to solve real-life problems by automatically learning and improving from experience without being explicitly programmed for a specific problem, but for a generic type of problem. This article approaches the different applications of ML in a series of econometric methods. Objective: The objective of this research is to identify the latest applications and do a comparative study of the performance of econometric and ML models. The study aimed to find empirical evidence for the performance of ML algorithms being superior to traditional econometric models. The Methodology of systematic mapping of literature has been followed to carry out this research, according to the guidelines established by [39], and [58] that facilitate the identification of studies published about this subject. Results: The results show, that in most cases ML outperforms econometric models, while in other cases the best performance has been achieved by combining traditional methods and ML applications. Conclusion: inclusion and exclusions criteria have been applied and 52 articles closely related articles have been reviewed. The conclusion drawn from this research is that it is a field that is growing, which is something that is well known nowadays and that there is no certainty as to the performance of ML being always superior to that of econometric models.

摘要背景:机器学习(ML)是一个颠覆性的概念，在许多研究领域的不同应用中引起了人们的兴趣。机器学习的目的是通过自动学习和从经验中改进来解决现实生活中的问题，而不是针对特定问题明确编程，而是针对一般类型的问题。本文探讨了机器学习在一系列计量经济学方法中的不同应用。目的:本研究的目的是识别计量经济模型和机器学习模型的最新应用，并对其性能进行比较研究。本研究旨在寻找机器学习算法优于传统计量经济模型的经验证据。根据[39]和[58]建立的指导方针，遵循文献系统制图的方法论进行本研究，这些指导方针有助于识别已发表的关于本主题的研究。结果:结果表明，在大多数情况下，ML优于计量经济模型，而在其他情况下，将传统方法与ML应用相结合可以获得最佳性能。结论:采用纳入和排除标准，并对52篇密切相关的文献进行了综述。从这项研究中得出的结论是，这是一个正在发展的领域，这是当今众所周知的事情，并且没有确定ML的性能总是优于计量经济模型。

{"title":"Machine Learning and Traditional Econometric Models: A Systematic Mapping Study","authors":"M. Pérez-Pons, Javier Parra-Domínguez, S. Omatu, E. Herrera-Viedma, J. Corchado","doi":"10.2478/jaiscr-2022-0006","DOIUrl":"https://doi.org/10.2478/jaiscr-2022-0006","url":null,"abstract":"Abstract Context: Machine Learning (ML) is a disruptive concept that has given rise to and generated interest in different applications in many fields of study. The purpose of Machine Learning is to solve real-life problems by automatically learning and improving from experience without being explicitly programmed for a specific problem, but for a generic type of problem. This article approaches the different applications of ML in a series of econometric methods. Objective: The objective of this research is to identify the latest applications and do a comparative study of the performance of econometric and ML models. The study aimed to find empirical evidence for the performance of ML algorithms being superior to traditional econometric models. The Methodology of systematic mapping of literature has been followed to carry out this research, according to the guidelines established by [39], and [58] that facilitate the identification of studies published about this subject. Results: The results show, that in most cases ML outperforms econometric models, while in other cases the best performance has been achieved by combining traditional methods and ML applications. Conclusion: inclusion and exclusions criteria have been applied and 52 articles closely related articles have been reviewed. The conclusion drawn from this research is that it is a field that is growing, which is something that is well known nowadays and that there is no certainty as to the performance of ML being always superior to that of econometric models.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"12 1","pages":"79 - 100"},"PeriodicalIF":2.8,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45119459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A Progressive and Cross-Domain Deep Transfer Learning Framework for Wrist Fracture Detection 腕部骨折检测的渐进式跨域深度迁移学习框架

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-04-01 DOI: 10.2478/jaiscr-2022-0007

Christophe Karam, Julia El Zini, M. Awad, C. Saade, L. Naffaa, Mohammad Ali K. El Amine

Abstract There has been an amplified focus on and benefit from the adoption of artificial intelligence (AI) in medical imaging applications. However, deep learning approaches involve training with massive amounts of annotated data in order to guarantee generalization and achieve high accuracies. Gathering and annotating large sets of training images require expertise which is both expensive and time-consuming, especially in the medical field. Furthermore, in health care systems where mistakes can have catastrophic consequences, there is a general mistrust in the black-box aspect of AI models. In this work, we focus on improving the performance of medical imaging applications when limited data is available while focusing on the interpretability aspect of the proposed AI model. This is achieved by employing a novel transfer learning framework, progressive transfer learning, an automated annotation technique and a correlation analysis experiment on the learned representations. Progressive transfer learning helps jump-start the training of deep neural networks while improving the performance by gradually transferring knowledge from two source tasks into the target task. It is empirically tested on the wrist fracture detection application by first training a general radiology network RadiNet and using its weights to initialize RadiNetwrist, that is trained on wrist images to detect fractures. Experiments show that RadiNetwrist achieves an accuracy of 87% and an AUC ROC of 94% as opposed to 83% and 92% when it is pre-trained on the ImageNet dataset. This improvement in performance is investigated within an explainable AI framework. More concretely, the learned deep representations of RadiNetwrist are compared to those learned by the baseline model by conducting a correlation analysis experiment. The results show that, when transfer learning is gradually applied, some features are learned earlier in the network. Moreover, the deep layers in the progressive transfer learning framework are shown to encode features that are not encountered when traditional transfer learning techniques are applied. In addition to the empirical results, a clinical study is conducted and the performance of RadiNetwrist is compared to that of an expert radiologist. We found that RadiNetwrist exhibited similar performance to that of radiologists with more than 20 years of experience. This motivates follow-up research to train on more data to feasibly surpass radiologists’ performance, and investigate the interpretability of AI models in the healthcare domain where the decision-making process needs to be credible and transparent.

人工智能(AI)在医学成像应用中的应用得到了越来越多的关注和受益。然而，深度学习方法需要使用大量带注释的数据进行训练，以保证泛化和实现高精度。收集和注释大量训练图像集需要专业知识，这既昂贵又耗时，特别是在医学领域。此外，在医疗保健系统中，错误可能会造成灾难性的后果，人们普遍不信任人工智能模型的黑箱方面。在这项工作中，我们专注于在可用数据有限的情况下提高医学成像应用程序的性能，同时关注所提出的AI模型的可解释性方面。这是通过采用一种新的迁移学习框架、渐进式迁移学习、自动标注技术和对学习表征的相关性分析实验来实现的。渐进式迁移学习通过将两个源任务之间的知识逐步迁移到目标任务中来提高深度神经网络的性能，从而帮助深度神经网络快速启动训练。首先训练一个通用的放射学网络RadiNet，利用其权值初始化radinetw腕部图像，对该方法在腕部骨折检测中的应用进行了实证检验。实验表明，RadiNetwrist的准确率为87%，AUC ROC为94%，而在ImageNet数据集上进行预训练时，准确率为83%，ROC为92%。这种性能的改进是在一个可解释的人工智能框架内进行研究的。具体来说，通过相关分析实验，将学习到的radinetw腕上的深度表征与基线模型学习到的深度表征进行比较。结果表明，随着迁移学习的逐步应用，网络中的一些特征学习得更早。此外，渐进式迁移学习框架中的深层被证明编码了传统迁移学习技术应用时不会遇到的特征。除了实证结果外，还进行了临床研究，并将radinetwwrist的性能与放射科专家的性能进行了比较。我们发现RadiNetwrist的表现与拥有20年以上经验的放射科医生相似。这促使后续研究在更多数据上进行训练，以超越放射科医生的表现，并研究人工智能模型在医疗保健领域的可解释性，因为决策过程需要可信和透明。

{"title":"A Progressive and Cross-Domain Deep Transfer Learning Framework for Wrist Fracture Detection","authors":"Christophe Karam, Julia El Zini, M. Awad, C. Saade, L. Naffaa, Mohammad Ali K. El Amine","doi":"10.2478/jaiscr-2022-0007","DOIUrl":"https://doi.org/10.2478/jaiscr-2022-0007","url":null,"abstract":"Abstract There has been an amplified focus on and benefit from the adoption of artificial intelligence (AI) in medical imaging applications. However, deep learning approaches involve training with massive amounts of annotated data in order to guarantee generalization and achieve high accuracies. Gathering and annotating large sets of training images require expertise which is both expensive and time-consuming, especially in the medical field. Furthermore, in health care systems where mistakes can have catastrophic consequences, there is a general mistrust in the black-box aspect of AI models. In this work, we focus on improving the performance of medical imaging applications when limited data is available while focusing on the interpretability aspect of the proposed AI model. This is achieved by employing a novel transfer learning framework, progressive transfer learning, an automated annotation technique and a correlation analysis experiment on the learned representations. Progressive transfer learning helps jump-start the training of deep neural networks while improving the performance by gradually transferring knowledge from two source tasks into the target task. It is empirically tested on the wrist fracture detection application by first training a general radiology network RadiNet and using its weights to initialize RadiNetwrist, that is trained on wrist images to detect fractures. Experiments show that RadiNetwrist achieves an accuracy of 87% and an AUC ROC of 94% as opposed to 83% and 92% when it is pre-trained on the ImageNet dataset. This improvement in performance is investigated within an explainable AI framework. More concretely, the learned deep representations of RadiNetwrist are compared to those learned by the baseline model by conducting a correlation analysis experiment. The results show that, when transfer learning is gradually applied, some features are learned earlier in the network. Moreover, the deep layers in the progressive transfer learning framework are shown to encode features that are not encountered when traditional transfer learning techniques are applied. In addition to the empirical results, a clinical study is conducted and the performance of RadiNetwrist is compared to that of an expert radiologist. We found that RadiNetwrist exhibited similar performance to that of radiologists with more than 20 years of experience. This motivates follow-up research to train on more data to feasibly surpass radiologists’ performance, and investigate the interpretability of AI models in the healthcare domain where the decision-making process needs to be credible and transparent.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"12 1","pages":"101 - 120"},"PeriodicalIF":2.8,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43988768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Position-Encoding Convolutional Network to Solving Connected Text Captcha 位置编码卷积网络求解连通文本字幕

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-04-01 DOI: 10.2478/jaiscr-2022-0008

Ke Qing, Rongsheng Zhang

Abstract Text-based CAPTCHA is a convenient and effective safety mechanism that has been widely deployed across websites. The efficient end-to-end models of scene text recognition consisting of CNN and attention-based RNN show limited performance in solving text-based CAPTCHAs. In contrast with the street view image and document, the character sequence in CAPTCHA is non-semantic. The RNN loses its ability to learn the semantic context and only implicitly encodes the relative position of extracted features. Meanwhile, the security features, which prevent characters from segmentation and recognition, extensively increase the complexity of CAPTCHAs. The performance of this model is sensitive to different CAPTCHA schemes. In this paper, we analyze the properties of the text-based CAPTCHA and accordingly consider solving it as a highly position-relative character sequence recognition task. We propose a network named PosConv to leverage the position information in the character sequence without RNN. PosConv uses a novel padding strategy and modified convolution, explicitly encoding the relative position into the local features of characters. This mechanism of PosConv makes the extracted features from CAPTCHAs more informative and robust. We validate PosConv on six text-based CAPTCHA schemes, and it achieves state-of-the-art or competitive recognition accuracy with significantly fewer parameters and faster convergence speed.

摘要基于文本的CAPTCHA是一种方便有效的安全机制，已被广泛部署在各个网站上。由CNN和基于注意力的RNN组成的高效的场景文本识别端到端模型在解决基于文本的CAPTCHA方面表现出有限的性能。与街景图像和文档相比，CAPTCHA中的字符序列是非语义的。RNN失去了学习语义上下文的能力，并且仅隐式地对提取的特征的相对位置进行编码。同时，阻止字符分割和识别的安全特性大大增加了CAPTCHA的复杂性。该模型的性能对不同的CAPTCHA方案是敏感的。在本文中，我们分析了基于文本的CAPTCHA的特性，并相应地将其视为一个高度位置相对的字符序列识别任务。我们提出了一个名为PosConv的网络，在没有RNN的情况下利用字符序列中的位置信息。PosConv使用了一种新颖的填充策略和改进的卷积，将相对位置显式编码到字符的局部特征中。PosConv的这种机制使从CAPTCHA中提取的特征更具信息性和鲁棒性。我们在六个基于文本的CAPTCHA方案上验证了PosConv，它以显著更少的参数和更快的收敛速度实现了最先进或有竞争力的识别精度。

{"title":"Position-Encoding Convolutional Network to Solving Connected Text Captcha","authors":"Ke Qing, Rongsheng Zhang","doi":"10.2478/jaiscr-2022-0008","DOIUrl":"https://doi.org/10.2478/jaiscr-2022-0008","url":null,"abstract":"Abstract Text-based CAPTCHA is a convenient and effective safety mechanism that has been widely deployed across websites. The efficient end-to-end models of scene text recognition consisting of CNN and attention-based RNN show limited performance in solving text-based CAPTCHAs. In contrast with the street view image and document, the character sequence in CAPTCHA is non-semantic. The RNN loses its ability to learn the semantic context and only implicitly encodes the relative position of extracted features. Meanwhile, the security features, which prevent characters from segmentation and recognition, extensively increase the complexity of CAPTCHAs. The performance of this model is sensitive to different CAPTCHA schemes. In this paper, we analyze the properties of the text-based CAPTCHA and accordingly consider solving it as a highly position-relative character sequence recognition task. We propose a network named PosConv to leverage the position information in the character sequence without RNN. PosConv uses a novel padding strategy and modified convolution, explicitly encoding the relative position into the local features of characters. This mechanism of PosConv makes the extracted features from CAPTCHAs more informative and robust. We validate PosConv on six text-based CAPTCHA schemes, and it achieves state-of-the-art or competitive recognition accuracy with significantly fewer parameters and faster convergence speed.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"12 1","pages":"121 - 133"},"PeriodicalIF":2.8,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42874156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Handling Realistic Noise in Multi-Agent Systems with Self-Supervised Learning and Curiosity 用自监督学习和好奇心处理多智能体系统中的真实噪声

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research

Pub Date : 2021-04-01 DOI: 10.2478/jaiscr-2022-0009

Marton Szemenyei, Patrik Reizinger

Abstract 1Most reinforcement learning benchmarks – especially in multi-agent tasks – do not go beyond observations with simple noise; nonetheless, real scenarios induce more elaborate vision pipeline failures: false sightings, misclassifications or occlusion. In this work, we propose a lightweight, 2D environment for robot soccer and autonomous driving that can emulate the above discrepancies. Besides establishing a benchmark for accessible multi-agent reinforcement learning research, our work addresses the challenges the simulator imposes. For handling realistic noise, we use self-supervised learning to enhance scene reconstruction and extend curiosity-driven learning to model longer horizons. Our extensive experiments show that the proposed methods achieve state-of-the-art performance, compared against actor-critic methods, ICM, and PPO.

摘要1大多数强化学习基准——尤其是在多智能体任务中——不会超出简单噪声的观察范围；尽管如此，真实的场景会引发更复杂的视觉管道故障：虚假视觉、错误分类或遮挡。在这项工作中，我们为机器人足球和自动驾驶提出了一个轻量级的2D环境，可以模拟上述差异。除了为可访问的多智能体强化学习研究建立基准外，我们的工作还解决了模拟器带来的挑战。为了处理逼真的噪声，我们使用自监督学习来增强场景重建，并将好奇心驱动的学习扩展到建模更长的视野。我们的大量实验表明，与演员-评论家方法、ICM和PPO相比，所提出的方法实现了最先进的性能。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal of Artificial Intelligence and Soft Computing Research

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀