首页 > 最新文献

Journal of Artificial Intelligence and Soft Computing Research最新文献

英文 中文
Towards Effective Music Therapy for Mental Health Care Using Machine Learning Tools: Human Affective Reasoning and Music Genres 使用机器学习工具实现心理健康护理的有效音乐治疗:人类情感推理和音乐流派
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-12-03 DOI: 10.2478/jaiscr-2021-0001
J. Rahman, Tom Gedeon, Sabrina Caldwell, Richard Jones, Zi Jin
Abstract Music has the ability to evoke different emotions in people, which is reflected in their physiological signals. Advances in affective computing have introduced computational methods to analyse these signals and understand the relationship between music and emotion in greater detail. We analyse Electrodermal Activity (EDA), Blood Volume Pulse (BVP), Skin Temperature (ST) and Pupil Dilation (PD) collected from 24 participants while they listen to 12 pieces from 3 different genres of music. A set of 34 features were extracted from each signal and 6 different feature selection methods were applied to identify useful features. Empirical analysis shows that a neural network (NN) with a set of features extracted from the physiological signals can achieve 99.2% accuracy in differentiating among the 3 music genres. The model also reaches 98.5% accuracy in classification based on participants’ subjective rating of emotion. The paper also identifies some useful features to improve accuracy of the classification models. Furthermore, we introduce a new technique called ’Gingerbread Animation’ to visualise the physiological signals we record as a video, and to make these signals more comprehensible to the human eye, and also appropriate for computer vision techniques such as Convolutional Neural Networks (CNNs). Our results overall provide a strong motivation to investigate the relationship between physiological signals and music, which can lead to improvements in music therapy for mental health care and musicogenic epilepsy reduction (our long term goal).
摘要音乐能够唤起人们不同的情绪,这反映在人们的生理信号中。情感计算的进步引入了计算方法来分析这些信号,并更详细地理解音乐和情感之间的关系。我们分析了24名参与者在听3种不同音乐类型的12首作品时收集的皮肤电活动(EDA)、血容量脉冲(BVP)、皮肤温度(ST)和瞳孔扩张(PD)。从每个信号中提取一组34个特征,并应用6种不同的特征选择方法来识别有用的特征。经验分析表明,具有从生理信号中提取的一组特征的神经网络(NN)在区分三种音乐流派方面可以达到99.2%的准确率。基于参与者对情绪的主观评价,该模型的分类准确率也达到98.5%。本文还确定了一些有用的特征,以提高分类模型的准确性。此外,我们引入了一种名为“姜饼动画”的新技术,将我们录制的生理信号可视化为视频,使这些信号更易于人眼理解,也适用于卷积神经网络(CNNs)等计算机视觉技术。总体而言,我们的研究结果为研究生理信号与音乐之间的关系提供了强有力的动力,这可以改善音乐治疗的心理健康护理和减少音乐性癫痫(我们的长期目标)。
{"title":"Towards Effective Music Therapy for Mental Health Care Using Machine Learning Tools: Human Affective Reasoning and Music Genres","authors":"J. Rahman, Tom Gedeon, Sabrina Caldwell, Richard Jones, Zi Jin","doi":"10.2478/jaiscr-2021-0001","DOIUrl":"https://doi.org/10.2478/jaiscr-2021-0001","url":null,"abstract":"Abstract Music has the ability to evoke different emotions in people, which is reflected in their physiological signals. Advances in affective computing have introduced computational methods to analyse these signals and understand the relationship between music and emotion in greater detail. We analyse Electrodermal Activity (EDA), Blood Volume Pulse (BVP), Skin Temperature (ST) and Pupil Dilation (PD) collected from 24 participants while they listen to 12 pieces from 3 different genres of music. A set of 34 features were extracted from each signal and 6 different feature selection methods were applied to identify useful features. Empirical analysis shows that a neural network (NN) with a set of features extracted from the physiological signals can achieve 99.2% accuracy in differentiating among the 3 music genres. The model also reaches 98.5% accuracy in classification based on participants’ subjective rating of emotion. The paper also identifies some useful features to improve accuracy of the classification models. Furthermore, we introduce a new technique called ’Gingerbread Animation’ to visualise the physiological signals we record as a video, and to make these signals more comprehensible to the human eye, and also appropriate for computer vision techniques such as Convolutional Neural Networks (CNNs). Our results overall provide a strong motivation to investigate the relationship between physiological signals and music, which can lead to improvements in music therapy for mental health care and musicogenic epilepsy reduction (our long term goal).","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"11 1","pages":"5 - 20"},"PeriodicalIF":2.8,"publicationDate":"2020-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43287354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
An Optimized Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks 非迭代训练递归神经网络的优化并行实现
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-12-03 DOI: 10.2478/jaiscr-2021-0003
Julia El Zini, Yara Rizk, M. Awad
Abstract Recurrent neural networks (RNN) have been successfully applied to various sequential decision-making tasks, natural language processing applications, and time-series predictions. Such networks are usually trained through back-propagation through time (BPTT) which is prohibitively expensive, especially when the length of the time dependencies and the number of hidden neurons increase. To reduce the training time, extreme learning machines (ELMs) have been recently applied to RNN training, reaching a 99% speedup on some applications. Due to its non-iterative nature, ELM training, when parallelized, has the potential to reach higher speedups than BPTT. In this work, we present Opt-PR-ELM, an optimized parallel RNN training algorithm based on ELM that takes advantage of the GPU shared memory and of parallel QR factorization algorithms to efficiently reach optimal solutions. The theoretical analysis of the proposed algorithm is presented on six RNN architectures, including LSTM and GRU, and its performance is empirically tested on ten time-series prediction applications. Opt-PR-ELM is shown to reach up to 461 times speedup over its sequential counterpart and to require up to 20x less time to train than parallel BPTT. Such high speedups over new generation CPUs are extremely crucial in real-time applications and IoT environments.
递归神经网络(RNN)已经成功地应用于各种顺序决策任务、自然语言处理应用和时间序列预测。这种网络通常通过时间反向传播(BPTT)进行训练,这是非常昂贵的,特别是当时间依赖关系的长度和隐藏神经元的数量增加时。为了减少训练时间,极限学习机(elm)最近被应用于RNN训练,在一些应用中达到了99%的加速。由于ELM训练的非迭代性质,当并行化时,它有可能达到比BPTT更高的速度。在这项工作中,我们提出了Opt-PR-ELM,一种基于ELM的优化并行RNN训练算法,它利用GPU共享内存和并行QR分解算法来有效地获得最优解。在LSTM和GRU等6种RNN体系结构上对该算法进行了理论分析,并在10个时间序列预测应用中对其性能进行了实证检验。与并行BPTT相比,Opt-PR-ELM的速度提高了461倍,所需的训练时间减少了20倍。在实时应用和物联网环境中,新一代cpu的高速度至关重要。
{"title":"An Optimized Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks","authors":"Julia El Zini, Yara Rizk, M. Awad","doi":"10.2478/jaiscr-2021-0003","DOIUrl":"https://doi.org/10.2478/jaiscr-2021-0003","url":null,"abstract":"Abstract Recurrent neural networks (RNN) have been successfully applied to various sequential decision-making tasks, natural language processing applications, and time-series predictions. Such networks are usually trained through back-propagation through time (BPTT) which is prohibitively expensive, especially when the length of the time dependencies and the number of hidden neurons increase. To reduce the training time, extreme learning machines (ELMs) have been recently applied to RNN training, reaching a 99% speedup on some applications. Due to its non-iterative nature, ELM training, when parallelized, has the potential to reach higher speedups than BPTT. In this work, we present Opt-PR-ELM, an optimized parallel RNN training algorithm based on ELM that takes advantage of the GPU shared memory and of parallel QR factorization algorithms to efficiently reach optimal solutions. The theoretical analysis of the proposed algorithm is presented on six RNN architectures, including LSTM and GRU, and its performance is empirically tested on ten time-series prediction applications. Opt-PR-ELM is shown to reach up to 461 times speedup over its sequential counterpart and to require up to 20x less time to train than parallel BPTT. Such high speedups over new generation CPUs are extremely crucial in real-time applications and IoT environments.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"11 1","pages":"33 - 50"},"PeriodicalIF":2.8,"publicationDate":"2020-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47022911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Data-Driven Temporal-Spatial Model for the Prediction of AQI in Nanjing 数据驱动的南京空气质量指数时空预测模型
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-06-15 DOI: 10.2478/jaiscr-2020-0017
Xuan Zhao, Meichen Song, Anqi Liu, Yiming Wang, Tong Wang, Jinde Cao
Abstract Air quality data prediction in urban area is of great significance to control air pollution and protect the public health. The prediction of the air quality in the monitoring station is well studied in existing researches. However, air-quality-monitor stations are insufficient in most cities and the air quality varies from one place to another dramatically due to complex factors. A novel model is established in this paper to estimate and predict the Air Quality Index (AQI) of the areas without monitoring stations in Nanjing. The proposed model predicts AQI in a non-monitoring area both in temporal dimension and in spatial dimension respectively. The temporal dimension model is presented at first based on the enhanced k-Nearest Neighbor (KNN) algorithm to predict the AQI values among monitoring stations, the acceptability of the results achieves 92% for one-hour prediction. Meanwhile, in order to forecast the evolution of air quality in the spatial dimension, the method is utilized with the help of Back Propagation neural network (BP), which considers geographical distance. Furthermore, to improve the accuracy and adaptability of the spatial model, the similarity of topological structure is introduced. Especially, the temporal-spatial model is built and its adaptability is tested on a specific non-monitoring site, Jiulonghu Campus of Southeast University. The result demonstrates that the acceptability achieves 73.8% on average. The current paper provides strong evidence suggesting that the proposed non-parametric and data-driven approach for air quality forecasting provides promising results.
摘要城市空气质量数据预测对控制空气污染、保护公众健康具有重要意义。现有的研究对监测站的空气质量预测进行了深入的研究。然而,大多数城市的空气质量监测站不足,由于复杂的因素,各地的空气质量差异很大。本文建立了一个新的模型来估计和预测南京市无监测站地区的空气质量指数。所提出的模型分别在时间维度和空间维度上预测非监测区域的AQI。首先提出了基于增强的k近邻(KNN)算法的时间维模型来预测监测站之间的AQI值,一小时预测结果的可接受性达到92%。同时,为了在空间维度上预测空气质量的演变,该方法借助于考虑地理距离的反向传播神经网络(BP)。此外,为了提高空间模型的准确性和适应性,引入了拓扑结构的相似性。特别是在东南大学九龙湖校区非监测点建立了时空模型,并对其适应性进行了测试。结果表明,可接受性平均达到73.8%。目前的论文提供了强有力的证据,表明所提出的空气质量预测的非参数和数据驱动方法提供了有希望的结果。
{"title":"Data-Driven Temporal-Spatial Model for the Prediction of AQI in Nanjing","authors":"Xuan Zhao, Meichen Song, Anqi Liu, Yiming Wang, Tong Wang, Jinde Cao","doi":"10.2478/jaiscr-2020-0017","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0017","url":null,"abstract":"Abstract Air quality data prediction in urban area is of great significance to control air pollution and protect the public health. The prediction of the air quality in the monitoring station is well studied in existing researches. However, air-quality-monitor stations are insufficient in most cities and the air quality varies from one place to another dramatically due to complex factors. A novel model is established in this paper to estimate and predict the Air Quality Index (AQI) of the areas without monitoring stations in Nanjing. The proposed model predicts AQI in a non-monitoring area both in temporal dimension and in spatial dimension respectively. The temporal dimension model is presented at first based on the enhanced k-Nearest Neighbor (KNN) algorithm to predict the AQI values among monitoring stations, the acceptability of the results achieves 92% for one-hour prediction. Meanwhile, in order to forecast the evolution of air quality in the spatial dimension, the method is utilized with the help of Back Propagation neural network (BP), which considers geographical distance. Furthermore, to improve the accuracy and adaptability of the spatial model, the similarity of topological structure is introduced. Especially, the temporal-spatial model is built and its adaptability is tested on a specific non-monitoring site, Jiulonghu Campus of Southeast University. The result demonstrates that the acceptability achieves 73.8% on average. The current paper provides strong evidence suggesting that the proposed non-parametric and data-driven approach for air quality forecasting provides promising results.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"255 - 270"},"PeriodicalIF":2.8,"publicationDate":"2020-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49397652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Browser Fingerprint Coding Methods Increasing the Effectiveness of User Identification in the Web Traffic 浏览器指纹编码方法提高了Web流量中用户识别的有效性
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-06-15 DOI: 10.2478/jaiscr-2020-0016
Marcin Gabryel, K. Grzanek, Y. Hayashi
Abstract Web-based browser fingerprint (or device fingerprint) is a tool used to identify and track user activity in web traffic. It is also used to identify computers that are abusing online advertising and also to prevent credit card fraud. A device fingerprint is created by extracting multiple parameter values from a browser API (e.g. operating system type or browser version). The acquired parameter values are then used to create a hash using the hash function. The disadvantage of using this method is too high susceptibility to small, normally occurring changes (e.g. when changing the browser version number or screen resolution). Minor changes in the input values generate a completely different fingerprint hash, making it impossible to find similar ones in the database. On the other hand, omitting these unstable values when creating a hash, significantly limits the ability of the fingerprint to distinguish between devices. This weak point is commonly exploited by fraudsters who knowingly evade this form of protection by deliberately changing the value of device parameters. The paper presents methods that significantly limit this type of activity. New algorithms for coding and comparing fingerprints are presented, in which the values of parameters with low stability and low entropy are especially taken into account. The fingerprint generation methods are based on popular Minhash, the LSH, and autoencoder methods. The effectiveness of coding and comparing each of the presented methods was also examined in comparison with the currently used hash generation method. Authentic data of the devices and browsers of users visiting 186 different websites were collected for the research.
摘要基于Web的浏览器指纹(或设备指纹)是一种用于识别和跟踪网络流量中用户活动的工具。它还用于识别滥用在线广告的计算机,并防止信用卡欺诈。通过从浏览器API提取多个参数值(例如,操作系统类型或浏览器版本)来创建设备指纹。然后,所获取的参数值用于使用哈希函数创建哈希。使用这种方法的缺点是对通常发生的小变化(例如,在更改浏览器版本号或屏幕分辨率时)的敏感性太高。输入值的微小更改会生成完全不同的指纹哈希,因此无法在数据库中找到类似的指纹哈希。另一方面,在创建哈希时忽略这些不稳定的值,极大地限制了指纹在设备之间进行区分的能力。欺诈者通常会利用这一弱点,故意改变设备参数的值来逃避这种形式的保护。本文提出了显著限制这类活动的方法。提出了一种新的指纹编码和比较算法,其中特别考虑了低稳定性和低熵的参数值。指纹生成方法基于流行的Minhash、LSH和自动编码器方法。与目前使用的哈希生成方法相比,还检查了编码和比较每种方法的有效性。本研究收集了访问186个不同网站的用户的设备和浏览器的真实数据。
{"title":"Browser Fingerprint Coding Methods Increasing the Effectiveness of User Identification in the Web Traffic","authors":"Marcin Gabryel, K. Grzanek, Y. Hayashi","doi":"10.2478/jaiscr-2020-0016","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0016","url":null,"abstract":"Abstract Web-based browser fingerprint (or device fingerprint) is a tool used to identify and track user activity in web traffic. It is also used to identify computers that are abusing online advertising and also to prevent credit card fraud. A device fingerprint is created by extracting multiple parameter values from a browser API (e.g. operating system type or browser version). The acquired parameter values are then used to create a hash using the hash function. The disadvantage of using this method is too high susceptibility to small, normally occurring changes (e.g. when changing the browser version number or screen resolution). Minor changes in the input values generate a completely different fingerprint hash, making it impossible to find similar ones in the database. On the other hand, omitting these unstable values when creating a hash, significantly limits the ability of the fingerprint to distinguish between devices. This weak point is commonly exploited by fraudsters who knowingly evade this form of protection by deliberately changing the value of device parameters. The paper presents methods that significantly limit this type of activity. New algorithms for coding and comparing fingerprints are presented, in which the values of parameters with low stability and low entropy are especially taken into account. The fingerprint generation methods are based on popular Minhash, the LSH, and autoencoder methods. The effectiveness of coding and comparing each of the presented methods was also examined in comparison with the currently used hash generation method. Authentic data of the devices and browsers of users visiting 186 different websites were collected for the research.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"243 - 253"},"PeriodicalIF":2.8,"publicationDate":"2020-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45328315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Local Levenberg-Marquardt Algorithm for Learning Feedforwad Neural Networks 前馈神经网络学习的局部Levenberg-Marquardt算法
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-06-15 DOI: 10.2478/jaiscr-2020-0020
J. Bilski, Bartosz Kowalczyk, A. Marchlewska, J. Zurada
Abstract This paper presents a local modification of the Levenberg-Marquardt algorithm (LM). First, the mathematical basics of the classic LM method are shown. The classic LM algorithm is very efficient for learning small neural networks. For bigger neural networks, whose computational complexity grows significantly, it makes this method practically inefficient. In order to overcome this limitation, local modification of the LM is introduced in this paper. The main goal of this paper is to develop a more complexity efficient modification of the LM method by using a local computation. The introduced modification has been tested on the following benchmarks: the function approximation and classification problems. The obtained results have been compared to the classic LM method performance. The paper shows that the local modification of the LM method significantly improves the algorithm’s performance for bigger networks. Several possible proposals for future works are suggested.
摘要本文对Levenberg-Marquardt算法(LM)进行了局部修改。首先,介绍了经典LM方法的数学基础。经典的LM算法对于学习小型神经网络是非常有效的。对于计算复杂度显著增长的较大神经网络,这使得这种方法实际上效率低下。为了克服这一限制,本文对LM进行了局部修改。本文的主要目标是通过使用局部计算对LM方法进行更复杂高效的修改。引入的修改已经在以下基准上进行了测试:函数近似和分类问题。将所获得的结果与经典LM方法的性能进行了比较。论文表明,LM方法的局部修改显著提高了算法在较大网络中的性能。对未来的工作提出了一些可能的建议。
{"title":"Local Levenberg-Marquardt Algorithm for Learning Feedforwad Neural Networks","authors":"J. Bilski, Bartosz Kowalczyk, A. Marchlewska, J. Zurada","doi":"10.2478/jaiscr-2020-0020","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0020","url":null,"abstract":"Abstract This paper presents a local modification of the Levenberg-Marquardt algorithm (LM). First, the mathematical basics of the classic LM method are shown. The classic LM algorithm is very efficient for learning small neural networks. For bigger neural networks, whose computational complexity grows significantly, it makes this method practically inefficient. In order to overcome this limitation, local modification of the LM is introduced in this paper. The main goal of this paper is to develop a more complexity efficient modification of the LM method by using a local computation. The introduced modification has been tested on the following benchmarks: the function approximation and classification problems. The obtained results have been compared to the classic LM method performance. The paper shows that the local modification of the LM method significantly improves the algorithm’s performance for bigger networks. Several possible proposals for future works are suggested.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"299 - 316"},"PeriodicalIF":2.8,"publicationDate":"2020-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48966110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
A Novel Drift Detection Algorithm Based on Features’ Importance Analysis in a Data Streams Environment 数据流环境下一种基于特征重要性分析的漂移检测算法
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-06-15 DOI: 10.2478/jaiscr-2020-0019
P. Duda, K. Przybyszewski, Lipo Wang
Abstract The training set consists of many features that influence the classifier in different degrees. Choosing the most important features and rejecting those that do not carry relevant information is of great importance to the operating of the learned model. In the case of data streams, the importance of the features may additionally change over time. Such changes affect the performance of the classifier but can also be an important indicator of occurring concept-drift. In this work, we propose a new algorithm for data streams classification, called Random Forest with Features Importance (RFFI), which uses the measure of features importance as a drift detector. The RFFT algorithm implements solutions inspired by the Random Forest algorithm to the data stream scenarios. The proposed algorithm combines the ability of ensemble methods for handling slow changes in a data stream with a new method for detecting concept drift occurrence. The work contains an experimental analysis of the proposed algorithm, carried out on synthetic and real data.
摘要训练集由许多不同程度影响分类器的特征组成。选择最重要的特征并拒绝那些不携带相关信息的特征对于学习模型的运行非常重要。在数据流的情况下,特性的重要性可能会随着时间的推移而改变。这种变化会影响分类器的性能,但也可能是发生概念漂移的重要指标。在这项工作中,我们提出了一种新的数据流分类算法,称为具有特征重要性的随机森林(RFFI),它使用特征重要性的度量作为漂移检测器。RFFT算法将受随机森林算法启发的解决方案实现到数据流场景中。该算法将集成方法处理数据流中缓慢变化的能力与检测概念漂移发生的新方法相结合。该工作包括对所提出算法的实验分析,在合成数据和实际数据上进行。
{"title":"A Novel Drift Detection Algorithm Based on Features’ Importance Analysis in a Data Streams Environment","authors":"P. Duda, K. Przybyszewski, Lipo Wang","doi":"10.2478/jaiscr-2020-0019","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0019","url":null,"abstract":"Abstract The training set consists of many features that influence the classifier in different degrees. Choosing the most important features and rejecting those that do not carry relevant information is of great importance to the operating of the learned model. In the case of data streams, the importance of the features may additionally change over time. Such changes affect the performance of the classifier but can also be an important indicator of occurring concept-drift. In this work, we propose a new algorithm for data streams classification, called Random Forest with Features Importance (RFFI), which uses the measure of features importance as a drift detector. The RFFT algorithm implements solutions inspired by the Random Forest algorithm to the data stream scenarios. The proposed algorithm combines the ability of ensemble methods for handling slow changes in a data stream with a new method for detecting concept drift occurrence. The work contains an experimental analysis of the proposed algorithm, carried out on synthetic and real data.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"287 - 298"},"PeriodicalIF":2.8,"publicationDate":"2020-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41853974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Triangular Fuzzy-Rough Set Based Fuzzification of Fuzzy Rule-Based Systems 基于三角模糊粗糙集的模糊规则系统模糊化
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-06-15 DOI: 10.2478/jaiscr-2020-0018
Janusz T. Starczewski, P. Goetzen, Christian Napoli
Abstract In real-world approximation problems, precise input data are economically expensive. Therefore, fuzzy methods devoted to uncertain data are in the focus of current research. Consequently, a method based on fuzzy-rough sets for fuzzification of inputs in a rule-based fuzzy system is discussed in this paper. A triangular membership function is applied to describe the nature of imprecision in data. Firstly, triangular fuzzy partitions are introduced to approximate common antecedent fuzzy rule sets. As a consequence of the proposed method, we obtain a structure of a general (non-interval) type-2 fuzzy logic system in which secondary membership functions are cropped triangular. Then, the possibility of applying so-called regular triangular norms is discussed. Finally, an experimental system constructed on precise data, which is then transformed and verified for uncertain data, is provided to demonstrate its basic properties.
摘要在现实世界的近似问题中,精确的输入数据在经济上是昂贵的。因此,致力于不确定数据的模糊方法是当前研究的重点。因此,本文讨论了一种基于模糊粗糙集的方法,用于基于规则的模糊系统中输入的模糊化。应用三角隶属函数来描述数据中不精确性的性质。首先,引入三角模糊划分来逼近常见的先行模糊规则集。作为所提出方法的结果,我们获得了一个一般(非区间)2型模糊逻辑系统的结构,其中二阶隶属函数被裁剪为三角形。然后,讨论了应用所谓的正三角规范的可能性。最后,提供了一个基于精确数据构建的实验系统,然后对不确定数据进行转换和验证,以证明其基本性质。
{"title":"Triangular Fuzzy-Rough Set Based Fuzzification of Fuzzy Rule-Based Systems","authors":"Janusz T. Starczewski, P. Goetzen, Christian Napoli","doi":"10.2478/jaiscr-2020-0018","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0018","url":null,"abstract":"Abstract In real-world approximation problems, precise input data are economically expensive. Therefore, fuzzy methods devoted to uncertain data are in the focus of current research. Consequently, a method based on fuzzy-rough sets for fuzzification of inputs in a rule-based fuzzy system is discussed in this paper. A triangular membership function is applied to describe the nature of imprecision in data. Firstly, triangular fuzzy partitions are introduced to approximate common antecedent fuzzy rule sets. As a consequence of the proposed method, we obtain a structure of a general (non-interval) type-2 fuzzy logic system in which secondary membership functions are cropped triangular. Then, the possibility of applying so-called regular triangular norms is discussed. Finally, an experimental system constructed on precise data, which is then transformed and verified for uncertain data, is provided to demonstrate its basic properties.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"271 - 285"},"PeriodicalIF":2.8,"publicationDate":"2020-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42745671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
An Algorithm for the Evolutionary-Fuzzy Generation of on-Line Signature Hybrid Descriptors 在线签名混合描述符的进化-模糊生成算法
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-05-23 DOI: 10.2478/jaiscr-2020-0012
Marcin Zalasiński, K. Cpałka, Lukasz Laskowski, D. Wunsch, K. Przybyszewski
Abstract In biometrics, methods which are able to precisely adapt to the biometric features of users are much sought after. They use various methods of artificial intelligence, in particular methods from the group of soft computing. In this paper, we focus on on-line signature verification. Such signatures are complex objects described not only by the shape but also by the dynamics of the signing process. In standard devices used for signature acquisition (with an LCD touch screen) this dynamics may include pen velocity, but sometimes other types of signals are also available, e.g. pen pressure on the screen surface (e.g. in graphic tablets), the angle between the pen and the screen surface, etc. The precision of the on-line signature dynamics processing has been a motivational springboard for developing methods that use signature partitioning. Partitioning uses a well-known principle of decomposing the problem into smaller ones. In this paper, we propose a new partitioning algorithm that uses capabilities of the algorithms based on populations and fuzzy systems. Evolutionary-fuzzy partitioning eliminates the need to average dynamic waveforms in created partitions because it replaces them. Evolutionary separation of partitions results in a better matching of partitions with reference signatures, eliminates dispro-portions between the number of points describing dynamics in partitions, eliminates the impact of random values, separates partitions related to the signing stage and its dynamics (e.g. high and low velocity of signing, where high and low are imprecise-fuzzy concepts). The operation of the presented algorithm has been tested using the well-known BioSecure DS2 database of real dynamic signatures.
摘要在生物识别中,能够精确适应用户生物特征的方法备受追捧。他们使用各种人工智能方法,特别是软计算组的方法。本文主要研究在线签名验证。这样的签名是复杂的对象,不仅通过签名过程的形状来描述,还通过签名过程中的动力学来描述。在用于签名采集的标准设备(具有LCD触摸屏)中,这种动态可以包括笔速度,但有时也可以使用其他类型的信号,例如,屏幕表面上的笔压力(例如,在图形平板电脑中)、笔和屏幕表面之间的角度等。在线签名动态处理的精度一直是开发使用签名划分的方法的动力跳板。分区使用了一个众所周知的原则,即把问题分解成更小的问题。在本文中,我们提出了一种新的划分算法,该算法利用了基于种群和模糊系统的算法的能力。进化模糊划分消除了在创建的分区中对动态波形进行平均的需要,因为它取代了它们。分区的进化分离导致分区与参考签名的更好匹配,消除了分区中描述动态的点的数量之间的离散部分,消除了随机值的影响,分离了与签名阶段及其动态相关的分区(例如,签名的高和低速度,其中高和低是不精确的模糊概念)。所提出的算法的操作已经使用众所周知的真实动态签名的BioSecure DS2数据库进行了测试。
{"title":"An Algorithm for the Evolutionary-Fuzzy Generation of on-Line Signature Hybrid Descriptors","authors":"Marcin Zalasiński, K. Cpałka, Lukasz Laskowski, D. Wunsch, K. Przybyszewski","doi":"10.2478/jaiscr-2020-0012","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0012","url":null,"abstract":"Abstract In biometrics, methods which are able to precisely adapt to the biometric features of users are much sought after. They use various methods of artificial intelligence, in particular methods from the group of soft computing. In this paper, we focus on on-line signature verification. Such signatures are complex objects described not only by the shape but also by the dynamics of the signing process. In standard devices used for signature acquisition (with an LCD touch screen) this dynamics may include pen velocity, but sometimes other types of signals are also available, e.g. pen pressure on the screen surface (e.g. in graphic tablets), the angle between the pen and the screen surface, etc. The precision of the on-line signature dynamics processing has been a motivational springboard for developing methods that use signature partitioning. Partitioning uses a well-known principle of decomposing the problem into smaller ones. In this paper, we propose a new partitioning algorithm that uses capabilities of the algorithms based on populations and fuzzy systems. Evolutionary-fuzzy partitioning eliminates the need to average dynamic waveforms in created partitions because it replaces them. Evolutionary separation of partitions results in a better matching of partitions with reference signatures, eliminates dispro-portions between the number of points describing dynamics in partitions, eliminates the impact of random values, separates partitions related to the signing stage and its dynamics (e.g. high and low velocity of signing, where high and low are imprecise-fuzzy concepts). The operation of the presented algorithm has been tested using the well-known BioSecure DS2 database of real dynamic signatures.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"173 - 187"},"PeriodicalIF":2.8,"publicationDate":"2020-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44718088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A New Method for Automatic Determining of the DBSCAN Parameters 一种自动确定DBSCAN参数的新方法
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-05-23 DOI: 10.2478/jaiscr-2020-0014
Artur Starczewski, P. Goetzen, M. Er
Abstract Clustering is an attractive technique used in many fields in order to deal with large scale data. Many clustering algorithms have been proposed so far. The most popular algorithms include density-based approaches. These kinds of algorithms can identify clusters of arbitrary shapes in datasets. The most common of them is the Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The original DBSCAN algorithm has been widely applied in various applications and has many different modifications. However, there is a fundamental issue of the right choice of its two input parameters, i.e the eps radius and the MinPts density threshold. The choice of these parameters is especially difficult when the density variation within clusters is significant. In this paper, a new method that determines the right values of the parameters for different kinds of clusters is proposed. This method uses detection of sharp distance increases generated by a function which computes a distance between each element of a dataset and its k-th nearest neighbor. Experimental results have been obtained for several different datasets and they confirm a very good performance of the newly proposed method.
聚类是一种有吸引力的技术,用于许多领域,以处理大规模数据。目前已经提出了许多聚类算法。最流行的算法包括基于密度的方法。这类算法可以识别数据集中任意形状的簇。其中最常见的是基于密度的带噪声应用空间聚类(DBSCAN)。原始的DBSCAN算法在各种应用中得到了广泛的应用,并进行了许多不同的修改。然而,有一个基本的问题是正确选择它的两个输入参数,即eps半径和MinPts密度阈值。当集群内密度变化显著时,这些参数的选择尤其困难。本文提出了一种确定不同类型聚类的参数值的新方法。该方法使用由一个函数生成的急剧距离增加检测,该函数计算数据集的每个元素与其第k个最近邻居之间的距离。在多个不同的数据集上进行了实验,结果证实了该方法的良好性能。
{"title":"A New Method for Automatic Determining of the DBSCAN Parameters","authors":"Artur Starczewski, P. Goetzen, M. Er","doi":"10.2478/jaiscr-2020-0014","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0014","url":null,"abstract":"Abstract Clustering is an attractive technique used in many fields in order to deal with large scale data. Many clustering algorithms have been proposed so far. The most popular algorithms include density-based approaches. These kinds of algorithms can identify clusters of arbitrary shapes in datasets. The most common of them is the Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The original DBSCAN algorithm has been widely applied in various applications and has many different modifications. However, there is a fundamental issue of the right choice of its two input parameters, i.e the eps radius and the MinPts density threshold. The choice of these parameters is especially difficult when the density variation within clusters is significant. In this paper, a new method that determines the right values of the parameters for different kinds of clusters is proposed. This method uses detection of sharp distance increases generated by a function which computes a distance between each element of a dataset and its k-th nearest neighbor. Experimental results have been obtained for several different datasets and they confirm a very good performance of the newly proposed method.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"209 - 221"},"PeriodicalIF":2.8,"publicationDate":"2020-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47729037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Detecting Visual Objects by Edge Crawling 边缘爬行检测视觉对象
IF 2.8 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2020-05-23 DOI: 10.2478/jaiscr-2020-0015
Rafał Grycuk, A. Wojciechowski, Wei Wei, A. Siwocha
Abstract Content-based image retrieval methods develop rapidly with a growing scale of image repositories. They are usually based on comparing and indexing some image features. We developed a new algorithm for finding objects in images by traversing their edges. Moreover, we describe the objects by histograms of local features and angles. We use such a description to retrieve similar images fast. We performed extensive experiments on three established image datasets proving the effectiveness of the proposed method.
摘要基于内容的图像检索方法随着图像库规模的不断扩大而迅速发展。它们通常基于对一些图像特征进行比较和索引。我们开发了一种新的算法,通过遍历图像中的对象边缘来找到它们。此外,我们通过局部特征和角度的直方图来描述对象。我们使用这样的描述来快速检索相似的图像。我们在三个已建立的图像数据集上进行了大量实验,证明了所提出方法的有效性。
{"title":"Detecting Visual Objects by Edge Crawling","authors":"Rafał Grycuk, A. Wojciechowski, Wei Wei, A. Siwocha","doi":"10.2478/jaiscr-2020-0015","DOIUrl":"https://doi.org/10.2478/jaiscr-2020-0015","url":null,"abstract":"Abstract Content-based image retrieval methods develop rapidly with a growing scale of image repositories. They are usually based on comparing and indexing some image features. We developed a new algorithm for finding objects in images by traversing their edges. Moreover, we describe the objects by histograms of local features and angles. We use such a description to retrieve similar images fast. We performed extensive experiments on three established image datasets proving the effectiveness of the proposed method.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"10 1","pages":"223 - 237"},"PeriodicalIF":2.8,"publicationDate":"2020-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47958028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Journal of Artificial Intelligence and Soft Computing Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1