首页 > 最新文献

Journal of Artificial Intelligence and Data Mining最新文献

英文 中文
A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features 一种基于条件随机场的波斯特定特征命名实体识别新方法
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2019.8430.1980
L. Tafreshi, F. Soltanzadeh
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performance in Conditional Random Field-based Persian Named Entity Recognition, a several syntactic features based on dependency grammar along with some morphological and language-independent features have been designed in order to extract suitable features for the learning phase. In this implementation, designed features have been applied to Conditional Random Field to build our model. To evaluate our system, the Persian syntactic dependency Treebank with about 30,000 sentences, prepared in NOOR Islamic science computer research center, has been implemented. This Treebank has Named-Entity tags, such as Person, Organization and location. The result of this study showed that our approach achieved 86.86% precision, 80.29% recall and 83.44% F-measure which are relatively higher than those values reported for other Persian NER methods.
命名实体识别是一种识别文本中名称实体的信息提取技术。通常使用的三种流行方法是:基于规则的、基于机器学习的和它们的混合,以从文本中提取命名实体。基于机器学习的方法如果经过良好的特征训练,在波斯语中会有很好的表现。为了在基于条件随机场的波斯语命名实体识别中获得良好的性能,设计了一些基于依赖语法的句法特征以及一些与形态学和语言无关的特征,以便提取适合学习阶段的特征。在这个实现中,设计的特征已经应用于条件随机场来构建我们的模型。为了对我们的系统进行评价,在NOOR伊斯兰科学计算机研究中心编写的大约3万个波斯语句法依存关系树库中进行了实现。这个树库具有命名实体标签,例如Person、Organization和location。研究结果表明,该方法的准确率为86.86%,召回率为80.29%,F-measure值为83.44%,相对于其他波斯NER方法的报道值。
{"title":"A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features","authors":"L. Tafreshi, F. Soltanzadeh","doi":"10.22044/JADM.2019.8430.1980","DOIUrl":"https://doi.org/10.22044/JADM.2019.8430.1980","url":null,"abstract":"Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performance in Conditional Random Field-based Persian Named Entity Recognition, a several syntactic features based on dependency grammar along with some morphological and language-independent features have been designed in order to extract suitable features for the learning phase. In this implementation, designed features have been applied to Conditional Random Field to build our model. To evaluate our system, the Persian syntactic dependency Treebank with about 30,000 sentences, prepared in NOOR Islamic science computer research center, has been implemented. This Treebank has Named-Entity tags, such as Person, Organization and location. The result of this study showed that our approach achieved 86.86% precision, 80.29% recall and 83.44% F-measure which are relatively higher than those values reported for other Persian NER methods.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43280074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A New Reliable Controller Placement Model for Software-Defined WANs 一种新的软件定义广域网可靠控制器布局模型
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2019.6319.1745
Ahmad Jalili, Manijeh Keshtgari
Software-Defined Network (SDNs) is a decoupled architecture that enables administrators to build a customizable and manageable network. Although the decoupled control plane provides flexible management and facilitates the task of operating the network, it is the vulnerable point of failure in SDN. To achieve a reliable control plane, multiple controller are often needed so that each switch must be assigned to more than one controller. In this paper, a Reliable Controller Placement Problem Model (RCPPM) is proposed to solve such a problem, so as to maximize the reliability of software defined networks. Unlike previous works that only consider latencies parameters, the new model takes into account the load of control traffic and reliability metrics as well. Furthermore, a near-optimal algorithm is proposed to solve the NP-hard RCPPM in a heuristic manner. Finally, through extensive simulation, a comprehensive analysis of the RCPPM is presented for various topologies extracted from Internet Topology Zoo. Our performance evaluations show the efficiency of the proposed framework.
软件定义网络(SDN)是一种解耦的体系结构,使管理员能够构建一个可定制和可管理的网络。尽管解耦的控制平面提供了灵活的管理并方便了操作网络的任务,但它是SDN中的薄弱点。为了实现可靠的控制平面,通常需要多个控制器,因此每个交换机必须分配给多个控制器。本文提出了一个可靠控制器布局问题模型(RCPPM)来解决这一问题,以最大限度地提高软件定义网络的可靠性。与以前只考虑延迟参数的工作不同,新模型还考虑了控制流量的负载和可靠性指标。此外,提出了一种近似最优算法,以启发式的方式求解NP难的RCPPM。最后,通过广泛的仿真,对从互联网拓扑动物园中提取的各种拓扑进行了RCPPM的综合分析。我们的绩效评估显示了拟议框架的效率。
{"title":"A New Reliable Controller Placement Model for Software-Defined WANs","authors":"Ahmad Jalili, Manijeh Keshtgari","doi":"10.22044/JADM.2019.6319.1745","DOIUrl":"https://doi.org/10.22044/JADM.2019.6319.1745","url":null,"abstract":"Software-Defined Network (SDNs) is a decoupled architecture that enables administrators to build a customizable and manageable network. Although the decoupled control plane provides flexible management and facilitates the task of operating the network, it is the vulnerable point of failure in SDN. To achieve a reliable control plane, multiple controller are often needed so that each switch must be assigned to more than one controller. In this paper, a Reliable Controller Placement Problem Model (RCPPM) is proposed to solve such a problem, so as to maximize the reliability of software defined networks. Unlike previous works that only consider latencies parameters, the new model takes into account the load of control traffic and reliability metrics as well. Furthermore, a near-optimal algorithm is proposed to solve the NP-hard RCPPM in a heuristic manner. Finally, through extensive simulation, a comprehensive analysis of the RCPPM is presented for various topologies extracted from Internet Topology Zoo. Our performance evaluations show the efficiency of the proposed framework.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48157824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Hybrid Business Success Versus Failure Classification Prediction Model: A Case of Iranian Accelerated Start-ups 混合型企业成功与失败分类预测模型:以伊朗加速创业为例
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2020.8248.1963
S. Sadatrasoul, O. Ebadati, R. Saedi
The purpose of this study is to reduce the uncertainty of early stage startups success prediction and filling the gap of previous studies in the field, by identifying and evaluating the success variables and developing a novel business success failure (S/F) data mining classification prediction model for Iranian start-ups. For this purpose, the paper is seeking to extend Bill Gross and Robert Lussier S/F prediction model variables and algorithms in a new context of Iranian start-ups which starts from accelerators in order to build a new S/F prediction model. A sample of 161 Iranian start-ups which are based in accelerators from 2013 to 2018 is applied and 39 variables are extracted from the literature and organized in five groups. Then the sample is fed into six well-known classification algorithms. Two staged stacking as a classification model is the best performer among all other six classification based S/F prediction models and it can predict binary dependent variable of success or failure with accuracy of 89% on average. Also finding shows that “starting from Accelerators”, “creativity and problem solving ability of founders”, “fist mover advantage” and “amount of seed investment” are the four most important variables which affects the start-ups success and the other 15 variables are less important.
本研究的目的是通过识别和评估成功变量,建立一种新的伊朗初创企业成功失败(S/F)数据挖掘分类预测模型,以减少早期创业成功预测的不确定性,填补该领域前人研究的空白。为此,本文试图将Bill Gross和Robert Lussier的S/F预测模型变量和算法扩展到从加速器开始的伊朗初创企业的新背景下,以建立新的S/F预测模型。本文选取了2013年至2018年在加速器中成立的161家伊朗初创企业作为样本,从文献中提取了39个变量,并将其分为五组。然后将样本输入六种著名的分类算法。在其他6种基于分类的S/F预测模型中,两阶段叠加作为分类模型表现最好,它可以预测成功或失败的二元因变量,平均准确率为89%。研究还发现,“从加速器开始”、“创业者的创造力和解决问题的能力”、“先发优势”和“种子投资金额”是影响创业成功的4个最重要的变量,其他15个变量的重要性较低。
{"title":"A Hybrid Business Success Versus Failure Classification Prediction Model: A Case of Iranian Accelerated Start-ups","authors":"S. Sadatrasoul, O. Ebadati, R. Saedi","doi":"10.22044/JADM.2020.8248.1963","DOIUrl":"https://doi.org/10.22044/JADM.2020.8248.1963","url":null,"abstract":"The purpose of this study is to reduce the uncertainty of early stage startups success prediction and filling the gap of previous studies in the field, by identifying and evaluating the success variables and developing a novel business success failure (S/F) data mining classification prediction model for Iranian start-ups. For this purpose, the paper is seeking to extend Bill Gross and Robert Lussier S/F prediction model variables and algorithms in a new context of Iranian start-ups which starts from accelerators in order to build a new S/F prediction model. A sample of 161 Iranian start-ups which are based in accelerators from 2013 to 2018 is applied and 39 variables are extracted from the literature and organized in five groups. Then the sample is fed into six well-known classification algorithms. Two staged stacking as a classification model is the best performer among all other six classification based S/F prediction models and it can predict binary dependent variable of success or failure with accuracy of 89% on average. Also finding shows that “starting from Accelerators”, “creativity and problem solving ability of founders”, “fist mover advantage” and “amount of seed investment” are the four most important variables which affects the start-ups success and the other 15 variables are less important.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47555808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
H-BwoaSvm: A Hybrid Model for Classification and Feature Selection of Mammography Screening Behavior Data H-BwoaSvm:用于乳腺x线筛查行为数据分类和特征选择的混合模型
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2020.8105.1945
E. Enayati, Z. Hassani, M. Moodi
Breast cancer is one of the most common cancer in the world. Early detection of cancers cause significantly reduce in morbidity rate and treatment costs. Mammography is a known effective diagnosis method of breast cancer. A way for mammography screening behavior identification is women's awareness evaluation for participating in mammography screening programs. Todays, intelligence systems could identify main factors on specific incident. These could help to the experts in the wide range of areas specially health scopes such as prevention, diagnosis and treatment. In this paper we use a hybrid model called H-BwoaSvm which BWOA is used for detecting effective factors on mammography screening behavior and SVM for classification. Our model is applied on a data set which collected from a segmental analytical descriptive study on 2256 women. Proposed model is operated on data set with 82.27 and 98.89 percent accuracy and select effective features on mammography screening behavior.
癌症是世界上最常见的癌症之一。癌症的早期发现可显著降低发病率和治疗成本。乳腺造影术是目前已知的癌症的有效诊断方法。乳房X光检查筛查行为识别的一种方法是对女性参与乳房X光筛查项目的意识进行评估。如今,情报系统可以识别特定事件的主要因素。这些可以帮助广泛领域的专家,特别是预防、诊断和治疗等健康领域的专家。在本文中,我们使用了一种称为H-BwoaSvm的混合模型,BWOA用于检测乳腺X光筛查行为的有效因素,SVM用于分类。我们的模型应用于从2256名女性的分段分析描述性研究中收集的数据集。所提出的模型在数据集上以82.27%和98.89%的准确率进行了操作,并选择了乳房X光检查筛查行为的有效特征。
{"title":"H-BwoaSvm: A Hybrid Model for Classification and Feature Selection of Mammography Screening Behavior Data","authors":"E. Enayati, Z. Hassani, M. Moodi","doi":"10.22044/JADM.2020.8105.1945","DOIUrl":"https://doi.org/10.22044/JADM.2020.8105.1945","url":null,"abstract":"Breast cancer is one of the most common cancer in the world. Early detection of cancers cause significantly reduce in morbidity rate and treatment costs. Mammography is a known effective diagnosis method of breast cancer. A way for mammography screening behavior identification is women's awareness evaluation for participating in mammography screening programs. Todays, intelligence systems could identify main factors on specific incident. These could help to the experts in the wide range of areas specially health scopes such as prevention, diagnosis and treatment. In this paper we use a hybrid model called H-BwoaSvm which BWOA is used for detecting effective factors on mammography screening behavior and SVM for classification. Our model is applied on a data set which collected from a segmental analytical descriptive study on 2256 women. Proposed model is operated on data set with 82.27 and 98.89 percent accuracy and select effective features on mammography screening behavior.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43041475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Credit Card Fraud Detection using Data mining and Statistical Methods 使用数据挖掘和统计方法检测信用卡欺诈
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2019.7506.1894
S. Beigi, M. Amin-Naseri
Due to today’s advancement in technology and businesses, fraud detection has become a critical component of financial transactions. Considering vast amounts of data in large datasets, it becomes more difficult to detect fraud transactions manually. In this research, we propose a combined method using both data mining and statistical tasks, utilizing feature selection, resampling and cost-sensitive learning for credit card fraud detection. In the first step, useful features are identified using genetic algorithm. Next, the optimal resampling strategy is determined based on the design of experiments (DOE) and response surface methodologies. Finally, the cost sensitive C4.5 algorithm is used as the base learner in the Adaboost algorithm. Using a real-time data set, results show that applying the proposed method significantly reduces the misclassification cost by at least 14% compared with Decision tree, Naive bayes, Bayesian Network, Neural network and Artificial immune system.
由于当今技术和商业的进步,欺诈检测已成为金融交易的关键组成部分。考虑到大型数据集中的大量数据,手动检测欺诈交易变得更加困难。在这项研究中,我们提出了一种结合数据挖掘和统计任务的方法,利用特征选择、重采样和成本敏感学习来检测信用卡欺诈。在第一步中,使用遗传算法识别有用的特征。接下来,基于实验设计(DOE)和响应面方法确定最优重采样策略。最后,使用成本敏感的C4.5算法作为Adaboost算法的基础学习器。使用实时数据集,结果表明,与决策树、朴素贝叶斯、贝叶斯网络、神经网络和人工免疫系统相比,应用该方法可以显著降低至少14%的误分类成本。
{"title":"Credit Card Fraud Detection using Data mining and Statistical Methods","authors":"S. Beigi, M. Amin-Naseri","doi":"10.22044/JADM.2019.7506.1894","DOIUrl":"https://doi.org/10.22044/JADM.2019.7506.1894","url":null,"abstract":"Due to today’s advancement in technology and businesses, fraud detection has become a critical component of financial transactions. Considering vast amounts of data in large datasets, it becomes more difficult to detect fraud transactions manually. In this research, we propose a combined method using both data mining and statistical tasks, utilizing feature selection, resampling and cost-sensitive learning for credit card fraud detection. In the first step, useful features are identified using genetic algorithm. Next, the optimal resampling strategy is determined based on the design of experiments (DOE) and response surface methodologies. Finally, the cost sensitive C4.5 algorithm is used as the base learner in the Adaboost algorithm. Using a real-time data set, results show that applying the proposed method significantly reduces the misclassification cost by at least 14% compared with Decision tree, Naive bayes, Bayesian Network, Neural network and Artificial immune system.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46775009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Statistical Wavelet-based Image Denoising using Scale Mixture of Normal Distributions with Adaptive Parameter Estimation 基于统计小波的自适应参数估计正态分布尺度混合图像去噪
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2020.7797.1920
Mansoore Saeedzarandi, Hossien Nezamabadi-pour, S. Saryazdi
Removing noise from images is a challenging problem in digital image processing. This paper presents an image denoising method based on a maximum a posteriori (MAP) density function estimator, which is implemented in the wavelet domain because of its energy compaction property. The performance of the MAP estimator depends on the proposed model for noise-free wavelet coefficients. Thus in the wavelet based image denoising, selecting a proper model for wavelet coefficients is very important. In this paper, we model wavelet coefficients in each sub-band by heavy-tail distributions that are from scale mixture of normal distribution family. The parameters of distributions are estimated adaptively to model the correlation between the coefficient amplitudes, so the intra-scale dependency of wavelet coefficients is also considered. The denoising results confirm the effectiveness of the proposed method.
从图像中去除噪声是数字图像处理中的一个具有挑战性的问题。本文提出了一种基于最大后验密度函数估计器的图像去噪方法,该方法由于其能量压缩特性而在小波域中实现。MAP估计器的性能取决于所提出的无噪声小波系数模型。因此,在基于小波的图像去噪中,选择合适的小波系数模型是非常重要的。本文利用正态分布族尺度混合的重尾分布对每个子带的小波系数进行建模。自适应地估计分布的参数以对系数幅度之间的相关性进行建模,因此还考虑了小波系数的尺度内相关性。去噪结果证实了该方法的有效性。
{"title":"Statistical Wavelet-based Image Denoising using Scale Mixture of Normal Distributions with Adaptive Parameter Estimation","authors":"Mansoore Saeedzarandi, Hossien Nezamabadi-pour, S. Saryazdi","doi":"10.22044/JADM.2020.7797.1920","DOIUrl":"https://doi.org/10.22044/JADM.2020.7797.1920","url":null,"abstract":"Removing noise from images is a challenging problem in digital image processing. This paper presents an image denoising method based on a maximum a posteriori (MAP) density function estimator, which is implemented in the wavelet domain because of its energy compaction property. The performance of the MAP estimator depends on the proposed model for noise-free wavelet coefficients. Thus in the wavelet based image denoising, selecting a proper model for wavelet coefficients is very important. In this paper, we model wavelet coefficients in each sub-band by heavy-tail distributions that are from scale mixture of normal distribution family. The parameters of distributions are estimated adaptively to model the correlation between the coefficient amplitudes, so the intra-scale dependency of wavelet coefficients is also considered. The denoising results confirm the effectiveness of the proposed method.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45648669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Hybrid Meta-heuristic Approach to Cope with State Space Explosion in Model Checking Technique for Deadlock Freeness 死锁自由度模型检验技术中状态空间爆炸的混合元启发式方法
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2019.7564.1900
N. Rezaee, H. Momeni
Model checking is an automatic technique for software verification through which all reachable states are generated from an initial state to finding errors and desirable patterns. In the model checking approach, the behavior and structure of system should be modeled. Graph transformation system is a graphical formal modeling language to specify and model the system. However, modeling of large systems with the graph transformation system suffers from the state space explosion problem which usually requires huge amounts of computational resources. In this paper, we propose a hybrid meta-heuristic approach to deal with this searching problem in the graph transformation system because meta-heuristic algorithms are efficient solutions to traverse the graph of large systems. Our approach, using Artificial Bee Colony and Simulated Annealing, replaces a full state space generation, only by producing part of it checking the safety, and finding errors (e.g., deadlock). The experimental results show that our proposed approach is more efficient and accurate compared to other approaches.
模型检查是一种用于软件验证的自动技术,通过它可以从初始状态生成所有可到达的状态,以查找错误和所需的模式。在模型检验方法中,需要对系统的行为和结构进行建模。图转换系统是一种对系统进行形式化描述和建模的图形化形式化建模语言。然而,利用图变换系统对大型系统建模存在状态空间爆炸问题,通常需要大量的计算资源。在本文中,我们提出了一种混合元启发式方法来处理图变换系统中的搜索问题,因为元启发式算法是遍历大系统图的有效解决方案。我们的方法,使用人工蜂群和模拟退火,取代了一个完整的状态空间生成,只有通过产生它的一部分,检查安全性,并发现错误(例如,死锁)。实验结果表明,与其他方法相比,该方法具有更高的效率和准确性。
{"title":"A Hybrid Meta-heuristic Approach to Cope with State Space Explosion in Model Checking Technique for Deadlock Freeness","authors":"N. Rezaee, H. Momeni","doi":"10.22044/JADM.2019.7564.1900","DOIUrl":"https://doi.org/10.22044/JADM.2019.7564.1900","url":null,"abstract":"Model checking is an automatic technique for software verification through which all reachable states are generated from an initial state to finding errors and desirable patterns. In the model checking approach, the behavior and structure of system should be modeled. Graph transformation system is a graphical formal modeling language to specify and model the system. However, modeling of large systems with the graph transformation system suffers from the state space explosion problem which usually requires huge amounts of computational resources. In this paper, we propose a hybrid meta-heuristic approach to deal with this searching problem in the graph transformation system because meta-heuristic algorithms are efficient solutions to traverse the graph of large systems. Our approach, using Artificial Bee Colony and Simulated Annealing, replaces a full state space generation, only by producing part of it checking the safety, and finding errors (e.g., deadlock). The experimental results show that our proposed approach is more efficient and accurate compared to other approaches.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45814349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Community Detection using a New Node Scoring and Synchronous Label Updating of Boundary Nodes in Social Networks 基于节点评分和边界节点同步标签更新的社交网络社区检测
Pub Date : 2020-04-01 DOI: 10.22044/JADM.2019.8768.2011
M. Zarezade, E. Nourani, Asgarali Bouyer
Community structure is vital to discover the important structures and potential property of complex networks. In recent years, the increasing quality of local community detection approaches has become a hot spot in the study of complex network due to the advantages of linear time complexity and applicable for large-scale networks. However, there are many shortcomings in these methods such as instability, low accuracy, randomness, etc. The G-CN algorithm is one of local methods that uses the same label propagation as the LPA method, but unlike the LPA, only the labels of boundary nodes are updated at each iteration that reduces its execution time. However, it has resolution limit and low accuracy problem. To overcome these problems, this paper proposes an improved community detection method called SD-GCN which uses a hybrid node scoring and synchronous label updating of boundary nodes, along with disabling random label updating in initial updates. In the first phase, it updates the label of boundary nodes in a synchronous manner using the obtained score based on degree centrality and common neighbor measures. In addition, we defined a new method for merging communities in second phase which is faster than modularity-based methods. Extensive set of experiments are conducted to evaluate performance of the SD-GCN on small and large-scale real-world networks and artificial networks. These experiments verify significant improvement in the accuracy and stability of community detection approaches in parallel with shorter execution time in a linear time complexity.
群落结构对于发现复杂网络的重要结构和潜在性质至关重要。近年来,由于线性时间复杂性和适用于大规模网络的优势,本地社区检测方法的质量不断提高,已成为复杂网络研究的热点。然而,这些方法存在许多缺点,如不稳定、精度低、随机性等。G-CN算法是一种与LPA方法使用相同标签传播的局部方法,但与LPA不同的是,每次迭代只更新边界节点的标签,从而减少了执行时间。然而,它存在分辨率限制和精度低的问题。为了克服这些问题,本文提出了一种改进的社区检测方法,称为SD-GCN,该方法使用混合节点评分和边界节点的同步标签更新,并在初始更新中禁用随机标签更新。在第一阶段,它使用基于度中心性和公共邻居度量获得的分数,以同步的方式更新边界节点的标签。此外,我们在第二阶段定义了一种新的社区合并方法,该方法比基于模块化的方法更快。进行了大量的实验来评估SD-GCN在小型和大型真实世界网络和人工网络上的性能。这些实验验证了社区检测方法的准确性和稳定性的显著提高,同时在线性时间复杂性中缩短了执行时间。
{"title":"Community Detection using a New Node Scoring and Synchronous Label Updating of Boundary Nodes in Social Networks","authors":"M. Zarezade, E. Nourani, Asgarali Bouyer","doi":"10.22044/JADM.2019.8768.2011","DOIUrl":"https://doi.org/10.22044/JADM.2019.8768.2011","url":null,"abstract":"Community structure is vital to discover the important structures and potential property of complex networks. In recent years, the increasing quality of local community detection approaches has become a hot spot in the study of complex network due to the advantages of linear time complexity and applicable for large-scale networks. However, there are many shortcomings in these methods such as instability, low accuracy, randomness, etc. The G-CN algorithm is one of local methods that uses the same label propagation as the LPA method, but unlike the LPA, only the labels of boundary nodes are updated at each iteration that reduces its execution time. However, it has resolution limit and low accuracy problem. To overcome these problems, this paper proposes an improved community detection method called SD-GCN which uses a hybrid node scoring and synchronous label updating of boundary nodes, along with disabling random label updating in initial updates. In the first phase, it updates the label of boundary nodes in a synchronous manner using the obtained score based on degree centrality and common neighbor measures. In addition, we defined a new method for merging communities in second phase which is faster than modularity-based methods. Extensive set of experiments are conducted to evaluate performance of the SD-GCN on small and large-scale real-world networks and artificial networks. These experiments verify significant improvement in the accuracy and stability of community detection approaches in parallel with shorter execution time in a linear time complexity.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47031432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Solving Traveling Salesman Problem based on Biogeography-based Optimization and Edge Assembly Cross-over 基于生物地理学的优化和边缘装配交叉求解旅行商问题
Pub Date : 2020-03-10 DOI: 10.22044/JADM.2020.7835.1922
Abbas Salehi, B. Masoumi
Biogeography-Based Optimization (BBO) algorithm has recently been of great interest to researchers for simplicity of implementation, efficiency, and the low number of parameters. The BBO Algorithm in optimization problems is one of the new algorithms which have been developed based on the biogeography concept. This algorithm uses the idea of animal migration to find suitable habitats for solving optimization problems. The BBO algorithm has three principal operators called migration, mutation and elite selection. The migration operator plays a very important role in sharing information among the candidate habitats. The original BBO algorithm, due to its poor exploration and exploitation, sometimes does not perform desirable results. On the other hand, the Edge Assembly Crossover (EAX) has been one of the high power crossovers for acquiring offspring and it increased the diversity of the population. The combination of biogeography-based optimization algorithm and EAX can provide high efficiency in solving optimization problems, including the traveling salesman problem (TSP). This paper proposed a combination of those approaches to solve traveling salesman problem. The new hybrid approach was examined with standard datasets for TSP in TSPLIB. In the experiments, the performance of the proposed approach was better than the original BBO and four others widely used metaheuristics algorithms.
基于生物地理的优化(BBO)算法由于其实现简单、高效和参数数量少而引起了研究人员的极大兴趣。优化问题中的BBO算法是基于生物地理学概念发展起来的一种新算法。该算法利用动物迁徙的思想来寻找合适的栖息地来解决优化问题。BBO算法有三个主要算子,称为迁移、变异和精英选择。迁移操作员在候选栖息地之间共享信息方面发挥着非常重要的作用。最初的BBO算法,由于其探索和开发不力,有时无法获得理想的结果。另一方面,边缘组装杂交(EAX)是获得后代的高功率杂交之一,它增加了种群的多样性。基于生物地理学的优化算法和EAX的结合可以在求解优化问题方面提供高效率,包括旅行商问题(TSP)。本文提出了将这些方法相结合的方法来解决旅行推销员问题。用TSPLIB中TSP的标准数据集对新的混合方法进行了检验。在实验中,所提出的方法的性能优于原始的BBO和其他四种广泛使用的元启发式算法。
{"title":"Solving Traveling Salesman Problem based on Biogeography-based Optimization and Edge Assembly Cross-over","authors":"Abbas Salehi, B. Masoumi","doi":"10.22044/JADM.2020.7835.1922","DOIUrl":"https://doi.org/10.22044/JADM.2020.7835.1922","url":null,"abstract":"Biogeography-Based Optimization (BBO) algorithm has recently been of great interest to researchers for simplicity of implementation, efficiency, and the low number of parameters. The BBO Algorithm in optimization problems is one of the new algorithms which have been developed based on the biogeography concept. This algorithm uses the idea of animal migration to find suitable habitats for solving optimization problems. The BBO algorithm has three principal operators called migration, mutation and elite selection. The migration operator plays a very important role in sharing information among the candidate habitats. The original BBO algorithm, due to its poor exploration and exploitation, sometimes does not perform desirable results. On the other hand, the Edge Assembly Crossover (EAX) has been one of the high power crossovers for acquiring offspring and it increased the diversity of the population. The combination of biogeography-based optimization algorithm and EAX can provide high efficiency in solving optimization problems, including the traveling salesman problem (TSP). This paper proposed a combination of those approaches to solve traveling salesman problem. The new hybrid approach was examined with standard datasets for TSP in TSPLIB. In the experiments, the performance of the proposed approach was better than the original BBO and four others widely used metaheuristics algorithms.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47370856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Shuffled Frog-Leaping Programming for Solving Regression Problems 求解回归问题的无序蛙跳规划
Pub Date : 2020-03-10 DOI: 10.22044/JADM.2020.7847.1924
M. Abdollahi, M. A. Shoorehdeli
There are various automatic programming models inspired by evolutionary computation techniques. Due to the importance of devising an automatic mechanism to explore the complicated search space of mathematical problems where numerical methods fails, evolutionary computations are widely studied and applied to solve real world problems. One of the famous algorithm in optimization problem is shuffled frog leaping algorithm (SFLA) which is inspired by behaviour of frogs to find the highest quantity of available food by searching their environment both locally and globally. The results of SFLA prove that it is competitively effective to solve problems. In this paper, Shuffled Frog Leaping Programming (SFLP) inspired by SFLA is proposed as a novel type of automatic programming model to solve symbolic regression problems based on tree representation. Also, in SFLP, a new mechanism for improving constant numbers in the tree structure is proposed. In this way, different domains of mathematical problems can be addressed with the use of proposed method. To find out about the performance of generated solutions by SFLP, various experiments were conducted using a number of benchmark functions. The results were also compared with other evolutionary programming algorithms like BBP, GSP, GP and many variants of GP.
受进化计算技术的启发,有各种各样的自动编程模型。由于设计一种自动机制来探索数值方法无法解决的数学问题的复杂搜索空间的重要性,进化计算被广泛研究并应用于解决现实世界的问题。优化问题中最著名的算法之一是shuffle frog leapalgorithm (SFLA),该算法的灵感来自青蛙的行为,通过局部和全局搜索它们的环境来寻找最高数量的可用食物。结果表明,该方法具有较好的解决问题的竞争力。本文提出了一种新的基于树表示的求解符号回归问题的自动规划模型——shuffle Frog leapprogramming (SFLP)。此外,在SFLP中,提出了一种改进树结构常数数的新机制。通过这种方式,不同领域的数学问题可以用所提出的方法来解决。为了了解SFLP生成的解决方案的性能,我们使用了许多基准函数进行了各种实验。结果还与BBP、GSP、GP等进化规划算法以及GP的多种变体进行了比较。
{"title":"Shuffled Frog-Leaping Programming for Solving Regression Problems","authors":"M. Abdollahi, M. A. Shoorehdeli","doi":"10.22044/JADM.2020.7847.1924","DOIUrl":"https://doi.org/10.22044/JADM.2020.7847.1924","url":null,"abstract":"There are various automatic programming models inspired by evolutionary computation techniques. Due to the importance of devising an automatic mechanism to explore the complicated search space of mathematical problems where numerical methods fails, evolutionary computations are widely studied and applied to solve real world problems. One of the famous algorithm in optimization problem is shuffled frog leaping algorithm (SFLA) which is inspired by behaviour of frogs to find the highest quantity of available food by searching their environment both locally and globally. The results of SFLA prove that it is competitively effective to solve problems. In this paper, Shuffled Frog Leaping Programming (SFLP) inspired by SFLA is proposed as a novel type of automatic programming model to solve symbolic regression problems based on tree representation. Also, in SFLP, a new mechanism for improving constant numbers in the tree structure is proposed. In this way, different domains of mathematical problems can be addressed with the use of proposed method. To find out about the performance of generated solutions by SFLP, various experiments were conducted using a number of benchmark functions. The results were also compared with other evolutionary programming algorithms like BBP, GSP, GP and many variants of GP.","PeriodicalId":32592,"journal":{"name":"Journal of Artificial Intelligence and Data Mining","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48011292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Artificial Intelligence and Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1