首页 > 最新文献

2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)最新文献

英文 中文
Suffix Tree Construction based Mapreduce 基于Mapreduce的后缀树构建
Sihem Klai Soukehal, Karima Chibane, M. Khadir
The genome sequence indexing is a primary step in order to facilitate other further treatments such as patterns search or assembly with a reference genome etc. And the suffix tree is one of the most used data structures for indexing the genome sequence. However, the memory required for running the suffix tree construction algorithms may exceed the amount of available main memory. Despite the efforts made by the researchers, the construction of suffix tree remains very expensive with the use of data centres to ensure optimal parallelization of treatments and reduce the execution time without forgetting the risks of breakdown and the problems that it breeds. The parallelization performed by Hadoop and MapReduce gives solutions to storage and data processing capacity limitations as well as fault tolerance, all that at reasonable costs. The emergence of Hadoop, a framework related to big data and the paradigm MapReduce that allows to model parallel and distributed processing, is investigating many domains of science in order to effectively parallel their treatments. PWOTD (Partition and Write Only Top Down) algorithm, is chosen here as it has proven itself in textual algorithms for genome sequencing. In this paper, an approach to model the parallel construction of the suffix tree using the MapReduce paradigm is designed for implementation in Hadoop with a java API.
基因组序列索引是促进其他进一步治疗如模式搜索或与参考基因组组装等的首要步骤。后缀树是基因组序列索引中最常用的数据结构之一。但是,运行后缀树构造算法所需的内存可能会超过可用的主内存。尽管研究人员做出了努力,但后缀树的构建仍然非常昂贵,需要使用数据中心来确保处理的最佳并行化,减少执行时间,同时还要考虑崩溃的风险和由此产生的问题。Hadoop和MapReduce执行的并行化解决了存储和数据处理容量限制以及容错问题,所有这些都是在合理的成本下完成的。Hadoop的出现,一个与大数据相关的框架,以及允许并行和分布式处理建模的范式MapReduce,正在研究许多科学领域,以便有效地并行它们的处理。这里选择PWOTD (Partition and Write Only Top Down)算法,因为它已经在基因组测序的文本算法中证明了自己。本文设计了一种使用MapReduce范式对后缀树的并行构建建模的方法,并通过java API在Hadoop中实现。
{"title":"Suffix Tree Construction based Mapreduce","authors":"Sihem Klai Soukehal, Karima Chibane, M. Khadir","doi":"10.1109/ICTAACS48474.2019.8988123","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988123","url":null,"abstract":"The genome sequence indexing is a primary step in order to facilitate other further treatments such as patterns search or assembly with a reference genome etc. And the suffix tree is one of the most used data structures for indexing the genome sequence. However, the memory required for running the suffix tree construction algorithms may exceed the amount of available main memory. Despite the efforts made by the researchers, the construction of suffix tree remains very expensive with the use of data centres to ensure optimal parallelization of treatments and reduce the execution time without forgetting the risks of breakdown and the problems that it breeds. The parallelization performed by Hadoop and MapReduce gives solutions to storage and data processing capacity limitations as well as fault tolerance, all that at reasonable costs. The emergence of Hadoop, a framework related to big data and the paradigm MapReduce that allows to model parallel and distributed processing, is investigating many domains of science in order to effectively parallel their treatments. PWOTD (Partition and Write Only Top Down) algorithm, is chosen here as it has proven itself in textual algorithms for genome sequencing. In this paper, an approach to model the parallel construction of the suffix tree using the MapReduce paradigm is designed for implementation in Hadoop with a java API.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114233763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Binary Firefly Algorithm for Feature Selection in Classification 分类中特征选择的二元萤火虫算法
Sofiane Maza, Djaafar Zouache
The large dimension of datasets influences on the classification performances and computational time. For that, feature selection is among process that avoid of those problems by selecting the relevance and eliminate the redundancy features.In this paper, we propose a new algorithm for feature selection FAFS (Firefly Algorithm for Feature Selection) which is based on the firefly algorithm. FAFS uses two objectives, which are Accuracy Rate and Reduction Rate. We propose a new formula to calculate the distance r and attractive A in Firefly algorithm.The experimental results show the capability of the proposed algorithm with three classifiers (KNN, NB, and LDA) and their outperformance against PSO-FS (Particle Swarm Optimization for feature selection).
数据集的大维度影响分类性能和计算时间。特征选择是通过选择相关特征和消除冗余特征来避免这些问题的过程之一。本文在萤火虫算法的基础上,提出了一种新的特征选择算法FAFS (Firefly algorithm for feature selection)。FAFS使用两个目标,即正确率和还原率。我们提出了一个新的计算萤火虫算法中距离r和吸引a的公式。实验结果表明,该算法具有KNN、NB和LDA三种分类器,并且优于PSO-FS (Particle Swarm Optimization for feature selection)。
{"title":"Binary Firefly Algorithm for Feature Selection in Classification","authors":"Sofiane Maza, Djaafar Zouache","doi":"10.1109/ICTAACS48474.2019.8988137","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988137","url":null,"abstract":"The large dimension of datasets influences on the classification performances and computational time. For that, feature selection is among process that avoid of those problems by selecting the relevance and eliminate the redundancy features.In this paper, we propose a new algorithm for feature selection FAFS (Firefly Algorithm for Feature Selection) which is based on the firefly algorithm. FAFS uses two objectives, which are Accuracy Rate and Reduction Rate. We propose a new formula to calculate the distance r and attractive A in Firefly algorithm.The experimental results show the capability of the proposed algorithm with three classifiers (KNN, NB, and LDA) and their outperformance against PSO-FS (Particle Swarm Optimization for feature selection).","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121390976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Local Patterns Weighting Approach for Optimizing Content-Based Image Retrieval Using a Differential Evolution Algorithm 差分进化算法优化基于内容的图像检索的局部模式加权方法
Rahima Boukerma, Salah Bougueroua, Bachir Boucheham
In Content Based-Image Retrieval (CBIR), low-level visual characteristics like color, texture and shape are used to search for relevant images. However, the result images returned to the user are generally not satisfactory to his expectations. This is due to the gap between the low-level features of the image and the semantic (high-level) concepts given by the user to the same image. To overcome this challenge, we propose in this paper a mechanism that improves CBIR performance and consequently reduce the semantic gap. In that regard, our work involves the optimization of CBIR using a specific mechanism for weighting the extracted textural characteristics of the image. The extraction of the latter is carried out by some local patterns methods. Then, the generation of the weights associated with the local patterns, is realized using the Differential Evolution algorithm. To evaluate our approach, we tested it on Wang’s database (Corel-1K). In addition, we adopted the precision as performance evaluation measure and we used Manhattan and Euclidean distances for comparing the local patterns histograms. The results of the carried-out experiments show that the obtained precisions by the weighted local patterns methods are better than those of the conventional methods.
在基于内容的图像检索(CBIR)中,使用颜色、纹理和形状等低级视觉特征来搜索相关图像。然而,返回给用户的结果图像通常不能满足用户的期望。这是由于图像的低级特征与用户对同一图像给出的语义(高级)概念之间存在差距。为了克服这一挑战,我们在本文中提出了一种提高CBIR性能的机制,从而减少语义差距。在这方面,我们的工作涉及使用特定机制对提取的图像纹理特征进行加权来优化CBIR。后者的提取是通过一些局部模式方法来实现的。然后,利用差分进化算法实现了与局部模式相关联的权值的生成。为了评估我们的方法,我们在Wang的数据库(Corel-1K)上进行了测试。此外,我们采用精度作为性能评价指标,并使用曼哈顿距离和欧几里得距离来比较局部模式直方图。实验结果表明,加权局部模式方法的精度优于传统方法。
{"title":"A Local Patterns Weighting Approach for Optimizing Content-Based Image Retrieval Using a Differential Evolution Algorithm","authors":"Rahima Boukerma, Salah Bougueroua, Bachir Boucheham","doi":"10.1109/ICTAACS48474.2019.8988120","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988120","url":null,"abstract":"In Content Based-Image Retrieval (CBIR), low-level visual characteristics like color, texture and shape are used to search for relevant images. However, the result images returned to the user are generally not satisfactory to his expectations. This is due to the gap between the low-level features of the image and the semantic (high-level) concepts given by the user to the same image. To overcome this challenge, we propose in this paper a mechanism that improves CBIR performance and consequently reduce the semantic gap. In that regard, our work involves the optimization of CBIR using a specific mechanism for weighting the extracted textural characteristics of the image. The extraction of the latter is carried out by some local patterns methods. Then, the generation of the weights associated with the local patterns, is realized using the Differential Evolution algorithm. To evaluate our approach, we tested it on Wang’s database (Corel-1K). In addition, we adopted the precision as performance evaluation measure and we used Manhattan and Euclidean distances for comparing the local patterns histograms. The results of the carried-out experiments show that the obtained precisions by the weighted local patterns methods are better than those of the conventional methods.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129039511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Solving the Cubic Cell Formation Problem Using Simulated Annealing Algorithm to Develop Multiple Configurations 用模拟退火算法求解立方胞的形成问题,发展多种构型
Hamida Bouaziz, Ali Lemouari
The Cubic Cell Formation Problem (CCFP) in cellular manufacturing systems consists in decomposing a production system into a set of manufacturing cells, and assigning workers to cells besides parts and machines. The objective is to obtain manageable cells by minimising the inter-cell moves of parts and workers and the heterogeneity in cells. In this paper, we provide a solution methodology based on a modified simulated annealing heuristic with a proposed neighbourhood search procedure. The proposed methodology allows to build multiple configurations by giving to the designer the ability to control some parameters. Experimental results show that the proposed algorithm gives a promising performance for the most problem instances found in the literature.
单元制造系统中的立方单元形成问题(CCFP)包括将生产系统分解为一组制造单元,并将工人分配到部件和机器之外的单元中。目标是通过最大限度地减少细胞间部分和工人的移动以及细胞的异质性来获得可管理的细胞。在本文中,我们提供了一种基于改进的模拟退火启发式方法和提出的邻域搜索过程的求解方法。所提出的方法允许通过赋予设计者控制某些参数的能力来构建多种配置。实验结果表明,该算法对文献中发现的大多数问题实例都具有良好的性能。
{"title":"Solving the Cubic Cell Formation Problem Using Simulated Annealing Algorithm to Develop Multiple Configurations","authors":"Hamida Bouaziz, Ali Lemouari","doi":"10.1109/ICTAACS48474.2019.8988125","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988125","url":null,"abstract":"The Cubic Cell Formation Problem (CCFP) in cellular manufacturing systems consists in decomposing a production system into a set of manufacturing cells, and assigning workers to cells besides parts and machines. The objective is to obtain manageable cells by minimising the inter-cell moves of parts and workers and the heterogeneity in cells. In this paper, we provide a solution methodology based on a modified simulated annealing heuristic with a proposed neighbourhood search procedure. The proposed methodology allows to build multiple configurations by giving to the designer the ability to control some parameters. Experimental results show that the proposed algorithm gives a promising performance for the most problem instances found in the literature.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121963746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Tracking diffusion pattern based on Salient Tweets 基于显著推文的扩散模式跟踪
Ahlem Drif, S. Giordano
Social networks, as user-generated content platforms, enable users to express their own opinions timely. To detect and predict user opinions, several works have been proposed. This paper aims to solve it as a time-series prediction problem: understanding the most important messages spreading, and predicting the evolution of the information spreading in future. We proposed a solution based on the Long Short-Term Memory (LSTM) model to capture such evolution. LSTM model was applied to features extracted from the corpus of tweets with the aim of capturing the impact of the salient tweets on information spreading process. By detecting the behaviour of the past tweets, we predict tweet diffusion in the next time period. The results with a real data-set of tweets show the strength of the proposed solution to predict the subtopic diffusion with high accuracy.
社交网络作为用户生成内容的平台,可以让用户及时表达自己的观点。为了检测和预测用户意见,已经提出了几个工作。本文旨在将其作为一个时间序列预测问题来解决:了解最重要的信息传播,并预测未来信息传播的演变。我们提出了一种基于长短期记忆(LSTM)模型的解决方案来捕捉这种进化。将LSTM模型应用于从推文语料库中提取的特征,目的是捕捉显著推文对信息传播过程的影响。通过检测过去推文的行为,我们预测推文在下一个时间段的扩散。实际推文数据集的结果表明,该方法具有较高的预测子主题扩散的准确性。
{"title":"Tracking diffusion pattern based on Salient Tweets","authors":"Ahlem Drif, S. Giordano","doi":"10.1109/ICTAACS48474.2019.8988130","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988130","url":null,"abstract":"Social networks, as user-generated content platforms, enable users to express their own opinions timely. To detect and predict user opinions, several works have been proposed. This paper aims to solve it as a time-series prediction problem: understanding the most important messages spreading, and predicting the evolution of the information spreading in future. We proposed a solution based on the Long Short-Term Memory (LSTM) model to capture such evolution. LSTM model was applied to features extracted from the corpus of tweets with the aim of capturing the impact of the salient tweets on information spreading process. By detecting the behaviour of the past tweets, we predict tweet diffusion in the next time period. The results with a real data-set of tweets show the strength of the proposed solution to predict the subtopic diffusion with high accuracy.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126495286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-criteria-based relay election for Data Dissemination in urban VANET 基于多准则的城市VANET数据分发中继选举
Amira Ichrak Tei, Z. Doukha, Youcef Zafoune
Data dissemination in Vehicular Ad-Hoc Networks (VANETs) is used as a tool for safety applications. In the real world, urban environment can be faced to some inherent VANET characteristics such as highly dynamic topology, diverse network densities, the changing speed of the vehicles, the noise, the obstacles which considerably affect the communication quality and make data dissemination fail in spreading urgent information over the network. To this effect, this work proposes a Multi-Criteria based Relay Election Protocol for Data Dissemination in urban VANETs (MCRE-DDP) where the most relevant relay node is elected based on several parameters like Signal to Noise Ratio (SNR), vehicle speed, distance between sender and receiver to determine the nodes quality and its ability to successfully relay the dissemination message because relevant relays prevent the communication system from multiple sending and guaranty the suitable data relaying as far as the boundaries of the area of interest. Simulation results show that MCRE-DDP performs data dissemination more efficiently than its peers in terms of Dissemination Speed, link load and redundancy ratio.
车辆自组织网络(VANETs)中的数据传播被用作安全应用的工具。在现实世界中,城市环境会面临VANET固有的一些特性,如拓扑结构的高度动态性、网络密度的多样性、车辆速度的变化、噪声和障碍物等,这些特性极大地影响了通信质量,使数据传播无法在网络上传播紧急信息。为此,本工作提出了一种基于多标准的城市vanet数据传播中继选举协议(MCRE-DDP),其中根据信噪比(SNR)、车速、发送方和接收方之间的距离决定了节点的质量及其成功中继传播消息的能力,因为相关的中继可以防止通信系统进行多次发送,并保证在感兴趣区域的边界内进行适当的数据中继。仿真结果表明,MCRE-DDP在数据传播速度、链路负载和冗余率等方面都优于同类算法。
{"title":"Multi-criteria-based relay election for Data Dissemination in urban VANET","authors":"Amira Ichrak Tei, Z. Doukha, Youcef Zafoune","doi":"10.1109/ICTAACS48474.2019.8988131","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988131","url":null,"abstract":"Data dissemination in Vehicular Ad-Hoc Networks (VANETs) is used as a tool for safety applications. In the real world, urban environment can be faced to some inherent VANET characteristics such as highly dynamic topology, diverse network densities, the changing speed of the vehicles, the noise, the obstacles which considerably affect the communication quality and make data dissemination fail in spreading urgent information over the network. To this effect, this work proposes a Multi-Criteria based Relay Election Protocol for Data Dissemination in urban VANETs (MCRE-DDP) where the most relevant relay node is elected based on several parameters like Signal to Noise Ratio (SNR), vehicle speed, distance between sender and receiver to determine the nodes quality and its ability to successfully relay the dissemination message because relevant relays prevent the communication system from multiple sending and guaranty the suitable data relaying as far as the boundaries of the area of interest. Simulation results show that MCRE-DDP performs data dissemination more efficiently than its peers in terms of Dissemination Speed, link load and redundancy ratio.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114675253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Towards a Smart Data Transmission Strategy for IoT Monitoring Systems: Application to Air Quality Monitoring 面向物联网监测系统的智能数据传输策略:在空气质量监测中的应用
Razika Lounas, Dhai Eddine Salhi, Hocine Mokrani, Rachid Djerbi, M. Bennai
In the modern digital area, Internet of Things (IoT) is increasingly gathering attention for the implementation of applications in several aspects of everyday activities, intending to make our cities smarter and more comfortable. Therefore, the implementation of these IoT applications raises several challenges to overcome. One of these challenges is the efficient use of resources at each stage of the application, such as acquisition, storage, processing, and networking. In smart cities, many IoT monitoring systems continuously generate large amounts of data. These data volumes, before they can be processed and responded, must first be transmitted through the city’s networks (Wifi, Bluetooth, LTE). To deal with this considerable amount of continually transmitted data and to reduce the load on networks, we propose an approach based on the efficient use of data compression in IoT systems. This approach uses a data compression smart strategy to reduce the transmitted data during the acquisition process and thus minimize the use of network resources while providing the user with relevant information in real-time using a prioritization mechanism. In order to show the efficiency of our proposal, we conducted experiments on a case study of an air quality monitoring system.
在现代数字领域,物联网(IoT)越来越受到人们的关注,因为它在日常活动的几个方面实施了应用,旨在使我们的城市更智能、更舒适。因此,这些物联网应用的实施提出了几个需要克服的挑战。这些挑战之一是在应用程序的每个阶段(例如获取、存储、处理和网络)有效地使用资源。在智慧城市中,许多物联网监控系统不断产生大量数据。这些数据量在被处理和响应之前,必须首先通过城市的网络(Wifi、蓝牙、LTE)传输。为了处理大量连续传输的数据并减少网络负载,我们提出了一种基于物联网系统中有效使用数据压缩的方法。该方法采用数据压缩智能策略,在采集过程中减少传输数据,从而最大限度地减少网络资源的使用,同时通过优先级机制实时为用户提供相关信息。为了证明我们建议的有效性,我们以空气质素监测系统为个案进行实验研究。
{"title":"Towards a Smart Data Transmission Strategy for IoT Monitoring Systems: Application to Air Quality Monitoring","authors":"Razika Lounas, Dhai Eddine Salhi, Hocine Mokrani, Rachid Djerbi, M. Bennai","doi":"10.1109/ICTAACS48474.2019.8988119","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988119","url":null,"abstract":"In the modern digital area, Internet of Things (IoT) is increasingly gathering attention for the implementation of applications in several aspects of everyday activities, intending to make our cities smarter and more comfortable. Therefore, the implementation of these IoT applications raises several challenges to overcome. One of these challenges is the efficient use of resources at each stage of the application, such as acquisition, storage, processing, and networking. In smart cities, many IoT monitoring systems continuously generate large amounts of data. These data volumes, before they can be processed and responded, must first be transmitted through the city’s networks (Wifi, Bluetooth, LTE). To deal with this considerable amount of continually transmitted data and to reduce the load on networks, we propose an approach based on the efficient use of data compression in IoT systems. This approach uses a data compression smart strategy to reduce the transmitted data during the acquisition process and thus minimize the use of network resources while providing the user with relevant information in real-time using a prioritization mechanism. In order to show the efficiency of our proposal, we conducted experiments on a case study of an air quality monitoring system.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115819098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Machine learning technique dedicated for biological data 一种专门用于生物数据的机器学习技术
M. N. Boufenara, M. Boufaida, M. Berkane
In a data-driven world, semi-supervised learning methods are motivated by the availability of large unlabeled datasets than a small amount of labeled data. However, incorporating unlabeled data into learning does not guarantee an improvement in classification performance. In this paper, we present an approach based on a deep learning system to predict missing classes by integrating a model of semi-supervised learning which is the self-training. In order to evaluate its performance, we used a set of diabetes data and four performance measures: Precision, Recall, F-Measure and Area Under the ROC Curve (AUC).
在数据驱动的世界中,半监督学习方法的动机是大量未标记数据集的可用性,而不是少量标记数据。然而,将未标记数据纳入学习并不能保证分类性能的提高。在本文中,我们提出了一种基于深度学习系统的方法,通过集成半监督学习模型(即自训练)来预测缺课。为了评估其性能,我们使用了一组糖尿病数据和四个性能指标:Precision, Recall, F-Measure和Area Under the ROC Curve (AUC)。
{"title":"A Machine learning technique dedicated for biological data","authors":"M. N. Boufenara, M. Boufaida, M. Berkane","doi":"10.1109/ICTAACS48474.2019.8988116","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988116","url":null,"abstract":"In a data-driven world, semi-supervised learning methods are motivated by the availability of large unlabeled datasets than a small amount of labeled data. However, incorporating unlabeled data into learning does not guarantee an improvement in classification performance. In this paper, we present an approach based on a deep learning system to predict missing classes by integrating a model of semi-supervised learning which is the self-training. In order to evaluate its performance, we used a set of diabetes data and four performance measures: Precision, Recall, F-Measure and Area Under the ROC Curve (AUC).","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"210 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115972586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Comparing the Effectiveness of the Improved ARLSTem Algorithm with Existing Arabic Light Stemmers 改进的ARLSTem算法与现有阿拉伯光系统的有效性比较
K. Abainia, Hamza Rebbani
As known in the literature, light stemmers produce more under-stemming errors, while root stemmers produce more over-stemming errors. In this investigation, we deal with the Arabic light stemming problem, where we propose an improvement to ARLSTem algorithm (i.e. ARLSTem v1.1). In particular, we introduce new rules to correct some under-stemming errors produced by ARLSTem. In addition, we compare the new version of ARLSTem with five existing stemming algorithms using ARASTEM corpus. The latter has been corrected, where we have found some errors in seven samples. The experimental results showed that ARLSTem v1.1 outperforms the other existing algorithms in terms of under-stemming and over-stemming errors. Moreover, it presents interesting performances in the Arabic text categorization task.
据文献所知,轻茎产生更多的干化不足错误,而根茎产生更多的干化过度错误。在这项研究中,我们处理阿拉伯光词干问题,我们提出了一个改进的ARLSTem算法(即ARLSTem v1.1)。特别地,我们引入了新的规则来纠正ARLSTem产生的一些词干不足错误。此外,我们还将新版本的ARLSTem与现有的五种基于arasstem语料库的词干提取算法进行了比较。后者已经更正,我们在其中发现了7个样品中的一些错误。实验结果表明,ARLSTem v1.1在词干不足和词干过度错误方面优于其他现有算法。此外,它在阿拉伯语文本分类任务中也表现出有趣的性能。
{"title":"Comparing the Effectiveness of the Improved ARLSTem Algorithm with Existing Arabic Light Stemmers","authors":"K. Abainia, Hamza Rebbani","doi":"10.1109/ICTAACS48474.2019.8988118","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988118","url":null,"abstract":"As known in the literature, light stemmers produce more under-stemming errors, while root stemmers produce more over-stemming errors. In this investigation, we deal with the Arabic light stemming problem, where we propose an improvement to ARLSTem algorithm (i.e. ARLSTem v1.1). In particular, we introduce new rules to correct some under-stemming errors produced by ARLSTem. In addition, we compare the new version of ARLSTem with five existing stemming algorithms using ARASTEM corpus. The latter has been corrected, where we have found some errors in seven samples. The experimental results showed that ARLSTem v1.1 outperforms the other existing algorithms in terms of under-stemming and over-stemming errors. Moreover, it presents interesting performances in the Arabic text categorization task.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129957738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Natural inclusion of Motives inside BDI agents 在BDI特工中自然包含动机
Adel Saadi, R. Maamri, Z. Sahnoun
The Belief-Desire-Intention (BDI) model is amongst the most popular approaches to design agents with flexible behaviors. It is based essentially on the processing of beliefs and goals. On the other hand, it is well accepted that the motive is another relevant concept for the agent’s reasoning and its flexibility. As the BDI model does not include this important concept, several BDI agent’ s extensions, with a new and a special component expressing the motive, were proposed in the literature. In this paper, we show that it is not necessary to add this new component to express a motive, as a motive can be expressed by a particular type of goal. This allows including and taking advantage from the reasoning about motives without additional components.
信念-欲望-意图(BDI)模型是设计具有灵活行为的智能体最流行的方法之一。它本质上是基于对信念和目标的处理。另一方面,人们普遍认为动机是与主体推理及其灵活性相关的另一个概念。由于BDI模型不包括这一重要概念,因此文献中提出了几种BDI代理的扩展,其中包括一个新的和特殊的表达动机的组件。在本文中,我们证明没有必要添加这个新的成分来表达动机,因为动机可以通过特定类型的目标来表达。这允许在没有额外组件的情况下包含和利用关于动机的推理。
{"title":"A Natural inclusion of Motives inside BDI agents","authors":"Adel Saadi, R. Maamri, Z. Sahnoun","doi":"10.1109/ICTAACS48474.2019.8988135","DOIUrl":"https://doi.org/10.1109/ICTAACS48474.2019.8988135","url":null,"abstract":"The Belief-Desire-Intention (BDI) model is amongst the most popular approaches to design agents with flexible behaviors. It is based essentially on the processing of beliefs and goals. On the other hand, it is well accepted that the motive is another relevant concept for the agent’s reasoning and its flexibility. As the BDI model does not include this important concept, several BDI agent’ s extensions, with a new and a special component expressing the motive, were proposed in the literature. In this paper, we show that it is not necessary to add this new component to express a motive, as a motive can be expressed by a particular type of goal. This allows including and taking advantage from the reasoning about motives without additional components.","PeriodicalId":406766,"journal":{"name":"2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130832182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2019 International Conference on Theoretical and Applicative Aspects of Computer Science (ICTAACS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1