首页 > 最新文献

2019 1st International Conference on Smart Systems and Data Science (ICSSD)最新文献

英文 中文
Enhancement of supply chain management by integrating Blockchain technology 整合区块链技术,加强供应链管理
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002771
S. Nasih, Sara Arezki, T. Gadi
Due to its wide involvement in different fields: industry, maritime industry, trade, supply chain management has been greatly expanded to cover a large and complex network of stakeholders in the production and distribution process. The multitude of intermediaries in this process leads difficulties in communication, control, time saving… In this paper, we propose the Blockchain technology as a solution for decentralization and disintermediation of operations in the supply chain, and its effect on the maritime industry.
由于其广泛涉及不同的领域:工业,海运业,贸易,供应链管理已经大大扩展到涵盖生产和分销过程中利益相关者的庞大而复杂的网络。在这一过程中,众多的中介机构导致了沟通、控制和节省时间的困难……在本文中,我们提出区块链技术作为供应链中操作的去中心化和非中介化的解决方案,以及它对海运业的影响。
{"title":"Enhancement of supply chain management by integrating Blockchain technology","authors":"S. Nasih, Sara Arezki, T. Gadi","doi":"10.1109/ICSSD47982.2019.9002771","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002771","url":null,"abstract":"Due to its wide involvement in different fields: industry, maritime industry, trade, supply chain management has been greatly expanded to cover a large and complex network of stakeholders in the production and distribution process. The multitude of intermediaries in this process leads difficulties in communication, control, time saving… In this paper, we propose the Blockchain technology as a solution for decentralization and disintermediation of operations in the supply chain, and its effect on the maritime industry.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"193 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115729331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Decomposition and Visualization of High-Dimensional Data in a Two Dimensional Interface 二维界面中高维数据的分解与可视化
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002846
Mimoun Lamrini, M. Chkouri
Data visualization has a crucial role in understanding and processing voluminous data (i.e., Big Data) and subsequently has become more important with the coincidence of the exponential growth of data analysis need.The problem of high-dimensional data visualization in a data processing software interface cannot be entirely displayed, in consideration that once the data size exceeds two-dimension, it cannot be projected into a two-dimension interface. Furthermore, the rough analysis and evaluation of high-dimensional data become considerably ambiguous, thus, making a precise decision on that data cannot be achieved. In order to overcome this anomaly, resorting to data dimensionality reduction is a plausible solution.In this paper, the integration of Principal Component Analysis (PCA) combined with the Matrix by Block Decomposition (MBD) method(A.K.A block segmentation). According to the literature, the MBD method turned out quite efficient in data segmentation, wherein a huge data can be divided into regular blocks. By doing so, it becomes easier to access and visualize a given part of data. In order to further enhance the visualization understanding, K-means segmentation has been integrated in our proposed algorithm.In our study, we took into account other data dimensionality reduction techniques such as Linear Discriminant Analysis (LDA), Multi-Dimensional Scaling(MDS).
数据可视化在理解和处理海量数据(即大数据)方面发挥着至关重要的作用,随着数据分析需求的指数级增长,数据可视化也变得越来越重要。数据处理软件界面中的高维数据可视化问题无法完全展现出来,因为一旦数据规模超过二维,就无法投影到二维界面中。此外,对高维数据的粗略分析和评价变得相当模糊,因此无法对该数据做出精确的决策。为了克服这种异常,采用数据降维是一种可行的解决方案。本文采用分块分解(MBD)方法,将主成分分析(PCA)与矩阵相结合块分割)。从文献来看,MBD方法在数据分割方面是非常高效的,可以将庞大的数据分割成规则的块。通过这样做,可以更容易地访问和可视化给定的数据部分。为了进一步增强可视化理解,我们提出的算法中集成了k均值分割。在我们的研究中,我们考虑了其他数据降维技术,如线性判别分析(LDA),多维尺度(MDS)。
{"title":"Decomposition and Visualization of High-Dimensional Data in a Two Dimensional Interface","authors":"Mimoun Lamrini, M. Chkouri","doi":"10.1109/ICSSD47982.2019.9002846","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002846","url":null,"abstract":"Data visualization has a crucial role in understanding and processing voluminous data (i.e., Big Data) and subsequently has become more important with the coincidence of the exponential growth of data analysis need.The problem of high-dimensional data visualization in a data processing software interface cannot be entirely displayed, in consideration that once the data size exceeds two-dimension, it cannot be projected into a two-dimension interface. Furthermore, the rough analysis and evaluation of high-dimensional data become considerably ambiguous, thus, making a precise decision on that data cannot be achieved. In order to overcome this anomaly, resorting to data dimensionality reduction is a plausible solution.In this paper, the integration of Principal Component Analysis (PCA) combined with the Matrix by Block Decomposition (MBD) method(A.K.A block segmentation). According to the literature, the MBD method turned out quite efficient in data segmentation, wherein a huge data can be divided into regular blocks. By doing so, it becomes easier to access and visualize a given part of data. In order to further enhance the visualization understanding, K-means segmentation has been integrated in our proposed algorithm.In our study, we took into account other data dimensionality reduction techniques such as Linear Discriminant Analysis (LDA), Multi-Dimensional Scaling(MDS).","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114846805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Multi-Agent Model for Network Intrusion Detection 网络入侵检测的多智能体模型
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9003119
Said Ouiazzane, M. Addou, Fatimazahra Barramou
The objective of this paper is to propose a distributed intrusion detection model based on a multi agent system. Mutli Agent Systems (MAS) are very suitable for intrusion detection systems as they meet the characteristics required by the networks and Big Data issues. The MAS agents cooperate and communicate with each other to ensure the effective detection of network intrusions without the intervention of an expert as used to be in the classical intrusion detection systems relying on signature matching to detect known attacks. The proposed model helped to detect known and unknown attacks within big computer infrastructure by responding to the network requirements in terms of distribution, autonomy, responsiveness and communication. The proposed model is capable of achieving a good and a real time intrusion detection using multi-agents paradigm and Hadoop Distributed File System (HDFS).
本文的目的是提出一种基于多智能体系统的分布式入侵检测模型。多代理系统(multi Agent Systems, MAS)满足了网络和大数据问题对入侵检测系统的要求,非常适用于入侵检测系统。传统的入侵检测系统依赖于特征匹配来检测已知的攻击,而传统的入侵检测系统依赖于特征匹配来检测已知的攻击,MAS agent之间相互协作和通信,保证了在没有专家干预的情况下有效检测网络入侵。该模型通过响应网络在分布、自治、响应和通信方面的需求,帮助检测大型计算机基础设施中的已知和未知攻击。该模型利用多代理模型和Hadoop分布式文件系统(HDFS)实现了良好的实时入侵检测。
{"title":"A Multi-Agent Model for Network Intrusion Detection","authors":"Said Ouiazzane, M. Addou, Fatimazahra Barramou","doi":"10.1109/ICSSD47982.2019.9003119","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9003119","url":null,"abstract":"The objective of this paper is to propose a distributed intrusion detection model based on a multi agent system. Mutli Agent Systems (MAS) are very suitable for intrusion detection systems as they meet the characteristics required by the networks and Big Data issues. The MAS agents cooperate and communicate with each other to ensure the effective detection of network intrusions without the intervention of an expert as used to be in the classical intrusion detection systems relying on signature matching to detect known attacks. The proposed model helped to detect known and unknown attacks within big computer infrastructure by responding to the network requirements in terms of distribution, autonomy, responsiveness and communication. The proposed model is capable of achieving a good and a real time intrusion detection using multi-agents paradigm and Hadoop Distributed File System (HDFS).","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129650613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An Empirical Study of Deep Neural Networks Models for Sentiment Classification on Movie Reviews 电影评论情感分类的深度神经网络模型实证研究
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9003171
Oumaima Hourrane, Nouhaila Idrissi, E. Benlahmar
Sentiment classification is one of the new absorbing parts appeared in natural language processing with the emergence of community sites on the web. Taking advantage of the amount of information now available, research and industry have been seeking ways to automatically analyze the sentiments expressed in texts. The challenge for this task is the human language ambiguity, and also the lack of labeled data. In order to solve this issue, Deep learning models appeared to be effective due to their automatic learning capability. In this paper, we provide a comparative study on IMDB movie review dataset, we compare word embeddings methods and further deep learning models on sentiment analysis and give broad empirical outcomes for those keen on taking advantage of deep learning for sentiment analysis in real-world settings.
情感分类是随着网络上社区站点的出现,自然语言处理中出现的一个新的吸收部分。利用现有的大量信息,研究和行业一直在寻找自动分析文本中表达的情感的方法。这项任务面临的挑战是人类语言的模糊性,以及缺乏标记数据。为了解决这个问题,深度学习模型由于具有自动学习能力而显得很有效。在本文中,我们对IMDB电影评论数据集进行了比较研究,我们比较了词嵌入方法和进一步的深度学习模型在情感分析方面的作用,并为那些热衷于在现实世界中利用深度学习进行情感分析的人提供了广泛的经验结果。
{"title":"An Empirical Study of Deep Neural Networks Models for Sentiment Classification on Movie Reviews","authors":"Oumaima Hourrane, Nouhaila Idrissi, E. Benlahmar","doi":"10.1109/ICSSD47982.2019.9003171","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9003171","url":null,"abstract":"Sentiment classification is one of the new absorbing parts appeared in natural language processing with the emergence of community sites on the web. Taking advantage of the amount of information now available, research and industry have been seeking ways to automatically analyze the sentiments expressed in texts. The challenge for this task is the human language ambiguity, and also the lack of labeled data. In order to solve this issue, Deep learning models appeared to be effective due to their automatic learning capability. In this paper, we provide a comparative study on IMDB movie review dataset, we compare word embeddings methods and further deep learning models on sentiment analysis and give broad empirical outcomes for those keen on taking advantage of deep learning for sentiment analysis in real-world settings.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125325322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A lightweight risk analysis of a critical infrastructure based ICSs 基于集成电路的关键基础设施的轻量级风险分析
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002902
O. El Idrissi, Abdellatif Mezrioui, A. Belmekki
Industrial Control Systems (ICS) are currently integrated into critical infrastructures and are designed to support industrial processes, monitor and control in real time a large number of processes and operations such as gas and electricity distribution (conventional and nuclear), water treatment, etc. ICSs have evolved significantly in recent years and have embraced new technologies such as IoT and have been made accessible through the Internet to allow remote access to administrators, service providers, etc. The aim of this paper is to illustrate, through the use of EBIOS risk analysis method that such infrastructures are subject to vulnerabilities, overwhelming threats and potentials risks. The paper also proposes a number of recommendations and organizational and technological security measures to reduce these risks to an acceptable level and to decrease both their impacts and potentialities.
工业控制系统(ICS)目前集成到关键基础设施中,旨在支持工业过程,实时监测和控制大量过程和操作,如天然气和电力分配(常规和核),水处理等。近年来,国际集成系统发生了重大变化,采用了物联网等新技术,并可通过互联网访问,允许管理员、服务提供商等进行远程访问。本文的目的是通过使用EBIOS风险分析方法来说明此类基础设施存在脆弱性、压倒性威胁和潜在风险。本文还提出了一些建议以及组织和技术安全措施,以将这些风险降低到可接受的水平,并减少其影响和潜力。
{"title":"A lightweight risk analysis of a critical infrastructure based ICSs","authors":"O. El Idrissi, Abdellatif Mezrioui, A. Belmekki","doi":"10.1109/ICSSD47982.2019.9002902","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002902","url":null,"abstract":"Industrial Control Systems (ICS) are currently integrated into critical infrastructures and are designed to support industrial processes, monitor and control in real time a large number of processes and operations such as gas and electricity distribution (conventional and nuclear), water treatment, etc. ICSs have evolved significantly in recent years and have embraced new technologies such as IoT and have been made accessible through the Internet to allow remote access to administrators, service providers, etc. The aim of this paper is to illustrate, through the use of EBIOS risk analysis method that such infrastructures are subject to vulnerabilities, overwhelming threats and potentials risks. The paper also proposes a number of recommendations and organizational and technological security measures to reduce these risks to an acceptable level and to decrease both their impacts and potentialities.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116116527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Big Data Dependability Opportunities & Challenges 大数据可靠性的机遇与挑战
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002676
Mdarbi Fatima Ezzahra, Afifi Nadia, Hilal Imane
Big Data is a very large data set, its analysis exceeds the capabilities of traditional database management systems. Big Data is linked to the need for large computing and storage capacity.Big Data dependability is one of the major concerns of organizations. It reflects the confidence that can be placed in these data. Nowadays, companies find a major interest in Big Data, but dependability challenge remains a major obstacle.In this article, we present different works that have addressed Big Data dependability aspects. This study highlights new opportunities in this field as well as different challenges.
大数据是一个非常庞大的数据集,其分析能力超出了传统数据库管理系统的能力。大数据与对大型计算和存储容量的需求有关。大数据的可靠性是组织的主要关注点之一。它反映了对这些数据的信心。如今,企业对大数据非常感兴趣,但可靠性挑战仍然是一个主要障碍。在本文中,我们介绍了解决大数据可靠性方面的不同工作。这项研究突出了该领域的新机遇以及不同的挑战。
{"title":"Big Data Dependability Opportunities & Challenges","authors":"Mdarbi Fatima Ezzahra, Afifi Nadia, Hilal Imane","doi":"10.1109/ICSSD47982.2019.9002676","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002676","url":null,"abstract":"Big Data is a very large data set, its analysis exceeds the capabilities of traditional database management systems. Big Data is linked to the need for large computing and storage capacity.Big Data dependability is one of the major concerns of organizations. It reflects the confidence that can be placed in these data. Nowadays, companies find a major interest in Big Data, but dependability challenge remains a major obstacle.In this article, we present different works that have addressed Big Data dependability aspects. This study highlights new opportunities in this field as well as different challenges.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132734198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards a new educational search engine based on hybrid searching and indexing techniques 基于混合检索和索引技术的新型教育搜索引擎
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002729
Kamal El Guemmat, Sara Ouahabi
Today, search engines play an important role in retrieving documents from a large database. The key success factors of these engines are their indexing and searching techniques.These engines have touched many areas to help their users to find the desired resources in a fast and accurate way. The field of teaching and research take advantage of these engines to offer them to the interested actors (students, teacher, staff, etc.) in order to find the desired learning objects.There are several prominent educational search engines in the field implementing the techniques of indexing and searching either classic, semantic, metadata. However, most engines do not mix all of these to achieve important results.The engine which will be presented in what follows benefits from the best techniques of the literature, and offers a more relevant searching.
如今,搜索引擎在从大型数据库中检索文档方面发挥着重要作用。这些引擎的关键成功因素是它们的索引和搜索技术。这些引擎已经触及了许多领域,以帮助它们的用户以快速和准确的方式找到所需的资源。教学和研究领域利用这些引擎将它们提供给感兴趣的参与者(学生、教师、员工等),以找到所需的学习对象。在这个领域有几个突出的教育搜索引擎实现了索引和搜索经典元数据、语义元数据的技术。然而,大多数引擎并没有混合所有这些来获得重要的结果。该引擎将在下文中介绍,受益于文献中的最佳技术,并提供更相关的搜索。
{"title":"Towards a new educational search engine based on hybrid searching and indexing techniques","authors":"Kamal El Guemmat, Sara Ouahabi","doi":"10.1109/ICSSD47982.2019.9002729","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002729","url":null,"abstract":"Today, search engines play an important role in retrieving documents from a large database. The key success factors of these engines are their indexing and searching techniques.These engines have touched many areas to help their users to find the desired resources in a fast and accurate way. The field of teaching and research take advantage of these engines to offer them to the interested actors (students, teacher, staff, etc.) in order to find the desired learning objects.There are several prominent educational search engines in the field implementing the techniques of indexing and searching either classic, semantic, metadata. However, most engines do not mix all of these to achieve important results.The engine which will be presented in what follows benefits from the best techniques of the literature, and offers a more relevant searching.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132027768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A survey of methods and tools used for interpreting Random Forest 用于解释随机森林的方法和工具的调查
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002770
Maissae Haddouchi, A. Berrado
Interpretability of highly performant Machine Learning [ML] methods, such as Random Forest [RF], is a key tool that attracts a great interest in datamining research. In the state of the art, RF is well-known as an efficient ensemble learning (in terms of predictive accuracy, flexibility and straightforwardness). Moreover, it is recognized as an intuitive and intelligible approach regarding to its building process. However it is also regarded as a Black Box model because of its hundreds of deep decision trees. This can be crucial for several fields of study, such as healthcare, biology and security, where the lack of interpretability could be a real disadvantage. Indeed, the interpretability of the RF models is, generally, necessary in such fields of applications because of different motivations. In fact, the more the ML users grasp what is going on inside a ML system (process and resulting model), the more they can trust it and take actions based on the knowledge extracted from it. Furthermore, ML models are increasingly constrained by new laws that require regulation and interpretation of the knowledge they provide.Several papers have tackled the interpretation of RF resulting models. It had been associated with different aspects depending on the specificity of the issue studied as well as the users concerned with explanations. Therefore, this paper aims to provide a survey of tools and methods used in literature in order to uncover insights in the RF resulting models. These tools are classified depending on different aspects characterizing the interpretability. This should guide, in practice, in the choice of the most useful tools for interpretation and deep analysis of the RF model depending on the interpretability aspect sought. This should also be valuable for researchers who aim to focus their work on the interpretability of RF, or ML in general.
高性能机器学习[ML]方法的可解释性,如随机森林[RF],是吸引数据挖掘研究极大兴趣的关键工具。在目前的技术状态中,RF被认为是一种高效的集成学习(在预测准确性、灵活性和直观性方面)。此外,它被认为是一种直观和可理解的方法,关于其建设过程。然而,它也被认为是一个黑盒模型,因为它有数百个深度决策树。这对于医疗保健、生物学和安全等几个研究领域至关重要,在这些领域,缺乏可解释性可能是一个真正的劣势。实际上,由于动机不同,RF模型的可解释性通常在这些应用领域是必要的。事实上,机器学习用户对机器学习系统(过程和结果模型)内部发生的事情了解得越多,他们就越能信任它,并根据从中提取的知识采取行动。此外,ML模型越来越多地受到新法律的约束,这些法律要求对它们提供的知识进行监管和解释。有几篇论文讨论了射频结果模型的解释。根据所研究问题的特殊性以及与解释有关的用户,它与不同方面有关。因此,本文旨在提供文献中使用的工具和方法的调查,以揭示RF结果模型中的见解。这些工具根据描述可解释性的不同方面进行分类。在实践中,这应该指导根据所寻求的可解释性方面选择最有用的工具来解释和深入分析RF模型。对于那些致力于研究RF或ML的可解释性的研究人员来说,这也应该是有价值的。
{"title":"A survey of methods and tools used for interpreting Random Forest","authors":"Maissae Haddouchi, A. Berrado","doi":"10.1109/ICSSD47982.2019.9002770","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002770","url":null,"abstract":"Interpretability of highly performant Machine Learning [ML] methods, such as Random Forest [RF], is a key tool that attracts a great interest in datamining research. In the state of the art, RF is well-known as an efficient ensemble learning (in terms of predictive accuracy, flexibility and straightforwardness). Moreover, it is recognized as an intuitive and intelligible approach regarding to its building process. However it is also regarded as a Black Box model because of its hundreds of deep decision trees. This can be crucial for several fields of study, such as healthcare, biology and security, where the lack of interpretability could be a real disadvantage. Indeed, the interpretability of the RF models is, generally, necessary in such fields of applications because of different motivations. In fact, the more the ML users grasp what is going on inside a ML system (process and resulting model), the more they can trust it and take actions based on the knowledge extracted from it. Furthermore, ML models are increasingly constrained by new laws that require regulation and interpretation of the knowledge they provide.Several papers have tackled the interpretation of RF resulting models. It had been associated with different aspects depending on the specificity of the issue studied as well as the users concerned with explanations. Therefore, this paper aims to provide a survey of tools and methods used in literature in order to uncover insights in the RF resulting models. These tools are classified depending on different aspects characterizing the interpretability. This should guide, in practice, in the choice of the most useful tools for interpretation and deep analysis of the RF model depending on the interpretability aspect sought. This should also be valuable for researchers who aim to focus their work on the interpretability of RF, or ML in general.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133247705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Artificial Neural Networks Optimized with Unsupervised Clustering for IDS Classification 基于无监督聚类优化的人工神经网络IDS分类
Pub Date : 2019-10-01 DOI: 10.1109/ICSSD47982.2019.9002827
I. Lafram, N. Berbiche, Jamila El Alami
Information systems are becoming more and more complex and closely linked. These systems are encountering an enormous amount of nefarious traffic while ensuring real - time connectivity. Therefore, a defense method needs to be in place. One of the commonly used tools for network security is intrusion detection systems (IDS). An IDS tries to identify fraudulent activity using predetermined signatures or pre-established user misbehavior while monitoring incoming traffic. Intrusion detection systems based on signature and behavior cannot detect new attacks and fall when small behavior deviations occur. Many researchers have proposed various approaches to intrusion detection using machine learning techniques as a new and promising tool to remedy this problem. In this paper, the authors present a combination of two machine learning methods, unsupervised clustering followed by a supervised classification framework as a Fast, highly scalable and precise packets classification system. This model’s performance is assessed on the new proposed dataset by the Canadian Institute for Cyber security and the University of New Brunswick (CICIDS2017). The overall process was fast, showing high accuracy classification results.
信息系统变得越来越复杂,联系越来越紧密。在确保实时连接的同时,这些系统正在遭遇大量的恶意流量。因此,需要有一种防御方法。入侵检测系统(IDS)是网络安全中常用的工具之一。IDS试图在监视传入流量时使用预定签名或预先建立的用户错误行为来识别欺诈活动。基于特征和行为的入侵检测系统无法检测到新的攻击,当行为发生微小偏差时,系统就会崩溃。许多研究人员提出了各种入侵检测方法,使用机器学习技术作为一种新的有前途的工具来解决这个问题。在本文中,作者提出了两种机器学习方法的组合,即无监督聚类和监督分类框架,作为一种快速、高度可扩展和精确的数据包分类系统。该模型的性能由加拿大网络安全研究所和新不伦瑞克大学(CICIDS2017)在新提出的数据集上进行评估。整个过程速度快,分类结果准确率高。
{"title":"Artificial Neural Networks Optimized with Unsupervised Clustering for IDS Classification","authors":"I. Lafram, N. Berbiche, Jamila El Alami","doi":"10.1109/ICSSD47982.2019.9002827","DOIUrl":"https://doi.org/10.1109/ICSSD47982.2019.9002827","url":null,"abstract":"Information systems are becoming more and more complex and closely linked. These systems are encountering an enormous amount of nefarious traffic while ensuring real - time connectivity. Therefore, a defense method needs to be in place. One of the commonly used tools for network security is intrusion detection systems (IDS). An IDS tries to identify fraudulent activity using predetermined signatures or pre-established user misbehavior while monitoring incoming traffic. Intrusion detection systems based on signature and behavior cannot detect new attacks and fall when small behavior deviations occur. Many researchers have proposed various approaches to intrusion detection using machine learning techniques as a new and promising tool to remedy this problem. In this paper, the authors present a combination of two machine learning methods, unsupervised clustering followed by a supervised classification framework as a Fast, highly scalable and precise packets classification system. This model’s performance is assessed on the new proposed dataset by the Canadian Institute for Cyber security and the University of New Brunswick (CICIDS2017). The overall process was fast, showing high accuracy classification results.","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116441158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ICSSD 2019 Committees ICSSD 2019委员会
Pub Date : 2019-10-01 DOI: 10.1109/icssd47982.2019.9002832
{"title":"ICSSD 2019 Committees","authors":"","doi":"10.1109/icssd47982.2019.9002832","DOIUrl":"https://doi.org/10.1109/icssd47982.2019.9002832","url":null,"abstract":"","PeriodicalId":342806,"journal":{"name":"2019 1st International Conference on Smart Systems and Data Science (ICSSD)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121838923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2019 1st International Conference on Smart Systems and Data Science (ICSSD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1