2018 12th International Conference on Open Source Systems and Technologies (ICOSST)最新文献

英文中文

Handling Missing Values in Chronic Kidney Disease Datasets Using KNN, K-Means and K-Medoids Algorithms 使用KNN, K-Means和k - mediids算法处理慢性肾脏疾病数据集的缺失值

2018 12th International Conference on Open Source Systems and Technologies (ICOSST)

Pub Date : 2018-12-01 DOI: 10.1109/ICOSST.2018.8632179

Tahira Mahboob, A. Ijaz, Amber Shahzad, Muqadas Kalsoom

Missing values in large datasets have become a difficult task for researchers and industrialists. Specifically in the field of medicine, the datasets contain missing values due to human error or non-availability of data. If these datasets have to utilized for inference purposes or predictive studies, the resutls are not that reliable. Discarding such instances is an option but effects overall accuracy and thus it is viable to perform some replacement or imputation technique. Here, imputaiton technique enable to estimate the missing values in the datasets by applying various algorithms. Therefore, in this paper we present a framework that assists in imouting missing values in a large Chronic Kidney Disease (CKD) datasets. We have used three machine learning algorithms i.e., K-Nearest Neighbors, K-Means and K-Medoids Clustering to impute the missing values. Performance evaluation of the proposed technique has been carried out by application of Decision Tree and Random Forest algorithms. Experimental results demonstrate that KNN algorithm provides the most accurate results compared with K-Means and K-Medoids clustering algorithms. KNN achieves an accuracy of 86.67% for Decision Tree algorithm, and 75.25% for Random Forest algorithm. Additionally it also has a less relative, absolute and root mean square error. Conclusively, KNN imputed datasets are used in our research for future predictions.

对于研究人员和实业家来说，大型数据集中的缺失值已经成为一项艰巨的任务。特别是在医学领域，由于人为错误或数据不可用，数据集包含缺失值。如果这些数据集必须用于推理目的或预测研究，则结果不那么可靠。丢弃这样的实例是一种选择，但会影响整体准确性，因此执行一些替代或插入技术是可行的。在这里，估算技术可以通过应用各种算法来估计数据集中的缺失值。因此，在本文中，我们提出了一个框架，有助于在大型慢性肾脏疾病(CKD)数据集中引入缺失值。我们使用了三种机器学习算法，即k -近邻，k -均值和k -媒质聚类来估算缺失值。应用决策树和随机森林算法对所提出的技术进行了性能评估。实验结果表明，与K-Means和K-Medoids聚类算法相比，KNN算法提供了最准确的聚类结果。决策树算法的KNN准确率为86.67%，随机森林算法的准确率为75.25%。此外，它还具有较小的相对、绝对和均方根误差。最后，我们的研究中使用了KNN估算的数据集来预测未来。

{"title":"Handling Missing Values in Chronic Kidney Disease Datasets Using KNN, K-Means and K-Medoids Algorithms","authors":"Tahira Mahboob, A. Ijaz, Amber Shahzad, Muqadas Kalsoom","doi":"10.1109/ICOSST.2018.8632179","DOIUrl":"https://doi.org/10.1109/ICOSST.2018.8632179","url":null,"abstract":"Missing values in large datasets have become a difficult task for researchers and industrialists. Specifically in the field of medicine, the datasets contain missing values due to human error or non-availability of data. If these datasets have to utilized for inference purposes or predictive studies, the resutls are not that reliable. Discarding such instances is an option but effects overall accuracy and thus it is viable to perform some replacement or imputation technique. Here, imputaiton technique enable to estimate the missing values in the datasets by applying various algorithms. Therefore, in this paper we present a framework that assists in imouting missing values in a large Chronic Kidney Disease (CKD) datasets. We have used three machine learning algorithms i.e., K-Nearest Neighbors, K-Means and K-Medoids Clustering to impute the missing values. Performance evaluation of the proposed technique has been carried out by application of Decision Tree and Random Forest algorithms. Experimental results demonstrate that KNN algorithm provides the most accurate results compared with K-Means and K-Medoids clustering algorithms. KNN achieves an accuracy of 86.67% for Decision Tree algorithm, and 75.25% for Random Forest algorithm. Additionally it also has a less relative, absolute and root mean square error. Conclusively, KNN imputed datasets are used in our research for future predictions.","PeriodicalId":261288,"journal":{"name":"2018 12th International Conference on Open Source Systems and Technologies (ICOSST)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125898841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Consensus Algorithms in Blockchain: Comparative Analysis, Challenges and Opportunities 区块链共识算法:比较分析、挑战与机遇

2018 12th International Conference on Open Source Systems and Technologies (ICOSST)

Pub Date : 2018-12-01 DOI: 10.1109/ICOSST.2018.8632190

Natalia Chaudhry, M. Yousaf

Blockchain is a distributed ledger that gained a prevalent attention in many areas. Many industries have started to implement blockchain solutions for their application and services. It is important to know the key components, functional characteristics, and architecture of blockchain to understand its impact and applicability to various applications. The most well-known use case of blockchain is bitcoin: a cryptocurrency. Being a distributed ledger, consensus mechanism is needed among peer nodes of a blockchain network to ensure its proper working. Many consensus algorithms have been proposed in literature each having its own performance and security characteristics. One consensus algorithm cannot serve the requirements of every application. It is vital to technically compare the available consensus algorithms to highlight their strengths, weaknesses, and use cases. We have identified and discussed parameters related to performance and security of consensus in blockchain. The consensus algorithms are analyzed and compared with respect to these parameters. Research gap regarding designing an efficient consensus algorithm and evaluating existing algorithms is presented. This paper will act as a guide for developers and researchers to evaluate and design a consensus algorithm.

区块链是一种分布式账本，在许多领域都受到了广泛关注。许多行业已经开始为其应用程序和服务实施区块链解决方案。了解区块链的关键组件、功能特征和架构，以了解其对各种应用的影响和适用性，这一点非常重要。区块链最著名的用例是比特币:一种加密货币。作为分布式账本，区块链网络的对等节点之间需要有共识机制来保证其正常工作。文献中提出了许多共识算法，每个算法都有自己的性能和安全特性。一种一致性算法无法满足每个应用程序的需求。从技术上比较可用的共识算法以突出它们的优点、缺点和用例是至关重要的。我们已经确定并讨论了与区块链共识的性能和安全性相关的参数。针对这些参数，对共识算法进行了分析和比较。指出了在设计高效的共识算法和评价现有算法方面的研究空白。本文将作为开发人员和研究人员评估和设计共识算法的指南。

{"title":"Consensus Algorithms in Blockchain: Comparative Analysis, Challenges and Opportunities","authors":"Natalia Chaudhry, M. Yousaf","doi":"10.1109/ICOSST.2018.8632190","DOIUrl":"https://doi.org/10.1109/ICOSST.2018.8632190","url":null,"abstract":"Blockchain is a distributed ledger that gained a prevalent attention in many areas. Many industries have started to implement blockchain solutions for their application and services. It is important to know the key components, functional characteristics, and architecture of blockchain to understand its impact and applicability to various applications. The most well-known use case of blockchain is bitcoin: a cryptocurrency. Being a distributed ledger, consensus mechanism is needed among peer nodes of a blockchain network to ensure its proper working. Many consensus algorithms have been proposed in literature each having its own performance and security characteristics. One consensus algorithm cannot serve the requirements of every application. It is vital to technically compare the available consensus algorithms to highlight their strengths, weaknesses, and use cases. We have identified and discussed parameters related to performance and security of consensus in blockchain. The consensus algorithms are analyzed and compared with respect to these parameters. Research gap regarding designing an efficient consensus algorithm and evaluating existing algorithms is presented. This paper will act as a guide for developers and researchers to evaluate and design a consensus algorithm.","PeriodicalId":261288,"journal":{"name":"2018 12th International Conference on Open Source Systems and Technologies (ICOSST)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121226849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 95

An Ontology-Based Approach to Semi-Automate Systematic Literature Reviews 基于本体的半自动化系统文献综述方法

2018 12th International Conference on Open Source Systems and Technologies (ICOSST)

Pub Date : 2018-12-01 DOI: 10.1109/ICOSST.2018.8632205

Asad Ali, C. Gravino

A Systematic Literature Review (SLR) allows us to combine and analyze data from multiple (published and unpublished) studies. Though it provides a complete and comprehensive empirical evidence of an area of interest, the results we usually get from the data synthesis phase of an SLR include huge tables and graphs and thus, for users, it is a tedious and time-consuming job to get the required results. In this work, we propose to semi-automate some steps which can be used to fetch the information from an SLR, beyond the traditional tables, graphs, and plots. The automation is performed using Semantic Web technologies like ontology, Jena API and SPARQL queries. The Semantic Web, also called Web 3.0, provides a common framework and thus allows us to share and re-use the data across the applications and enterprises. It can be used to integrate, extract, and infer the most relevant data required by the users, which are hidden behind the huge information on the Web. We also provide an easy-to-use user interface in order to allow users to perform different searches and find their required SLR results easily and quickly. Finally, we present the results of a preliminary user study performed to analyze the amount of time users need to extract their required information, both via the SLR tables and our proposal. The results revealed that with our system the users get their required information in less time compared to the manual system.

系统文献综述(SLR)允许我们结合和分析来自多个(已发表和未发表)研究的数据。虽然它为感兴趣的领域提供了完整而全面的经验证据，但我们通常从单反相机的数据合成阶段得到的结果包括大量的表格和图表，因此，对于用户来说，获得所需的结果是一项乏味而耗时的工作。在这项工作中，我们建议将一些步骤半自动化，这些步骤可以用来从单反相机中获取信息，而不仅仅是传统的表格、图表和绘图。自动化是使用语义Web技术(如本体、Jena API和SPARQL查询)执行的。语义Web，也称为Web 3.0，提供了一个公共框架，从而允许我们在应用程序和企业之间共享和重用数据。它可以用来集成、提取和推断用户所需要的最相关的数据，这些数据隐藏在Web上庞大的信息背后。我们还提供了一个易于使用的用户界面，以便用户执行不同的搜索并轻松快速地找到所需的单反结果。最后，我们提出了初步用户研究的结果，通过单反表和我们的建议，分析了用户提取所需信息所需的时间。结果表明，与手动系统相比，我们的系统可以在更短的时间内获得用户所需的信息。

{"title":"An Ontology-Based Approach to Semi-Automate Systematic Literature Reviews","authors":"Asad Ali, C. Gravino","doi":"10.1109/ICOSST.2018.8632205","DOIUrl":"https://doi.org/10.1109/ICOSST.2018.8632205","url":null,"abstract":"A Systematic Literature Review (SLR) allows us to combine and analyze data from multiple (published and unpublished) studies. Though it provides a complete and comprehensive empirical evidence of an area of interest, the results we usually get from the data synthesis phase of an SLR include huge tables and graphs and thus, for users, it is a tedious and time-consuming job to get the required results. In this work, we propose to semi-automate some steps which can be used to fetch the information from an SLR, beyond the traditional tables, graphs, and plots. The automation is performed using Semantic Web technologies like ontology, Jena API and SPARQL queries. The Semantic Web, also called Web 3.0, provides a common framework and thus allows us to share and re-use the data across the applications and enterprises. It can be used to integrate, extract, and infer the most relevant data required by the users, which are hidden behind the huge information on the Web. We also provide an easy-to-use user interface in order to allow users to perform different searches and find their required SLR results easily and quickly. Finally, we present the results of a preliminary user study performed to analyze the amount of time users need to extract their required information, both via the SLR tables and our proposal. The results revealed that with our system the users get their required information in less time compared to the manual system.","PeriodicalId":261288,"journal":{"name":"2018 12th International Conference on Open Source Systems and Technologies (ICOSST)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132683266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring Media Bias and Toxicity in South Asian Political Discourse 探讨南亚政治话语中的媒体偏见和毒性

2018 12th International Conference on Open Source Systems and Technologies (ICOSST)

Pub Date : 2018-11-16 DOI: 10.1109/ICOSST.2018.8632183

A. Qayyum, Z. Gilani, S. Latif, Junaid Qadir, J. Singh

Media outlets and political campaigners recognise social media as a means for widely disseminating news and opinions. In particular, Twitter is used by political groups all over the world to spread political messages, engage their supporters, drive election campaigns, and challenge their critics. Further, news agencies, many of which aim to give an impression of balance, are often of a particular political persuasion which is reflected in the content they produce. Driven by the potential for political and media organisations to influence public opinion, our aim is to quantify the nature of political discourse by these organisations through their use of social media. In this study, we analyse the sentiments, toxicity, and bias exhibited by the most prominent Pakistani and Indian political parties and media houses, and the pattern by which these political parties utilise Twitter. We found that media bias and toxicity exist in the political discourse of these two developing nations.

媒体机构和政治活动家认为社交媒体是广泛传播新闻和观点的手段。特别是，Twitter被世界各地的政治团体用来传播政治信息，吸引他们的支持者，推动选举活动，并挑战他们的批评者。此外，许多新闻机构的目的是给人一种平衡的印象，它们往往具有特定的政治信念，这反映在它们生产的内容中。在政治和媒体组织影响公众舆论的潜力的推动下，我们的目标是通过这些组织使用社交媒体来量化政治话语的性质。在这项研究中，我们分析了巴基斯坦和印度最著名的政党和媒体机构所表现出的情绪、毒性和偏见，以及这些政党利用Twitter的模式。我们发现媒体偏见和毒性存在于这两个发展中国家的政治话语中。

引用次数: 13

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 12th International Conference on Open Source Systems and Technologies (ICOSST)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀