首页 > 最新文献

2019 International Conference on Computational Science and Computational Intelligence (CSCI)最新文献

英文 中文
Functional Annotations of Novel Cancer-Associated lncRNAs Identified Using Machine Learning Algorithms 使用机器学习算法鉴定的新型癌症相关lncrna的功能注释
Luis Diego Mora-Jimenez, Oscar Azofeifa-Segura, J. Guevara-Coto
Cancer consists of a set of diseases that result from deregulated cell growth and invasion of adjacent tissues. Due to an increase in research, more information has become available regarding the potential causes for cancer, including non-coding elements such as lncRNAs. This new knowledge can be discovered through machine learning methods that can extract new information from data such as gene expression profiles and identify new cancer-associated genes. For this work we use two different machine learning algorithms, random forests and support vector machines. The models were trained and we tested fine-tuning methods including: balancing and feature selection. The predictors with the highest metrics were: balanced RF with Boruta (AUC-ROC: 0.9696) and the balanced SVM with recursive feature elimination (AUC-ROC: 0.9710). These models were used to identify new potential lncRNA driver-like genes from protein coding expression data. The predicted candidates were then functionally annotated using disease ontologies and molecular function ontologies to determine their enrichment in cancer related processes. These processes included prostate cancer and glycosaminglycan binding, a potential tumor therapeutic target.
癌症由一系列疾病组成,这些疾病是由于细胞生长失控和邻近组织的侵入而引起的。由于研究的增加,关于癌症的潜在原因的信息越来越多,包括lncrna等非编码元件。这种新知识可以通过机器学习方法发现,机器学习方法可以从基因表达谱等数据中提取新信息,并识别新的癌症相关基因。在这项工作中,我们使用了两种不同的机器学习算法,随机森林和支持向量机。我们对模型进行了训练,并测试了包括平衡和特征选择在内的微调方法。预测指标最高的分别是:Boruta的平衡SVM (AUC-ROC: 0.9696)和递归特征消除的平衡SVM (AUC-ROC: 0.9710)。这些模型用于从蛋白质编码表达数据中鉴定新的潜在的lncRNA驱动样基因。然后使用疾病本体论和分子功能本体论对预测的候选物进行功能注释,以确定它们在癌症相关过程中的富集程度。这些过程包括前列腺癌和糖saminglycan结合,一个潜在的肿瘤治疗靶点。
{"title":"Functional Annotations of Novel Cancer-Associated lncRNAs Identified Using Machine Learning Algorithms","authors":"Luis Diego Mora-Jimenez, Oscar Azofeifa-Segura, J. Guevara-Coto","doi":"10.1109/CSCI49370.2019.00274","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00274","url":null,"abstract":"Cancer consists of a set of diseases that result from deregulated cell growth and invasion of adjacent tissues. Due to an increase in research, more information has become available regarding the potential causes for cancer, including non-coding elements such as lncRNAs. This new knowledge can be discovered through machine learning methods that can extract new information from data such as gene expression profiles and identify new cancer-associated genes. For this work we use two different machine learning algorithms, random forests and support vector machines. The models were trained and we tested fine-tuning methods including: balancing and feature selection. The predictors with the highest metrics were: balanced RF with Boruta (AUC-ROC: 0.9696) and the balanced SVM with recursive feature elimination (AUC-ROC: 0.9710). These models were used to identify new potential lncRNA driver-like genes from protein coding expression data. The predicted candidates were then functionally annotated using disease ontologies and molecular function ontologies to determine their enrichment in cancer related processes. These processes included prostate cancer and glycosaminglycan binding, a potential tumor therapeutic target.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122418280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Attention-Based Surgical Phase Boundaries Detection in Laparoscopic Videos 基于注意力的腹腔镜手术相边界检测
Babak Namazi, G. Sankaranarayanan, V. Devarajan
A new deep learning-based method is proposed for identifying the boundaries of all surgical phases in a laparoscopic video. The model is designed based on the sequence-to-sequence architecture with an attention mechanism, to map the extracted visual features to the frame numbers of the beginning and the ending of each phase. The main novelty is that the alignment vectors for each phase are taken as the outputs, and are trained directly to select the indices. We evaluated our model using a large publicly available dataset of laparoscopic cholecystectomy procedure and obtained the Mean Absolute Error (MAE) of 48 seconds.
提出了一种新的基于深度学习的方法来识别腹腔镜视频中所有手术阶段的边界。该模型基于序列到序列的结构,采用注意机制,将提取的视觉特征映射到每个阶段开始和结束的帧数上。主要的新颖之处在于将每个阶段的对齐向量作为输出,并直接训练以选择指标。我们使用大型公开可用的腹腔镜胆囊切除术数据集评估我们的模型,并获得48秒的平均绝对误差(MAE)。
{"title":"Attention-Based Surgical Phase Boundaries Detection in Laparoscopic Videos","authors":"Babak Namazi, G. Sankaranarayanan, V. Devarajan","doi":"10.1109/CSCI49370.2019.00109","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00109","url":null,"abstract":"A new deep learning-based method is proposed for identifying the boundaries of all surgical phases in a laparoscopic video. The model is designed based on the sequence-to-sequence architecture with an attention mechanism, to map the extracted visual features to the frame numbers of the beginning and the ending of each phase. The main novelty is that the alignment vectors for each phase are taken as the outputs, and are trained directly to select the indices. We evaluated our model using a large publicly available dataset of laparoscopic cholecystectomy procedure and obtained the Mean Absolute Error (MAE) of 48 seconds.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131394057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Unsupervised Multi-Label Document Classification for Large Taxonomies Using Word Embeddings 使用词嵌入的大型分类法无监督多标签文档分类
Stefan Hirschmeier, J. Melsbach, D. Schoder, Sven Stahlmann
More and more businesses are in need for metadata for their documents. However, automatic generation for metadata is not easy, as for supervised document classification, a significant amount of labelled training data is needed, which is not always present in the desired amount or quality. Often, documents need to be tagged with a predefined set of company specific keywords that are organized in a taxonomy. We present an unsupervised approach to perform multi-label document classification for large taxonomies using word embeddings and evaluate it with a dataset of a public broadcaster. We point out strengths of the approach compared to supervised classification and statistical approaches like tf-idf.
越来越多的企业需要其文档的元数据。然而,元数据的自动生成并不容易,因为对于监督文档分类,需要大量标记的训练数据,这些数据并不总是以期望的数量或质量存在。通常,文档需要使用一组预定义的公司特定关键字进行标记,这些关键字按照分类法组织。我们提出了一种无监督的方法,使用词嵌入对大型分类法进行多标签文档分类,并使用公共广播公司的数据集对其进行评估。我们指出了该方法与监督分类和统计方法(如tf-idf)相比的优势。
{"title":"Unsupervised Multi-Label Document Classification for Large Taxonomies Using Word Embeddings","authors":"Stefan Hirschmeier, J. Melsbach, D. Schoder, Sven Stahlmann","doi":"10.1109/CSCI49370.2019.00241","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00241","url":null,"abstract":"More and more businesses are in need for metadata for their documents. However, automatic generation for metadata is not easy, as for supervised document classification, a significant amount of labelled training data is needed, which is not always present in the desired amount or quality. Often, documents need to be tagged with a predefined set of company specific keywords that are organized in a taxonomy. We present an unsupervised approach to perform multi-label document classification for large taxonomies using word embeddings and evaluate it with a dataset of a public broadcaster. We point out strengths of the approach compared to supervised classification and statistical approaches like tf-idf.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121854339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Five Levels of Data Destruction: A Paradigm for Introducing Data Recovery in a Computer Science Course 数据破坏的五个层次:在计算机科学课程中介绍数据恢复的范例
Gary Cantrell, Joan Runs Through
Digital forensics has become a fundamental piece of many cyber security programs across the US, and data recovery is an integral building block of digital forensics. Data recovery can be a difficult topic to cover without a system or organization to the different methods of recovery. The following manuscript offers a structure for introducing data recovery in a digital forensics or information technology course and a method for evaluating the admissibility of recovered files as court evidence based on how the data were recovered. This offers both a framework for teaching data recovery and a way for discussing evidence admissibility. The five levels of destruction paradigm is a result of over a decade of teaching digital forensics in vocational and academic environments in a computer science program. The authors offer up this paradigm in hopes it will be useful to other computer science and digital forensics educators.
数字取证已经成为美国许多网络安全项目的基本组成部分,而数据恢复是数字取证不可或缺的组成部分。如果没有系统或组织提供不同的恢复方法,数据恢复可能是一个难以涵盖的主题。以下手稿提供了在数字取证或信息技术课程中引入数据恢复的结构,以及基于数据恢复方式评估恢复文件作为法庭证据的可采性的方法。这既提供了一个教学数据恢复的框架,也提供了一种讨论证据可采性的方法。销毁范式的五个层次是十多年来在计算机科学项目的职业和学术环境中教授数字取证的结果。作者提供了这个范例,希望它能对其他计算机科学和数字取证教育者有用。
{"title":"The Five Levels of Data Destruction: A Paradigm for Introducing Data Recovery in a Computer Science Course","authors":"Gary Cantrell, Joan Runs Through","doi":"10.1109/CSCI49370.2019.00029","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00029","url":null,"abstract":"Digital forensics has become a fundamental piece of many cyber security programs across the US, and data recovery is an integral building block of digital forensics. Data recovery can be a difficult topic to cover without a system or organization to the different methods of recovery. The following manuscript offers a structure for introducing data recovery in a digital forensics or information technology course and a method for evaluating the admissibility of recovered files as court evidence based on how the data were recovered. This offers both a framework for teaching data recovery and a way for discussing evidence admissibility. The five levels of destruction paradigm is a result of over a decade of teaching digital forensics in vocational and academic environments in a computer science program. The authors offer up this paradigm in hopes it will be useful to other computer science and digital forensics educators.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"79 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132531931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Radial Basis Function Network: Its Robustness and Ability to Mitigate Adversarial Examples 径向基函数网络:鲁棒性和抗敌对实例能力
Jules Chenou, G. Hsieh, Tonya Fields
this work is a continuation of an ongoing effort to increase the robustness of the deep neural network, and thus mitigate possible adversarial examples. In our previous work, the emphasis was placed on denoising the input dataset by adding colored noise before processing. In that work, the evaluation made with the empirical robustness score, resulted in a 1% improvement on average for individual noise and a 3.74% improvement on average for ensemble noise. The aim of this paper is to demonstrate the effective robustness of a well-designed radial basis function neural network in tackling adversarial examples. With the empirical robustness as a metric, the results show a 72.5% increase with Fast Gradient Sign Method (FGSM) attack on the MNIST dataset in comparison to a simple deep network and a 6.4 % increase with FGSM on the CIFAR10 dataset.
这项工作是正在进行的努力的延续,以增加深度神经网络的鲁棒性,从而减少可能的对抗性示例。在我们之前的工作中,重点放在通过在处理前添加彩色噪声来对输入数据集进行去噪。在这项工作中,使用经验稳健性评分进行的评估导致单个噪声平均改善1%,整体噪声平均改善3.74%。本文的目的是证明一个设计良好的径向基函数神经网络在处理对抗性实例时的有效鲁棒性。以经验鲁棒性为衡量标准,结果显示,与简单的深度网络相比,快速梯度符号方法(FGSM)攻击在MNIST数据集上的效率提高了72.5%,在CIFAR10数据集上使用FGSM攻击的效率提高了6.4%。
{"title":"Radial Basis Function Network: Its Robustness and Ability to Mitigate Adversarial Examples","authors":"Jules Chenou, G. Hsieh, Tonya Fields","doi":"10.1109/CSCI49370.2019.00024","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00024","url":null,"abstract":"this work is a continuation of an ongoing effort to increase the robustness of the deep neural network, and thus mitigate possible adversarial examples. In our previous work, the emphasis was placed on denoising the input dataset by adding colored noise before processing. In that work, the evaluation made with the empirical robustness score, resulted in a 1% improvement on average for individual noise and a 3.74% improvement on average for ensemble noise. The aim of this paper is to demonstrate the effective robustness of a well-designed radial basis function neural network in tackling adversarial examples. With the empirical robustness as a metric, the results show a 72.5% increase with Fast Gradient Sign Method (FGSM) attack on the MNIST dataset in comparison to a simple deep network and a 6.4 % increase with FGSM on the CIFAR10 dataset.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132818978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Analysis of Recent Trends in Automatic Object Identification 自动目标识别的最新发展趋势分析
Xavier Williams, N. Mahapatra
Automatic object identification (auto-ID) involves techniques for automatically identifying objects using visual features or tags with unique identification codes. These auto-ID systems then transfer the collected identification information to computer systems for further data management. In this paper, we analyze the existing auto-ID techniques for physically tagged objects.
自动对象识别(auto-ID)涉及使用具有唯一识别码的视觉特征或标签自动识别对象的技术。然后,这些自动识别系统将收集到的身份信息传输到计算机系统,以便进行进一步的数据管理。在本文中,我们分析了现有的自动识别技术的物理标记的对象。
{"title":"Analysis of Recent Trends in Automatic Object Identification","authors":"Xavier Williams, N. Mahapatra","doi":"10.1109/CSCI49370.2019.00083","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00083","url":null,"abstract":"Automatic object identification (auto-ID) involves techniques for automatically identifying objects using visual features or tags with unique identification codes. These auto-ID systems then transfer the collected identification information to computer systems for further data management. In this paper, we analyze the existing auto-ID techniques for physically tagged objects.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"156 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131818469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Case Study of Technology Use and Information Flow at a Danish E-Clinic 丹麦一家电子诊所的技术使用和信息流案例研究
B. Smaradottir, R. Fensli
There is an urgent call from health organizations, health professionals and health authorities to re-design care delivery for patients with chronic conditions and multi-morbidities. The research project 3P-Patients and Professionals in Productive Teams aims to study health care services that are run with different patient-centered teamwork models. In this context, a case study was made of an E-clinic in Denmark, with a focus on the technology use and information flow in a patient-centered clinical care context. Qualitative methods were applied with observations and interviews with key informants. The results showed that information flow worked well in a patient-centered care perspective, even though the technology was a standalone system for the E-clinic and with limited interoperability with other health care providers.
卫生组织、卫生专业人员和卫生当局紧急呼吁为慢性病和多种疾病患者重新设计保健服务。研究项目3p -生产团队中的患者和专业人员旨在研究以不同的以患者为中心的团队模式运行的医疗保健服务。在此背景下,对丹麦的一家电子诊所进行了案例研究,重点关注以患者为中心的临床护理环境中的技术使用和信息流。定性方法应用于观察和访谈关键举报人。结果表明,信息流在以患者为中心的护理角度下运行良好,尽管该技术是电子诊所的独立系统,并且与其他医疗保健提供者的互操作性有限。
{"title":"A Case Study of Technology Use and Information Flow at a Danish E-Clinic","authors":"B. Smaradottir, R. Fensli","doi":"10.1109/CSCI49370.2019.00182","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00182","url":null,"abstract":"There is an urgent call from health organizations, health professionals and health authorities to re-design care delivery for patients with chronic conditions and multi-morbidities. The research project 3P-Patients and Professionals in Productive Teams aims to study health care services that are run with different patient-centered teamwork models. In this context, a case study was made of an E-clinic in Denmark, with a focus on the technology use and information flow in a patient-centered clinical care context. Qualitative methods were applied with observations and interviews with key informants. The results showed that information flow worked well in a patient-centered care perspective, even though the technology was a standalone system for the E-clinic and with limited interoperability with other health care providers.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134506576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Uncovering Los Angeles Tourists' Patterns Using Geospatial Analysis and Supervised Machine Learning with Random Forest Predictors 利用地理空间分析和随机森林预测器的监督机器学习揭示洛杉矶游客的模式
Yuan-Yuan Lee, Y. Chang
Consumer behavior analytics is at the epicenter of a Big Data revolution. In this paper we propose to analyze intra-regional spatial patterns mining tourists' behaviors and characteristics based on traveling group size with data collected from Airbnb open source focused on Los Angeles neighborhood in 2016. Random Forest Classification (RF) technique, an ensemble approach, is applied to identify the key drivers according to relevant traveler groups and presented patterns using Hotspot Analysis on Geographic Information System (GIS). Our empirical result highlights driving factors within Airbnb listings, providing valuable insights to better plan, monitor and manage tourism activity.
消费者行为分析是大数据革命的中心。本文以2016年洛杉矶社区为研究对象,利用Airbnb开源数据,分析基于旅游群体规模挖掘游客行为和特征的区域内空间格局。基于地理信息系统(GIS)的热点分析,将随机森林分类技术(RF)应用于关键驱动因素的识别。我们的实证结果突出了Airbnb房源中的驱动因素,为更好地规划、监控和管理旅游活动提供了有价值的见解。
{"title":"Uncovering Los Angeles Tourists' Patterns Using Geospatial Analysis and Supervised Machine Learning with Random Forest Predictors","authors":"Yuan-Yuan Lee, Y. Chang","doi":"10.1109/CSCI49370.2019.00239","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00239","url":null,"abstract":"Consumer behavior analytics is at the epicenter of a Big Data revolution. In this paper we propose to analyze intra-regional spatial patterns mining tourists' behaviors and characteristics based on traveling group size with data collected from Airbnb open source focused on Los Angeles neighborhood in 2016. Random Forest Classification (RF) technique, an ensemble approach, is applied to identify the key drivers according to relevant traveler groups and presented patterns using Hotspot Analysis on Geographic Information System (GIS). Our empirical result highlights driving factors within Airbnb listings, providing valuable insights to better plan, monitor and manage tourism activity.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"18 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114031365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Incorporating Association Patterns into Manifold Clustering for Enabling Predictive Analytics 将关联模式整合到流形聚类中以实现预测分析
B. Sy, Jin Chen, Rebecca Horowitz
The goal of this research is to develop a predictive analytics technique based on manifold clustering of mixed data type. In this research, we explore the concept of statistically significant association patterns to induce an initial partition on data for deriving manifolds. Manifolds are hyperplanes embedded in low dimensions. The advantage of this novel technique is a bootstrap on data clusters that reveals statistical associations from the information-theoretic perspective. As an illustration, the proposed technique is applied to a real data set of diabetes patients. An assessment on the proposed technique is performed to investigate the effect of bootstrap based on association patterns. Results of the preliminary study demonstrate the feasibility of applying the proposed technique to real-world data.
本研究的目标是开发一种基于混合数据类型的流形聚类的预测分析技术。在本研究中,我们探索了统计显著关联模式的概念,以诱导数据上的初始划分来推导流形。流形是嵌入在低维中的超平面。这种新技术的优点是对数据簇进行了自举,从信息论的角度揭示了统计关联。作为一个例子,所提出的技术应用于糖尿病患者的真实数据集。对所提出的技术进行了评估,以研究基于关联模式的自举的效果。初步研究的结果证明了将所提出的技术应用于实际数据的可行性。
{"title":"Incorporating Association Patterns into Manifold Clustering for Enabling Predictive Analytics","authors":"B. Sy, Jin Chen, Rebecca Horowitz","doi":"10.1109/CSCI49370.2019.00243","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00243","url":null,"abstract":"The goal of this research is to develop a predictive analytics technique based on manifold clustering of mixed data type. In this research, we explore the concept of statistically significant association patterns to induce an initial partition on data for deriving manifolds. Manifolds are hyperplanes embedded in low dimensions. The advantage of this novel technique is a bootstrap on data clusters that reveals statistical associations from the information-theoretic perspective. As an illustration, the proposed technique is applied to a real data set of diabetes patients. An assessment on the proposed technique is performed to investigate the effect of bootstrap based on association patterns. Results of the preliminary study demonstrate the feasibility of applying the proposed technique to real-world data.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115025086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Load Balancing in Cloud Computing Using Genetic Algorithm and Fuzzy Logic 基于遗传算法和模糊逻辑的云计算负载均衡
Ali Saadat, E. Masehian
Cloud computing systems play a vital role in the digital age. A critical bottleneck in most scenarios in cloud computing is the high degree of unpredictability with respect to resource availability and network bandwidth, which may lead to low Quality of Service (like low response times), which can be improved by Load Balancing. Load balancing concerns with efficiently distributing incoming network traffic across a group of servers. This ensures no single server bears too much demand, and thus the availability of applications and websites for users is increased. Due to the huge state-space of such a problem, implementing task scheduling algorithms in load balancing can be very effective. In this paper, we propose a hybrid intelligent approach to load balancing: a Genetic Algorithm module arranges the jobs randomly, and a fuzzy logic module builds the objective function for determining busy states of servers according to their RAM and CPU task queues. The fuzzy input variables include the satisfaction degree and the start and end times of the service, and the fuzzy output is service availability. Computational experiments showed that the best solution was obtained within half of the planned execution time, which leads to higher user satisfaction degree.
云计算系统在数字时代扮演着至关重要的角色。云计算中大多数场景中的一个关键瓶颈是资源可用性和网络带宽方面的高度不可预测性,这可能导致低服务质量(如低响应时间),这可以通过负载平衡来改善。负载平衡关注的是在一组服务器之间有效地分配传入的网络流量。这确保了没有单个服务器承担过多的需求,从而增加了用户的应用程序和网站的可用性。由于这类问题的状态空间非常大,因此在负载均衡中实现任务调度算法是非常有效的。在本文中,我们提出了一种混合智能的负载均衡方法:遗传算法模块随机安排任务,模糊逻辑模块根据服务器的RAM和CPU任务队列构建目标函数来确定服务器的繁忙状态。模糊输入变量为服务满意度、服务开始时间和服务结束时间,模糊输出变量为服务可用性。计算实验表明,在计划执行时间的一半内获得了最优解,用户满意度较高。
{"title":"Load Balancing in Cloud Computing Using Genetic Algorithm and Fuzzy Logic","authors":"Ali Saadat, E. Masehian","doi":"10.1109/CSCI49370.2019.00268","DOIUrl":"https://doi.org/10.1109/CSCI49370.2019.00268","url":null,"abstract":"Cloud computing systems play a vital role in the digital age. A critical bottleneck in most scenarios in cloud computing is the high degree of unpredictability with respect to resource availability and network bandwidth, which may lead to low Quality of Service (like low response times), which can be improved by Load Balancing. Load balancing concerns with efficiently distributing incoming network traffic across a group of servers. This ensures no single server bears too much demand, and thus the availability of applications and websites for users is increased. Due to the huge state-space of such a problem, implementing task scheduling algorithms in load balancing can be very effective. In this paper, we propose a hybrid intelligent approach to load balancing: a Genetic Algorithm module arranges the jobs randomly, and a fuzzy logic module builds the objective function for determining busy states of servers according to their RAM and CPU task queues. The fuzzy input variables include the satisfaction degree and the start and end times of the service, and the fuzzy output is service availability. Computational experiments showed that the best solution was obtained within half of the planned execution time, which leads to higher user satisfaction degree.","PeriodicalId":103662,"journal":{"name":"2019 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115342550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
2019 International Conference on Computational Science and Computational Intelligence (CSCI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1