首页 > 最新文献

Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02最新文献

英文 中文
On the building of the next generation integrated environment for Beowulf clusters 论贝奥武夫集群下一代集成环境的构建
P. Uthayopas, T. Angskun, Jullawadee Maneesilp
Since their beginning in a NASA project around 1994, Beowulf clusters have become one of the most important classes of high performance computing systems. Although this platform has been widely used for several years, it is still difficult to build a large Beowulf cluster. The reason for this is the lack of tools that allow people to effectively maintain the system and develop applications. This problem significantly reduces the benefit gained from a Beowulf system. In this paper, the design of the next generation integrated environment for Beowulf clusters, called SCE (Scalable Cluster Environment), is presented. The goal of the SCE project is to build a powerful, scalable, portable, and easy to use cluster environment. The paper discusses the important issues in designing a good cluster environment and proposes a solution to these issues. SCE design has been used to demonstrate the ideas. Finally, the experimental results are presented to illustrated that while it provides a flexible and portable environment, SCE can also provide a scalable, fast, and efficient cluster environment for users.
自从1994年左右在NASA的一个项目中出现以来,Beowulf集群已经成为高性能计算系统中最重要的一类。虽然这个平台已经被广泛使用了好几年,但是构建一个大型的Beowulf集群仍然很困难。其原因是缺乏允许人们有效地维护系统和开发应用程序的工具。这个问题大大降低了从Beowulf系统中获得的好处。本文介绍了Beowulf集群的下一代集成环境SCE(可扩展集群环境)的设计。SCE项目的目标是构建一个功能强大、可伸缩、可移植且易于使用的集群环境。本文讨论了设计一个良好的集群环境的重要问题,并提出了解决这些问题的方法。SCE设计已被用于演示这些想法。最后,给出了实验结果,说明SCE在提供灵活、可移植环境的同时,还可以为用户提供可扩展、快速、高效的集群环境。
{"title":"On the building of the next generation integrated environment for Beowulf clusters","authors":"P. Uthayopas, T. Angskun, Jullawadee Maneesilp","doi":"10.1109/ISPAN.2002.1004276","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004276","url":null,"abstract":"Since their beginning in a NASA project around 1994, Beowulf clusters have become one of the most important classes of high performance computing systems. Although this platform has been widely used for several years, it is still difficult to build a large Beowulf cluster. The reason for this is the lack of tools that allow people to effectively maintain the system and develop applications. This problem significantly reduces the benefit gained from a Beowulf system. In this paper, the design of the next generation integrated environment for Beowulf clusters, called SCE (Scalable Cluster Environment), is presented. The goal of the SCE project is to build a powerful, scalable, portable, and easy to use cluster environment. The paper discusses the important issues in designing a good cluster environment and proposes a solution to these issues. SCE design has been used to demonstrate the ideas. Finally, the experimental results are presented to illustrated that while it provides a flexible and portable environment, SCE can also provide a scalable, fast, and efficient cluster environment for users.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121269735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Advanced Baseline: a new MIN with fault-tolerance characteristic 高级基线:一种具有容错特性的新MIN
Tao Liu, Qiuyang Li, Yulu Yang
On the basis of the idea of the Delta network and the Baseline network, a new type of MIN, Advanced Baseline network, which using 4/spl times/4 switches, is proposed. In this paper it is proved that 4/spl times/4 switches have the better performance/cost ratios than 2/spl times/2 switches based on the current level of the VLSI technology. This paper expounds its topological properties and routing techniques and makes performance/ratios comparisons. And furthermore a fault-tolerance Advanced Baseline network is discussed in detail, which has a good reliability in complex parallel systems.
在Delta网络和基线网络思想的基础上,提出了一种采用4/ sp1次/4交换机的新型MIN——高级基线网络。基于目前的VLSI技术水平,证明了4/spl倍/4开关比2/spl倍/2开关具有更好的性能/成本比。本文阐述了它的拓扑特性和路由技术,并进行了性能比比较。在此基础上详细讨论了一种容错高级基线网络,该网络在复杂并联系统中具有良好的可靠性。
{"title":"Advanced Baseline: a new MIN with fault-tolerance characteristic","authors":"Tao Liu, Qiuyang Li, Yulu Yang","doi":"10.1109/ISPAN.2002.1004294","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004294","url":null,"abstract":"On the basis of the idea of the Delta network and the Baseline network, a new type of MIN, Advanced Baseline network, which using 4/spl times/4 switches, is proposed. In this paper it is proved that 4/spl times/4 switches have the better performance/cost ratios than 2/spl times/2 switches based on the current level of the VLSI technology. This paper expounds its topological properties and routing techniques and makes performance/ratios comparisons. And furthermore a fault-tolerance Advanced Baseline network is discussed in detail, which has a good reliability in complex parallel systems.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126942431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Computational aspects of distributed sensor networks 分布式传感器网络的计算方面
S. Iyengar, C. Wu
In recent years, the increasing sophistication of detection and tracking systems with multiple sensors has generated a great deal of interest in the development of new computational structures and strategies. The design of such spatially distributed systems involves the integration of solutions to the problems in data-association, hypothesis testing, data fusion, etc. More importantly, the design of efficient algorithms for these problems becomes all important issue as numerous distributed computational framework models are developed. In this paper, we present a survey of the recent research work conducted on the distributed sensor networks (DSNs) ranging from multi-sensor fusion methods, through target location, complexities of sensor deployment, probabilistic optimal surveillance to mobile agent based DSN (MADSN).
近年来,越来越复杂的多传感器检测和跟踪系统引起了人们对开发新的计算结构和策略的极大兴趣。这种空间分布式系统的设计涉及对数据关联、假设检验、数据融合等问题的解决方案的集成。更重要的是,随着众多分布式计算框架模型的发展,为这些问题设计有效的算法变得非常重要。本文对分布式传感器网络(DSN)的最新研究工作进行了综述,从多传感器融合方法、目标定位、传感器部署的复杂性、概率最优监控到基于移动代理的DSN (MADSN)。
{"title":"Computational aspects of distributed sensor networks","authors":"S. Iyengar, C. Wu","doi":"10.1109/ISPAN.2002.1004256","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004256","url":null,"abstract":"In recent years, the increasing sophistication of detection and tracking systems with multiple sensors has generated a great deal of interest in the development of new computational structures and strategies. The design of such spatially distributed systems involves the integration of solutions to the problems in data-association, hypothesis testing, data fusion, etc. More importantly, the design of efficient algorithms for these problems becomes all important issue as numerous distributed computational framework models are developed. In this paper, we present a survey of the recent research work conducted on the distributed sensor networks (DSNs) ranging from multi-sensor fusion methods, through target location, complexities of sensor deployment, probabilistic optimal surveillance to mobile agent based DSN (MADSN).","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125505322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Optimal multicast tree routing for cluster computing in hypercube interconnection networks 超立方体互联网络中集群计算的最优组播树路由
W. Jia, B. Han, Pui-on Au, Yonggang He, Wanlei Zhou
This paper proposes a set of novel efficient multicast routing algorithms on hypercube interconnections of cluster computers using a multicast shared-tree approach. We present some new algorithms for selecting an optimal core (root) for constructing the shared tree so as to minimize the average delay for multicast messages. Simulation results indicate that our algorithms are efficient in the sense of short end-to-end average delay, load balance and lower resource utilization over hypercube cluster connections.
本文提出了一套基于组播共享树的高效组播路由算法。为了使多播消息的平均延迟最小化,我们提出了一些选择最优核心(根)构建共享树的新算法。仿真结果表明,在超立方体集群连接上,我们的算法在端到端平均延迟短、负载均衡和资源利用率低等方面是有效的。
{"title":"Optimal multicast tree routing for cluster computing in hypercube interconnection networks","authors":"W. Jia, B. Han, Pui-on Au, Yonggang He, Wanlei Zhou","doi":"10.1109/ISPAN.2002.1004297","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004297","url":null,"abstract":"This paper proposes a set of novel efficient multicast routing algorithms on hypercube interconnections of cluster computers using a multicast shared-tree approach. We present some new algorithms for selecting an optimal core (root) for constructing the shared tree so as to minimize the average delay for multicast messages. Simulation results indicate that our algorithms are efficient in the sense of short end-to-end average delay, load balance and lower resource utilization over hypercube cluster connections.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114888555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Parallel processing of multi-join expansion-aggregate data cube query in high performance database systems 高性能数据库系统中多连接扩展聚合数据立方体查询的并行处理
D. Taniar, Rebecca Boon-Noi Tan
Data-cube queries containing aggregate functions often combine multiple tables through join operations. We can extend this to "multi-join expansion-aggregate" data-cube queries by using more than one aggregate function in a "SELECT" statement in conjunction with relational operators. In parallel processing for such queries, it must be decided which attribute to use as a partitioning attribute, in particular the join attribute or "cube-by". Based on the partitioning attribute, we introduce three parallel multi-join expansion-aggregate data-cube query methods, namely the multi-join partition method (MPM), the expansion partition method (EPM) and the "early expansion partition with replication" method (EPRM). All three methods use the join attribute and "cube-by" as the partitioning attribute. A performance evaluation of the three parallel processing methods is also carried out and presented.
包含聚合函数的数据多维数据集查询通常通过连接操作组合多个表。通过在“SELECT”语句中使用多个聚合函数并结合关系操作符,我们可以将其扩展到“多连接扩展-聚合”数据多维数据集查询。在对此类查询进行并行处理时,必须决定使用哪个属性作为分区属性,特别是join属性或“cube-by”。基于分区属性,介绍了三种并行的多连接扩展聚合数据立方体查询方法,即多连接分区法(MPM)、扩展分区法(EPM)和“带复制的早期扩展分区法”(EPRM)。这三种方法都使用join属性和“cube-by”作为分区属性。对三种并行处理方法进行了性能评价。
{"title":"Parallel processing of multi-join expansion-aggregate data cube query in high performance database systems","authors":"D. Taniar, Rebecca Boon-Noi Tan","doi":"10.1109/ISPAN.2002.1004260","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004260","url":null,"abstract":"Data-cube queries containing aggregate functions often combine multiple tables through join operations. We can extend this to \"multi-join expansion-aggregate\" data-cube queries by using more than one aggregate function in a \"SELECT\" statement in conjunction with relational operators. In parallel processing for such queries, it must be decided which attribute to use as a partitioning attribute, in particular the join attribute or \"cube-by\". Based on the partitioning attribute, we introduce three parallel multi-join expansion-aggregate data-cube query methods, namely the multi-join partition method (MPM), the expansion partition method (EPM) and the \"early expansion partition with replication\" method (EPRM). All three methods use the join attribute and \"cube-by\" as the partitioning attribute. A performance evaluation of the three parallel processing methods is also carried out and presented.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132939726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Online real-time job scheduling with rate of progress guarantees 具有进度保证的在线实时作业调度
M. Palis
Investigates the job scheduling problem within the context of reservation-based real-time systems that provide quality of service (QoS) guarantees. In such a system, each incoming job specifies a rate of progress requirement on the job execution that must be met by the system in order for computation to be deemed usable. A new metric, called granularity, is introduced that quantifies both the maximum slowdown and the variance in execution rate that the job allows. This metric generalizes the stretch metric used in recent research on job scheduling. An online pre-emptive scheduling algorithm is presented that is shown achieve a competitive ratio of g(1-r) for every set of jobs with maximum rate r and granularity g. This result generalizes a previous result based on the stretch metric that showed that a competitive ratio of 1-r is achievable for the case when g=1.
在提供服务质量(QoS)保证的基于预约的实时系统环境中研究作业调度问题。在这样的系统中,每个传入的作业指定了作业执行的进度要求,系统必须满足该要求,才能认为计算是可用的。引入了一个称为粒度的新度量,它量化了作业允许的最大减速和执行速率的变化。该度量推广了最近在作业调度研究中使用的拉伸度量。提出了一种在线抢先调度算法,该算法对每组最大速率r和粒度g的作业实现了g(1-r)的竞争比。该结果推广了先前基于拉伸度量的结果,该结果表明,当g=1时,竞争比为1-r。
{"title":"Online real-time job scheduling with rate of progress guarantees","authors":"M. Palis","doi":"10.1109/ISPAN.2002.1004262","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004262","url":null,"abstract":"Investigates the job scheduling problem within the context of reservation-based real-time systems that provide quality of service (QoS) guarantees. In such a system, each incoming job specifies a rate of progress requirement on the job execution that must be met by the system in order for computation to be deemed usable. A new metric, called granularity, is introduced that quantifies both the maximum slowdown and the variance in execution rate that the job allows. This metric generalizes the stretch metric used in recent research on job scheduling. An online pre-emptive scheduling algorithm is presented that is shown achieve a competitive ratio of g(1-r) for every set of jobs with maximum rate r and granularity g. This result generalizes a previous result based on the stretch metric that showed that a competitive ratio of 1-r is achievable for the case when g=1.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115851145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Data reservoir: a new approach to data-intensive scientific computation 数据存储:一种数据密集型科学计算的新方法
K. Hiraki, M. Inaba, Junji Tamatsukuri, Ryutaro Kurusu, Yukichi Ikuta, Hisashi Koga, Akira Zinzaki
We propose a very fast file sharing facility "data reservoir", which can transfer huge data files between far distant places, and can be used as an ordinary NFS server in a local area network. We use a low-level protocol and a hierarchical disk striping technique. We show the experimental results of a prototype model, which transfers a single huge data file with nearly 4 Gbps speed.
我们提出了一种非常快速的文件共享设施“数据储存库”,它可以在遥远的地方之间传输巨大的数据文件,可以作为局域网中的普通NFS服务器使用。我们使用低级协议和分层磁盘条带化技术。我们展示了一个原型模型的实验结果,该模型以接近4 Gbps的速度传输单个大数据文件。
{"title":"Data reservoir: a new approach to data-intensive scientific computation","authors":"K. Hiraki, M. Inaba, Junji Tamatsukuri, Ryutaro Kurusu, Yukichi Ikuta, Hisashi Koga, Akira Zinzaki","doi":"10.1109/ISPAN.2002.1004293","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004293","url":null,"abstract":"We propose a very fast file sharing facility \"data reservoir\", which can transfer huge data files between far distant places, and can be used as an ordinary NFS server in a local area network. We use a low-level protocol and a hierarchical disk striping technique. We show the experimental results of a prototype model, which transfers a single huge data file with nearly 4 Gbps speed.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133195415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Parallel selection query processing involving index in parallel database systems 并行数据库系统中涉及索引的并行选择查询处理
J. Rahayu, D. Taniar
Index is an important element in databases, and the existence of index is unavoidable. When an index has been built on a particular attribute, database operations (e.g. selection, join) on this attribute will become more efficient by utilizing the index. We focus on parallel algorithms for selection queries involving index that is data searching on indexed attributes. We propose two categories of parallel selection queries using index: parallel exact match and range selections; depending on the type of selection conditions. As parallel algorithms for these selection queries are very much influenced by indexing schemes, we also describe various index partitioning methods for parallel databases, and discuss their efficiency in supporting parallel selection query processing.
索引是数据库的重要组成部分,索引的存在是不可避免的。当在特定属性上建立索引时,通过利用索引,对该属性的数据库操作(例如选择、连接)将变得更加高效。我们将重点讨论涉及索引的选择查询的并行算法,即对索引属性进行数据搜索。我们提出了两类使用索引的并行选择查询:并行精确匹配和范围选择;取决于选择条件的类型。由于这些选择查询的并行算法受索引方案的影响很大,我们还描述了并行数据库的各种索引划分方法,并讨论了它们在支持并行选择查询处理方面的效率。
{"title":"Parallel selection query processing involving index in parallel database systems","authors":"J. Rahayu, D. Taniar","doi":"10.1109/ISPAN.2002.1004299","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004299","url":null,"abstract":"Index is an important element in databases, and the existence of index is unavoidable. When an index has been built on a particular attribute, database operations (e.g. selection, join) on this attribute will become more efficient by utilizing the index. We focus on parallel algorithms for selection queries involving index that is data searching on indexed attributes. We propose two categories of parallel selection queries using index: parallel exact match and range selections; depending on the type of selection conditions. As parallel algorithms for these selection queries are very much influenced by indexing schemes, we also describe various index partitioning methods for parallel databases, and discuss their efficiency in supporting parallel selection query processing.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132298473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A distributed simulation management engine for ordinary differential equations 常微分方程的分布式仿真管理引擎
S. See
The Internet is quickly evolving into a global computing platform. Internetworking of computers means that computing resources available for personal use need not be confined to the users' local environment. Instead, the Internet provides a delivery channel where remote computing resources are easily accessible. Computer simulations often require High Performance Computers (HPC) for fast number crunching. However, HPCs remain inaccessible to ordinary users due to the high cost involved. The aim of this project is to develop a Web-based simulation architecture where users can input a simulation model, based on the European Space Agency Simulation Language (ESL), through a Java graphical front-end application, and letting backend HPCs to perform intensive computation on behalf of the user. We have successfully implemented this system within an intranet environment, with the possibility of actual implementation on the World Wide Web. In this report, we discuss the architectural merits of this framework, our design philosophy and proposals for improvements and future development.
互联网正在迅速发展成为一个全球性的计算平台。计算机的互联意味着可供个人使用的计算资源不必局限于用户的本地环境。相反,Internet提供了一个传输通道,可以方便地访问远程计算资源。计算机模拟通常需要高性能计算机(HPC)来快速处理数字。然而,由于涉及的高成本,普通用户仍然无法使用高性能计算机。该项目的目的是开发一个基于web的仿真体系结构,用户可以在其中输入仿真模型,基于欧洲空间局仿真语言(ESL),通过Java图形前端应用程序,并让后端hpc代表用户执行密集计算。我们已经在一个内部网环境中成功地实现了这个系统,并有可能在万维网上实际实现。在本报告中,我们将讨论该框架的架构优点、我们的设计理念以及改进和未来发展的建议。
{"title":"A distributed simulation management engine for ordinary differential equations","authors":"S. See","doi":"10.1109/ISPAN.2002.1004292","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004292","url":null,"abstract":"The Internet is quickly evolving into a global computing platform. Internetworking of computers means that computing resources available for personal use need not be confined to the users' local environment. Instead, the Internet provides a delivery channel where remote computing resources are easily accessible. Computer simulations often require High Performance Computers (HPC) for fast number crunching. However, HPCs remain inaccessible to ordinary users due to the high cost involved. The aim of this project is to develop a Web-based simulation architecture where users can input a simulation model, based on the European Space Agency Simulation Language (ESL), through a Java graphical front-end application, and letting backend HPCs to perform intensive computation on behalf of the user. We have successfully implemented this system within an intranet environment, with the possibility of actual implementation on the World Wide Web. In this report, we discuss the architectural merits of this framework, our design philosophy and proposals for improvements and future development.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133902314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new data clustering approach for data mining in large databases 大型数据库数据挖掘的一种新的数据聚类方法
Cheng-Fa Tsai, Hang-Chang Wu, Chun-Wei Tsai
Clustering is the unsupervised classification of patterns (data item, feature vectors, or observations) into groups (clusters). Clustering in data mining is very useful to discover distribution patterns in the underlying data. Clustering algorithms usually employ a distance metric-based similarity measure in order to partition the database such that data points in the same partition are more similar than points in different partitions. In this paper, we present a new data clustering method for data mining in large databases. Our simulation results show that the proposed novel clustering method performs better than a fast self-organizing map (FSOM) combined with the k-means approach (FSOM+k-means) and the genetic k-means algorithm (GKA). In addition, in all the cases we studied, our method produces much smaller errors than both the FSOM+k-means approach and GKA.
聚类是将模式(数据项、特征向量或观察值)分成组(聚类)的无监督分类。数据挖掘中的聚类对于发现底层数据中的分布模式非常有用。聚类算法通常采用基于距离度量的相似性度量来划分数据库,使同一分区中的数据点比不同分区中的数据点更相似。本文提出了一种新的用于大型数据库数据挖掘的数据聚类方法。仿真结果表明,本文提出的聚类方法优于快速自组织映射(FSOM)结合k-means方法(FSOM+k-means)和遗传k-means算法(GKA)。此外,在我们研究的所有情况下,我们的方法比FSOM+k-means方法和GKA方法产生的误差要小得多。
{"title":"A new data clustering approach for data mining in large databases","authors":"Cheng-Fa Tsai, Hang-Chang Wu, Chun-Wei Tsai","doi":"10.1109/ISPAN.2002.1004300","DOIUrl":"https://doi.org/10.1109/ISPAN.2002.1004300","url":null,"abstract":"Clustering is the unsupervised classification of patterns (data item, feature vectors, or observations) into groups (clusters). Clustering in data mining is very useful to discover distribution patterns in the underlying data. Clustering algorithms usually employ a distance metric-based similarity measure in order to partition the database such that data points in the same partition are more similar than points in different partitions. In this paper, we present a new data clustering method for data mining in large databases. Our simulation results show that the proposed novel clustering method performs better than a fast self-organizing map (FSOM) combined with the k-means approach (FSOM+k-means) and the genetic k-means algorithm (GKA). In addition, in all the cases we studied, our method produces much smaller errors than both the FSOM+k-means approach and GKA.","PeriodicalId":255069,"journal":{"name":"Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133221205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 64
期刊
Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1