首页 > 最新文献

Proceedings of the 2006 ACM/IEEE conference on Supercomputing最新文献

英文 中文
So What's innovative and exotic about star-P for MATLAB and other clients? 那么,star-P对MATLAB和其他客户来说有什么创新和新奇之处呢?
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188746
A. Edelman
Star-P is a unique technology offered by Interactive Supercomputing after nurturing at MIT. Star-P through its clever abstractions is solving the ease of use problem that has plagued supercomputing. Given that there have been around 30 parallel MATLABs including three other major offerings, Star-P must be way ahead to compete in the marketplace.Some of the innovative features of Star-P are the ability to program in MATLAB, hook in task parallel codes written using a processor free abstraction, hook in existing parallel codes, and obtain the performance that represents the HPC promise. All this is through a client/server interface. Other clients such as Python or R could be possible. The MATLAB, Python, or R becomes the "browser." If we make it look easy, it is because decades of parallel computing experience has taught us that it is not.This talk demonstrates the abstractions and innovations that make this possible.
Star-P是Interactive Supercomputing在麻省理工学院(MIT)培育后提供的独特技术。Star-P通过其巧妙的抽象解决了困扰超级计算的易用性问题。考虑到已经有大约30个并行matlab,包括其他三个主要产品,Star-P必须在市场竞争中遥遥领先。Star-P的一些创新特性是能够在MATLAB中编程,挂钩使用无处理器抽象编写的任务并行代码,挂钩现有并行代码,并获得代表HPC承诺的性能。所有这些都是通过客户机/服务器接口实现的。其他客户端(如Python或R)也是可能的。MATLAB、Python或R成为“浏览器”。如果我们让它看起来很简单,那是因为几十年的并行计算经验告诉我们事实并非如此。这次演讲展示了使这成为可能的抽象和创新。
{"title":"So What's innovative and exotic about star-P for MATLAB and other clients?","authors":"A. Edelman","doi":"10.1145/1188455.1188746","DOIUrl":"https://doi.org/10.1145/1188455.1188746","url":null,"abstract":"Star-P is a unique technology offered by Interactive Supercomputing after nurturing at MIT. Star-P through its clever abstractions is solving the ease of use problem that has plagued supercomputing. Given that there have been around 30 parallel MATLABs including three other major offerings, Star-P must be way ahead to compete in the marketplace.Some of the innovative features of Star-P are the ability to program in MATLAB, hook in task parallel codes written using a processor free abstraction, hook in existing parallel codes, and obtain the performance that represents the HPC promise. All this is through a client/server interface. Other clients such as Python or R could be possible. The MATLAB, Python, or R becomes the \"browser.\" If we make it look easy, it is because decades of parallel computing experience has taught us that it is not.This talk demonstrates the abstractions and innovations that make this possible.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123039183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GPGPU: general-purpose computation on graphics hardware GPGPU:图形硬件上的通用计算
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188672
D. Luebke, Mark J. Harris, N. Govindaraju, A. Lefohn, M. Houston, John Douglas Owens, Mark E. Segal, Matthew Papakipos, I. Buck
The graphics processor (GPU) on today's commodity video cards has evolved into an extremely powerful and flexible processor. Modern graphics architectures provide tremendous memory bandwidth and computational horsepower, with dozens of fully programmable shading units that support vector operations and IEEE floating point precision. High-level languages have emerged for graphics hardware, making this computational power accessible. GPGPU stands for "General-Purpose Computation on GPUs". GPGPU researchers have achieved over an order of magnitude speedup over modern CPUs on some non-graphics problems.This course provides detailed coverage of general-purpose computation on graphics hardware. We emphasize core computational building blocks, ranging from linear algebra to database queries, and review the tools, perils, and strategies in GPU programming. We present analysis of GPU performance characteristics, and use this analysis to provide insight into how to build efficient GPGPU algorithms. Finally we present a set of case studies on general-purpose applications of graphics hardware.
今天的商品视频卡上的图形处理器(GPU)已经发展成为一个极其强大和灵活的处理器。现代图形架构提供了巨大的内存带宽和计算能力,具有数十个完全可编程的着色单元,支持矢量操作和IEEE浮点精度。针对图形硬件的高级语言已经出现,使这种计算能力成为可能。GPGPU代表“gpu上的通用计算”。在一些非图形问题上,GPGPU研究人员已经实现了比现代cpu超过一个数量级的加速。本课程详细介绍了图形硬件上的通用计算。我们强调核心计算构建块,从线性代数到数据库查询,并回顾GPU编程中的工具,风险和策略。我们对GPU的性能特征进行了分析,并利用这一分析来深入了解如何构建高效的GPGPU算法。最后,我们提出了一组图形硬件通用应用的案例研究。
{"title":"GPGPU: general-purpose computation on graphics hardware","authors":"D. Luebke, Mark J. Harris, N. Govindaraju, A. Lefohn, M. Houston, John Douglas Owens, Mark E. Segal, Matthew Papakipos, I. Buck","doi":"10.1145/1188455.1188672","DOIUrl":"https://doi.org/10.1145/1188455.1188672","url":null,"abstract":"The graphics processor (GPU) on today's commodity video cards has evolved into an extremely powerful and flexible processor. Modern graphics architectures provide tremendous memory bandwidth and computational horsepower, with dozens of fully programmable shading units that support vector operations and IEEE floating point precision. High-level languages have emerged for graphics hardware, making this computational power accessible. GPGPU stands for \"General-Purpose Computation on GPUs\". GPGPU researchers have achieved over an order of magnitude speedup over modern CPUs on some non-graphics problems.This course provides detailed coverage of general-purpose computation on graphics hardware. We emphasize core computational building blocks, ranging from linear algebra to database queries, and review the tools, perils, and strategies in GPU programming. We present analysis of GPU performance characteristics, and use this analysis to provide insight into how to build efficient GPGPU algorithms. Finally we present a set of case studies on general-purpose applications of graphics hardware.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"309 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123142733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 212
The coming merger of biological and non biological intelligence 即将到来的生物智能和非生物智能的融合
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188658
R. Kurzweil
The paradigm shift rate is now doubling every decade, so the twenty-first century will see 20,000 years of progress at today's rate. Computation, communication, biological technologies (for example, DNA sequencing), brain scanning, knowledge of the human brain, and human knowledge in general are all accelerating at an even faster pace, generally doubling price-performance, capacity, and bandwidth every year. The well-known Moore's Law is only one example of many of this inherent acceleration. The size of the key features of technology is also shrinking, at a rate of about 4 per linear dimension per decade. Three-dimensional molecular computing will provide the hardware for human-level "strong" AI well before 2030. The more important software insights will be gained in part from the reverse-engineering of the human brain, a process well under way.We are rapidly learning the software programs called genes that underlie biology. We are understanding disease and aging processes as information processes, and are gaining the tools to reprogram them. RNA interference, for example, allows us to turn selected genes off, and new forms of gene therapy are enabling us to effectively add new genes. Within one to two decades, we will be in a position to stop and reverse the progression of disease and aging resulting in dramatic gains in health and longevity.The fraction of value of products and services comprised by software and related forms of information is rapidly asymptoting to 100 percent The deflation rate for information technologies, both hardware and software, is about 50 percent per year, providing a powerful deflationary force in the economy. The portion of the economy comprised of information technology is itself growing exponentially and within a couple of decades, the bulk of the economy will be dominated by information and software.Once nonbiological intelligence matches the range and subtlety of human intelligence, it will necessarily soar past it because of the continuing acceleration of information-based technologies, as well as the ability of machines to instantly share their knowledge. Intelligent nanorobots will be deeply integrated in the environment, our bodies and our brains, providing vastly extended longevity, full-immersion virtual reality incorporating all of the senses, experience "beaming," and enhanced human intelligence. The implication will be an intimate merger between the technology-creating species and the evolutionary process it spawned.
范式转换的速度现在每十年翻一番,所以按照今天的速度,21世纪将会看到2万年的进步。计算、通信、生物技术(例如DNA测序)、大脑扫描、对人类大脑的了解,以及人类的总体知识,都在以更快的速度加速发展,通常价格、性能、容量和带宽每年都会翻一番。众所周知的摩尔定律只是众多这种内在加速的一个例子。技术的关键特征的尺寸也在缩小,以每十年每线性维度缩小4个的速度缩小。三维分子计算将在2030年之前为人类水平的“强”人工智能提供硬件。更重要的软件见解将部分地从人类大脑的逆向工程中获得,这一过程正在进行中。我们正在快速地学习作为生物学基础的称为基因的软件程序。我们正在将疾病和衰老过程理解为信息过程,并正在获得重新编程它们的工具。例如,RNA干扰使我们能够关闭选定的基因,而新形式的基因治疗使我们能够有效地添加新基因。在10到20年内,我们将能够阻止和逆转疾病和衰老的进程,从而显著提高健康和寿命。由软件和相关形式的信息构成的产品和服务价值的比例正在迅速接近100%,信息技术(包括硬件和软件)的通货紧缩率每年约为50%,为经济提供了强大的通货紧缩力量。由信息技术组成的经济部分本身正呈指数级增长,在几十年内,经济的大部分将由信息和软件主导。一旦非生物智能与人类智能的范围和微妙程度相匹配,它必然会超越人类智能,因为基于信息的技术的持续加速发展,以及机器即时分享知识的能力。智能纳米机器人将与环境、我们的身体和大脑深度融合,提供极大延长的寿命、融合所有感官的全沉浸式虚拟现实、体验“喜气”和增强的人类智能。这意味着创造技术的物种和它所产生的进化过程之间的亲密融合。
{"title":"The coming merger of biological and non biological intelligence","authors":"R. Kurzweil","doi":"10.1145/1188455.1188658","DOIUrl":"https://doi.org/10.1145/1188455.1188658","url":null,"abstract":"The paradigm shift rate is now doubling every decade, so the twenty-first century will see 20,000 years of progress at today's rate. Computation, communication, biological technologies (for example, DNA sequencing), brain scanning, knowledge of the human brain, and human knowledge in general are all accelerating at an even faster pace, generally doubling price-performance, capacity, and bandwidth every year. The well-known Moore's Law is only one example of many of this inherent acceleration. The size of the key features of technology is also shrinking, at a rate of about 4 per linear dimension per decade. Three-dimensional molecular computing will provide the hardware for human-level \"strong\" AI well before 2030. The more important software insights will be gained in part from the reverse-engineering of the human brain, a process well under way.We are rapidly learning the software programs called genes that underlie biology. We are understanding disease and aging processes as information processes, and are gaining the tools to reprogram them. RNA interference, for example, allows us to turn selected genes off, and new forms of gene therapy are enabling us to effectively add new genes. Within one to two decades, we will be in a position to stop and reverse the progression of disease and aging resulting in dramatic gains in health and longevity.The fraction of value of products and services comprised by software and related forms of information is rapidly asymptoting to 100 percent The deflation rate for information technologies, both hardware and software, is about 50 percent per year, providing a powerful deflationary force in the economy. The portion of the economy comprised of information technology is itself growing exponentially and within a couple of decades, the bulk of the economy will be dominated by information and software.Once nonbiological intelligence matches the range and subtlety of human intelligence, it will necessarily soar past it because of the continuing acceleration of information-based technologies, as well as the ability of machines to instantly share their knowledge. Intelligent nanorobots will be deeply integrated in the environment, our bodies and our brains, providing vastly extended longevity, full-immersion virtual reality incorporating all of the senses, experience \"beaming,\" and enhanced human intelligence. The implication will be an intimate merger between the technology-creating species and the evolutionary process it spawned.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"17 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121001000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SPEC HPG benchmarks 规格HPG基准
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188473
K. Kalyanasundaram
SPEC HPG has been creating benchmarks for comparing HPC systems. Currently they have a benchmark suite based on OpenMP (SPEC OMP) and another based on real applications supporting multiple parallel paradigms (SPEC HPC2002). SPEC is now in the process of developing a benhcmark based on MPI applications. This new suite is expected to measure MPI, CPU, compiler, memory bandwidth and interconnect performance. The BOF will focus on SPEC HPG benchmarks and will introduce the new SPEC MPI2006 benchmark.
SPEC HPG一直在为比较HPC系统创建基准。目前,他们有一个基于OpenMP的基准测试套件(SPEC OMP)和另一个基于支持多个并行范式的实际应用程序的基准测试套件(SPEC HPC2002)。SPEC目前正在开发一个基于MPI应用程序的基准测试。这个新套件有望测量MPI、CPU、编译器、内存带宽和互连性能。BOF将专注于SPEC HPG基准测试,并将引入新的SPEC MPI2006基准测试。
{"title":"SPEC HPG benchmarks","authors":"K. Kalyanasundaram","doi":"10.1145/1188455.1188473","DOIUrl":"https://doi.org/10.1145/1188455.1188473","url":null,"abstract":"SPEC HPG has been creating benchmarks for comparing HPC systems. Currently they have a benchmark suite based on OpenMP (SPEC OMP) and another based on real applications supporting multiple parallel paradigms (SPEC HPC2002). SPEC is now in the process of developing a benhcmark based on MPI applications. This new suite is expected to measure MPI, CPU, compiler, memory bandwidth and interconnect performance. The BOF will focus on SPEC HPG benchmarks and will introduce the new SPEC MPI2006 benchmark.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126048410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Canadian design research network's prototype design grid 加拿大设计研究网络的原型设计网格
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188788
J. Danahy, West Suhanic
The Canadian Design Research Network-(CDRN) is setting up a prototype Design Grid. The Design Grid has several goals including:1)enabling national and international design reviews in areas ranging from urban planning, landscape architecture, architect and aerospace engineering. 2)allowing interactive discussions including video/audio streaming and shared powerpoints between highly distributed individuals and institutions where individuals may only have access to very modest technical resources. 3)allowing the public to access visualizations of buildings and see the impact of those buildings. 4)allow the sharing of design courses across universities.
加拿大设计研究网络(CDRN)正在建立一个原型设计网格。设计网格有几个目标,包括:1)在城市规划、景观建筑、建筑和航空航天工程等领域进行国内和国际设计审查。2)允许在高度分散的个人和机构之间进行交互式讨论,包括视频/音频流和共享幻灯片,而个人可能只能获得非常有限的技术资源。3)允许公众访问建筑物的可视化图像,并看到这些建筑物的影响。4)允许跨大学共享设计课程。
{"title":"The Canadian design research network's prototype design grid","authors":"J. Danahy, West Suhanic","doi":"10.1145/1188455.1188788","DOIUrl":"https://doi.org/10.1145/1188455.1188788","url":null,"abstract":"The Canadian Design Research Network-(CDRN) is setting up a prototype Design Grid. The Design Grid has several goals including:1)enabling national and international design reviews in areas ranging from urban planning, landscape architecture, architect and aerospace engineering. 2)allowing interactive discussions including video/audio streaming and shared powerpoints between highly distributed individuals and institutions where individuals may only have access to very modest technical resources. 3)allowing the public to access visualizations of buildings and see the impact of those buildings. 4)allow the sharing of design courses across universities.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125507725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The cosmic simulator 宇宙模拟器
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188509
M. Norman
In the coming decade, the visible universe will be surveyed to unprecedented breadth, depth, and spectral coverage to explore its structures, past history, and future fate. Data volumes of many petabytes are expected. Numerical simulations of cosmic structure formation and evolution must keep pace in scale and complexity if they are to assist in the interpretation of such data, as well as to make predictions. We describe the Cosmic Simulator, a software facility under development at UCSD for performing, analyzing, and archiving cosmological simulations of unprecedented scale and physical realism. Its major components are the adaptive mesh refinement hydrodynamic cosmology simulation code ENZO, a data analysis pipeline, and a Storage Resource Broker (SRB)-managed data archive housed at SDSC. We show scientific results from the Cosmic Simulator implemented on a data grid connecting LLNL and SDSC computational resources, and discuss future plans.
在未来的十年里,人们将对可见宇宙进行前所未有的广度、深度和光谱覆盖,以探索它的结构、过去的历史和未来的命运。预计数据量将达到许多pb。宇宙结构形成和演化的数值模拟如果要帮助解释这些数据并做出预测,就必须在规模和复杂性上保持同步。我们描述了宇宙模拟器,这是加州大学圣地亚哥分校正在开发的一个软件设施,用于执行、分析和存档前所未有的规模和物理真实感的宇宙模拟。它的主要组成部分是自适应网格细化流体动力学宇宙学模拟代码ENZO,数据分析管道和存储资源代理(SRB)管理的数据存档。我们展示了在连接LLNL和SDSC计算资源的数据网格上实现的宇宙模拟器的科学结果,并讨论了未来的计划。
{"title":"The cosmic simulator","authors":"M. Norman","doi":"10.1145/1188455.1188509","DOIUrl":"https://doi.org/10.1145/1188455.1188509","url":null,"abstract":"In the coming decade, the visible universe will be surveyed to unprecedented breadth, depth, and spectral coverage to explore its structures, past history, and future fate. Data volumes of many petabytes are expected. Numerical simulations of cosmic structure formation and evolution must keep pace in scale and complexity if they are to assist in the interpretation of such data, as well as to make predictions. We describe the Cosmic Simulator, a software facility under development at UCSD for performing, analyzing, and archiving cosmological simulations of unprecedented scale and physical realism. Its major components are the adaptive mesh refinement hydrodynamic cosmology simulation code ENZO, a data analysis pipeline, and a Storage Resource Broker (SRB)-managed data archive housed at SDSC. We show scientific results from the Cosmic Simulator implemented on a data grid connecting LLNL and SDSC computational resources, and discuss future plans.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114296402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
gridMathematica: overview and new developments gridMathematica:概述和新发展
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188720
Roman Maeder
Parallel Computing Toolkit is a Mathematica package that allows users to develop and run parallel computations. It provides a high-level language for expressing parallel algorithms in a machine-independent way supporting many standard parallel programming paradigms. gridMathematica takes full advantage of Parallel Computing Toolkit for providing an easy-to-use environment for parallel computations on multi-processor machines, clusters, or grids of compute nodes using the scale and power of Mathematica. Among the new features under development is an extension of the Wolfram Workbench (which is based on Eclipse) that allows development, debugging and profiling of parallel Mathematica applications. The presentation discusses the features of gridMathematica and demonstrates examples of real world applications currently being used today. Attendees are encouraged to bring their problems to the workshop for evaluation and suggestion.
并行计算工具包是一个Mathematica软件包,允许用户开发和运行并行计算。它提供了一种高级语言,以独立于机器的方式表达并行算法,支持许多标准的并行编程范例。gridMathematica充分利用了并行计算工具包,为使用Mathematica的规模和能力在多处理器机器、集群或计算节点网格上进行并行计算提供了一个易于使用的环境。正在开发的新特性包括Wolfram Workbench(基于Eclipse)的扩展,它允许开发、调试和分析并行的Mathematica应用程序。该演讲讨论了gridMathematica的特性,并展示了目前正在使用的实际应用示例。我们鼓励与会者把他们的问题带到研讨会上进行评估和建议。
{"title":"gridMathematica: overview and new developments","authors":"Roman Maeder","doi":"10.1145/1188455.1188720","DOIUrl":"https://doi.org/10.1145/1188455.1188720","url":null,"abstract":"Parallel Computing Toolkit is a Mathematica package that allows users to develop and run parallel computations. It provides a high-level language for expressing parallel algorithms in a machine-independent way supporting many standard parallel programming paradigms. gridMathematica takes full advantage of Parallel Computing Toolkit for providing an easy-to-use environment for parallel computations on multi-processor machines, clusters, or grids of compute nodes using the scale and power of Mathematica. Among the new features under development is an extension of the Wolfram Workbench (which is based on Eclipse) that allows development, debugging and profiling of parallel Mathematica applications. The presentation discusses the features of gridMathematica and demonstrates examples of real world applications currently being used today. Attendees are encouraged to bring their problems to the workshop for evaluation and suggestion.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117299685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High performance networks for the future 未来的高性能网络
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188728
D. Roweth, M. McLaren
QsNetIII and 10 Gbit/s Ethernet, two networks for high perfomance computing. While the proprietary QsNet will continue to provide supercomputing funcionalities to clusters of commodity based servers, the second will establish itself as preferred choice in capacity class systems.Quadrics, who is actively developing products based on the two technologies, will present early performance results and comparisons.
QsNetIII和10gbit /s以太网,两种高性能计算网络。虽然专有的QsNet将继续为基于商品的服务器集群提供超级计算功能,但后者将成为容量级系统中的首选。目前正在积极开发基于这两种技术的产品的Quadrics将在此次会议上发表早期的性能结果和比较。
{"title":"High performance networks for the future","authors":"D. Roweth, M. McLaren","doi":"10.1145/1188455.1188728","DOIUrl":"https://doi.org/10.1145/1188455.1188728","url":null,"abstract":"QsNetIII and 10 Gbit/s Ethernet, two networks for high perfomance computing. While the proprietary QsNet will continue to provide supercomputing funcionalities to clusters of commodity based servers, the second will establish itself as preferred choice in capacity class systems.Quadrics, who is actively developing products based on the two technologies, will present early performance results and comparisons.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129411889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Targeting multi-core architectures for linear algebra applications 针对线性代数应用的多核架构
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188623
A. Buttari, J. Kurzak, J. Dongarra
We are on the verge of a paradigm shift with our software for the new multicore architectures and there is no free lunch for conventional software. Power consumption and heat dissipation issues are pushing the microprocessor industry towards multicore design patterns. With the number of cores on multicore chips expected to reach tens to perhaps hundreds in a few years, efficient implementations of numerical libraries using shared memory programming models is of high interest. The current message passing paradigm used in ScaLAPACK and elsewhere introduces unnecessary memory overhead and memory copy operations, which degrade performance, along with making it harder to schedule operations that could be done in parallel. Limiting the use of shared memory to fork-join parallelism (perhaps with OpenMP) or to focusing the parallelism within the BLAS does not address all these issues.
我们的软件正处于新的多核架构的范式转变的边缘,传统软件没有免费的午餐。功耗和散热问题正在推动微处理器行业向多核设计模式发展。随着多核芯片上的内核数量预计在几年内达到数十到数百个,使用共享内存编程模型的数字库的有效实现非常有趣。当前在ScaLAPACK和其他地方使用的消息传递范式引入了不必要的内存开销和内存复制操作,这会降低性能,并且使可以并行完成的操作更难调度。将共享内存的使用限制为fork-join并行性(可能使用OpenMP)或将并行性集中在BLAS内并不能解决所有这些问题。
{"title":"Targeting multi-core architectures for linear algebra applications","authors":"A. Buttari, J. Kurzak, J. Dongarra","doi":"10.1145/1188455.1188623","DOIUrl":"https://doi.org/10.1145/1188455.1188623","url":null,"abstract":"We are on the verge of a paradigm shift with our software for the new multicore architectures and there is no free lunch for conventional software. Power consumption and heat dissipation issues are pushing the microprocessor industry towards multicore design patterns. With the number of cores on multicore chips expected to reach tens to perhaps hundreds in a few years, efficient implementations of numerical libraries using shared memory programming models is of high interest. The current message passing paradigm used in ScaLAPACK and elsewhere introduces unnecessary memory overhead and memory copy operations, which degrade performance, along with making it harder to schedule operations that could be done in parallel. Limiting the use of shared memory to fork-join parallelism (perhaps with OpenMP) or to focusing the parallelism within the BLAS does not address all these issues.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129140176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Engineering the 100 terabyte turbulence database (or how to track particles at home) 设计100tb的湍流数据库(或者如何在家跟踪粒子)
Pub Date : 2006-11-11 DOI: 10.1145/1188455.1188625
E. Perlman, R. Burns
We describe a new environment for large-scale turbulence simulations that uses a cluster of database nodes to store the complete space-time history of fluid velocities. This allows for rapid access to high resolution data that were traditionally too large to store and too computationally expensive to produce on demand.We perform the actual experimental analysis inside the database nodes, which allows for data-intensive computations to be performed across a large number of nodes with relatively little network traffic.We currently have a limited-scale prototype system running actual turbulence simulations and are in the process of establishing a production cluster with high-resolution data. We will discuss our design choices and initial results with load balancing a data-intensive, migratory workload.
我们描述了一个大规模湍流模拟的新环境,它使用一组数据库节点来存储流体速度的完整时空历史。这允许快速访问高分辨率数据,这些数据传统上太大而无法存储,并且计算成本太高而无法按需生成。我们在数据库节点内执行实际的实验分析,这允许在相对较少的网络流量下跨大量节点执行数据密集型计算。我们目前有一个有限规模的原型系统,可以运行实际的湍流模拟,并正在建立一个具有高分辨率数据的生产集群。我们将讨论我们的设计选择和对数据密集型迁移工作负载进行负载平衡的初步结果。
{"title":"Engineering the 100 terabyte turbulence database (or how to track particles at home)","authors":"E. Perlman, R. Burns","doi":"10.1145/1188455.1188625","DOIUrl":"https://doi.org/10.1145/1188455.1188625","url":null,"abstract":"We describe a new environment for large-scale turbulence simulations that uses a cluster of database nodes to store the complete space-time history of fluid velocities. This allows for rapid access to high resolution data that were traditionally too large to store and too computationally expensive to produce on demand.We perform the actual experimental analysis inside the database nodes, which allows for data-intensive computations to be performed across a large number of nodes with relatively little network traffic.We currently have a limited-scale prototype system running actual turbulence simulations and are in the process of establishing a production cluster with high-resolution data. We will discuss our design choices and initial results with load balancing a data-intensive, migratory workload.","PeriodicalId":115940,"journal":{"name":"Proceedings of the 2006 ACM/IEEE conference on Supercomputing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129916728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1