首页 > 最新文献

Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)最新文献

英文 中文
Clash of the Timelines: Lessons Learned from the Front Lines of CS Education 时间线的冲突:从计算机科学教育的前线吸取的教训
Diane A. Baxter, B. Simon
Public education in the United States is challenged by the rapidly changing skills and knowledge required by the global workforce. Technology innovation drives the evolution of workforce requirements at a pace that is roughly tied to Moore's law, with significant changes every few months to years. In contrast, significant public K-12 school reform has historically required decades. The conflict between the two evolutionary rates, orders of magnitude apart, creates a crisis for sustaining the US economy requiring urgent, innovative, and sustainable solutions. Insightful, strategic thinkers representing a coalition of national agencies and organizations launched a project designed to infuse innovative computer science curriculum into pre-college public schools through the most expeditious pathway available: creation of a new advanced placement (AP) course. The new AP exam is based on a course taught with many titles at the college level that focuses on learning objectives within computer science (CS) principles. If taught with pedagogy that includes and supports traditionally underrepresented students, the course provides diverse students with foundational understanding of the underlying logic, grammar, communication skills, and problem-solving approaches of computational thinking-- essential skills and knowledge for becoming contributors to the country's economic survival. Pilot high school courses based on college level courses that meet the above learning objectives are being introduced into high schools through a national teacher professional development initiative. The goal of this initiative, the CS 10K project, is to train ten thousand teachers to teach CS Principles in ten thousand secondary schools by the time the new AP exam unrolls in 2016-17. Several NSF projects supporting this initiative are undergoing careful evaluation. This paper describes one of these projects and its teacher, student, and district-level outcomes to date. It also considers ways that the positive outcomes might be scaled and sustained, addressing the larger challenge posed above of creating a sustainable strategy for accelerating the pace of educational adaptation to technology's more rapid global transformation.
美国的公共教育受到全球劳动力所需的快速变化的技能和知识的挑战。技术创新推动劳动力需求的演变,其速度与摩尔定律大致相关,每隔几个月到几年就会发生重大变化。相比之下,重大的公立K-12学校改革历史上需要几十年的时间。两种进化速度之间的冲突(数量级之差)给维持美国经济带来了危机,需要紧急、创新和可持续的解决方案。代表国家机构和组织联盟的富有洞察力的战略思想家发起了一个项目,旨在通过最快捷的途径将创新的计算机科学课程注入公立学校的大学预科课程:创建一门新的大学先修课程(AP)。新AP考试的基础是一门大学水平的课程,其中有许多题目,重点是计算机科学(CS)原理中的学习目标。如果采用包括和支持传统上代表性不足的学生的教学方法,该课程将为不同的学生提供对基本逻辑、语法、沟通技巧和计算思维解决问题方法的基本理解——这些都是为国家经济生存做出贡献的基本技能和知识。在满足上述学习目标的大学水平课程的基础上,正在通过一项国家教师专业发展倡议将试点高中课程引入高中。CS 10K项目的目标是在2016-17年新的AP考试开始之前,培训1万名教师在1万所中学教授CS原则。支持这一倡议的几个国家科学基金会项目正在进行仔细的评估。本文介绍了其中一个项目及其迄今为止在教师、学生和地区层面取得的成果。它还考虑了扩大和维持积极成果的方式,以应对上述提出的更大挑战,即制定可持续战略,加快教育适应技术更快的全球转型的步伐。
{"title":"Clash of the Timelines: Lessons Learned from the Front Lines of CS Education","authors":"Diane A. Baxter, B. Simon","doi":"10.1145/2616498.2616574","DOIUrl":"https://doi.org/10.1145/2616498.2616574","url":null,"abstract":"Public education in the United States is challenged by the rapidly changing skills and knowledge required by the global workforce. Technology innovation drives the evolution of workforce requirements at a pace that is roughly tied to Moore's law, with significant changes every few months to years. In contrast, significant public K-12 school reform has historically required decades. The conflict between the two evolutionary rates, orders of magnitude apart, creates a crisis for sustaining the US economy requiring urgent, innovative, and sustainable solutions.\u0000 Insightful, strategic thinkers representing a coalition of national agencies and organizations launched a project designed to infuse innovative computer science curriculum into pre-college public schools through the most expeditious pathway available: creation of a new advanced placement (AP) course.\u0000 The new AP exam is based on a course taught with many titles at the college level that focuses on learning objectives within computer science (CS) principles. If taught with pedagogy that includes and supports traditionally underrepresented students, the course provides diverse students with foundational understanding of the underlying logic, grammar, communication skills, and problem-solving approaches of computational thinking-- essential skills and knowledge for becoming contributors to the country's economic survival.\u0000 Pilot high school courses based on college level courses that meet the above learning objectives are being introduced into high schools through a national teacher professional development initiative. The goal of this initiative, the CS 10K project, is to train ten thousand teachers to teach CS Principles in ten thousand secondary schools by the time the new AP exam unrolls in 2016-17. Several NSF projects supporting this initiative are undergoing careful evaluation.\u0000 This paper describes one of these projects and its teacher, student, and district-level outcomes to date. It also considers ways that the positive outcomes might be scaled and sustained, addressing the larger challenge posed above of creating a sustainable strategy for accelerating the pace of educational adaptation to technology's more rapid global transformation.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"5 1","pages":"69:1-69:5"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78193999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
LittleFe and BCCD as a successful on-ramp to HPC little lefe和BCCD作为高性能计算的成功入口
Ivan Babic, Aaron Weeden, Mobeen Ludin, S. Thompson, Charles Peck, Kristin Muterspaw, Andrew Fitz Gibbon, Jennifer Houchins, Tom Murphy
The LittleFe/BCCD Project is a combination of hardware, software, and curriculum to facilitate the inclusion of high-performance computing (HPC), data-enabled science (DES), and computational science (CS) education in high school, undergraduate, and graduate settings. The hardware is a six-node computational cluster whose parts cost $2800, easily assembled by students in a day, and small enough to be checked on an airline. The software is a Linux distribution called Bootable Cluster CD (BCCD) that will configure a HPC cluster in under five minutes. BCCD was created in conjunction with LittleFe, and continues to be maintained alongside LittleFe. The curriculum modules are written by faculty who use LittleFe in their classrooms and cover a wide range of topics. The combination of these three has proven to be very effective in delivering HPC, DES, and CS education in this environment.
litefe /BCCD项目是硬件、软件和课程的结合,以促进高性能计算(HPC)、数据支持科学(DES)和计算科学(CS)教育在高中、本科和研究生的设置。硬件是一个六节点计算集群,其部件价值2800美元,学生一天就能轻松组装好,而且体积小到可以在飞机上检查。该软件是一个名为Bootable Cluster CD (BCCD)的Linux发行版,可以在五分钟内配置一个HPC集群。BCCD是与little lefe一起创建的,并将继续与little lefe一起维护。课程模块由在课堂上使用littleffe的教师编写,涵盖了广泛的主题。事实证明,在这种环境下,这三者的结合在提供HPC、DES和CS教育方面非常有效。
{"title":"LittleFe and BCCD as a successful on-ramp to HPC","authors":"Ivan Babic, Aaron Weeden, Mobeen Ludin, S. Thompson, Charles Peck, Kristin Muterspaw, Andrew Fitz Gibbon, Jennifer Houchins, Tom Murphy","doi":"10.1145/2616498.2616569","DOIUrl":"https://doi.org/10.1145/2616498.2616569","url":null,"abstract":"The LittleFe/BCCD Project is a combination of hardware, software, and curriculum to facilitate the inclusion of high-performance computing (HPC), data-enabled science (DES), and computational science (CS) education in high school, undergraduate, and graduate settings. The hardware is a six-node computational cluster whose parts cost $2800, easily assembled by students in a day, and small enough to be checked on an airline. The software is a Linux distribution called Bootable Cluster CD (BCCD) that will configure a HPC cluster in under five minutes. BCCD was created in conjunction with LittleFe, and continues to be maintained alongside LittleFe. The curriculum modules are written by faculty who use LittleFe in their classrooms and cover a wide range of topics. The combination of these three has proven to be very effective in delivering HPC, DES, and CS education in this environment.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"32 1","pages":"73:1-73:7"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90399368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
XSEDE Campus Bridging Pilot Case Study XSEDE校园桥接试点案例研究
B. Hallock, R. Knepper, J. Ferguson, C. Stewart
The major goals of the XSEDE Campus Bridging pilot were to simplify the transition between resources local to the researcher and those at the national scale, as well as those resources intermediary to them; to put in place software and other resources that facilitate diverse researcher workflows; and to begin resolving programming and usability issues with the software selected for these purposes. In this paper, we situate the pilot within the domain of existing research cyberinfrastructure (and in the context of campus bridging) and examine the process by which the pilot program was completed and evaluated. We then present a status update for the selected software packages and explore further advancements to be made in this realm.
XSEDE校园桥接试点的主要目标是简化研究人员当地资源与国家规模资源之间的过渡,以及这些资源的中介;部署软件和其他资源,以促进不同的研究人员工作流程;并开始解决为这些目的而选择的软件的编程和可用性问题。在本文中,我们将试点置于现有研究网络基础设施领域(以及校园桥接的背景下),并检查试点计划完成和评估的过程。然后,我们为所选软件包提供一个状态更新,并探索在这个领域要做的进一步进展。
{"title":"XSEDE Campus Bridging Pilot Case Study","authors":"B. Hallock, R. Knepper, J. Ferguson, C. Stewart","doi":"10.1145/2616498.2616570","DOIUrl":"https://doi.org/10.1145/2616498.2616570","url":null,"abstract":"The major goals of the XSEDE Campus Bridging pilot were to simplify the transition between resources local to the researcher and those at the national scale, as well as those resources intermediary to them; to put in place software and other resources that facilitate diverse researcher workflows; and to begin resolving programming and usability issues with the software selected for these purposes. In this paper, we situate the pilot within the domain of existing research cyberinfrastructure (and in the context of campus bridging) and examine the process by which the pilot program was completed and evaluated. We then present a status update for the selected software packages and explore further advancements to be made in this realm.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"11 1","pages":"77:1-77:5"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90767429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Minimum Time, Maximum Effect: Introducing Parallel Computing in CS0 and STEM Outreach Activities Using Scratch 最短时间,最大效果:在CS0和STEM推广活动中使用Scratch引入并行计算
Russell Feldhausen, R. Bell, Daniel Andresen
This paper presents our experiences and outcomes using Scratch to teach parallel computing concepts to students just learning about computer science. We presented versions of this material to middle school and high school girls during a STEM workshop and then to undergraduate university students enrolled in an introductory computer science course. Using the Scratch development environment, students are able to build, modify and observe the changes in the performance of applications which utilize multi-threaded, concurrent, operations. This includes scenarios which involve more advanced topics such as race conditions and mutex locks. Developing these materials has allowed us to introduce these concepts in a programming environment much earlier than we have previously, giving instructors in down-stream courses the ability to build upon this early exposure. Survey results show that this approach resulted in a significant increase in both of these areas. For example, the number of students in our CS0 course who felt they could apply parallel programming to other problems using Scratch more than doubled, rising from 25 to 55 (out of 61 students that responded to both surveys). Likewise, the number of students who felt they understood what parallel programming means rose from 27 to 56. These results were achieved after just one class period. Similarly, 27 of the 37 girls responding to the workshop survey felt that they were capable of learning to write computer programs and 22 of 41 indicated they had an interest in a job using HPC to solve problems.
本文介绍了我们使用Scratch向刚学习计算机科学的学生教授并行计算概念的经验和结果。我们在STEM研讨会上向初中和高中女生展示了这些材料的版本,然后向参加计算机科学入门课程的大学生展示了这些材料的版本。使用Scratch开发环境,学生能够构建、修改和观察使用多线程、并发操作的应用程序的性能变化。这包括涉及更高级主题的场景,如竞争条件和互斥锁。开发这些材料使我们能够比以前更早地在编程环境中介绍这些概念,使下游课程的讲师能够在这些早期接触的基础上进行构建。调查结果表明,这一方法在这两个方面都取得了显著的增长。例如,在我们的CS0课程中,认为他们可以使用Scratch将并行编程应用于其他问题的学生人数增加了一倍多,从25人增加到55人(在61名学生中,两项调查都有回应)。同样,认为自己理解并行编程含义的学生人数从27人增加到56人。这些结果仅仅是在一节课之后取得的。同样,37名参与研讨会调查的女孩中有27人认为她们有能力学习编写计算机程序,41名女孩中有22人表示她们对使用高性能计算解决问题的工作感兴趣。
{"title":"Minimum Time, Maximum Effect: Introducing Parallel Computing in CS0 and STEM Outreach Activities Using Scratch","authors":"Russell Feldhausen, R. Bell, Daniel Andresen","doi":"10.1145/2616498.2616568","DOIUrl":"https://doi.org/10.1145/2616498.2616568","url":null,"abstract":"This paper presents our experiences and outcomes using Scratch to teach parallel computing concepts to students just learning about computer science. We presented versions of this material to middle school and high school girls during a STEM workshop and then to undergraduate university students enrolled in an introductory computer science course. Using the Scratch development environment, students are able to build, modify and observe the changes in the performance of applications which utilize multi-threaded, concurrent, operations. This includes scenarios which involve more advanced topics such as race conditions and mutex locks.\u0000 Developing these materials has allowed us to introduce these concepts in a programming environment much earlier than we have previously, giving instructors in down-stream courses the ability to build upon this early exposure. Survey results show that this approach resulted in a significant increase in both of these areas. For example, the number of students in our CS0 course who felt they could apply parallel programming to other problems using Scratch more than doubled, rising from 25 to 55 (out of 61 students that responded to both surveys). Likewise, the number of students who felt they understood what parallel programming means rose from 27 to 56. These results were achieved after just one class period. Similarly, 27 of the 37 girls responding to the workshop survey felt that they were capable of learning to write computer programs and 22 of 41 indicated they had an interest in a job using HPC to solve problems.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"396 1","pages":"75:1-75:7"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76916665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An Analysis of Node Sharing on HPC Clusters using XDMoD/TACC_Stats 基于XDMoD/TACC_Stats的HPC集群节点共享分析
Joseph P. White, R. L. Deleon, T. Furlani, S. Gallo, Matthew D. Jones, Amin Ghadersohi, Cynthia D. Cornelius, A. Patra, J. Browne, W. Barth, John L. Hammond
When a user requests less than a full node for a job on XSEDE's large resources - Stampede and Lonestar4 -, that is less than 16 cores on Stampede or 12 cores on Lonestar4, they are assigned a full node by policy. Although the actual CPU hours consumed by these jobs is small when compared to the total CPU hours delivered by these resources, they do represent a substantial fraction of the total number of jobs (~18% for Stampede and ~15% for Lonestar4 between January and February 2014). Academic HPC centers, such as the Center for Computational Research (CCR) at the University at Buffalo, SUNY typically have a much larger proportion of small jobs than the large XSEDE systems. For CCR's production cluster, Rush, the decision was made to allow the allocation of simultaneous jobs on the same node. This greatly increases the overall throughput but also raises questions whether the jobs that share the same node will interfere with one another. We present here an analysis that explores this issue using data from Rush, Stampede and Lonestar4. Analysis of usage data indicates little interference.
当用户为XSEDE的大型资源(Stampede和Lonestar4)上的作业请求少于一个完整节点时,即在Stampede上少于16个内核,在Lonestar4上少于12个内核,则根据策略为它们分配一个完整节点。尽管与这些资源交付的总CPU时间相比,这些作业实际消耗的CPU时间很少,但它们确实占作业总数的很大一部分(2014年1月至2月期间Stampede约占18%,Lonestar4约占15%)。学术高性能计算中心,如纽约州立大学布法罗分校的计算研究中心(CCR),通常比大型XSEDE系统拥有更大比例的小型作业。对于CCR的生产集群Rush,决定允许在同一节点上分配并发作业。这极大地提高了总体吞吐量,但也提出了共享同一节点的作业是否会相互干扰的问题。我们在此使用《Rush》、《Stampede》和《Lonestar4》的数据分析了这一问题。对使用数据的分析表明干扰很小。
{"title":"An Analysis of Node Sharing on HPC Clusters using XDMoD/TACC_Stats","authors":"Joseph P. White, R. L. Deleon, T. Furlani, S. Gallo, Matthew D. Jones, Amin Ghadersohi, Cynthia D. Cornelius, A. Patra, J. Browne, W. Barth, John L. Hammond","doi":"10.1145/2616498.2616533","DOIUrl":"https://doi.org/10.1145/2616498.2616533","url":null,"abstract":"When a user requests less than a full node for a job on XSEDE's large resources - Stampede and Lonestar4 -, that is less than 16 cores on Stampede or 12 cores on Lonestar4, they are assigned a full node by policy. Although the actual CPU hours consumed by these jobs is small when compared to the total CPU hours delivered by these resources, they do represent a substantial fraction of the total number of jobs (~18% for Stampede and ~15% for Lonestar4 between January and February 2014). Academic HPC centers, such as the Center for Computational Research (CCR) at the University at Buffalo, SUNY typically have a much larger proportion of small jobs than the large XSEDE systems. For CCR's production cluster, Rush, the decision was made to allow the allocation of simultaneous jobs on the same node. This greatly increases the overall throughput but also raises questions whether the jobs that share the same node will interfere with one another. We present here an analysis that explores this issue using data from Rush, Stampede and Lonestar4. Analysis of usage data indicates little interference.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"13 1","pages":"31:1-31:8"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85051787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
SR-IOV: Performance Benefits for Virtualized Interconnects SR-IOV:虚拟化互联的性能优势
Glenn K. Lockwood, M. Tatineni, R. Wagner
The demand for virtualization within high-performance computing is rapidly growing as new communities, driven by both new application stacks and new computing modalities, continue to grow and expand. While virtualization has traditionally come with significant penalties in I/O performance that have precluded its use in mainstream large-scale computing environments, new standards such as Single Root I/O Virtualization (SR-IOV) are emerging that promise to diminish the performance gap and make high-performance virtualization possible. To this end, we have evaluated SR-IOV in the context of both virtualized InfiniBand and virtualized 10 gigabit Ethernet (GbE) using micro-benchmarks and real-world applications. We compare the performance of these interconnects on non-virtualized environments, Amazon's SR-IOV-enabled C3 instances, and our own SR-IOV-enabled InfiniBand cluster and show that SR-IOV significantly reduces the performance losses caused by virtualization. InfiniBand demonstrates less than 2% loss of bandwidth and less than 10% increase in latency when virtualized with SR-IOV. Ethernet also benefits, although less dramatically, when SR-IOV is enabled on Amazon's cloud.
在新的应用程序栈和新的计算模式的推动下,随着新社区的不断发展和扩展,高性能计算中对虚拟化的需求正在迅速增长。虽然传统上虚拟化在I/O性能上有很大的缺点,这使得它无法在主流的大规模计算环境中使用,但新的标准,如Single Root I/O虚拟化(SR-IOV)正在出现,它们承诺缩小性能差距,使高性能虚拟化成为可能。为此,我们使用微基准测试和实际应用,在虚拟化InfiniBand和虚拟化10gb以太网(GbE)的背景下评估了SR-IOV。我们比较了这些互连在非虚拟化环境、Amazon支持SR-IOV的C3实例和我们自己支持SR-IOV的InfiniBand集群上的性能,结果表明SR-IOV显著降低了虚拟化造成的性能损失。InfiniBand在使用SR-IOV虚拟化时,带宽损失不到2%,延迟增加不到10%。当在亚马逊的云上启用SR-IOV时,以太网也会受益,尽管没有那么明显。
{"title":"SR-IOV: Performance Benefits for Virtualized Interconnects","authors":"Glenn K. Lockwood, M. Tatineni, R. Wagner","doi":"10.1145/2616498.2616537","DOIUrl":"https://doi.org/10.1145/2616498.2616537","url":null,"abstract":"The demand for virtualization within high-performance computing is rapidly growing as new communities, driven by both new application stacks and new computing modalities, continue to grow and expand. While virtualization has traditionally come with significant penalties in I/O performance that have precluded its use in mainstream large-scale computing environments, new standards such as Single Root I/O Virtualization (SR-IOV) are emerging that promise to diminish the performance gap and make high-performance virtualization possible.\u0000 To this end, we have evaluated SR-IOV in the context of both virtualized InfiniBand and virtualized 10 gigabit Ethernet (GbE) using micro-benchmarks and real-world applications. We compare the performance of these interconnects on non-virtualized environments, Amazon's SR-IOV-enabled C3 instances, and our own SR-IOV-enabled InfiniBand cluster and show that SR-IOV significantly reduces the performance losses caused by virtualization. InfiniBand demonstrates less than 2% loss of bandwidth and less than 10% increase in latency when virtualized with SR-IOV. Ethernet also benefits, although less dramatically, when SR-IOV is enabled on Amazon's cloud.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"92 1","pages":"47:1-47:7"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84094715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A Tool for Interactive Parallelization 交互式并行化工具
R. Arora, Julio C. Olaya, Madhav Gupta
"Tell me, I'll forget, Show me, I may remember, Involve me, I'll understand." - Confucius This proverb describes the essence of our paper and the motivation behind the development of the Interactive Parallelization Tool (IPT) that can transform serial applications into multiple parallel variants. The end-users of IPT must have an understanding of the basic concepts involved in parallel programming (e.g., data distribution and data gathering). After developing an understanding of the basic parallel programming concepts, IPT can be used by its target audience (domain-experts and students) to semi-automatically generate parallel programs using multiple parallel programming paradigms (MPI, OpenMP, and CUDA), and learn about these paradigms through observation and comparison. This IPT-based personalized learning approach complements the traditional methods of learning and training that usually emphasize the syntax and semantics of one or more programming standards. The main benefit of IPT is that it provides a jumpstart to the domain-experts in using modern HPC platforms for their research and development needs, and hence lowers the adoption barriers to HPC.
“告诉我,我会忘记;展示给我,我会记得;让我参与,我会明白。”这句谚语描述了我们论文的精髓,以及开发交互式并行化工具(IPT)的动机,IPT可以将串行应用程序转换为多个并行变体。IPT的最终用户必须理解并行编程所涉及的基本概念(例如,数据分发和数据收集)。在理解了基本的并行编程概念之后,IPT可以被它的目标受众(领域专家和学生)用来使用多个并行编程范式(MPI、OpenMP和CUDA)半自动地生成并行程序,并通过观察和比较来学习这些范式。这种基于ipt的个性化学习方法补充了传统的学习和培训方法,传统的学习和培训方法通常强调一个或多个编程标准的语法和语义。IPT的主要好处是,它为使用现代HPC平台满足其研究和开发需求的领域专家提供了一个快速起步,从而降低了采用HPC的障碍。
{"title":"A Tool for Interactive Parallelization","authors":"R. Arora, Julio C. Olaya, Madhav Gupta","doi":"10.1145/2616498.2616558","DOIUrl":"https://doi.org/10.1145/2616498.2616558","url":null,"abstract":"\"Tell me, I'll forget,\u0000 Show me, I may remember,\u0000 Involve me, I'll understand.\" - Confucius\u0000 This proverb describes the essence of our paper and the motivation behind the development of the Interactive Parallelization Tool (IPT) that can transform serial applications into multiple parallel variants. The end-users of IPT must have an understanding of the basic concepts involved in parallel programming (e.g., data distribution and data gathering). After developing an understanding of the basic parallel programming concepts, IPT can be used by its target audience (domain-experts and students) to semi-automatically generate parallel programs using multiple parallel programming paradigms (MPI, OpenMP, and CUDA), and learn about these paradigms through observation and comparison. This IPT-based personalized learning approach complements the traditional methods of learning and training that usually emphasize the syntax and semantics of one or more programming standards. The main benefit of IPT is that it provides a jumpstart to the domain-experts in using modern HPC platforms for their research and development needs, and hence lowers the adoption barriers to HPC.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"6 1","pages":"51:1-51:8"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84269698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Accessible Parallelization for the Open Modeling Interface 开放建模接口的可访问并行化
Tom Bulatewicz, Daniel Andresen
As the availability of computing infrastructure continues to increase, so too does the need for accessible means for utilizing those resources. An effective approach is to enable desktop-oriented scientific software tools and frameworks to support execution on high performance cyberinfrastructure in a way that is transparent to the user. We have found this to be the case in our ongoing environmental modeling study in which we are applying multidisciplinary, integrated models to the study of a depleting aquifer. Our models are linked together using the Open Modeling Interface (OpenMI) which provides a composition framework for the sequential execution of model components. In this work we investigate the potential for incorporating parallelism into the OpenMI as a first-class citizen. We present a general solution in which model components may be executed in parallel without requiring changes to their source code. An alternate solution achieves greater parallelism through simultaneous invocations of individual components, but requires them to be modified in some cases. These can result in significant reductions in simulation runtimes on both multi-core desktop machines as well as in high performance computing environments. We demonstrate this potential speedup in a performance study in which the application of the general solution achieved 86% of linear speedup when executed on a high performance machine with 80 cores.
随着计算基础设施的可用性不断增加,对利用这些资源的可访问方法的需求也在增加。一种有效的方法是使面向桌面的科学软件工具和框架能够以对用户透明的方式支持高性能网络基础设施的执行。我们在正在进行的环境建模研究中发现了这种情况,我们正在应用多学科综合模型来研究枯竭的含水层。我们的模型使用开放建模接口(OpenMI)链接在一起,该接口为模型组件的顺序执行提供了一个组合框架。在这项工作中,我们研究了将并行性作为一流公民纳入OpenMI的可能性。我们提出了一个通用的解决方案,其中模型组件可以并行执行,而不需要更改它们的源代码。另一种解决方案通过同时调用各个组件来实现更高的并行性,但在某些情况下需要对它们进行修改。这可以显著减少多核桌面机器和高性能计算环境中的模拟运行时间。我们在性能研究中展示了这种潜在的加速,其中通用解决方案的应用程序在具有80核的高性能机器上执行时实现了86%的线性加速。
{"title":"Accessible Parallelization for the Open Modeling Interface","authors":"Tom Bulatewicz, Daniel Andresen","doi":"10.1145/2616498.2616566","DOIUrl":"https://doi.org/10.1145/2616498.2616566","url":null,"abstract":"As the availability of computing infrastructure continues to increase, so too does the need for accessible means for utilizing those resources. An effective approach is to enable desktop-oriented scientific software tools and frameworks to support execution on high performance cyberinfrastructure in a way that is transparent to the user. We have found this to be the case in our ongoing environmental modeling study in which we are applying multidisciplinary, integrated models to the study of a depleting aquifer. Our models are linked together using the Open Modeling Interface (OpenMI) which provides a composition framework for the sequential execution of model components. In this work we investigate the potential for incorporating parallelism into the OpenMI as a first-class citizen. We present a general solution in which model components may be executed in parallel without requiring changes to their source code. An alternate solution achieves greater parallelism through simultaneous invocations of individual components, but requires them to be modified in some cases. These can result in significant reductions in simulation runtimes on both multi-core desktop machines as well as in high performance computing environments. We demonstrate this potential speedup in a performance study in which the application of the general solution achieved 86% of linear speedup when executed on a high performance machine with 80 cores.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"36 1","pages":"52:1-52:8"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81638902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MOVIE: Large Scale Automated Analysis of MOVing ImagEs 电影:运动图像的大规模自动分析
Virginia Kuhn, Michael Simeone, Luigi Marini, Dave Bock, Alan B. Craig, Liana Diesendruck, Sandeep Puthanveetil Satheesan
In this paper we describe our efforts at establishing a software workbench for video analysis, annotation, and visualization, using both current and experimental discovery methods. This project builds upon our previous research with video and image analysis, and joins the emergent field of cultural analytics in the digital humanities. Moving image media is particularly ripe for computational analysis given its increasing ubiquity in contemporary culture. Hoping to make video more legible as a big data format, we employ visual media in the public domain and we focus on crowd-sourced annotation, aural and visual analysis and visualization of extracted image data. Our goal is to fill in existing gaps for asking cultural questions about video archives using computers, we also experiment with transformative methods in video research and analysis. Our long term goal is to allow researchers to move with agility from textual description and collection management, to manual inspection, to automated analysis, to visualization of discrete films as well as whole collections.
在本文中,我们描述了我们在使用当前和实验发现方法建立视频分析,注释和可视化软件工作台方面所做的努力。这个项目建立在我们之前对视频和图像分析的研究基础上,并加入了数字人文学科中新兴的文化分析领域。考虑到动态图像媒体在当代文化中的日益普及,它对于计算分析来说尤其成熟。为了让视频作为一种大数据格式变得更清晰,我们采用公共领域的视觉媒体,专注于对提取的图像数据进行众包标注、视听分析和可视化。我们的目标是填补现有的空白,用计算机来询问关于视频档案的文化问题,我们也在视频研究和分析中尝试变革的方法。我们的长期目标是使研究人员能够灵活地从文本描述和收藏管理,到人工检查,到自动分析,到离散电影以及整个收藏的可视化。
{"title":"MOVIE: Large Scale Automated Analysis of MOVing ImagEs","authors":"Virginia Kuhn, Michael Simeone, Luigi Marini, Dave Bock, Alan B. Craig, Liana Diesendruck, Sandeep Puthanveetil Satheesan","doi":"10.1145/2616498.2616529","DOIUrl":"https://doi.org/10.1145/2616498.2616529","url":null,"abstract":"In this paper we describe our efforts at establishing a software workbench for video analysis, annotation, and visualization, using both current and experimental discovery methods. This project builds upon our previous research with video and image analysis, and joins the emergent field of cultural analytics in the digital humanities. Moving image media is particularly ripe for computational analysis given its increasing ubiquity in contemporary culture. Hoping to make video more legible as a big data format, we employ visual media in the public domain and we focus on crowd-sourced annotation, aural and visual analysis and visualization of extracted image data. Our goal is to fill in existing gaps for asking cultural questions about video archives using computers, we also experiment with transformative methods in video research and analysis. Our long term goal is to allow researchers to move with agility from textual description and collection management, to manual inspection, to automated analysis, to visualization of discrete films as well as whole collections.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"77 1","pages":"21:1-21:3"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84449422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Revision Control System (RCS) in computational sciences and engineering curriculum 计算科学与工程课程的修订控制系统(RCS
S. Gowtham
The Revision Control System (RCS) is an essential aspect of software development process and software configuration management. While continuing to be an integral component of the real world, it is often left out of the main stream curriculum in most academic institutions. Instead, students are expected to learn it on their own, as a hobby or as an independent study, out of personal interest. The author describes the experiences gained from attempting to implement a distributed Revision Control System, Git, as part of the computational sciences and engineering curriculum at the undergraduate and graduate levels. The author also describes the advantages for both parties involved: improving the competency of students and preparing them for the real world expectations while providing the teacher an opportunity to provide timely feedback to the students and monitor their progress. The availability of free and open source tools used to analyze and visualize the commit history to the repository helps teachers and students observe submission and feedback patterns respectively.
版本控制系统(RCS)是软件开发过程和软件配置管理的一个重要方面。虽然它仍然是现实世界不可或缺的组成部分,但在大多数学术机构中,它经常被排除在主流课程之外。相反,学生们应该自己学习,作为一种爱好或独立学习,出于个人兴趣。作者描述了从尝试实现分布式版本控制系统(Git)中获得的经验,该系统作为本科和研究生阶段计算科学和工程课程的一部分。作者还描述了双方的优势:提高学生的能力,为现实世界的期望做好准备,同时为教师提供机会,向学生提供及时的反馈,并监督他们的进步。用于分析和可视化提交到存储库的历史的免费和开放源码工具的可用性可以帮助教师和学生分别观察提交和反馈模式。
{"title":"Revision Control System (RCS) in computational sciences and engineering curriculum","authors":"S. Gowtham","doi":"10.1145/2616498.2616576","DOIUrl":"https://doi.org/10.1145/2616498.2616576","url":null,"abstract":"The Revision Control System (RCS) is an essential aspect of software development process and software configuration management. While continuing to be an integral component of the real world, it is often left out of the main stream curriculum in most academic institutions. Instead, students are expected to learn it on their own, as a hobby or as an independent study, out of personal interest. The author describes the experiences gained from attempting to implement a distributed Revision Control System, Git, as part of the computational sciences and engineering curriculum at the undergraduate and graduate levels. The author also describes the advantages for both parties involved: improving the competency of students and preparing them for the real world expectations while providing the teacher an opportunity to provide timely feedback to the students and monitor their progress. The availability of free and open source tools used to analyze and visualize the commit history to the repository helps teachers and students observe submission and feedback patterns respectively.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"360 1","pages":"76:1-76:3"},"PeriodicalIF":0.0,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76441356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1