Computing and Computational Sciences Directorate
SuperComputing 2009
SC2009 Newsletters
To subscribe to the SC09 Newsletter, click here.
ORNL/UT participation in SC2009 committees
- Media Room : Betsy Riley
- Advance Media Registration : Betsy Riley
- Conference Office Manager : Linda Duncan
- Space Chair : James Ferguson
- Signage Chair : Rebecca Hartman-Baker
- Technical Program Deputy Chair : Ricky Kendall
- Technical Papers Applications Area Committee Members
- Ed D'Azevedo
- George Fann
- Technical Papers Performance Area Committee Member : Jeffrey Vetter
- Technical Papers System Software Area Committee Members
- George Bosilca
- Al Geist
- Rich Graham
- Barney Maccabe
- Tutorials Committee Members
- John Cobb
- Jack Dongarra
- Jeffery Kuehn
- Posters Chair : Jack Dongarra
- Posters Algorithms Committee Members
- Piotr Luszczek
- Pat Worley
- Posters Applications Committee Member : Pat Worley
- Student Cluster Competition Committee Member : Hai Ah Nam
- Awards Deputy Chair : Jack Dongarra
- ACM Gordon Bell Committee Members
- Jack Dongarra
- Doug Kothe
- Exhibits Chair : Becky Verastegui
- Exhibits - Sustainability Chair : Jeff Kuehn
- Architecture Chair : Charles Fisher
- Commodity Network Chair : Rex Duncan
- Fiber Committee Members
- DA Fye
- Zachary Giles
- November Program - Student Volunteers : James Rome
- Student Volunteers Committee Members
- Bruce Loftis
- Robert Whitten
- Student Volunteer Vice Chair : Bruce Loftis
ORNL/UT participation in SC2009 Education Outreach
Information on current employment opportunities and on student programs (RAMS, internships, summer semesters, etc.) ORNL is sponsoring three students for the Education Program: will be available in the ORNL booth -- Debbie McCoy, contact. Research Alliance in Math and Science/ORNL will exhibit at the Student Fair on Thursday, Nov. 19.
ORNL/UT participation in SC2009 MSI Outreach
Debbie McCoy will be participating in the MSI meetings.
ORNL/UT participation in SC2009 technical papers
Terascale Data Organization for Discovering Multivariate Climatic Trends
Authors:
Wesley Kendall (University of Tennessee, Knoxville)
Markus Glatter (University of Tennessee, Knoxville)
Jian Huang (University of Tennessee, Knoxville)
Tom Peterka (Argonne National Laboratory)
Robert Latham (Argonne National Laboratory)
Robert Ross (Argonne National Laboratory)Papers Session - Large-Scale Applications
Tuesday, 10:30AM - 11:00AM Room PB255Abstract:
Current visualization tools lack the ability to perform full-range spatial and temporal analysis on terascale scientific datasets. Two key reasons exist for this shortcoming: I/O and postprocessing on these datasets are being performed in suboptimal manners, and the subsequent data extraction and analysis routines have not been studied in depth at large scales. We resolved these issues through advanced I/O techniques and improvements to current query-driven visualization methods. We show the efficiency of our approach by analyzing over a terabyte of multivariate satellite data and addressing two key issues in climate science: time-lag analysis and drought assessment. Our methods allowed us to reduce the end-to-end execution times on these problems to one minute on a Cray XT4 machine.
Scalable Computation of Streamlines on Very Large Datasets
Authors:
David Pugmire (Oak Ridge National Laboratory)
Hank Childs (Lawrence Berkeley National Laboratory)
Christoph Garth (University of California, Davis)
Sean Ahern (Oak Ridge National Laboratory)
Gunther Weber (Lawrence Berkeley National Laboratory)Papers Session - Large-Scale Applications
Tuesday, 11:30AM - 12:00PM Room PB255Abstract:
Understanding vector fields resulting from large scientific simulations is an important and often difficult task. Streamlines, curves that are tangential to a vector field at each point, are a powerful visualization method in this context. Application of streamline-based visualization to very large vector field data represents a significant challenge due to the non-local and data-dependent nature of streamline computation, and requires careful balancing of computational demands placed on I/O, memory, communication, and processors. In this paper we review two parallelization approaches based on established parallelization paradigms (static decomposition and on-demand loading) and present a novel hybrid algorithm for computing streamlines. Our algorithm is aimed at good scalability and performance across the widely varying computational characteristics of streamline-based problems. We perform performance and scalability studies of all three algorithms on a number of prototypical application problems and demonstrate that our hybrid scheme is able to perform well in different settings.
Instruction-Level Simulation of a Cluster at Scale
Authors:
Edgar Leon (University of New Mexico)
Rolf Riesen (Sandia National Laboratories)
Arthur B. Maccabe (Oak Ridge National Laboratory)
Patrick G. Bridges (University of New Mexico)Papers Session - System Performance Evaluation
Thursday, 01:30PM - 02:00PM - Room PB256Abstract:
Instruction-level simulation is necessary to evaluate new architectures. However, single-node simulation cannot predict the behavior of a parallel application on a supercomputer. We present a scalable simulator that couples a cycle-accurate node simulator with a supercomputer network model. Our simulator executes individual instances of IBM's Mambo PowerPC simulator on hundreds of cores. We integrated a NIC emulator into Mambo and model the network instead of fully simulating it. This decouples the individual node simulators and makes our design scalable. Our simulator runs unmodified parallel message-passing applications on hundreds of nodes. We can change network and detailed node parameters, inject network traffic directly into caches, and use different policies to decide when that is an advantage. This paper describes our simulator in detail, evaluates it, and demonstrates its scalability. We show its suitability for architecture research by evaluating the impact of cache injection on parallel application performance.
Dynamic Task Scheduling for Linear Algebra Algorithms on Distributed-Memory Multicore Systems
Authors:
Fengguang Song (University of Tennessee, Knoxville)
Asim YarKhan (University of Tennessee, Knoxville)
Jack Dongarra (University of Tennessee, Knoxville)Papers Session - Multicore Task Scheduling
Thursday, 01:30PM - 02:00PM Room PB255Abstract:
This paper presents a dynamic task scheduling approach to executing dense linear algebra algorithms on multicore systems (either shared-memory or distributed-memory). We use a task-based library to replace the existing linear algebra subroutines such as PBLAS to transparently provide the same interface and computational function as the ScaLAPACK library. Linear algebra programs are written with the task-based library and executed by a dynamic runtime system. We mainly focus our runtime system design on the metric of performance scalability. We propose a distributed algorithm to solve data dependences without process cooperation. We have implemented the runtime system and applied it to three linear algebra algorithms: Cholesky, LU, and QR factorizations. Our experiments on both shared-memory machines(16, 32 cores) and distributed-memory machines(1024 cores) demonstrate that our runtime system is able to achieve good scalability. Furthermore, we provide analytical analysis to show why the tiled algorithms are scalable and the expected execution time.
Comparative Study of One-Sided Factorizations with Multiple Software Packages on Multi-Core Hardware
Authors:
Emmanuel Agullo (University of Tennessee, Knoxville)
Bilel Hadri (University of Tennessee, Knoxville)
Hatem Ltaief (University of Tennessee, Knoxville)
Jack Dongarra (University of Tennessee, Knoxville)Papers Session - System Performance Evaluation
Thursday, 02:30PM - 03:00PM Room PB256Abstract:
The emergence and continuing use of multi-core architectures require changes in the existing software and sometimes even a redesign of the established algorithms in order to take advantage of now prevailing parallelism. The Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) is a project that aims to achieve both high performance and portability across a wide range of multi-core architectures. We present in this paper a comparative study of PLASMA's performance against established linear algebra packages (LAPACK and ScaLAPACK), against new approaches at parallel execution (Task Based Linear Algebra Subroutines - TBLAS), and against equivalent commercial software offerings (MKL, ESSL and PESSL). Our experiments were conducted on one-sided linear algebra factorizations (LU, QR and Cholesky) and used multi-core architectures (based on Intel Xeon EMT64 and IBM Power6). The performance results show improvements brought by new algorithms on up to 32 cores - the largest multi-core system we could access.
ORNL/UT participation in SC2009 tutorials
VGrADS: Enabling e-Science Workflows on Grids and Clouds with Fault Tolerance
Authors:
Lavanya Ramakrishnan (Indiana University)
Daniel Nurmi (University of California, Santa Barbara)
Anirban Mandal (Renaissance Computing Institute)
Charles Koelbel (Rice University)
Dennis Gannon (Microsoft Research)
T. Mark Huang (University of Houston)
Yang-Suk Kee (Oracle)
Graziano Obertelli (University of California, Santa Barbara)
Kiran Thyagaraja (Rice University)
Rich Wolski (University of California, Santa Barbara)
Asim Yarkhan (University of Tennessee, Knoxville)
Dmitrii Zagorodnov (University of California, Santa Barbara)Papers Session - Dynamic Task Scheduling
Thursday, 03:30PM - 04:00PM Room E145-146Abstract:
Today's scientific workflows use distributed heterogeneous resources through diverse grid and cloud interfaces that are often hard to program. In addition, especially for time-sensitive critical applications, predictable quality of service is necessary across these distributed resources. VGrADS' virtual grid execution system (vgES) provides an uniform qualitative resource abstraction over grid and cloud systems. We apply vgES for scheduling a set of deadline sensitive weather forecasting workflows. Specifically, this paper reports on our experiences with (1) virtualized reservations for batch-queue systems, (2) coordinated usage of TeraGrid (batch queue), Amazon EC2 (cloud), our own clusters (batch queue) and Eucalyptus (cloud) resources, and (3) fault tolerance through automated task replication. The combined effect of these techniques was to enable a new workflow planning method to balance performance, reliability and cost considerations. The results point toward improved resource selection and execution management support for a variety of e-Science applications over grids and cloud systems.
S09: VisIt - Visualization and Analysis for Very Large Data Sets
Presenters:
Hank Childs (Lawrence Berkeley National Laboratory)
Sean Ahern (Oak Ridge National Laboratory)Tutorials Session
Sunday, 08:30AM - 12:00PM Room TBDAbstract:
Visualization and analysis plays a crucial role for scientific simulations as an enabling technology: exploring data, confirming hypotheses, and communicating results. This tutorial will focus on VisIt, an open source visualization and analysis tool designed for processing large data. The tool is intended for more than just visualization and is built around five primary use cases: data exploration, quantitative analysis, comparative analysis, visual debugging, and communication of results. VisIt has a client-server design for remote visualization, with a distributed memory parallel server. VisIt won an R&D 100 award in 2005, has been downloaded over 100,000 times, and is being developed by a large community. VisIt is used to visualization and analyze the results of hero runs on eight of the top twelve machines on top500.org. The tutorial will introduce VisIt, demonstrate how to do basic things in VisIt, and discuss how to do advanced analysis and visualizations.
M05: Productive Performance Engineering of Petascale Applications with POINT and VI-HPS
Presenters:
Rick Kufrin (National Center for Supercomputing Applications)
Sameer Shende (University of Oregon)
Brian Wylie (Juelich Supercomputing Centre)
Andreas Knuepfer (Technical University Dresden)
Allen Malony (University of Oregon)
Shirley Moore (University of Tennessee, Knoxville)
Nick Nystrom (Pittsburgh Supercomputing Center)
Felix Wolf (Juelich Supercomputing Centre)
Wolfgang Nagel (Technical University Dresden)Tutorials Session
Monday, 08:30AM - 05:00PM Room TBDAbstract:
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems, focusing on how the tools are used for performance engineering of petascale, scientific applications. Four parallel performance evaluation toolsets from the POINT (Performance Productivity through Open Integrated Tools) and VI-HPS (Virtual Institute – High Productivity Supercomputing) projects are discussed – PerfSuite, TAU, Scalasca, and Vampir. We cover all aspects of performance engineering practice, including instrumentation, measurement (profiling and tracing, timing and hardware counters), performance data storage, analysis, and visualization. Emphasis is placed on how performance tools are used in combination for identifying performance problems and investigating optimization alternatives. The tutorial will include hands-on exercises using a Live-DVD containing all of the tools, helping to prepare participants to apply modern methods for locating and diagnosing typical performance bottlenecks in real-world parallel programs at scale. In addition to the primary presenters, the principals in the POINT and VI-HPS projects will be present.
ORNL/UT participation in SC2009 panels
Applications Architecture Power Puzzle
Moderator: Mark Stalzer (California Institute of Technology)
Panelists:
Thomas Sterling (Louisiana State University)
Allan Snavely (San Diego Supercomputer Center)
Stephen Poole (Oak Ridge National Laboratory)
William Camp (Intel Corporation)Panels Session
Friday, 08:30AM - 10:00AM Room PB252Abstract:
Over the next few years there are two boundary conditions that should constrain computer systems architecture: commodity components and applications performance. Yet, these two seem strangely disconnected. Perhaps we need some human optimization, as opposed to repeated use of Moore’s Law. Our panelists have been given a set of standard components that are on announced vendor roadmaps. They also each get to make one mystery component of no more complexity than a commercially available FPGA. The applications are HPL for linear algebra, Map-Reduce for databases, and a sequence matching algorithm for biology. The panelists have 10 minutes to disclose their systems architecture and mystery component, and estimate performance for the three applications at 1MW of power.
The Road to Exascale: Hardware and Software Challenges
Moderator: William J. Camp (Intel Corporation)
Panelists:
Jack Dongarra (Oak Ridge National Laboratory / University of Tennessee, Knoxville)
Peter Kogge (University of Notre Dame)
Marc Snir (University of Illinois at Urbana-Champaign)
Steve Scott (Cray Inc.)Panels Session
Friday, 10:30AM - 12:00PM Room PB251Abstract:
For three decades we have increased the performance of highest-end computing about 1000-fold each decade. In doing so, we have had to increase the degree of parallelism, cost, footprint and power dissipation of the top systems in each generation. Now we are challenged to move from petascale in 2008-9 to exascale in 2018-19. The obstacles we face are qualitatively harder than any we have dealt with until now and are unprecedented even by the history we build on. To achieve reasonable power, reliability, programmability, cost, and size targets will take major breakthroughs in hardware and software. For example we will have to create and program systems with hundreds of million of processor cores. This panel will review the hardware and software advances needed for exascale computing and will attempt to lay out research, development and engineering roadmaps to successfully overcome the obstacles in our path and create the needed advances.
ORNL/UT participation in SC2009 posters
Numerical Linear Algebra on Emerging Architectures: The PLASMA and MAGMA Projects
Authors:
Jim Demmel (University of California, Berkeley)
Jack Dongarra (University of Tennessee, Knoxville)
Jakub Kurzak (University of Tennessee, Knoxville)
Julien Langou (University of Colorado Denver)
Hatem Ltaief (University of Tennessee, Knoxville)
Stanimire Tomov (University of Tennessee, Knoxville)Posters Session
Tuesday, 05:15PM - 07:00PM Room Oregon Ballroom LobbyAbstract:
The emergence and continuing use of multi-core architectures and graphics processing units require changes in the existing software and sometimes even a redesign of the established algorithms in order to take advantage of now prevailing parallelism. Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) and Matrix Algebra on GPU and Multics Architectures (MAGMA) are two projects that aims to achieve high performance and portability across a wide range of multi-core architectures and hybrid systems respectively. We present in this poster a comparative study of PLASMA's performance against established linear algebra packages, e.g. LAPACK and ScaLAPACK as well as equivalent commercial software offerings (Intel MKL, IBM ESSL and IBM PESSL) on two different architectures and some preliminary results of MAGMA on hybrid multi-core and GPU systems.
Performance Analysis and Optimization of Parallel I/O in a Large Scale Groundwater Application on the Cray XT5
Authors:
Vamsi Sripathi (North Carolina State University)
Glenn E. Hammond (Pacific Northwest National Laboratory)
G. (Kumar) Mahinthakumar (North Carolina State University)
Richard T. Mills (Oak Ridge National Laboratory)
Patrick H. Worley (Oak Ridge National Laboratory)
Peter C. Lichtner (Los Alamos National Laboratory)Posters Session
Tuesday, 05:15PM - 07:00PM Room Oregon Ballroom LobbyAbstract:
We describe in this poster the performance analysis and optimization of I/O within a massively parallel groundwater application, PFLOTRAN, on the Cray XT5 at ORNL. A strong scaling study with a 270 million cell test problem from 2,048 to 65,536 cores indicated that a high volume of independent I/O disk access requests and file access operations would severely limit the I/O performance scalability. To avoid the performance penalty at higher processor counts, we implemented a two-phase I/O approach at the application level by splitting the MPI global communicator into multiple sub-communicators. The root process in each sub-communicator is responsible for performing the I/O operations for the entire group and then distributing the data to rest of the group. With this approach we were able to achieve 25X improvement in read I/O and 3X improvement in write I/O resulting in an overall application performance improvement of over 5X at 65,536 cores.
Highly Scalable Bioinformatics Tools on Supercomputers
Authors:
Bhanu Rekapalli (University of Tennessee, Knoxville)
Christian Halloy (University of Tennessee, Knoxville)
Igor B. Jouline (University of Tennessee, Knoxville)Posters Session
Tuesday, 05:15PM - 07:00PM Room Oregon Ballroom LobbyAbstract:
Only a few thousand genomes have been sequenced since 1998, even though there are millions of known organisms on the planet. It is expected that new sequencing technologies will be able to sequence nearly 1000 microbial genomes per hour. This mind-boggling growth of genomic data will require vastly improved computational methods for the pre-processing, analysis, storage and retrieval of sequence data. We have made significant headway in this direction by porting some of the most popular bioinformatics tools such as BLAST and HMMER on supercomputers. Our Highly Scalable Parallel versions of these tools scale very well up to thousands of cores on a CrayXT5 supercomputers (UTK-ORNL). We discuss how we resolved performance issues by cleverly restructuring the input data sets of protein sequences, randomizing and reducing the I/O bottlenecks. For example, HSP-HMMER code can identify the functional domains of millions of proteins 100 times faster than the publicly available MPI-HMMER.
Automating the Knowledge Discovery Process for Computational Quality of Service Research with Scientific Workflow Systems
Authors:
Meng-Shiou Wu (Ames Laboratory)
Colin Campbell (University of Tennessee, Knoxville)Posters Session
Tuesday, 05:15PM - 07:00PM Room Oregon Ballroom LobbyAbstract:
Computational Quality of Service (CQoS) research requires knowledge of performance behaviors acquired from running scientific applications on a variety of platforms in order to develop tuning mechanisms. However, the complexity of current high performance architectures and large-scale scientific packages, the granularity of the 'right' performance data required combined with the numerous options of compilers and external libraries all hamper the discovery of performance knowledge. Without automation of the knowledge-discovering process, CQoS researchers will spend most of their time in the laborious task of data management and analysis. In this research we present our experiences and efforts in automating the process of knowledge discovery for CQoS research. We will discuss the fundamental modules in this process, present our prototype infrastructure, and show how to integrate scientific workflow systems into our infrastructure to automate the complex process of knowledge discovery.
MOON: MapReduce On Opportunistic eNvironments
Authors:
Heshan Lin (Virginia Tech)
Jeremy Archuleta (Virginia Tech)
Xiaosong Ma (North Carolina State University / Oak Ridge National Laboratory)
Wuchun Feng (Virginia Tech)Posters Session
Tuesday, 05:15PM - 07:00PM Room Oregon Ballroom LobbyAbstract:
Existing MapReduce systems run on dedicated resources, where only a small fraction of such resources are ever unavailable. In contrast, we tackle the challenges of hosting MapReduce services on volunteer computing systems, which opportunistically harness idle desktop computers, e.g., Condor. Such systems, when running MapReduce, result in poor performance due to the volatility of resources. Specifically, the data and task replication scheme adopted by existing MapReduce implementations is woefully inadequate for resources with high unavailability. To address this, we extend Hadoop with adaptive task and data scheduling algorithms to offer reliable MapReduce services in volunteer computing systems. Leveraging a hybrid resource architecture, our algorithms distinguish between (1) different types of MapReduce data and (2) different types of node outages in order to strategically place tasks and data on both volatile and dedicated nodes. Our tests demonstrate that MOON can deliver a 3-fold performance improvement to Hadoop in volatile environments.
ORNL/UT participation in SC2009 BOFs
CIFTS: A Coordinated Infrastructure for Fault Tolerant Systems
Primary Session Leader: Pete Beckman (Argonne National Laboratory)
Secondary Session Leaders:
- Al Geist (Oak Ridge National Laboratory)
- Rinku Gupta (Argonne National Laboratory)
Birds-of-a-Feather Session
Tuesday, 12:15PM - 01:15PM Room D137-138Abstract:
The Coordinated Infrastructure for Fault Tolerant Systems (CIFTS) initiative provides a standard framework, through the Fault Tolerance Backplane (FTB), where any component of the software stack can report or be notified of faults through a common interface - thus enabling coordinated fault tolerance and recovery. SC'07 and SC'08 saw an enthusiastic audience of industry leaders, academia, and research participate in the CIFTS BOF. Expanding on our previous success, the objectives of SC'09 BOF are: 1. Discuss the experiences gained, challenges faced in comprehensive fault management on petascale leadership machines, and the impact of the CIFTS framework in this environment. Teams developing FTB-enabled software such as MPICH2, Open MPI, MVAPICH2, BLCR etc., will share their experiences. 2. Discuss the recent enhancements and planned developments for CIFTS and solicit audience feedback. 3. Bring together individuals responsible for exascale computing infrastructures, who have an interest in developing fault tolerance specifically for these environments
2009 HPC Challenge Awards
Primary Session Leader: Jeremy Kepner (MIT Lincoln Laboratory)
Secondary Session Leader: Piotr Luszczek (University of Tennessee, Knoxville)Birds-of-a-Feather Session
Tuesday, 12:15PM - 01:15PM Room E145-146Abstract:
The 2009 HPC Challenge Awards is a competition with awards in two classes: Class 1 and 2. Class 1: Best Performance awards best run submitted to the HPC Challenge website. Since there are multiple tests, the term "best" is subjective. It has been decided by the committee that winners in four categories will be announced: HPL, Global-RandomAccess, EP-STREAM-Triad per system, and Global-FFT. Class 2: Most Elegant awards implementation of three or more of the HPC Challenge benchmarks with special emphasis being placed on: HPL, Global-RandomAccess, STREAM-Triad and Global-FFT. This award would be weighted 50% on performance and 50% on code elegance/clarity/size. Competition in Class 1 offers a rich view of contemporary supercomputers as they compete for supremacy not just in one category but in four. Class 2, on the other hand, offers a glimpse into high end programming technologies and the effectiveness of their implementation.
HPC Centers
Primary Session Leader: Robert M. Whitten (Oak Ridge National Laboratory)
Birds-of-a-Feather Session
Wednesday, 12:15PM - 01:15PM Room B118Abstract:
This BOF provides an open forum for user assistance personnel to discuss topics on interest. Topics include but are not limited to ticket procedures, queue policies, organization and structure of support options, and any pressing topic. This BOF is an ongoing meeting of the HPC Centers working group and all interested parties are encouraged to attend.
Open MPI Community Meeting
Primary Session Leader: Jeff Squyres (Cisco Systems)
Secondary Session Leader: George Bosilca (University of Tennessee, Knoxville)Birds-of-a-Feather Session
Wednesday, 12:15PM - 01:15PM Room E145-146Abstract:
MPI is increasingly being called upon to handle higher scale environments: "manycore", petascale, and faster networks. Years of experience in traditional HPC environments are leading to revolutions in leading-edge hardware platforms; software must evolve to match. The challenges are significant. As we proved last year by being the first MPI implementation to achieve a petaflop, we believe that an diverse open source team representing many different backgrounds, philosophies, and biases (from both industry and academia) is uniquely equipped to meet these challenges. Come hear where Open MPI is going, and how you can (and should!) join our efforts. The meeting will consist of three parts: 1. Members of the Open MPI core development team will present our current status. 2. Discussions of ongoing and future projects. 3. Discuss the Open MPI roadmap and actively soliciting feedback from real-world MPI users and ISVs with MPI-based products (please bring your suggestions!).
International Exascale Software Program
Primary Session Leader: Jack Dongarra (University of Tennessee, Knoxville)
Secondary Session Leader: Pete Beckman (Argonne National Laboratory)
Birds-of-a-Feather Session
Wednesday, 05:30PM - 07:00PM Room PB256Abstract:
Over the last twenty years, the international open source community has increasingly provided the software at the heart of the world’s high-performance computing systems. The community provides everything from operating system components to compilers and advanced math libraries. As an international community, however, we have only loosely coordinated activities and plans for development. The new rapidly changing technologies in multicore, power consumption, GPGPUs, and memory architectures creates an opportunity for the community to work together and build an international program to design, build, and deliver the software so critical the science goals of our institutions. To help plan how the international community could build a partnership to provide the next generation of HPC software to support scientific discovery, we have had a series of workshops. We would like to hold a BOF at SC09 to report on the effort and solicit feedback from the community.
MPI Forum: Preview of the MPI 3 Standard (Comment Session)
Primary Session Leader: Rich Graham (Oak Ridge National Laboratory)
Secondary Session Leader: George Bosilca (University of Tennessee, Knoxville)
Birds-of-a-Feather Session
Wednesday, 05:30PM - 07:00PM Room D135-136Abstract:
With the release of the Message Passing Interface (MPI) standard version 2.2 (http://www.mpi-forum.org) the standard has been consolidated into a single, more coherent document, numerous corrections and clarifications have been made, and the standard has been updated to support the current versions of the C, C++ (C99), and Fortran (2003) standards. The work of this standards body has shifted to focusing on proposed changes needed to fill functionality and scalability gaps. This includes the addition of support for nonblocking collectives, support for fault-tolerance within the standard, improved support for remote memory operations, improved support in threaded environments, and standardized hooks for tool access to information internal to MPI implementations.
OSCAR Community Meeting
Primary Session Leader: Stephen L Scott (Oak Ridge National Laboratory)
Secondary Session Leader: Geoffroy Vallee (Oak Ridge National Laboratory)
Birds-of-a-Feather Session
Wednesday, 05:30PM - 07:00PM Room B119Abstract:
Since the first public release in 2001, there have been well over 200,000 downloads of the Open Source Cluster Application Resources (OSCAR) software stack. OSCAR is a self-extracting cluster configuration, installation, maintenance, and operation suite consisting of "best known practices" for cluster computing. OSCAR has been used on highly ranking clusters in the TOP500 list and is available in both a freely downloadable version as well as commercially supported instantiations. The OSCAR team is comprised of an international group of developers from research laboratories, universities, and industry cooperating in the open source effort. As it has for the past seven years at SuperComputing, the OSCAR BoF will be a focal point for the OSCAR community at SuperComputing (SC09), where both developers and users may gather to discuss the "current state" as well as future directions for the OSCAR software stack. New and potential users and developers are welcome.
ORNL/UT participation in SC2009 workshops
2009 Workshop on Ultra-Scale Visualization
Monday, November 16
This meeting aims to address how to exploit petacale data sets produced from leading-edge scientific simulations by fostering communication among visualization researchers and practitioners, high-performance computing professionals and application scientists. Web Site: http://vis.cs.ucdavis.edu/Ultravis09/
Petascale Data Management using FIESTA
Scott Klasky, Oak Ridge National Laboratory
Component-Based High Performance Computing 2009
Sunday, November 15 and Monday, November 16
This workshop will bring together developers and users of component-based high-performance computing (CBHPC) frameworks and environments. The workshop aims to build an international research community around issues such as the role of CBHPC in high-performance scientific computing, component features unique to HPC, and tools for efficiently developing component applications that exploit massively parallel systems, the Grid, or hybrid, hardware-accelerated high-performance computing environments. Web Site: http://compframe.org/cbhpc2009/
Designing a Component-Based Architecture for the Modeling and Simulation of Nuclear Fuels and Reactors (Extended Abstract)
Jay Billings, Wael Elwasif, Lee Hively, David Bernholdt, John Hetrick and Timothy Bohn
Fifth International Workshop on High Performance Computing for Nano-science and Technology (HPCNano09)
Sunday, November 15
This workshop’s main theme is on how to conduct innovative research and development using cyber physical computing systems for large-scale computations in nano-science and nanotechnology, an exciting field with impact already being felt in materials, engineering, electronics, medicine and other disciplines. Web Site: http://www.hpcnano.org/HPCNano09/
Advisory Committee (HPCNano Past Chairs) : Jack Dongarra, University of Tennessee-Knoxville
Program Committee : Stanimire Tomov, University of Tennessee
Fourth Petascale Data Storage Workshop
Sunday, November 15
This workshop focuses on the data storage problems and emerging solutions found in petascale scientific computing environments, with special attention to problem identification, workload capture, solution interoperability, standards with community buy-in, shared tools – all issues in which community collaboration can be crucial. Web Site: http://www.pdsi-scidac.org/events/PDSW09/
Committe Member : Philip C. Roth, Oak Ridge National Laboratory
Fourth Workshop on Workflows in Support of Large-Scale Science (WORKS09)
Monday, November 16
This workshop focuses on workflows, which have emerged as a key technology that enables large-scale computations on distributed resources. The workshop welcomes contributions on workflow composition, mapping, and robust execution, as well the applications area, where the requirements on the workflow management systems can be derived. Web Site: http://www.isi.edu/works09/
Paper Presenter : Norbert Podhorszki, Scott Klasky, Qing Liu, Hasan Abbasi Jay Lofstead, Karsten Schwan, Matthew Wolf, Fang Zheng, Ciprian Docan, Manish Parashar, Julian Cummings, “Plasma fusion code coupling using scalable I/O services and scientific workflows”
Third International Workshop on High-Performance Reconfigurable Computing Technology and Applications (HPRCTA’09)
Sunday, November 15
This workshop will have talks on the latest trends and developments in the field of computing that uses a combination of conventional microprocessors and field- programmable gate arrays (FPGAs), a rapidly evolving computing paradigm that offers the potential to accelerate computationally intensive scientific applications beyond what is possible on today's mainstream HPC systems. Web Site: http://www.ncsa.uiuc.edu/Conferences/HPRCTA09/
Program Committee Member : Gregory Peterson, University of Tennessee
ORNL/UT participation in SC2009 Gordon Bell Finalists
A Scalable Method for Ab Initio Computation of Free Energies in Nanoscale Systems
Authors:
Markus Eisenbach (Oak Ridge National Laboratory)
Chenggang Zhou (J.P. Morgan Chase & Co)
Donald M. Nicholson (Oak Ridge National Laboratory)
Gregory Brown (Florida State University)
Jeff Larkin (Cray Inc.)
Thomas C. Schulthess (ETH Zürich)
Liquid Water: Obtaining the Right Answer for the Right Reasons
Authors:
Edoardo Apra (Oak Ridge National Laboratory)
Robert J. Harrison (Oak Ridge National Laboratory)
Wibe A. de Jong (Pacific Northwest National Laboratory)
Alistair Rendell (Australian National University)
Vinod Tipparaju (Oak Ridge National Laboratory)
Sotiris Xantheas (Pacific Northwest National Laboratory)
ORNL/UT participation in SC2009 awards won
- Jaguar Takes Number 1 Spot on the Top500 List
- Jaguar, Oak Ridge National Laboratory Receive HPCwire Editors’ Choice Award
- Oak Ridge National Laboratory Wins insideHPC’s Inaugural Community Leadership Award
- HPC Challenge Best Performance Awards
ORNL Booth Speakers
Time |
Tuesday 11/17/2009 |
Wednesday 11/18/2009 |
||
ENERGY SESSION |
HIGH ENERGY PHYSICS/ASTRO SESSION |
|||
| 10:30 - 11:15 | Jeremy Smith | Alexei Kritsuk | ||
| 11:30 - 12:15 | Gil Weigand | Pieter Maris | ||
| 12:15 - 1:15 | Lunch |
Lunch |
||
CLIMATE SESSION |
NANOTECHNOLOGY SESSION |
|||
| 1:30 - 2:15 | Forrest Hoffman | Edoardo Apra | ||
| 2:30 - 3:15 | James Kinter | Markus Eisenbach | ||
| 3:30 - 4:15 | William Putman | Robert Harrison | ||
SOFTWARE |
OPPORTUNITIES WITH ORNL |
|||
| 4:30 - 5:15 | Rich Graham | Wendy Peper | ||
ORNL booth model images (click on images to see a larger version)
ORNL kiosk presentations
Name |
Title |
PDF |
|
| Applied Mathematics | |||
| Valmor de Almeida, Ahmed Khamayseh | Adaptive Hybrid Mesh Refinement for Multiphysics Applications | ||
| The Innovative Computing Laboratory | Cell Processor for Linear Algebra Computations | ||
| Ed D’Azevedo | Computational Mathematics at the Oak Ridge National Laboratory | ||
| Sreekanth Pannala | Micro-Mesoscopic Modeling of Heterogeneous Chemically Reacting Flows (MMMHCRF) Over Catalytic/Solid Surfaces | ||
| George Ostrouchov | ORNL Statistics and Data Sciences | ||
| Kalyan Perumalla | Parallel Discrete Event Simulation (PDES) at ORNL | ||
| Computer Science & Engineering - Middleware | |||
| Nagiza Samatova, Arie Shoshani | Scientific Data Management Center | ||
| The CCA Forum and the TASCS | An Overview of the Common Component Architecture (CCA) | ||
| Jack Dongarra, Terry Moore, Daniel Terpstra, Haihang You | Component PAPI: Measuring Multiple Counter Domains | ||
| The Innovative Computing Laboratory | Dealing with the Scale Problem | ||
| Al Geist | Fault Tolerance Challenges and Solutions | ||
| Jack Dongarra | High Productivity Software Development Tools | ||
| The CCA Forum and the TASCS | How the Common Component Architecture (CCA) Advances Computational Science | ||
| Richard Graham | Open MPI on the Cray XT | ||
| The Innovative Computing Laboratory | PLASMA (Parallel Linear Algebra Software for Multi-Processor Architectures) | ||
| Piotr Luszczek | The Lapack For Clusters (LFC) Project | ||
| Dawid Kurzyniec, Magdalena Slawinska, Jaroslaw Slawinski, Vaidy Sunderam | The Zero-Force MPI Toolkit – Toward Tractable Toolkits for HPC | ||
| Computer Science & Engineering - Performance | |||
| Sadaf Alam, Nikhil Bhatia, Jeff Vetter | Modeling Assertions: Symbolic Models for Petascale Applications and Platforms | ||
| Phil Roth, Jeff Vetter, Weikuan Yu | Understanding and Optimizing Data Input/Output of Large-Scale Scientific Applications | ||
| Patrick Worley | Performance Engineering Research Institute (PERI) | ||
| Patrick Worley | Performance Evaluation & Analysis Consortium (PEAC) End Station | ||
| Piotr Luszczek | The HPC Challenge (HPCC) Benchmark Suite | ||
| Computer Science & Engineering - Storage | |||
| Sudharshan Vazhkudai | Robust Storage Management in the Machine Room and Beyond | ||
| Phil Roth | SciDAC-2 Petascale Data Storage Institute | ||
| Computer Science & Engineering - System Software & Tools | |||
| Al Geist | Electronic Laboratory Notebooks - Key to scientific records and project notes | ||
| Tara Gibson, James Myers | Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them | ||
| Christian Engelmann | The Harness Workbench: Unified and Adaptive Access to Diverse HPC Platforms | ||
| Nikhil Bhatia, Collin McCurdy, Phil Roth, Jeff Vetter, Weikuan Yu | FAST-OS: Petascale Single System Image | ||
| David Bernholdt, Wael Elwasif, Robert Harrison, Aniruddha Shet | High Productivity Language Systems: Next Generation Petascale Programming | ||
| Thomas Naughton, Stephen Scott, Geoffroy Vallée | Open Source Cluster Application Resources (OSCAR) | ||
| Christian Engelmann, Stephen Scott | Reliability, Availability, and Serviceability (RAS) for High-Performance Computing | ||
| Al Geist | Scalable Systems Software Project | ||
| Thomas Naughton, Stephen Scott, Geoffroy Vallée | System-Level Virtualization & OSCAR-V | ||
| Education & Job Opportunities | |||
| Debbie McCoy | Research Alliance in Math and Science | ||
| Future Systems | |||
| Olaf Storaasli | ORNL Field-Programmable Gate Array (FPGA) Research Speeds HPC "up to 100X" | ||
| Sadaf Alam, Jeremy Meredith, Jeff Vetter | Performance and Productivity of Emerging Architectures | ||
| Homeland Security/Defense | |||
| Frank DeNap | SensorNet Technologies and Real-world Deployments | ||
| Knowledge Discovery | |||
| Xiaohui Cui, Yu (Cathy) Jiao, Robert Patton, Justin Beaver | Data Analysis and High Performance Computing | ||
| Budhendra Bhaduri, Auroop Ganguly, Ranga Raju Vatsavai | Geospatial Data Sciences | ||
| Budhendra Bhaduri, Alexandre Sorokine | High-Performance Visualization of Geographic Data | ||
| Budhendra Bhaduri | LandScan Population Research Program | ||
| Jinzhu Gao, Markus Glatter, Jian Huang, Colin Mollenhour | Scalable Data Servers for Large Multivariate Volume Visualization | ||
| Leadership Infrastructure | |||
| Nagi Rao | DOE UltraScience Net: High-Performance Experimental Network Research Testbed | ||
| Scott Klasky | End-to-End Computing at ORNL | ||
| Galen Shipman | Infrastructure | ||
| Jim Rogers | NCCS Hardware | ||
| Josh Lothian | NCCS Network Roadmap | ||
| Buddy Bland | New Jaguar | ||
| Bill Wing | ORNL’s FutureNet Infrastructure | ||
| Ricky Kendall | Scientific Computing Group | ||
| Sean Ahern | Visualization at ORNL’s National Center for Computational Sciences | ||
| Doug Kothe | ORNL Leadership Computing Facility Preparing for the Exascale | ||
| Doug Kothe | Overview of Science at ORNL Leadership Computing Facility | ||
| Al Geist | Lustre Centre of Excellence | ||
| Leadership Science - Astrophysics | |||
| Tony Mezzacappa | Toward the Explosion Mechanism for Core-Collapse Supernovas: An Emerging Picture | ||
| Leadership Science - Biology | |||
| Pratul Agarwal | Accelerating Biomolecular Simulations on Reconfigurable Computing Hardware | ||
| Barbara Beckerman | Biomedical Engineering and Biomedical Informatics | ||
| Nagiza Samatova | BioPilot: Data-Intensive Computing for Complex Biological Systems | ||
| Jeremy Smith | Cellulosic Ethanol: A Molecular Mechanics Force Field for Lignin (Biosystems) | ||
| Igor Jouline | ORNL–University Partnerships in Computational Biology | ||
| Leadership Science - Climate | |||
| Forrest Hoffman | Global Coupled Climate and Carbon Cycle Modeling | ||
| Lawrence Buja, James White III (Trey) | LCF Climate Science Computational End Station | ||
| Richard Mills | SciDAC-2 project: Modeling Multiscale-Multiphase-Multicomponent Subsurface Reactive Flows Using Advance Computing/ | ||
| David Bernholdt | The Earth System Grid: Turning Climate Datasets into Community Resources | ||
| John Drake | The SciDAC CCSM Consortium Project | ||
| Leadership Science - Combustion | |||
| Jacqueline Chen, David Lignell, Chun Yoo, Joseph Oefelein | High Fidelity Simulations for Clean and Efficient Combustion of Alternative Fuels | ||
| Joseph Oefelein, Tomasz Drozda, Vaidya Sankaran, Ramanan Sankaran | High-Fidelity Simulations for Clean and Efficient Combustion of Alternative Fuels (Large Eddy) | ||
| Leadership Science - Fusion | |||
| Scott Klasky | CPES: Toward a first-principles integrated simulation of tokamak edge plasmas | ||
| J. R. Cary | FACETS: Framework Application for Core-Edge Transport Simulations | ||
| Scott Klasky | Gyrokinetic Particle Simulations of Fusion Plasmas | ||
| Don Batchelor, Lee Berry | High-Performance Computing in Magnetic Fusion Energy Research | ||
| Center for Plasma Edge Simulation Team | XGC: Gyrokinetic Particle Simulation of Edge Plasma | ||
| Leadership Science - Materials Science & Nanoscience | |||
| Thomas Schulthess | Fermion Glue in the Hubbard Model: New Insights into the Cuprate Pairing Mechanism with Advanced Computing | ||
| Thomas Schulthess | Statistical Physics of Fracture: Recent Advances through High-Performance Computing | ||
| Leadership Science - Nuclear Physics | |||
| David Dean | Building Nuclei from the Ground Up: Nuclear Coupled-cluster Theory | ||
| Leadership Science - Overview | |||
| Thomas Zacharia | ORNL Collaborators in HPC | ||
| Doug Kothe | Breakthrough Science in the 2008 INCITE Program at ORNL | ||
| NSF - Hardware | |||
| John Cobb | The Neutron Science TeraGrid Gateway: TeraGrid Cyberinfrastructure at ORNL | ||
| Phil Andrews | Petascale Hardware at NICS | ||
| NSF - Organization | |||
| Thomas Zacharia | Joint Institute for Computational Sciences | ||
| Jim Ferguson | Employment Opportunities at NICS | ||
| Jim Ferguson | National Institute for Computational Sciences (NICS): Education, Outreach and Training | ||
| Phil Andrews | National Institute for Computational Sciences (NICS): Overview | ||
| NSF - Partners | |||
| Julia White | Region National Institute for Computational Sciences (NICS): Outstanding Academic and Natural Resources in East Tennessee | ||
| NSF - Software | |||
| Ricky Kendall | Applications National Institute for Computational Sciences (NICS): NSF and ORNL 2011 Model Problems | ||






