Combining Parallelism, Virtualization, Heterogeneity and Reliability: Some cu…

Combining Parallelism, Virtualization, Heterogeneity and Reliability: Some cu…

Google Tech Talks October 28, 2008 ABSTRACT This talk will begin with an overview of the Computer Systems group within the College of Engineering and IT at The Australian National University. These fall under the Themes of Bio-Engineering, Robotics, Advanced Runtime Systems, Performance Analysis, Parallel Processing, Operating Systems. Depending on audience interest, projects under the latter three themes will be discussed in detail. These include: OpenMP for Contemporary Clusters: state-of-the-art for distributed shared memory based systems, the handling of heterogeneity and utilization of advanced networking technologies (Infinband). High Performance Numerical Computing on Service-Oriented Architectures: this work involves the extension of the Symphony programming paradigm (Platform Computing), originally developed for financial applications running on enterprise grids. The desirable properties of this model include inherent load balancing in a heterogeneous environment, fault tolerance and relative simplicity of programming. The challenge arises in enabling compute tasks to effectively communicate with low overheads, while retaining most of these advantages. Virtualized HPC Clusters: Virtualization has many advantages in the context of a data center with a heterogeneous cluster of sub-clusters. Work on evaluating the performance of virtualized communication configurations is described, together with a framework for scheduling for taking advantage of virtual machine <b>…</b>