HPCMASPA Workshop - Sept 26, 2014
Meeting will consist of Technical Paper Presentations (30 min), MiniTalks (15 min), and a Panel Discussion with Q & A
9:30 - 11:00: SESSION 1: Introduction + HPC Components and Topics
Moderator: Narate Taerat
09:30 - 09:45 Introduction - Jim Brandt
09:45 - 10:00 - MiniTalk - Experiences with NVIDIA FERMI and Kepler GPGPU's. P. Romero and N. DeBardeleben (Los Alamos National Laboratory)
10:00 - 10:30 - Paper - Power Monitoring with PAPI for Extreme Scale Architectures and Dataflow-based Programming Models. H. McCraw, J. Ralph, A. Danalis, and J. Dongarra (Innovative Computing Laboratory, Univ of Tennessee Knoxville, USA; ORNL, USA; and Univ of Manchester, UK)
10:30 - 10:45 - MiniTalk - Monitoring Application Resource Utilization on the Intel PHI Coprocessor. J. Brandt and A. Gentile (Sandia National Laboratories, USA)
10:45 - 11:00 - MiniTalk - Memory Reliability and Performance Degradation: Hunting Rabbits with an Elephant Gun?. B. Allan (Sandia National Laboratories, USA)
11:00 - 11:30: Break
11:30 - 13:00: SESSION 2: Monitoring Systems
Moderator: Ben Allan
14:30 - 15:00 - Paper - Methodology and Application of Machine Learning Algorithms To Classify the Performance of High Performance Cluster Components. C. Idler and P. Romero (Los Alamos National Laboratory, USA)
15:00 - 15:30 - Paper - Demonstrating Improved Application Performance Using Dynamic Monitoring and Task Mapping. J. Brandt, K. Devine, A. Gentile, and K. Pedretti (Sandia National Laboratories, USA)
15:30 - 16:00 - Paper - Multiobjective Optimization Technique Based on Monitoring Information to Increase the Performance of Thread Migration on Multicores. O. Lorenzo, T. Pena, J. Cabaleiro, J. Pichel and F. Rivera (CITIUS Centro de Investigacion en Tenoloxias da Informacion, Univ de Santiago de Compostela, Spain)
13:00 - 14:30: Lunch
14:30 - 16:00: SESSION 3: Analysis, Feedback, and Response
Moderator: Nichamon Naksinehaboon
11:30 - 12:00 - Paper - It Takes a Village: Monitoring the Blue Waters Supercomputer. B. D. Semeraro, R. Sisneros, J. Fullop and G. H. Bauer (National Center for Supercomputing Applications (NCSA), Univ of Illinois, USA)
12:00 - 12:30 - Paper - ProMon: Production-Time Application Monitoring. H. Sharifi and J. Cook (Intel and New Mexico State University, USA)
12:30 - 12:45 - MiniTalk - Realtime Monitoring Using Riemann, Syslog-ng and Collectd. F. Wernli (IN2P3 Computing Centre, CNRS, France)
12:45 - 13:00 - MiniTalk - Procmon: Scalable Workload Analysis for the Extreme Data Era. D. Jacobsen, L. Pezzaglia, and S. Canon (Lawrence Berkeley National Laboratory/ National Energy Research Scientific Computing Center (NERSC), USA)
16:00 - 16:30: Break
16:30 - 18:00: SESSION 4: Panel + Wrap up
Panel Discussion and Question and Answer
Moderator: Jim Brandt
Panel:
Jon Cook, NMSU
Joshi Fullop, NCSA
Forest Godfrey, Cray Inc.
Larry Pezzaglia, NERSC
Narate Taerat, Open Grid Computing
Related Future Events: Monitoring Large-Scale HPC Systems: Issues and Approaches - BOF at SC14 Wed Nov 19 5:30-7:00 pm