Architects, administrators, and users of modern high-performance computing (HPC) systems strive to meet goals of energy, resource, and application efficiency. Optimizing for any or all of these can only be accomplished through analysis of appropriate system and application information. While application performance analysis tools can provide application performance insight, the associated overhead typically decreases application performance while the tools are being employed. Thus they are typically used for application performance tuning but not for production application runs. Likewise traditional system monitoring tools in conjunction with analysis tools can provide insight into run-time system resource utilization. However, due to overhead and impact concerns, such tools are often run with collection periods on order of minutes or only used to solve problems and not during normal HPC system operation. There are currently few, if any, tools that provide continuous, low impact, high fidelity system monitoring, analysis, and feedback that meet the increasingly urgent resource efficiency optimization needs of HPC systems.
Modern processors and operating systems being used in HPC systems expose a wealth of information about how system resources, including energy, are being utilized. Lightweight tools that gather and analyze this information could provide feedback, including run-time, to increase application performance; optimize system resource utilization; and drive more efficient future HPC system design.
The goal of this workshop is to provide an opportunity for researchers to exchange new ideas, research, techniques, and tools in the area of HPC monitoring, analysis, and feedback as it relates to increasing efficiency with respect to energy, resource utilization, and application run-time.
Keynote Speaker: William (Bill) T. C. Kramer, NCSA Blue Waters Director and PI
Failure and Resiliency in the Shadow of Exascale – Will our our Current Assumptions Take us in the Right Direction?
Panel: Accessible Analytics and Visualizations
Topics
Data collection, transport, and storage
Analysis of monitored data and system information
Response to and utilization of processed data and system information
Experience reports and System operations
Important dates (AOE):
HPCMASPA 2016 will be held in conjunction with IEEE IPDPS (May 23-27, 2016 in Chicago, IL)
Format
HPCMASPA 2016 will be a full day and will consist of talks from refereed papers, a panel discussion with representative researchers and practitioners, and a keynote speaker.
Talks from refereed papers
HPCMASPA 2016 welcomes submissions of original work not previously published nor under review by another conference or journal. Proceedings of the workshop will be distributed at the conference and will be submitted for inclusion in the IEEE Xplore Digital Library.
Categories:
Guidelines: