Publication details
- Monitoring Energy Consumption With SIOX Autonomous Monitoring Triggered by Abnormal Energy Consumption (Julian M. Kunkel, Alvaro Aguilera, Nathanael Hübbe, Marc Wiedemann, Michaela Zimmer), pp. 8, Springer, EnA-HPC 2014, Technische Universität Dresden, Dresden, 2014
Publication details
Abstract
In the face of the growing complexity of HPC systems, their growing energy costs, and the increasing difficulty to run applications efficiently, a number of monitoring tools have been developed during the last years. SIOX is one such endeavor, with a uniquely holis- tic approach: Not only does it aim to record a certain kind of data, but to make all relevant data available for analysis and optimization. Among other sources, this encompasses data from hardware energy counters and trace data from different hardware/software layers. However, not all data that can be recorded should be recorded. As such, SIOX needs good heuristics to de- termine when and what data needs to be collected, and the energy consumption can provide an important sig- nal about when the system is in a state that deserves closer attention. In this paper, we show that SIOX can use Likwid to collect and report the energy consump- tion of applications, and present how this data can be visualized using SIOX’s web-interface. Furthermore, we outline how SIOX can use this information to intelli- gently adjust the amount of data it collects, allowing it to reduce the monitoring overhead while still providing complete information about critical situations.
BibTeX
@inproceedings{MECWSMTBAE14, author = {Julian M. Kunkel and Alvaro Aguilera and Nathanael Hübbe and Marc Wiedemann and Michaela Zimmer}, title = {{Monitoring Energy Consumption With SIOX Autonomous Monitoring Triggered by Abnormal Energy Consumption}}, year = {2014}, editor = {}, publisher = {Springer}, pages = {8}, conference = {EnA-HPC 2014}, organization = {Technische Universität Dresden}, location = {Dresden}, abstract = {In the face of the growing complexity of HPC systems, their growing energy costs, and the increasing difficulty to run applications efficiently, a number of monitoring tools have been developed during the last years. SIOX is one such endeavor, with a uniquely holis- tic approach: Not only does it aim to record a certain kind of data, but to make all relevant data available for analysis and optimization. Among other sources, this encompasses data from hardware energy counters and trace data from different hardware/software layers. However, not all data that can be recorded should be recorded. As such, SIOX needs good heuristics to de- termine when and what data needs to be collected, and the energy consumption can provide an important sig- nal about when the system is in a state that deserves closer attention. In this paper, we show that SIOX can use Likwid to collect and report the energy consump- tion of applications, and present how this data can be visualized using SIOX’s web-interface. Furthermore, we outline how SIOX can use this information to intelli- gently adjust the amount of data it collects, allowing it to reduce the monitoring overhead while still providing complete information about critical situations.}, }