User Tools

Site Tools


publication

Publication details

Abstract

In the face of the growing complexity of HPC systems, their growing energy costs, and the increasing difficulty to run applications efficiently, a number of monitoring tools have been developed during the last years. SIOX is one such endeavor, with a uniquely holis- tic approach: Not only does it aim to record a certain kind of data, but to make all relevant data available for analysis and optimization. Among other sources, this encompasses data from hardware energy counters and trace data from different hardware/software layers. However, not all data that can be recorded should be recorded. As such, SIOX needs good heuristics to de- termine when and what data needs to be collected, and the energy consumption can provide an important sig- nal about when the system is in a state that deserves closer attention. In this paper, we show that SIOX can use Likwid to collect and report the energy consump- tion of applications, and present how this data can be visualized using SIOX’s web-interface. Furthermore, we outline how SIOX can use this information to intelli- gently adjust the amount of data it collects, allowing it to reduce the monitoring overhead while still providing complete information about critical situations.

BibTeX

@inproceedings{MECWSMTBAE14,
	author	 = {Julian M. Kunkel and Alvaro Aguilera and Nathanael Hübbe and Marc Wiedemann and Michaela Zimmer},
	title	 = {{Monitoring Energy Consumption With SIOX
Autonomous Monitoring Triggered by Abnormal Energy Consumption}},
	year	 = {2014},
	editor	 = {},
	publisher	 = {Springer},
	pages	 = {8},
	conference	 = {EnA-HPC 2014},
	organization	 = {Technische Universität Dresden},
	location	 = {Dresden},
	abstract	 = {In the face of the growing complexity of HPC
systems, their growing energy costs, and the increasing
difficulty to run applications efficiently, a number of
monitoring tools have been developed during the last
years. SIOX is one such endeavor, with a uniquely holis-
tic approach: Not only does it aim to record a certain
kind of data, but to make all relevant data available
for analysis and optimization. Among other sources,
this encompasses data from hardware energy counters
and trace data from different hardware/software layers.
However, not all data that can be recorded should be
recorded. As such, SIOX needs good heuristics to de-
termine when and what data needs to be collected, and
the energy consumption can provide an important sig-
nal about when the system is in a state that deserves
closer attention. In this paper, we show that SIOX can
use Likwid to collect and report the energy consump-
tion of applications, and present how this data can be
visualized using SIOX’s web-interface. Furthermore, we
outline how SIOX can use this information to intelli-
gently adjust the amount of data it collects, allowing it
to reduce the monitoring overhead while still providing
complete information about critical situations.},
}

publication.txt · Last modified: 2019-01-23 10:26 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki