Table of Contents
HPC-IODC: HPC I/O in the Data Center Workshop
Managing scientific data at large scale is challenging for scientists but also for the host data center. The storage and file systems deployed within a data center are expected to meet users' requirements for data integrity and high performance across heterogeneous and concurrently running applications.
With new storage technologies and layers in the memory hierarchy, the picture is becoming murkier. To effectively manage the data load within a data center, I/O experts must understand how users expect to use these new storage technologies and what services they should provide in order to enhance user productivity. We seek to ensure a systems-level perspective is included in these discussions.
In this workshop we bring together I/O experts from data centers and application workflows to share current practices for scientific workflows, issues and obstacles for both hardware and the software stack, and R&D to overcome these issues. A common structure of the talks will be provided to focus on relevant aspects and streamline the discussion. Short scientific papers related to the topic are welcome for submission and will be published under open access.
The workshop is organized by
Topics of Interest
The following list of items should be tried to be integrated into the talk if possible. We hope your sites admin will support you to gather the information with little effort.
- Workload characterization
- Scientific Workflow (give a short introduction)
- A typical use-case (if multiple are known, feel free to present more)
- Involved number of files / amount of data
- Job mix
- Node utilization (rel. to peak-performance)
- System view
- Schema of the client/server infrastructure
- Capacities (Tape, Disk, etc.)
- Potential peak-performance of the storage
- Optional: performance results of acceptance tests.
- Software / Middleware used, e.g. NetCDF 4.X, HDF5, …
- Monitoring infrastructure
- Tools and systems used to gather and analyse utilization
- Actual observed performance in production
- Throughput graphs of the storage (e.g. from Ganglia)
- Metadata throughput (Ops/s)
- Files on the storage
- Number of files (if possible per file type)
- Distribution of file sizes
- Issues / Obstacles
- Pain points (what is seen as the biggest problem(s) and suggested solutions, if known)
- Conducted R&D (that aim to mitigate issues)
- Future perspective
- Known or projected future workload characterization
- Scheduled hardware upgrades and new capabilities we should focus on exploiting as a community
- Ideal system characteristics and how it addresses current problems or challenges
- what hardware should be added
- what software should be developed to make things work better (capabilities perspective)
- Items requiring discussion to work through how to address
Talks and (short) proceedings will be published on this webpage. Short papers should be no longer than 5 pages (two column) but we are quite flexible. Additionally, we aim to write a journal paper containing the synthesis of the workshop: current practice, obstacles and observations from the data centers, that is co-authored by workshop participants.
Short Paper Deadlines
- June 10th: Submission deadline of the paper draft (just send the PDF to email@example.com)
- June 30th: Author notification with feedback
- July 7th: Camera-ready publication that will be published on the web-page
The workshop is integrated into ISC-HPC. We welcome everybody to joint the workshop, including:
- I/O experts from data centers and industry.
- Interested domain scientists and computer scientists interested in discussing I/O issues.
You may be interested to join our open mailing list HPC-IODC-15 which is open to discuss HPC-I/O topics.
We especially welcome participants that are willing to give a presentation about the I/O of the representing institutions data center. If you are interested to give a presentation, please send a short email to firstname.lastname@example.org. Note that all presentations should cover the topics mentioned above – sketch a rough presentation outline in your email. (For vendors: do not focus on commercial aspects in your talk).
Agenda is in preparation.
- 9:00 Welcome – Julian Kunkel – Slides
- 9:15 Summary of the DoE Storage Systems and Input/Output Workshop 2014 – Jay Lofstead (Sandia) – Slides
- 10:00 I/O at the German Climate Computing Center (DKRZ) – Julian Kunkel (DKRZ) – Slides
- 10:30 I/O studies at CSCS – Colin McMurtrie (CSCS) – Slides
- 11:00 Coffee Break
- 11:30 I/O in an Industrial Scientific Computing environment – Gerd Büttner (Airbus) – Slides
- 12:00 I/O at the HLRS – Thomas Bönisch (HLRS) – Slides
- 12:30 Discussion round (Benchmarking) – Jay Lofstead
- 13:00 Lunch
- 14:00 I/O at the University of Dresden – Michael Kluge (ZIH Dresden) – Slides
- 14:30 Activities Towards High Availability of Parallel I/O at the K computer – Yuichi Tsujita (RIKEN) – Slides
- 15:00 I/O at Argonne – Florin Isaila (Argonne National Laboratory & University Carlos III) – Slides
- 15:30 I/O at JSC – Wolfgang Frings (Jülich Supercomputing Centre) – Slides
- 16:00 Coffee Break
- 16:30 Percipient Storage: A Big Data and Extreme Compute Storage Architecture – Sai Narasimhamurthy (Seagate) – Slides
- 17:00 Discussion round 2 – Colin McMurtrie
- 17:30 Feedback and Farewell – Julian Kunkel
- 18:00 End