SW Runtime Estimation using Automata Theory and Deep Learning on HPC

Hyunjoon Cheon, Jinseung Ryu, Chan Yeol Park, Yo Sub Han

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

People use high performance computers (HPCs) for computation-intensive tasks. Often these tasks require a lot of running time of their corresponding softwares. It is important to execute several tasks simultaneously for the system utilization while finishing all tasks within their desired deadlines. Therefore, it is important to know the runtime of each computation-intensive task without executing them in order to schedule the tasks on HPC and obtain better system performance. We propose a method for predicting runtime of MPI-based softwares on HPC using automata theory and deep learning. We first analyze a source code of an input program by representing the code as finite automata and measuring their state complexities. Next, we train the execution runtime of each module of our finite automata using deep neural network (DNN) together with its own state complexity. Then we combine all modules and make a single SW-runtime-prediction model. For experiment, we train the proposed model using OSU benchmark data, HPL and two in-house datasets, and present the usefulness of our model. We also demonstrate the adaptability of our model by updating the current model for new inputs using incremental DNN, which is an important feature for coping with new softwares or new systems.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion, ACSOS-C 2020
EditorsEsam El-Araby, Sven Tomforde, Timothy Wood, Pradeep Kumar, Claudia Raibulet, Ioan Petri, Gabriele Valentini, Phyllis Nelson, Barry Porter
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages7-12
Number of pages6
ISBN (Electronic)9781728184142
DOIs
Publication statusPublished - 2020 Aug
Event1st IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion, ACSOS-C 2020 - Virtual, Washington, United States
Duration: 2020 Aug 172020 Aug 21

Publication series

NameProceedings - 2020 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion, ACSOS-C 2020

Conference

Conference1st IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion, ACSOS-C 2020
Country/TerritoryUnited States
CityVirtual, Washington
Period20/8/1720/8/21

Bibliographical note

Publisher Copyright:
© 2020 IEEE.

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Hardware and Architecture
  • Control and Optimization
  • Information Systems
  • Computer Science Applications
  • Software
  • Safety, Risk, Reliability and Quality
  • Control and Systems Engineering
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'SW Runtime Estimation using Automata Theory and Deep Learning on HPC'. Together they form a unique fingerprint.

Cite this