Napredna pretraga

Pregled bibliografske jedinice broj: 877325

Running HPC applications on many million cores Cloud


Tomić, Draško; Car, Zlatan; Ogrizović, Dario
Running HPC applications on many million cores Cloud // MIPRO 2017 40th International Convention Proceedings / Skala, Karolj ; Trobec, Roman (ur.).
Opatija: Croatian Society for Information and Communication Technology, Electronics and Microelectronics - MIPRO, 2017. str. 226-231 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Running HPC applications on many million cores Cloud

Autori
Tomić, Draško ; Car, Zlatan ; Ogrizović, Dario

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
MIPRO 2017 40th International Convention Proceedings / Skala, Karolj ; Trobec, Roman - Opatija : Croatian Society for Information and Communication Technology, Electronics and Microelectronics - MIPRO, 2017, 226-231

ISBN
978-953-233-069-4

Skup
MIPRO 2017 - 40th Jubilee International Convention on Information and Communication Technology, Electronics and Microelectronics

Mjesto i datum
Opatija, Croatia, 22-26.05.2017

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
HPC Cloud, Openstack, HPL, NAMD, Cloud controller

Sažetak
Despite the various hardware and software improvements in Cloud architecture, there still exists the huge performance gap between the commodity supercomputers and Cloud when running HPC communication intensive applications. In order to find what is preventing them to better scale on Cloud, we evaluated HPL and NAMD benchmarks on HPE Openstack testbed, and NAMD benchmarks on supercomputer located at Rijeka University Supercomputing Center. Our results revealed two major bottlenecks: the throughput of the interconnect, and Cloud orchestration layer, among other responsible for the management of the communication between Cloud instances. We investigated the influence of jittering, but did not find the significant influence on performance. Our conclusion is that by solely increasing the interconnect throughput, one will not improve the scalability of HPC communication intensive HPC applications in Cloud. This is also backed up with NAMD performed at HP Labs, and with HPL benchmark performed at San Diego Supercomputing Center. We propose two possible scenarios of scalability improvements. One with distributed model of Cloud Orchestration layer ; another with bare metal containers. Efficient load balancing remains the must if we want to see HPC applications scaling over many million Cloud cores. For this, we propose novel SLEM based load balancing strategy.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Ustanove
Tehnički fakultet, Rijeka,
Pomorski fakultet, Rijeka,
Sveučilište u Rijeci