Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 952293

Accelerating the computation of FLAPW methods on heterogeneous architectures


Davidović, Davor; Fabregat-Traver, Diego; Höhnerbach, Markus; Di Napoli, Edoardo
Accelerating the computation of FLAPW methods on heterogeneous architectures // Concurrency and computation-practice & experience, 30 (2018), 24; e4905, 14 doi:10.1002/cpe.4905 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 952293 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Accelerating the computation of FLAPW methods on heterogeneous architectures

Autori
Davidović, Davor ; Fabregat-Traver, Diego ; Höhnerbach, Markus ; Di Napoli, Edoardo

Izvornik
Concurrency and computation-practice & experience (1532-0626) 30 (2018), 24; E4905, 14

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
FLAPW ; FLEUR ; hybrid BLAS ; multiGPU ; Phi ; portability ; scalability

Sažetak
Legacy codes in computational science and engineering have been very successful in providing essential functionality to researchers. However, they are not capable of exploiting the massive parallelism provided by emerging heterogeneous architectures. The lack of portable performance and scalability puts them at high risk, ie, either they evolve or they are destined to be executed on older platforms and small clusters. One example of a legacy code which would heavily benefit from a modern redesign is FLEUR, a software for electronic structure calculations. In previous work, the computational bottleneck of FLEUR was partially re-engineered to have a modular design that relies on standard building blocks, namely, BLAS and LAPACK libraries. In this paper, we demonstrate how the initial redesign enables the portability to heterogeneous architectures. More specifically, we study different approaches to port the code to architectures consisting of multi-core CPUs equipped with one or more coprocessors such as Nvidia GPUs and Intel Xeon Phis. Our final code attains over 70% of the architectures' peak performance and outperforms Nvidia's and Intel's libraries. On JURECA, the large tier-0 cluster where FLEUR is often executed, the code takes advantage of the full power of the computing nodes, attaining 5× speedup over the sole use of the CPUs.

Izvorni jezik
Engleski

Znanstvena područja
Matematika, Fizika, Računarstvo



POVEZANOST RADA


Ustanove:
Institut "Ruđer Bošković", Zagreb

Profili:

Avatar Url Davor Davidović (autor)

Citiraj ovu publikaciju

Davidović, Davor; Fabregat-Traver, Diego; Höhnerbach, Markus; Di Napoli, Edoardo
Accelerating the computation of FLAPW methods on heterogeneous architectures // Concurrency and computation-practice & experience, 30 (2018), 24; e4905, 14 doi:10.1002/cpe.4905 (međunarodna recenzija, članak, znanstveni)
Davidović, D., Fabregat-Traver, D., Höhnerbach, M. & Di Napoli, E. (2018) Accelerating the computation of FLAPW methods on heterogeneous architectures. Concurrency and computation-practice & experience, 30 (24), e4905, 14 doi:10.1002/cpe.4905.
@article{article, year = {2018}, pages = {14}, DOI = {10.1002/cpe.4905}, chapter = {e4905}, keywords = {FLAPW, FLEUR, hybrid BLAS, multiGPU, Phi, portability, scalability}, journal = {Concurrency and computation-practice and experience}, doi = {10.1002/cpe.4905}, volume = {30}, number = {24}, issn = {1532-0626}, title = {Accelerating the computation of FLAPW methods on heterogeneous architectures}, keyword = {FLAPW, FLEUR, hybrid BLAS, multiGPU, Phi, portability, scalability}, chapternumber = {e4905} }
@article{article, year = {2018}, pages = {14}, DOI = {10.1002/cpe.4905}, chapter = {e4905}, keywords = {FLAPW, FLEUR, hybrid BLAS, multiGPU, Phi, portability, scalability}, journal = {Concurrency and computation-practice and experience}, doi = {10.1002/cpe.4905}, volume = {30}, number = {24}, issn = {1532-0626}, title = {Accelerating the computation of FLAPW methods on heterogeneous architectures}, keyword = {FLAPW, FLEUR, hybrid BLAS, multiGPU, Phi, portability, scalability}, chapternumber = {e4905} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Citati





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font