Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Parallel Solver for Shifted Systems in a Hybrid CPU--GPU Framework (CROSBI ID 259489)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko Parallel Solver for Shifted Systems in a Hybrid CPU--GPU Framework // SIAM journal on scientific computing, 40 (2018), 4; C605-C633. doi: 10.1137/17m1144465

Podaci o odgovornosti

Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko

engleski

Parallel Solver for Shifted Systems in a Hybrid CPU--GPU Framework

This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A-\sigma I)X=B$ with a large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear, e.g., in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side matrix $B$ to the controller Hessenberg canonical form that facilitates efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form, and $B$ is made upper triangular. This is implemented as a blocked highly parallel CPU--GPU hybrid algorithm ; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.

GPU ; Hessenberg matrix ; interpolatory model reduction ; parallel solver ; pseudospectrum ; shifted linear systems ; transfer function

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

40 (4)

2018.

C605-C633

objavljeno

1064-8275

1095-7197

10.1137/17m1144465

Povezanost rada

Matematika

Poveznice
Indeksiranost