Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1083902

Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm


Bosner, Nela
Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm // Numerical algorithms, 86 (2021), 1; 153-178 doi:10.1007/s11075-020-00883-z (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1083902 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm

Autori
Bosner, Nela

Izvornik
Numerical algorithms (1017-1398) 86 (2021), 1; 153-178

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Hessenberg-triangular-triangular--triangular form ; generalized singular value and eigenvalue problem ; Givens rotations ; block and parallel implementations

Sažetak
The VZ algorithm proposed by Charles F. Van Loan (SIMA, 1975) attempts to solve the generalized type of matrix eigenvalue problem ACx = λBDx, where A, B ∈ R^(n×m), C, D ∈ R^(m×n), and m ≥ n, without forming products and inverses. Especially, this algorithm is suitable for solving the generalized singular value problem. Van Loan’s approach first reduces the matrices A, B, C, and D to a condensed form by the finite step initial reduction. The reduction finds orthogonal matrices Q, U, V, and Z, such that QAZ is upper Hessenberg, and QBV, Z^TCU, and V^TDU are upper triangular. In this initial reduction, A is reduced to upper Hessenberg form, while simultaneously preserving triangularity of other three matrices. This is done by Givens rotations, annihilating one by one element of A, and by generating three more rotations applied to other matrices per each annihilation. Such an algorithm is quite inefficient. In our work, we propose a blocked algorithm for the initial reduction, based on aggregated Givens rotations and matrix–matrix multiplications, which are applied in the outer loop updates. This algorithm has another level of blocking, exploited in the inner loop. Further, we also consider a variant of the algorithm in a hybrid CPU–GPU framework, where compute-intensive outer loop updates are performed on GPU, and can be overlapped with the reduction in the next step performed on CPU. On the other hand, application of a sequence of rotations in the inner loop is parallelized on CPU, with balanced operation count per thread. Since a large number of aggregated rotations are produced in every outer loop step, they are simultaneously accumulated before outer loop updates. These adjustments speed up original initial reduction considerably which is confirmed by numerical experiments, and the efficiency of the whole VZ algorithm is increased.

Izvorni jezik
Engleski

Znanstvena područja
Matematika



POVEZANOST RADA


Projekti:
HRZZ-IP-2013-11-9345 - Matematičko modeliranje, analiza i računanje s primjenama na kompleksne mehaničke sustave (MMACACMS) (Drmač, Zlatko, HRZZ - 2013-11) ( CroRIS)

Ustanove:
Prirodoslovno-matematički fakultet, Matematički odjel, Zagreb,
Prirodoslovno-matematički fakultet, Zagreb

Profili:

Avatar Url Nela Bosner (autor)

Citiraj ovu publikaciju:

Bosner, Nela
Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm // Numerical algorithms, 86 (2021), 1; 153-178 doi:10.1007/s11075-020-00883-z (međunarodna recenzija, članak, znanstveni)
Bosner, N. (2021) Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm. Numerical algorithms, 86 (1), 153-178 doi:10.1007/s11075-020-00883-z.
@article{article, author = {Bosner, Nela}, year = {2021}, pages = {153-178}, DOI = {10.1007/s11075-020-00883-z}, keywords = {Hessenberg-triangular-triangular--triangular form, generalized singular value and eigenvalue problem, Givens rotations, block and parallel implementations}, journal = {Numerical algorithms}, doi = {10.1007/s11075-020-00883-z}, volume = {86}, number = {1}, issn = {1017-1398}, title = {Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm}, keyword = {Hessenberg-triangular-triangular--triangular form, generalized singular value and eigenvalue problem, Givens rotations, block and parallel implementations} }
@article{article, author = {Bosner, Nela}, year = {2021}, pages = {153-178}, DOI = {10.1007/s11075-020-00883-z}, keywords = {Hessenberg-triangular-triangular--triangular form, generalized singular value and eigenvalue problem, Givens rotations, block and parallel implementations}, journal = {Numerical algorithms}, doi = {10.1007/s11075-020-00883-z}, volume = {86}, number = {1}, issn = {1017-1398}, title = {Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm}, keyword = {Hessenberg-triangular-triangular--triangular form, generalized singular value and eigenvalue problem, Givens rotations, block and parallel implementations} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Uključenost u ostale bibliografske baze podataka::


  • MathSciNet
  • Zentrallblatt für Mathematik/Mathematical Abstracts


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font