Choosing proper normalization is essential for discovery of sparse glycan biomarkers

Uh, Hae-Won; Klaric, Lucija; Ugrina, Ivo; Lauc, Gordan; Smilde, Age L.; Houwing-Duistermaat, Jeanine J.

doi:10.1039/C9MO00174C

Pregled bibliografske jedinice broj: 1108043

Choosing proper normalization is essential for discovery of sparse glycan biomarkers

Uh, Hae-Won; Klaric, Lucija; Ugrina, Ivo; Lauc, Gordan; Smilde, Age L.; Houwing-Duistermaat, Jeanine J.

Choosing proper normalization is essential for discovery of sparse glycan biomarkers // Molecular omics, 16 (2020), 3; 231-242 doi:10.1039/C9MO00174C (međunarodna recenzija, članak, znanstveni)

CROSBI ID: 1108043 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Choosing proper normalization is essential for discovery of sparse glycan biomarkers

Autori
Uh, Hae-Won ; Klaric, Lucija ; Ugrina, Ivo ; Lauc, Gordan ; Smilde, Age L. ; Houwing-Duistermaat, Jeanine J.

Izvornik
Molecular omics (2515-4184) 16 (2020), 3; 231-242

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Glycans ; Normalization ; Algorithms ; Biomarkers ; Calibration ; Glycomics ; Mass Spectrometry ; Glycan ; Biological marker

Sažetak
Rapid progress in high-throughput glycomics analysis enables the researchers to conduct large sample studies. Typically, the between-subject differences in total abundance of raw glycomics data are very large, and it is necessary to reduce the differences, making measurements comparable across samples. Essentially there are two ways to approach this issue: row-wise and column-wise normalization. In glycomics, the differences per subject are usually forced to be exactly zero, by scaling each sample having the sum of all glycan intensities equal to 100%. This total area (row-wise) normalization (TA) results in so-called compositional data, rendering many standard multivariate statistical methods inappropriate or inapplicable. Ignoring the compositional nature of the data, moreover, may lead to spurious results. Alternatively, a log- transformation to the raw data can be performed prior to column-wise normalization and implementing standard statistical tools. Until now, there is no clear consensus on the appropriate normalization method applied to glycomics data. Nor is systematic investigation of impact of TA on downstream analysis available to justify the choice of TA. Our motivation lies in efficient variable selection to identify glycan biomarkers with regard to accurate prediction as well as interpretability of the model chosen.Viaextensive simulations we investigate how different normalization methods affect the performance of variable selection, and compare their performance. We also address the effect of various types of measurement error in glycans: additive, multiplicative and two-component error. We show that when sample-wise differences are not large row-wise normalization (like TA) can have deleterious effects on variable selection and prediction.

Izvorni jezik
Engleski

Znanstvena područja
Matematika, Biologija, Interdisciplinarne prirodne znanosti

POVEZANOST RADA

Ustanove:
Farmaceutsko-biokemijski fakultet, Zagreb,
Prirodoslovno-matematički fakultet, Split,
GENOS d.o.o.

Profili:

Gordan Lauc (autor)

Ivo Ugrina (autor)

Poveznice na cjeloviti tekst rada:

doi pubs.rsc.org

Citiraj ovu publikaciju:

Časopis indeksira:

Current Contents Connect (CCC)
Web of Science Core Collection (WoSCC)

Science Citation Index Expanded (SCI-EXP)
SCI-EXP, SSCI i/ili A&HCI

Scopus
MEDLINE

CROSBI Hrvatska znanstvena bibliografija

Pregled bibliografske jedinice broj: 1108043

Choosing proper normalization is essential for discovery of sparse glycan biomarkers

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Časopis indeksira:

Citati:

Altmetrijski pokazatelji:

Pregled bibliografske jedinice broj: 1108043

Choosing proper normalization is essential for discovery of sparse glycan biomarkers

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Časopis indeksira:

Citati:

Altmetrijski pokazatelji:

Podijeli: