Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 773950

MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS


Gotal Dmitrović, Lovorka; Dušak, Vesna; Dobša, Jasminka
MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS // Informatologia, 49 (2016), 3-4; 129-258 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 773950 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS

Autori
Gotal Dmitrović, Lovorka ; Dušak, Vesna ; Dobša, Jasminka

Izvornik
Informatologia (1330-0067) 49 (2016), 3-4; 129-258

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
missing data ; imputation methods ; probability distribution ; ecoinformatics

Sažetak
Ecology as a scientific discipline has been developing rapidly and becoming the interdisciplinary science based on Information and Communication Technologies (ICT). Discovering, integrating and analyzing a huge amount of heterogeneous data is crucial in exploring complex ecological issues. Ecoinformatics offers tools and approaches for the management of environmental data which it transforms further into information and knowledge. The development of Information Technologies with the special emphasis on the research methods of gathering and analyzing data, their storage and data access, has significantly enhanced the laboratory methods and their reports. The above, influences the data quality, as well as the research itself. Moreover, it provides a stable base for the development and the replacement of missing data. The improper missing data handling can lead to invalid conclusions. Therefore, it is important to use the adequate methods for handling the missing data. This paper compares The Deleting Rows Method (Listwise Deletion Method) and six single imputation methods, namely: Last Observation Carried Forward (LOCF), Hot-deck Imputation, Group Mean Imputation, Estimated Mean Value Imputation (Regression), Mode Imputation , and Median Imputation. For the purposes of this study, the actual, empirical data was collected and used from the non- Gaussian probability distribution of the observed technical system. Mostly, these are asymmetric probability distributions with a tail. Data sets with missing data were created by deleting values with a random number generator. The experiment was repeated three times for each 100%, 95% and 75% sets of the collected data. Experiments have shown that the best imputation data results were provided by Hot-Deck Method, especially when there was a larger number of missing data, which has been confirmed by the Tests of Goodness. The same results, regardless of the set size, were provided by Listwise Deletion Method, which is simpler.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Fakultet organizacije i informatike, Varaždin,
Sveučilište Sjever, Koprivnica

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada

Citiraj ovu publikaciju:

Gotal Dmitrović, Lovorka; Dušak, Vesna; Dobša, Jasminka
MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS // Informatologia, 49 (2016), 3-4; 129-258 (međunarodna recenzija, članak, znanstveni)
Gotal Dmitrović, L., Dušak, V. & Dobša, J. (2016) MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS. Informatologia, 49 (3-4), 129-258.
@article{article, author = {Gotal Dmitrovi\'{c}, Lovorka and Du\v{s}ak, Vesna and Dob\v{s}a, Jasminka}, year = {2016}, pages = {129-258}, keywords = {missing data, imputation methods, probability distribution, ecoinformatics}, journal = {Informatologia}, volume = {49}, number = {3-4}, issn = {1330-0067}, title = {MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS}, keyword = {missing data, imputation methods, probability distribution, ecoinformatics} }
@article{article, author = {Gotal Dmitrovi\'{c}, Lovorka and Du\v{s}ak, Vesna and Dob\v{s}a, Jasminka}, year = {2016}, pages = {129-258}, keywords = {missing data, imputation methods, probability distribution, ecoinformatics}, journal = {Informatologia}, volume = {49}, number = {3-4}, issn = {1330-0067}, title = {MISSING DATA PROBLEMS IN NON-GAUSSIAN PROBABILITY DISTRIBUTIONS}, keyword = {missing data, imputation methods, probability distribution, ecoinformatics} }

Časopis indeksira:


  • Scopus


Uključenost u ostale bibliografske baze podataka::


  • European Reference Index for the Humanities (ERIH)
  • SCOPUS
  • EBSCO
  • Social Science Research Network – SSRN
  • ProQuest - Advanced Technologies & Aerospace Journals, Computer Science Journals, Illustrata: Technology, Library Science, SciTech Journals, Technology Journals
  • LISA-Library and Information Science Abstracts
  • Scientific Commons
  • Portal of scientific journals of Croatia – HRČAK
  • DOAJ...





Contrast
Increase Font
Decrease Font
Dyslexic Font