Pregled bibliografske jedinice broj: 1212026
Eksploratorna i dubinska analiza u okruženju Velikih podataka
Eksploratorna i dubinska analiza u okruženju Velikih podataka, 2020., diplomski rad, diplomski, Fakultet Elektrotehnike i Računarstva, Zagreb
CROSBI ID: 1212026 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Eksploratorna i dubinska analiza u okruženju Velikih podataka
(Exploratory and Data Mining Analysis in Big Data Environment)
Autori
Tin Ivan Križ
Vrsta, podvrsta i kategorija rada
Ocjenski radovi, diplomski rad, diplomski
Fakultet
Fakultet Elektrotehnike i Računarstva
Mjesto
Zagreb
Datum
07.07
Godina
2020
Stranica
62
Mentor
Pintar, Damir
Ključne riječi
Apache Spark ; R, veliki podaci ; sparklyr ; eksploratorna analiza podataka ; rudarenje podataka ; sustav napojnica ; taksi vožnje
(Apache Spark ; R ; big data ; sparklyr ; exploratory data analysis ; data mining ; tipping system ; taxi rides)
Sažetak
S global datasphere is growing faster every year, data analysis is becoming an increasingly challenging task. This paper goes through a big data exploratory analysis and data mining workflow using a distributed data processing environment. Specifically, it presents an analysis of tips in New York City's taxi rides from January 2009 to June 2016 using the R programming language and the Apache Spark platform. Although twenty-seven features have been explored and three different models have been trained, the research shows that predicting tips in taxi rides is a difficult task and that tipping is mostly motivated by social pressure.
Izvorni jezik
Engleski
Znanstvena područja
Elektrotehnika, Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Profili:
Damir Pintar
(mentor)