Multimodal semantic forecasting based on conditional generation of future features

Fugošić, Kristijan; Šarić, Josip; Šegvić, Siniša

Pregled bibliografske jedinice broj: 1113435

Multimodal semantic forecasting based on conditional generation of future features

Fugošić, Kristijan; Šarić, Josip; Šegvić, Siniša

Multimodal semantic forecasting based on conditional generation of future features // Pattern Recognition 42nd DAGM German Conference, DAGM GCPR 2020, Tübingen, Germany, September 28 – October 1, 2020, Proceedings / Akata, Zeynep ; Geiger, Andreas ; Sattler, Torsten (ur.).
Tübingen, Njemačka: Springer, 2020. str. 1-14 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)

CROSBI ID: 1113435 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Multimodal semantic forecasting based on conditional generation of future features

Autori
Fugošić, Kristijan ; Šarić, Josip ; Šegvić, Siniša

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Pattern Recognition 42nd DAGM German Conference, DAGM GCPR 2020, Tübingen, Germany, September 28 – October 1, 2020, Proceedings / Akata, Zeynep ; Geiger, Andreas ; Sattler, Torsten - : Springer, 2020, 1-14

ISBN
978-3-030-71277-8

Skup
42nd German Conference on Pattern Recognition (DAGM GCPR 2020)

Mjesto i datum
Tübingen, Njemačka, 28.09.2020. - 01.10.2020

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Computer vision, semantic segmentation, Semantic forecasting,

Sažetak
This paper considers semantic forecasting in road-driving scenes. Most existing approaches address this problem as deterministic regression of future features or future predictions given observed frames. However, such approaches ignore the fact that future can not always be guessed with certainty. For example, when a car is about to turn around a corner, the road which is currently occluded by buildings may turn out to be either free to drive, or occupied by people, other vehicles or roadworks. When a deterministic model confronts such situation, its best guess is to forecast the most likely outcome. However, this is not acceptable since it defeats the purpose of forecasting to improve security. It also throws away valuable training data, since a deterministic model is unable to learn any deviation from the norm. We address this problem by providing more freedom to the model through allowing it to forecast different futures. We propose to formulate multimodal forecasting as sampling of a multimodal generative model conditioned on the observed frames. Experiments on the Cityscapes dataset reveal that our multimodal model outperforms its deterministic counterpart in short-term forecasting while performing slightly worse in the mid-term case.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo

POVEZANOST RADA

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Josip Šarić (autor)