Pregled bibliografske jedinice broj: 1113435
Multimodal semantic forecasting based on conditional generation of future features
Multimodal semantic forecasting based on conditional generation of future features // Pattern Recognition 42nd DAGM German Conference, DAGM GCPR 2020, Tübingen, Germany, September 28 – October 1, 2020, Proceedings / Akata, Zeynep ; Geiger, Andreas ; Sattler, Torsten (ur.).
Tübingen, Njemačka: Springer, 2020. str. 1-14 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1113435 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Multimodal semantic forecasting based on conditional generation of future features
Autori
Fugošić, Kristijan ; Šarić, Josip ; Šegvić, Siniša
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Pattern Recognition 42nd DAGM German Conference, DAGM GCPR 2020, Tübingen, Germany, September 28 – October 1, 2020, Proceedings
/ Akata, Zeynep ; Geiger, Andreas ; Sattler, Torsten - : Springer, 2020, 1-14
ISBN
978-3-030-71277-8
Skup
42nd German Conference on Pattern Recognition (DAGM GCPR 2020)
Mjesto i datum
Tübingen, Njemačka, 28.09.2020. - 01.10.2020
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Computer vision, semantic segmentation, Semantic forecasting,
Sažetak
This paper considers semantic forecasting in road-driving scenes. Most existing approaches address this problem as deterministic regression of future features or future predictions given observed frames. However, such approaches ignore the fact that future can not always be guessed with certainty. For example, when a car is about to turn around a corner, the road which is currently occluded by buildings may turn out to be either free to drive, or occupied by people, other vehicles or roadworks. When a deterministic model confronts such situation, its best guess is to forecast the most likely outcome. However, this is not acceptable since it defeats the purpose of forecasting to improve security. It also throws away valuable training data, since a deterministic model is unable to learn any deviation from the norm. We address this problem by providing more freedom to the model through allowing it to forecast different futures. We propose to formulate multimodal forecasting as sampling of a multimodal generative model conditioned on the observed frames. Experiments on the Cityscapes dataset reveal that our multimodal model outperforms its deterministic counterpart in short-term forecasting while performing slightly worse in the mid-term case.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Citiraj ovu publikaciju:
Časopis indeksira:
- Scopus