Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Human action prediction in collaborative environments based on shared-weight LSTMs with feature dimensionality reduction (CROSBI ID 311978)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Petković, Tomislav ; Petrović, Luka ; Marković, Ivan ; Petrović, Ivan Human action prediction in collaborative environments based on shared-weight LSTMs with feature dimensionality reduction // Applied soft computing, 126 (2022), 109245, 12. doi: 10.1016/j.asoc.2022.109245

Podaci o odgovornosti

Petković, Tomislav ; Petrović, Luka ; Marković, Ivan ; Petrović, Ivan

engleski

Human action prediction in collaborative environments based on shared-weight LSTMs with feature dimensionality reduction

As robots are progressing towards being ubiquitous and an indispensable part of our everyday environments, such as home, offices, healthcare, education, and manufacturing shop floors, efficient and safe collaboration and cohabitation become imperative. Given that, such environments could benefit greatly from accurate human action prediction. In addition to being accurate, human action prediction should be computationally efficient, in order to ensure a timely reaction, and capable of dealing with changing environments, since unstructured interaction and collaboration with humans usually do not assume static conditions. In this paper, we propose a model for human action prediction based on motion cues and gaze using shared-weight Long Short-Term Memory networks (LSTMs) and feature dimensionality reduction. LSTMs have proven to be a powerful tool in processing time series data, especially when dealing with long-term dependencies ; however, to maximize their performance, LSTM networks should be fed with informative and quality inputs. Given that, in this paper, we furthermore conducted an extensive input feature analysis based on (i) signal correlation and their strength to act as stand-alone predictors, and (ii) a multilayer perceptron inspired by the autoencoder architecture. We validated the proposed model on a publicly available MoGaze1 dataset for human action prediction, as well as on a smaller dataset recorded in our laboratory. Our model outperformed alternatives, such as recurrent neural networks, a fully connected LSTM network, and the strongest stand-alone signals (baselines), and can run in real-time on a standard laptop CPU. Since eye gaze might not always be available in a real-world scenario, we have implemented and tested a multi- layer perceptron for gaze estimation from more easily obtainable motion cues, such as head orientation and hand position. The estimated gaze signal can be utilized during inference of our LSTM-based model, thus making our action prediction pipeline suitable for real-time practical applications

Human action prediction ; Long short-term memory networks ; Feature dimensionality reduction ; Correlation ; Autoencoder ; Gaze estimation

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

126

2022.

109245

12

objavljeno

1568-4946

1568-4946

10.1016/j.asoc.2022.109245

Povezanost rada

Elektrotehnika, Računarstvo, Temeljne tehničke znanosti

Poveznice
Indeksiranost