Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning

Wei, Yufei; Nie, Xiaotong; Hiraga, Motoaki; Ohkura, Kazuhiro; Car, Zlatan

doi:10.20965/jaciii.2019.p0920

Pregled bibliografske jedinice broj: 1021446

Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning

Wei, Yufei; Nie, Xiaotong; Hiraga, Motoaki; Ohkura, Kazuhiro; Car, Zlatan

Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning // Journal of Advanced Computational Intelligence and Intelligent Informatics, 23 (2019), 5; 920-927 doi:10.20965/jaciii.2019.p0920 (međunarodna recenzija, članak, znanstveni)

CROSBI ID: 1021446 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning

Autori
Wei, Yufei ; Nie, Xiaotong ; Hiraga, Motoaki ; Ohkura, Kazuhiro ; Car, Zlatan

Izvornik
Journal of Advanced Computational Intelligence and Intelligent Informatics (1343-0130) 23 (2019), 5; 920-927

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
swarm robotics, automatic design, deep reinforcement learning, deep Q-learning

Sažetak
In this study, the use of a popular deep reinforcement learning algorithm – deep Q- learning – in developing end-to-end control policies for robotic swarms is explored. Robots only have limited local sensory capabilities ; however, in a swarm, they can accomplish collective tasks beyond the capability of a single robot. Compared with most automatic design approaches proposed so far, which belong to the field of evolutionary robotics, deep reinforcement learning techniques provide two advantages: (i) they enable researchers to develop control policies in an end-to-end fashion ; and (ii) they require fewer computation resources, especially when the control policy to be developed has a large parameter space. The proposed approach is evaluated in a round-trip task, where the robots are required to travel between two destinations as much as possible. Simulation results show that the proposed approach can learn control policies directly from high- dimensional raw camera pixel inputs for robotic swarms.

Izvorni jezik
Engleski

Znanstvena područja
Elektrotehnika, Računarstvo, Strojarstvo

POVEZANOST RADA

Ustanove:
Tehnički fakultet, Rijeka

Profili:

Zlatan Car (autor)

Poveznice na cjeloviti tekst rada:

doi www.fujipress.jp

Citiraj ovu publikaciju:

Časopis indeksira:

Web of Science Core Collection (WoSCC)

Emerging Sources Citation Index (ESCI)

Scopus

CROSBI Hrvatska znanstvena bibliografija

Pregled bibliografske jedinice broj: 1021446

Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Časopis indeksira:

Citati:

Altmetrijski pokazatelji:

Pregled bibliografske jedinice broj: 1021446

Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Časopis indeksira:

Citati:

Altmetrijski pokazatelji:

Podijeli: