Pregled bibliografske jedinice broj: 1021446
Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning
Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning // Journal of Advanced Computational Intelligence and Intelligent Informatics, 23 (2019), 5; 920-927 doi:10.20965/jaciii.2019.p0920 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1021446 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Developing End-to-End Control Policies for Robotic Swarms Using Deep Q-learning
Autori
Wei, Yufei ; Nie, Xiaotong ; Hiraga, Motoaki ; Ohkura, Kazuhiro ; Car, Zlatan
Izvornik
Journal of Advanced Computational Intelligence and Intelligent Informatics (1343-0130) 23
(2019), 5;
920-927
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
swarm robotics, automatic design, deep reinforcement learning, deep Q-learning
Sažetak
In this study, the use of a popular deep reinforcement learning algorithm – deep Q- learning – in developing end-to-end control policies for robotic swarms is explored. Robots only have limited local sensory capabilities ; however, in a swarm, they can accomplish collective tasks beyond the capability of a single robot. Compared with most automatic design approaches proposed so far, which belong to the field of evolutionary robotics, deep reinforcement learning techniques provide two advantages: (i) they enable researchers to develop control policies in an end-to-end fashion ; and (ii) they require fewer computation resources, especially when the control policy to be developed has a large parameter space. The proposed approach is evaluated in a round-trip task, where the robots are required to travel between two destinations as much as possible. Simulation results show that the proposed approach can learn control policies directly from high- dimensional raw camera pixel inputs for robotic swarms.
Izvorni jezik
Engleski
Znanstvena područja
Elektrotehnika, Računarstvo, Strojarstvo
Citiraj ovu publikaciju:
Časopis indeksira:
- Web of Science Core Collection (WoSCC)
- Emerging Sources Citation Index (ESCI)
- Scopus