Q-learning by the nth step state and multi-agent negotiation in unknown environment

Job, Josip; Jović, Franjo; Livada, Časlav

Pregled bibliografske jedinice broj: 594684

Q-learning by the nth step state and multi-agent negotiation in unknown environment

Job, Josip; Jović, Franjo; Livada, Časlav

Q-learning by the nth step state and multi-agent negotiation in unknown environment // Tehnicki Vjesnik-Technical Gazette, 19 (2012), 3; 529-534 (međunarodna recenzija, članak, znanstveni)

CROSBI ID: 594684 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Q-learning by the nth step state and multi-agent negotiation in unknown environment

Autori
Job, Josip ; Jović, Franjo ; Livada, Časlav

Izvornik
Tehnicki Vjesnik-Technical Gazette (1330-3651) 19 (2012), 3; 529-534

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
agent; learning from reward and punishment; q-learning; reinforcement learning

Sažetak
This work will show a new procedure of Q-learning in which the agent’s decision, regarding the next step, is not based on the optimal action at that moment but on the usefulness of a future state. A near agent communication has been implemented so that the agents signal each other their future actions which contribute to a better choice of actions for each of the agents. The new method is named Q-learning by the nth step and multi-agent negotiation. The results of the testing of this algorithm are compared with the basic QL algorithm which is also graphically demonstrated and the advantages of the new algorithm are listed too. An average of 40 % collision decrease is obtained during learning procedure.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo

POVEZANOST RADA

Ustanove:
Fakultet elektrotehnike, računarstva i informacijskih tehnologija Osijek

Profili:

Franjo Jović (autor)