Learning Suboptimal Broadcasting Intervals in Multi- Agent Systems

Tolić, Domagoj; Palunko, Ivana

izvor podataka: crosbi ✓

Learning Suboptimal Broadcasting Intervals in Multi- Agent Systems (CROSBI ID 237135)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Tolić, Domagoj ; Palunko, Ivana Learning Suboptimal Broadcasting Intervals in Multi- Agent Systems // IFAC-PapersOnLine, 50 (2017), 1; 4144-4149. doi: 10.1016/j.ifacol.2017.08.802

Podaci o odgovornosti

Autori

Tolić, Domagoj ; Palunko, Ivana

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

Learning Suboptimal Broadcasting Intervals in Multi- Agent Systems

Sažetak

In this paper, agents learn how often to exchange information with neighbors in cooperative Multi-Agent Systems (MASs) such that their user-defined cost functions are minimized. The investigated cost functions capture trade-offs between the MAS local control performance and energy consumption of each agent in the presence of exogenous disturbances. Agent energy consumption is critical for prolonging the MAS mission and is comprised of both control (e.g., acceleration, velocity) and communication efforts. The proposed methodology starts off by computing upper bounds on asynchronous broadcasting intervals that provably stabilize the MAS. Subsequently, we utilize these upper bounds as optimization constraints and employ an online learning algorithm based on Least Square Policy Iteration (LSPI) to minimize the cost function for each agent. Consequently, the obtained broadcasting intervals adapt to the most recent information (e.g., delayed and noisy agents' inputs and/or outputs) received from neighbors and provably stabilize the MAS. Chebyshev polynomials are utilized as the approximator in the LSPI while Kalman Filtering (KF) handles sampled, corrupted and delayed data. The proposed methodology is exemplified in a consensus control problem with general linear agent dynamics.

Ključne riječi

Multi-Agent Systems, Decentralized Control, Reinforcement Learning, Optimal Control

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o izdanju

Časopis

IFAC-PapersOnLine

Volumen (broj)

50 (1)

Godina

2017.

Stranice rada

4144-4149

Status objave rada

objavljeno

ISSN

2405-8971

e-ISSN

2405-8963

DOI

10.1016/j.ifacol.2017.08.802

Povezanost rada

Povezane osobe

Domagoj Tolić (autor/i)

Ivana Palunko (autor/i)

Povezane ustanove

Sveučilište u Dubrovniku (275) (autorova ustanova)

Povezani projekti

Upravljanje dinamičkim sustavima (rezultat rada na projektu)

Područje

Elektrotehnika

Poveznice

doi.org

sciencedirect.com

Indeksiranost

Scopus