Automated Aerial Suspended Cargo Delivery through Reinforcement Learning

Faust, Aleksandra; Palunko, Ivana; Cruz, Patricio; Fierro, Rafael; Tapia, Lydia

doi:10.1016/j.artint.2014.11.009

Pregled bibliografske jedinice broj: 733775

Automated Aerial Suspended Cargo Delivery through Reinforcement Learning

Faust, Aleksandra; Palunko, Ivana; Cruz, Patricio; Fierro, Rafael; Tapia, Lydia

Automated Aerial Suspended Cargo Delivery through Reinforcement Learning // Artificial intelligence, 247 (2017), 381-398 doi:10.1016/j.artint.2014.11.009 (međunarodna recenzija, članak, znanstveni)

CROSBI ID: 733775 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Automated Aerial Suspended Cargo Delivery through Reinforcement Learning

Autori
Faust, Aleksandra ; Palunko, Ivana ; Cruz, Patricio ; Fierro, Rafael ; Tapia, Lydia

Izvornik
Artificial intelligence (0004-3702) 247 (2017); 381-398

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Reinforcement learning ; aerial load transportation ; quadrotors

Sažetak
Cargo-bearing Unmanned aerial vehicles (UAVs) have tremendous potential to assist humans in food, medicine, and supply deliveries. For time-critical cargo delivery tasks, UAVs need to be able to navigate their environments and deliver suspended payloads with bounded load displacement. As a constraint balancing task for joint UAV- suspended load system dynamics, this task poses a challenge. This article presents a reinforcement learning approach to aerial cargo delivery tasks in environments with static obstacles. We first learn a minimal residual oscillations task policy in obstacle- free environments that find trajectories with minimized residual load displacement with a specifically designed feature vector for value function approximation. With insights of learning from the cargo delivery problem, we define a set of formal criteria for class of robotics problems where learning can occur in a simplified problem space and transfer to a broader problem space. Exploiting this property, we create a path tracking method that suppresses load displacement. As an extension to tasks in environments with static obstacles where the load displacement needs to be bounded throughout the trajectory, sampling-based motion planning generates collision-free paths. Next, a reinforcement learning agent transforms these paths into trajectories that maintain the bound on the load displacement while following the collision-free path in a timely manner. We verify the approach both in simulation and in experiments on quadrotor with suspended load.

Izvorni jezik
Engleski

Znanstvena područja
Elektrotehnika, Računarstvo

POVEZANOST RADA

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Ivana Palunko (autor)

Poveznice na cjeloviti tekst rada:

doi www.sciencedirect.com

Citiraj ovu publikaciju:

Časopis indeksira:

Current Contents Connect (CCC)
Web of Science Core Collection (WoSCC)

Science Citation Index Expanded (SCI-EXP)
SCI-EXP, SSCI i/ili A&HCI

Scopus

CROSBI Hrvatska znanstvena bibliografija

Pregled bibliografske jedinice broj: 733775

Automated Aerial Suspended Cargo Delivery through Reinforcement Learning

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Časopis indeksira:

Citati:

Altmetrijski pokazatelji:

Pregled bibliografske jedinice broj: 733775

Automated Aerial Suspended Cargo Delivery through Reinforcement Learning

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Časopis indeksira:

Citati:

Altmetrijski pokazatelji:

Podijeli: