logo
ResearchBunny Logo
Revealing principles of autonomous thermal soaring in windy conditions using vulture-inspired deep reinforcement-learning

Biology

Revealing principles of autonomous thermal soaring in windy conditions using vulture-inspired deep reinforcement-learning

Y. Flato, R. Harel, et al.

Explore the fascinating world of thermal soaring, a technique beloved by birds and gliders that utilizes updrafts of hot air. This groundbreaking study by Yoav Flato, Roi Harel, Aviv Tamar, Ran Nathan, and Tsevi Beatus offers insights into motion control learning, unveiling new efficiency metrics and demonstrating how trained networks evolve their functionalities. Immerse yourself in the rich complexities of this model-problem that bridges nature and technology.

00:00
00:00
~3 min • Beginner • English
Abstract
Thermal soaring, a technique used by birds and gliders to utilize updrafts of hot air, is an appealing model-problem for studying motion control and how it is learned by animals and engineered autonomous systems. Thermal soaring has rich dynamics and nontrivial constraints, yet it uses few control parameters and is becoming experimentally accessible. Following recent developments in applying reinforcement learning methods for training deep neural-network (deep-RL) models to soar autonomously both in simulation and real gliders, here we develop a simulation-based deep-RL system to study the learning process of thermal soaring. We find that this process has learning bottlenecks, we define a new efficiency metric and use it to characterize learning robustness, we compare the learned policy to data from soaring vultures, and find that the neurons of the trained network divide into function clusters that evolve during learning. These results pose thermal soaring as a rich yet tractable model-problem for the learning of motion control.
Publisher
Nature Communications
Published On
Jun 10, 2024
Authors
Yoav Flato, Roi Harel, Aviv Tamar, Ran Nathan, Tsevi Beatus
Tags
thermal soaring
motion control
deep reinforcement learning
learning bottlenecks
policy comparison
function clusters
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny