Skip to main content
To KTH's start page To KTH's start page

Safety-aware adaptive reinforcement learning with applications to Brushbot navigation

Time: Fri 2018-02-23 11.00 - 12.00

Location: Seminar room (Rumsnr: A:641), Osquldas väg 10, Q-huset, våningsplan 6, KTH Campus

Participating: Motoya Ohnishi

Opponent: Sylvain Potuaud

Supervisor: Prof. Mikael Johansson

Export to calendar

This thesis presents a safety-aware learning framework that employs an adaptive model learning method together with barrier certificates for systems with possibly nonstationary agent dynamics.

To extract the dynamic structure of the model, we use a sparse optimization technique, and the resulting model will be used in combination with control barrier certificates which constrain feedback controllers only when safety is about to be violated. Under some mild assumptions, solutions to the constrained feedback-controller optimization are guaranteed to be globally optimal, and the monotonic improvement of a feedback controller is thus ensured. In addition, we reformulate the (action-)value function approximation to make any kernel-based nonlinear function estimation method applicable. We then employ a state-of-the-art kernel adaptive filtering technique for the (action-)value function approximation. The resulting framework is verified experimentally on a Brushbot, whose dynamics is unknown and highly complex.