Research

Implicitly Quantized Representations for Reinforcement Learning featured image

Implicitly Quantized Representations for Reinforcement Learning

Learning representations for reinforcement learning (RL) has shown much promise for continuous control. In this project, we investigate using vector quantization to prevent …

Aidan Scannell
Read more
Function-Space Bayesian Deep Learning for Sequential Learning featured image

Function-Space Bayesian Deep Learning for Sequential Learning

Sequential learning paradigms pose challenges for gradient-based deep learning due to difficulties incorporating new data and retaining prior knowledge. While Gaussian processes …

Aidan Scannell
Read more
Investigating Bayesian Neural Network Dynamics Models for Model-Based Reinforcement Learning featured image

Investigating Bayesian Neural Network Dynamics Models for Model-Based Reinforcement Learning

This project seeks to evaluate and compare different approaches for learning dynamics models in model-based RL. In particular, we plan to compare different approximate inference …

Aidan Scannell
Read more
Mode-Constrained Exploration for Model-Based Reinforcement Learning featured image

Mode-Constrained Exploration for Model-Based Reinforcement Learning

This work presents a learning-based control method for navigating to a target state in unknown, or partially unknown, multimodal dynamical systems. In particular, it develops a …

Aidan Scannell
Read more
Trajectory Optimisation in Learned Multimodal Dynamical Systems featured image

Trajectory Optimisation in Learned Multimodal Dynamical Systems

This work presents a two-stage method to perform trajectory optimisation in multimodal dynamical systems with unknown nonlinear stochastic transition dynamics. The method finds …

avatar
Aidan Scannell
Read more
Identifiable Mixtures of Sparse Variational Gaussian Process Experts featured image

Identifiable Mixtures of Sparse Variational Gaussian Process Experts

This work introduces a variational lower bound for the Mixture of Gaussian Process Experts model with a GP-based gating network based on sparse GPs. The model (and inference) are …

avatar
Aidan Scannell
Read more