Mode-constrained Model-based Reinforcement Learning via Gaussian Processes
We present a model-based RL algorithm that constrains training to a single dynamic mode with high probability. This is a difficult problem because the mode constraint is a hidden …