Mode-Constrained Exploration for Model-Based Reinforcement Learning