Search

Home
Publications
Talks
Posts
Projects
CV
Notes

Notes

Literate Dotfiles
Literate Emacs Configuration

Cluster log in with SSH
Conda environments
Debugging on a cluster
Hydra submitit launcher
Running Jupyter Notebooks on GPU Clusters

Python

Model-based value expansion
Vision Language Models as Reward Models

Contents

Reading

Model-based value expansion
Notes on model-based value expansion (MVE)
Vision Language Models as Reward Models
Notes on using VLMs as reward models for RL

Last updated on Sep 9, 2018

© 2025 Aidan Scannell. This work is licensed under CC BY NC ND 4.0

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite