mechanistic interpretability, reinforcement learning, sokoban, planning
2023 – Present
interpretability, hypothesis testing, estimators, deep learning theory
2021 – 2023
gaussian processes, bayesian neural networks, markov chain monte carlo, bayesian, deep learning theory
2017 – 2021