I am a fifth-year PhD student in the Department of Computer Science at the University of Toronto. I am fortunate to be advised by Jimmy Ba and Sheila McIlraith. I am also affiliated with the Vector Institute.
In 2023, I was a student researcher at Google working on data efficiency for LLMs with the Gemini and Blueshift teams.
My research interests lie in AI reasoning. Research highlights include:
- Automatic prompt engineering (resulting in a popular prompting trick that gives insight into the psychology of GPT).
- OpenWebMath, a dataset of all the mathematical documents on Common Crawl, and Llemma, the best open language model for mathematical reasoning.
- STEVE-1, a Minecraft agent that can follow visual and text instructions using only keyboard and mouse commands.
- Llemma: An Open Language Model For Mathematics2023
- OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text2023
- STEVE-1: A Generative Model for Text-to-Behavior in MinecraftIn Advances in Neural Information Processing Systems, 2023Spotlight
- Large Language Models are Human-Level Prompt EngineersIn The Eleventh International Conference on Learning Representations , 2023
- You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic EnvironmentsIn Advances in Neural Information Processing Systems, 2022
- Planning from Pixels using Inverse Dynamics ModelsIn International Conference on Learning Representations, 2021