I work on large language models with a focus on post-training which includes SFT, RL, reasoning capabilities, and model evaluation. This site is where I write to think: sharing notes, essays, and ongoing explorations about LLMs, model behavior, and learning systems.
Thoughts, notes, and essays on AI, language models, and other stuff!