About

I'm a researcher and writer based in San Francisco.

I did my PhD in computer science at Stanford, where I designed long-context architectures for sequence modeling and memory. In a past life, I was a math & physics nerd at Cornell. In between, I've been a quant, a research engineer/scientist, and even a teacher. Most recently, I was a research scientist at Cartesia, training the next generation of model architectures.

Selected Writings

    All posts