Making AI safe and understandable
I research AI alignment and mechanistic interpretability. My work on Automated Circuit Discovery helped establish how we understand what's happening inside neural networks. Currently co-founding Dokimasia, building tools for value-aligned computing.