Martin Wattenberg

NEWs & publications

No items found.
Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models
February 18, 2025
archetypal-sae-adaptive-and-stable-dictionary-learning-for-concept-extraction-in-large-vision-models
Open Problems in Mechanistic Interpretability
January 27, 2025
open-problems-in-mechanistic-interpretability