Mechanistic Interpretability

1 article in this category

AI NewsMachine LearningMechanistic Interpretability

OpenAI's weight-sparse transformers achieve 1-in-1000 weight sparsity, enabling interpretable circuits for safer AI

Nov 14, 2025