Skip to main content
← All Tags

Mechanistic Interpretability

1 article in this category

AI NewsMachine LearningMechanistic Interpretability

OpenAI Researchers Train Weight Sparse Transformers to Expose Interpretable Circuits

OpenAI's weight-sparse transformers achieve 1-in-1000 weight sparsity, enabling interpretable circuits for safer AI

Read more