Skip to main content
← All Tags

Interpretability

2 articles in this category

AI NewsMachine LearningInterpretability

Identifying Influential LLM Interactions at Scale with SPEX and ProxySPEX

SPEX and ProxySPEX enable interaction discovery at scale, with ProxySPEX reducing computational costs by 10x through hierarchical structural assumptions.

Read more
AI NewsLarge language modelsInterpretability

Google Releases Gemma Scope 2 to Deepen Understanding of LLM Behavior

Google’s Gemma Scope 2 suite of tools enhances LLM interpretability, addressing crucial safety concerns like jailbreaks and hallucinations.

Read more