Want to create an interactive transcript for this episode?
Podcast: Arxiv Papers
Episode: Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs