-
Anatomy of Attention Sinks in Vision Transformers
A mechanistic dissection of attention sinks across five vision transformer families reveals fundamentally different mechanisms behind similar-looking behavior.
A mechanistic dissection of attention sinks across five vision transformer families reveals fundamentally different mechanisms behind similar-looking behavior.