November 21, 2024
Emergent Architectural Dynamics of Neural Token Compression in Large Language Models
William Helms, Konstantin Papadopoulos, Sergei Morozov, et al.