From Bytes to Ideas: Language Modeling with Autoregressive U-Nets Paper • 2506.14761 • Published Jun 17, 2025 • 17
TokAlign: Efficient Vocabulary Adaptation via Token Alignment Paper • 2506.03523 • Published Jun 4, 2025
zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression Paper • 2506.01084 • Published Jun 1, 2025 • 7
Pretraining Language Models to Ponder in Continuous Space Paper • 2505.20674 • Published May 27, 2025 • 2