AdaLomo: Low-memory Optimization with Adaptive Learning Rate Paper • 2310.10195 • Published Oct 16, 2023 • 4
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 22 days ago • 55