Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published about 17 hours ago • 23
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 15 days ago • 87 • 5
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 15 days ago • 87
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 15 days ago • 87
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 15 days ago • 87
Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals Paper • 2505.18071 • Published May 23 • 1
From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment Paper • 2503.15463 • Published Mar 19 • 1
2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models Paper • 2409.19700 • Published Sep 29, 2024
From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment Paper • 2503.15463 • Published Mar 19 • 1
Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals Paper • 2505.18071 • Published May 23 • 1
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning Paper • 2509.19894 • Published Sep 24 • 33