Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
Paper
•
2512.24271
•
Published
•
50
None defined yet.
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs