WeDetect: Fast Open-Vocabulary Object Detection as Retrieval Paper • 2512.12309 • Published 23 days ago • 2
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation Paper • 2512.10730 • Published 25 days ago • 3
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation Paper • 2512.10730 • Published 25 days ago • 3
ViSpeak Collection ViSpeak: Visual Instruction Feedback in Streaming Videos • 5 items • Updated Oct 29, 2025
LOVE-R1 Collection LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning • 5 items • Updated Oct 27, 2025