V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising
Paper • 2603.16792 • Published • 3
None defined yet.
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning