Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics Paper • 2602.02343 • Published Feb 2 • 13
On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks Paper • 2602.00130 • Published Jan 28 • 3
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 207
Vision Language Models Quantization Collection Vision Language Models (VLMs) quantized by Neural Magic • 20 items • Updated Mar 4, 2025 • 6
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 12 items • Updated about 1 hour ago • 36
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 6 items • Updated Mar 2 • 23
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 494