openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ 0.4B β’ Updated Sep 15, 2023 β’ 7.71M β’ 1.94k
microsoft/Phi-3-vision-128k-instruct Text Generation β’ 4B β’ Updated 25 days ago β’ 22.1k β’ 969
Running on Zero MCP Featured 139 Multimodal OCR2 π» 139 nanonets ocr / smoldocling / monkey ocr / typhoon ocr
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17, 2025 β’ 51.6k β’ 1.6k
Running on Zero 16 Explainable-Vision-Language-Model π₯Ά 16 Generate a video visualizing how a model attends to an image while generating text
mistralai/Mistral-7B-Instruct-v0.2 Text Generation β’ 7B β’ Updated Jul 24, 2025 β’ 2.38M β’ β’ 3.04k
google/vit-base-patch16-224 Image Classification β’ 86.6M β’ Updated Sep 5, 2023 β’ 4.47M β’ β’ 915