LLaVA-CoT: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Jul 21, 2025 • 131
apple/aimv2-large-patch14-224-distilled Image Feature Extraction • 0.3B • Updated Jul 8, 2025 • 48 • 2
apple/aimv2-large-patch14-336-distilled Image Feature Extraction • 0.3B • Updated Jul 8, 2025 • 31 • 7
apple/aimv2-large-patch14-224-lit Zero-Shot Image Classification • 0.4B • Updated Jul 8, 2025 • 3.43k • 8