Tom Butler
butlert
AI & ML interests
None yet
Organizations
Models
-
Open-Orca/Mistral-7B-OpenOrca
Text Generation • Updated • 2.96k • 688 -
butlert/Llama-2-7b-chat-hf-sharded-bf16-fine-tuned-adapters
Updated • 2 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 20.1M • 884 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54
llava models
-
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54 -
liuhaotian/llava-v1.6-mistral-7b
Image-Text-to-Text • 8B • Updated • 11.2k • 245 -
liuhaotian/llava-v1.5-7b
Image-Text-to-Text • Updated • 223k • 543 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 74 • 19
Optimized Vision Language Models
-
Efficient-Large-Model/VILA-2.7b
Text Generation • 3B • Updated • 152 • 15 -
NousResearch/Obsidian-3B-V0.5
Text Generation • Updated • 103 • 178 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 74 • 19 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54
Papers
-
Extending Context Window of Large Language Models via Semantic Compression
Paper • 2312.09571 • Published • 16 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Paper • 2312.02949 • Published • 14 -
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper • 2402.14289 • Published • 20
Image Classification
- Running on A10G2.97k
CLIP Interrogator
🕵2.97kGenerate art prompts and style tags from any image
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 19.3M • 1.97k -
altndrr/cased
Image Classification • 0.4B • Updated • 62 • 1 - Runtime error5
Clip Playground
🖇5
Datasets
reasoning
Papers
-
Extending Context Window of Large Language Models via Semantic Compression
Paper • 2312.09571 • Published • 16 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Paper • 2312.02949 • Published • 14 -
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper • 2402.14289 • Published • 20
Models
-
Open-Orca/Mistral-7B-OpenOrca
Text Generation • Updated • 2.96k • 688 -
butlert/Llama-2-7b-chat-hf-sharded-bf16-fine-tuned-adapters
Updated • 2 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 20.1M • 884 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54
Image Classification
- Running on A10G2.97k
CLIP Interrogator
🕵2.97kGenerate art prompts and style tags from any image
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 19.3M • 1.97k -
altndrr/cased
Image Classification • 0.4B • Updated • 62 • 1 - Runtime error5
Clip Playground
🖇5
llava models
-
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54 -
liuhaotian/llava-v1.6-mistral-7b
Image-Text-to-Text • 8B • Updated • 11.2k • 245 -
liuhaotian/llava-v1.5-7b
Image-Text-to-Text • Updated • 223k • 543 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 74 • 19
Datasets
Optimized Vision Language Models
-
Efficient-Large-Model/VILA-2.7b
Text Generation • 3B • Updated • 152 • 15 -
NousResearch/Obsidian-3B-V0.5
Text Generation • Updated • 103 • 178 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 74 • 19 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54