-
-
-
-
-
-
Inference Providers
Active filters:
fp4
imgailab/flux1-trtx-schnell-fp4-blackwell
llmat/Mistral-7B-Instruct-v0.3-NVFP4
Text Generation
•
4B
•
Updated
•
12
llmat/Mistral-Small-Instruct-2409-NVFP4
Text Generation
•
13B
•
Updated
•
9
2imi9/gpt-oss-20B-NVFP4A16-BF16
Text Generation
•
21B
•
Updated
•
2.71k
•
3
nvidia/Phi-4-multimodal-instruct-NVFP4
4B
•
Updated
•
2.39k
•
7
nvidia/Phi-4-reasoning-plus-NVFP4
8B
•
Updated
•
1.43k
•
6
Text Generation
•
8B
•
Updated
•
19.9k
•
5
Text Generation
•
17B
•
Updated
•
19.5k
•
5
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
5.51k
•
13
xxrjun/DeepSeek-R1-0528-FP4
394B
•
Updated
•
1
Sunbird/Sunflower-14B-4bit-fp4-bnb
Text Generation
•
15B
•
Updated
Sunbird/Sunflower-32B-4bit-fp4-bnb
Text Generation
•
33B
•
Updated
RedHatAI/Llama-3.1-8B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
15.1k
Text Generation
•
9B
•
Updated
•
392
Text Generation
•
5B
•
Updated
•
1.01k
•
1
RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4
Text Generation
•
14B
•
Updated
•
11.9k
•
6
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-NVFP4
Text Generation
•
229B
•
Updated
•
944
•
2
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
136B
•
Updated
•
1.39k
•
4
prithivMLmods/Nanonets-OCR2-3B-AWQ-nvfp4
Image-Text-to-Text
•
3B
•
Updated
•
101
eousphoros/DeepSeek-V3.2-NVFP4
Text Generation
•
387B
•
Updated
•
54
•
5
Daemontatox/Qwen3-L-NVFP4
Text Generation
•
133B
•
Updated
•
2
trithemius/Velvet-14B-nvfp4
8B
•
Updated
•
3
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
•
23B
•
Updated
•
22
Shifusen/L3.3-70B-Magnum-v4-SE-NVFP4
Text Generation
•
41B
•
Updated
•
7
Firworks/Snowpiercer-15B-v4-nvfp4
9B
•
Updated
•
6
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
•
18B
•
Updated
•
6.69k
•
11
Shifusen/Strawberrylemonade-L3-70B-v1.2-NVFP4
Text Generation
•
41B
•
Updated
•
16
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
2B
•
Updated
•
30
•
1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
3B
•
Updated
•
29
•
1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
5B
•
Updated
•
251
•
2