-
RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4
Text Generation • 14B • Updated • 5.89k • 3 -
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation • 133B • Updated • 4.02k • 3 -
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation • 136B • Updated • 848 • 4 -
RedHatAI/Qwen3-235B-A22B-NVFP4
Text Generation • 136B • Updated • 327
AI & ML interests
OpenSource and AI
Recent Activity
View all activity
September 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation • 104B • Updated • 717 • 12 -
RedHatAI/Qwen3-8B-FP8-dynamic
Text Generation • 8B • Updated • 12.3k • 9 -
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation • 146B • Updated • 214 • 12 -
RedHatAI/gemma-3n-E4B-it-FP8-dynamic
Text Generation • 8B • Updated • 154k • 3
May 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic
Image-Text-to-Text • 109B • Updated • 26.4k • 27 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
Image-Text-to-Text • 20B • Updated • 21.3k • 12 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • 109B • Updated • 2.79k -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct
Image-Text-to-Text • 402B • Updated • 41 • 2
Collection of quantized Gemma 3 models created by Google.
-
RedHatAI/gemma-3-27b-it-quantized.w4a16
Any-to-Any • 7B • Updated • 13.7k • 10 -
RedHatAI/gemma-3-12b-it-quantized.w4a16
Any-to-Any • 4B • Updated • 1.2k • 2 -
RedHatAI/gemma-3-4b-it-quantized.w4a16
Any-to-Any • 2B • Updated • 1.09k • 2 -
RedHatAI/gemma-3-1b-it-quantized.w8a8
Text Generation • 1B • Updated • 46.7k
Quantized variants of the Llama 4 release by Meta.
-
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic
Image-Text-to-Text • 109B • Updated • 26.4k • 27 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
Image-Text-to-Text • 20B • Updated • 21.3k • 12 -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text • 402B • Updated • 228 • 2 -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text • 59B • Updated • 279 • 1
Quantized variants of Mistral Small 3.1 (2503) Instruct.
-
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic
Image-Text-to-Text • 24B • Updated • 17.7k • 9 -
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text • 24B • Updated • 4.9k • 5 -
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text • 5B • Updated • 74.2k • 10
Quantized variants of Meta Llama 3.3 multilingual large language model (LLM) is an instruction tuned generative model in 70B (text in/text out).
Quantized Granite models from IBM Research.
-
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation • 8B • Updated • 248 • 2 -
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation • 3B • Updated • 41 -
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation • 1B • Updated • 757 • 1 -
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation • 3B • Updated • 25
October 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/gpt-oss-120b
Text Generation • 120B • Updated • 706 • 3 -
RedHatAI/gpt-oss-20b
Text Generation • 22B • Updated • 6.7k • 5 -
RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 111 • 2 -
RedHatAI/whisper-large-v3-turbo-quantized.w4a16
Automatic Speech Recognition • 0.2B • Updated • 270 • 6
Collection of quantized whisper models created by OpenAI
-
RedHatAI/whisper-large-v3-turbo-quantized.w4a16
Automatic Speech Recognition • 0.2B • Updated • 270 • 6 -
RedHatAI/whisper-large-v3-turbo-quantized.w8a8
Automatic Speech Recognition • 0.9B • Updated • 66 • 4 -
RedHatAI/whisper-large-v3-turbo-FP8-dynamic
Automatic Speech Recognition • 0.9B • Updated • 329 • 6 -
RedHatAI/whisper-tiny-FP8-Dynamic
Automatic Speech Recognition • 57.8M • Updated • 94
Collection of quantized Qwen 3 models from Alibaba Cloud.
Quantized variants of Phi-4 family of small language and multi-modal models by Microsoft.
Quantized variants of Qwen 2.5 Instruct and Qwen VL models
-
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-to-Text • 8B • Updated • 1.63k • 8 -
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w4a16
Image-to-Text • 3B • Updated • 850 • 7 -
RedHatAI/Qwen2.5-7B-quantized.w8a8
Text Generation • 8B • Updated • 97 • 1 -
RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-dynamic
Image-to-Text • 73B • Updated • 8.65k • 14
Collection of kernels from vLLM built using https://github.com/huggingface/kernel-builder
-
RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4
Text Generation • 14B • Updated • 5.89k • 3 -
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation • 133B • Updated • 4.02k • 3 -
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation • 136B • Updated • 848 • 4 -
RedHatAI/Qwen3-235B-A22B-NVFP4
Text Generation • 136B • Updated • 327
October 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/gpt-oss-120b
Text Generation • 120B • Updated • 706 • 3 -
RedHatAI/gpt-oss-20b
Text Generation • 22B • Updated • 6.7k • 5 -
RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 111 • 2 -
RedHatAI/whisper-large-v3-turbo-quantized.w4a16
Automatic Speech Recognition • 0.2B • Updated • 270 • 6
September 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation • 104B • Updated • 717 • 12 -
RedHatAI/Qwen3-8B-FP8-dynamic
Text Generation • 8B • Updated • 12.3k • 9 -
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation • 146B • Updated • 214 • 12 -
RedHatAI/gemma-3n-E4B-it-FP8-dynamic
Text Generation • 8B • Updated • 154k • 3
May 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.
-
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic
Image-Text-to-Text • 109B • Updated • 26.4k • 27 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
Image-Text-to-Text • 20B • Updated • 21.3k • 12 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • 109B • Updated • 2.79k -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct
Image-Text-to-Text • 402B • Updated • 41 • 2
Collection of quantized Gemma 3 models created by Google.
-
RedHatAI/gemma-3-27b-it-quantized.w4a16
Any-to-Any • 7B • Updated • 13.7k • 10 -
RedHatAI/gemma-3-12b-it-quantized.w4a16
Any-to-Any • 4B • Updated • 1.2k • 2 -
RedHatAI/gemma-3-4b-it-quantized.w4a16
Any-to-Any • 2B • Updated • 1.09k • 2 -
RedHatAI/gemma-3-1b-it-quantized.w8a8
Text Generation • 1B • Updated • 46.7k
Collection of quantized whisper models created by OpenAI
-
RedHatAI/whisper-large-v3-turbo-quantized.w4a16
Automatic Speech Recognition • 0.2B • Updated • 270 • 6 -
RedHatAI/whisper-large-v3-turbo-quantized.w8a8
Automatic Speech Recognition • 0.9B • Updated • 66 • 4 -
RedHatAI/whisper-large-v3-turbo-FP8-dynamic
Automatic Speech Recognition • 0.9B • Updated • 329 • 6 -
RedHatAI/whisper-tiny-FP8-Dynamic
Automatic Speech Recognition • 57.8M • Updated • 94
Quantized variants of the Llama 4 release by Meta.
-
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic
Image-Text-to-Text • 109B • Updated • 26.4k • 27 -
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
Image-Text-to-Text • 20B • Updated • 21.3k • 12 -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text • 402B • Updated • 228 • 2 -
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text • 59B • Updated • 279 • 1
Collection of quantized Qwen 3 models from Alibaba Cloud.
Quantized variants of Mistral Small 3.1 (2503) Instruct.
-
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic
Image-Text-to-Text • 24B • Updated • 17.7k • 9 -
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text • 24B • Updated • 4.9k • 5 -
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text • 5B • Updated • 74.2k • 10
Quantized variants of Phi-4 family of small language and multi-modal models by Microsoft.
Quantized variants of Meta Llama 3.3 multilingual large language model (LLM) is an instruction tuned generative model in 70B (text in/text out).
Quantized variants of Qwen 2.5 Instruct and Qwen VL models
-
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-to-Text • 8B • Updated • 1.63k • 8 -
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w4a16
Image-to-Text • 3B • Updated • 850 • 7 -
RedHatAI/Qwen2.5-7B-quantized.w8a8
Text Generation • 8B • Updated • 97 • 1 -
RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-dynamic
Image-to-Text • 73B • Updated • 8.65k • 14
Quantized Granite models from IBM Research.
-
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation • 8B • Updated • 248 • 2 -
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation • 3B • Updated • 41 -
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation • 1B • Updated • 757 • 1 -
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation • 3B • Updated • 25
Collection of kernels from vLLM built using https://github.com/huggingface/kernel-builder