Red Hat AI

Enterprise

company

Verified

https://www.redhat.com/en/products/ai

RedHat_AI

RedHatOfficial

Activity Feed

AI & ML interests

OpenSource and AI

Recent Activity

nm-research updated a model about 17 hours ago

RedHatAI/Llama-4-Maverick-17B-128E-Instruct-NVFP4

nm-research updated a model 6 days ago

RedHatAI/Qwen3-30B-A3B-NVFP4

nm-research updated a model 6 days ago

RedHatAI/Qwen3-235B-A22B-NVFP4

View all activity

RedHatAI 's collections 16

NVFP4 Models

RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4

Text Generation • 14B • Updated 6 days ago • 5.89k • 3
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4

Text Generation • 133B • Updated 6 days ago • 4.02k • 3
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4

Text Generation • 136B • Updated 6 days ago • 848 • 4
RedHatAI/Qwen3-235B-A22B-NVFP4

Text Generation • 136B • Updated 6 days ago • 327

Red Hat AI validated models - September 2025

September 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.

RedHatAI/DeepSeek-R1-0528-quantized.w4a16

Text Generation • 104B • Updated Oct 13 • 717 • 12
RedHatAI/Qwen3-8B-FP8-dynamic

Text Generation • 8B • Updated Sep 22 • 12.3k • 9
RedHatAI/Kimi-K2-Instruct-quantized.w4a16

Text Generation • 146B • Updated Oct 13 • 214 • 12
RedHatAI/gemma-3n-E4B-it-FP8-dynamic

Text Generation • 8B • Updated Oct 13 • 154k • 3

Red Hat AI validated models - May 2025

May 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.

RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic

Image-Text-to-Text • 109B • Updated Sep 22 • 26.4k • 27
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • 20B • Updated Sep 22 • 21.3k • 12
RedHatAI/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated Sep 22 • 2.79k
RedHatAI/Llama-4-Maverick-17B-128E-Instruct

Image-Text-to-Text • 402B • Updated Sep 22 • 41 • 2

Gemma-3 Quantized

Collection of quantized Gemma 3 models created by Google.

RedHatAI/gemma-3-27b-it-quantized.w4a16

Any-to-Any • 7B • Updated Jun 9 • 13.7k • 10
RedHatAI/gemma-3-12b-it-quantized.w4a16

Any-to-Any • 4B • Updated Jun 9 • 1.2k • 2
RedHatAI/gemma-3-4b-it-quantized.w4a16

Any-to-Any • 2B • Updated Jun 9 • 1.09k • 2
RedHatAI/gemma-3-1b-it-quantized.w8a8

Text Generation • 1B • Updated Jun 6 • 46.7k

Llama 4 Quantized

Quantized variants of the Llama 4 release by Meta.

RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic

Image-Text-to-Text • 109B • Updated Sep 22 • 26.4k • 27
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • 20B • Updated Sep 22 • 21.3k • 12
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8

Image-Text-to-Text • 402B • Updated Sep 22 • 228 • 2
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16

Image-Text-to-Text • 59B • Updated Jun 12 • 279 • 1

Mistral-Small-3.1 (2503) Instruct Quantized

Quantized variants of Mistral Small 3.1 (2503) Instruct.

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic

Image-Text-to-Text • 24B • Updated Oct 29 • 17.7k • 9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8

Image-Text-to-Text • 24B • Updated Oct 29 • 4.9k • 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16

Image-Text-to-Text • 5B • Updated Oct 29 • 74.2k • 10

Llama 3.3 70B Instruct Quantized

Quantized variants of Meta Llama 3.3 multilingual large language model (LLM) is an instruction tuned generative model in 70B (text in/text out).

RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8

Text Generation • 71B • Updated Sep 22 • 13.9k • 12
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic

Text Generation • Updated Sep 22 • 124k • 13
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16

Text Generation • 11B • Updated Sep 22 • 1.94k • 3

Granite Quantized

Quantized Granite models from IBM Research.

RedHatAI/granite-3.1-8b-instruct-quantized.w8a8

Text Generation • 8B • Updated Sep 25 • 248 • 2
RedHatAI/granite-3.1-2b-base-quantized.w8a8

Text Generation • 3B • Updated Feb 28 • 41
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16

Text Generation • 1B • Updated Sep 22 • 757 • 1
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8

Text Generation • 3B • Updated Feb 28 • 25

Red Hat AI validated models - October 2025

October 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.

RedHatAI/gpt-oss-120b

Text Generation • 120B • Updated Oct 13 • 706 • 3
RedHatAI/gpt-oss-20b

Text Generation • 22B • Updated Oct 31 • 6.7k • 5
RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8

Text Generation • 480B • Updated Oct 13 • 111 • 2
RedHatAI/whisper-large-v3-turbo-quantized.w4a16

Automatic Speech Recognition • 0.2B • Updated Oct 13 • 270 • 6

Speculator Models

RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3

Text Generation • 1.0B • Updated 8 days ago • 8.23k • 1
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3

Text Generation • 2B • Updated 8 days ago • 1.32k • 1
RedHatAI/Qwen3-8B-speculator.eagle3

Text Generation • 1B • Updated 8 days ago • 27.3k

Docling Models

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23 • 115k • 1.05k
docling-project/docling-layout-heron

42.9M • Updated Jul 2 • 669k • 25
docling-project/docling-models

Updated 7 days ago • 725k • 186
docling-project/CodeFormulaV2

0.3B • Updated Aug 11 • 23.7k • 2

Whisper Quantized

Collection of quantized whisper models created by OpenAI

RedHatAI/whisper-large-v3-turbo-quantized.w4a16

Automatic Speech Recognition • 0.2B • Updated Oct 13 • 270 • 6
RedHatAI/whisper-large-v3-turbo-quantized.w8a8

Automatic Speech Recognition • 0.9B • Updated Apr 22 • 66 • 4
RedHatAI/whisper-large-v3-turbo-FP8-dynamic

Automatic Speech Recognition • 0.9B • Updated Apr 22 • 329 • 6
RedHatAI/whisper-tiny-FP8-Dynamic

Automatic Speech Recognition • 57.8M • Updated Apr 22 • 94

Qwen3 Quantized

Collection of quantized Qwen 3 models from Alibaba Cloud.

RedHatAI/Qwen3-4B-quantized.w4a16

Text Generation • 1B • Updated May 13 • 7k • 3
RedHatAI/Qwen3-32B-FP8-dynamic

Text Generation • 33B • Updated May 13 • 999 • 15
RedHatAI/Qwen3-0.6B-FP8-dynamic

Text Generation • 0.8B • Updated May 12 • 3.91k
RedHatAI/Qwen3-8B-FP8-dynamic

Text Generation • 8B • Updated Sep 22 • 12.3k • 9

Phi-4 Quantized

Quantized variants of Phi-4 family of small language and multi-modal models by Microsoft.

RedHatAI/phi-4-quantized.w4a16

Text Generation • 3B • Updated Sep 25 • 632 • 4
RedHatAI/phi-4-FP8-dynamic

Text Generation • 15B • Updated Sep 25 • 330
RedHatAI/phi-4-quantized.w8a8

Text Generation • 15B • Updated Sep 25 • 2.55k • 2

Qwen 2.5 Quantized

Quantized variants of Qwen 2.5 Instruct and Qwen VL models

RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8

Image-to-Text • 8B • Updated Oct 2 • 1.63k • 8
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w4a16

Image-to-Text • 3B • Updated Apr 3 • 850 • 7
RedHatAI/Qwen2.5-7B-quantized.w8a8

Text Generation • 8B • Updated Dec 3, 2024 • 97 • 1
RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-dynamic

Image-to-Text • 73B • Updated Apr 25 • 8.65k • 14

vLLM Kernels

Collection of kernels from vLLM built using https://github.com/huggingface/kernel-builder

RedHatAI/quantization

Updated Jul 27 • 6
RedHatAI/moe

Updated Jul 25 • 3

NVFP4 Models

RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4

Text Generation • 14B • Updated 6 days ago • 5.89k • 3
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4

Text Generation • 133B • Updated 6 days ago • 4.02k • 3
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4

Text Generation • 136B • Updated 6 days ago • 848 • 4
RedHatAI/Qwen3-235B-A22B-NVFP4

Text Generation • 136B • Updated 6 days ago • 327

Red Hat AI validated models - October 2025

October 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.

RedHatAI/gpt-oss-120b

Text Generation • 120B • Updated Oct 13 • 706 • 3
RedHatAI/gpt-oss-20b

Text Generation • 22B • Updated Oct 31 • 6.7k • 5
RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8

Text Generation • 480B • Updated Oct 13 • 111 • 2
RedHatAI/whisper-large-v3-turbo-quantized.w4a16

Automatic Speech Recognition • 0.2B • Updated Oct 13 • 270 • 6

Red Hat AI validated models - September 2025

September 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.

RedHatAI/DeepSeek-R1-0528-quantized.w4a16

Text Generation • 104B • Updated Oct 13 • 717 • 12
RedHatAI/Qwen3-8B-FP8-dynamic

Text Generation • 8B • Updated Sep 22 • 12.3k • 9
RedHatAI/Kimi-K2-Instruct-quantized.w4a16

Text Generation • 146B • Updated Oct 13 • 214 • 12
RedHatAI/gemma-3n-E4B-it-FP8-dynamic

Text Generation • 8B • Updated Oct 13 • 154k • 3

Speculator Models

RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3

Text Generation • 1.0B • Updated 8 days ago • 8.23k • 1
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3

Text Generation • 2B • Updated 8 days ago • 1.32k • 1
RedHatAI/Qwen3-8B-speculator.eagle3

Text Generation • 1B • Updated 8 days ago • 27.3k

Red Hat AI validated models - May 2025

May 2025 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio.

RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic

Image-Text-to-Text • 109B • Updated Sep 22 • 26.4k • 27
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • 20B • Updated Sep 22 • 21.3k • 12
RedHatAI/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated Sep 22 • 2.79k
RedHatAI/Llama-4-Maverick-17B-128E-Instruct

Image-Text-to-Text • 402B • Updated Sep 22 • 41 • 2

Docling Models

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23 • 115k • 1.05k
docling-project/docling-layout-heron

42.9M • Updated Jul 2 • 669k • 25
docling-project/docling-models

Updated 7 days ago • 725k • 186
docling-project/CodeFormulaV2

0.3B • Updated Aug 11 • 23.7k • 2

Gemma-3 Quantized

Collection of quantized Gemma 3 models created by Google.

RedHatAI/gemma-3-27b-it-quantized.w4a16

Any-to-Any • 7B • Updated Jun 9 • 13.7k • 10
RedHatAI/gemma-3-12b-it-quantized.w4a16

Any-to-Any • 4B • Updated Jun 9 • 1.2k • 2
RedHatAI/gemma-3-4b-it-quantized.w4a16

Any-to-Any • 2B • Updated Jun 9 • 1.09k • 2
RedHatAI/gemma-3-1b-it-quantized.w8a8

Text Generation • 1B • Updated Jun 6 • 46.7k

Whisper Quantized

Collection of quantized whisper models created by OpenAI

RedHatAI/whisper-large-v3-turbo-quantized.w4a16

Automatic Speech Recognition • 0.2B • Updated Oct 13 • 270 • 6
RedHatAI/whisper-large-v3-turbo-quantized.w8a8

Automatic Speech Recognition • 0.9B • Updated Apr 22 • 66 • 4
RedHatAI/whisper-large-v3-turbo-FP8-dynamic

Automatic Speech Recognition • 0.9B • Updated Apr 22 • 329 • 6
RedHatAI/whisper-tiny-FP8-Dynamic

Automatic Speech Recognition • 57.8M • Updated Apr 22 • 94

Llama 4 Quantized

Quantized variants of the Llama 4 release by Meta.

RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic

Image-Text-to-Text • 109B • Updated Sep 22 • 26.4k • 27
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • 20B • Updated Sep 22 • 21.3k • 12
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8

Image-Text-to-Text • 402B • Updated Sep 22 • 228 • 2
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16

Image-Text-to-Text • 59B • Updated Jun 12 • 279 • 1

Qwen3 Quantized

Collection of quantized Qwen 3 models from Alibaba Cloud.

RedHatAI/Qwen3-4B-quantized.w4a16

Text Generation • 1B • Updated May 13 • 7k • 3
RedHatAI/Qwen3-32B-FP8-dynamic

Text Generation • 33B • Updated May 13 • 999 • 15
RedHatAI/Qwen3-0.6B-FP8-dynamic

Text Generation • 0.8B • Updated May 12 • 3.91k
RedHatAI/Qwen3-8B-FP8-dynamic

Text Generation • 8B • Updated Sep 22 • 12.3k • 9

Mistral-Small-3.1 (2503) Instruct Quantized

Quantized variants of Mistral Small 3.1 (2503) Instruct.

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic

Image-Text-to-Text • 24B • Updated Oct 29 • 17.7k • 9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8

Image-Text-to-Text • 24B • Updated Oct 29 • 4.9k • 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16

Image-Text-to-Text • 5B • Updated Oct 29 • 74.2k • 10

Phi-4 Quantized

Quantized variants of Phi-4 family of small language and multi-modal models by Microsoft.

RedHatAI/phi-4-quantized.w4a16

Text Generation • 3B • Updated Sep 25 • 632 • 4
RedHatAI/phi-4-FP8-dynamic

Text Generation • 15B • Updated Sep 25 • 330
RedHatAI/phi-4-quantized.w8a8

Text Generation • 15B • Updated Sep 25 • 2.55k • 2

Llama 3.3 70B Instruct Quantized

Quantized variants of Meta Llama 3.3 multilingual large language model (LLM) is an instruction tuned generative model in 70B (text in/text out).

RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8

Text Generation • 71B • Updated Sep 22 • 13.9k • 12
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic

Text Generation • Updated Sep 22 • 124k • 13
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16

Text Generation • 11B • Updated Sep 22 • 1.94k • 3

Qwen 2.5 Quantized

Quantized variants of Qwen 2.5 Instruct and Qwen VL models

RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8

Image-to-Text • 8B • Updated Oct 2 • 1.63k • 8
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w4a16

Image-to-Text • 3B • Updated Apr 3 • 850 • 7
RedHatAI/Qwen2.5-7B-quantized.w8a8

Text Generation • 8B • Updated Dec 3, 2024 • 97 • 1
RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-dynamic

Image-to-Text • 73B • Updated Apr 25 • 8.65k • 14

Granite Quantized

Quantized Granite models from IBM Research.

RedHatAI/granite-3.1-8b-instruct-quantized.w8a8

Text Generation • 8B • Updated Sep 25 • 248 • 2
RedHatAI/granite-3.1-2b-base-quantized.w8a8

Text Generation • 3B • Updated Feb 28 • 41
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16

Text Generation • 1B • Updated Sep 22 • 757 • 1
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8

Text Generation • 3B • Updated Feb 28 • 25

vLLM Kernels

Collection of kernels from vLLM built using https://github.com/huggingface/kernel-builder

RedHatAI/quantization

Updated Jul 27 • 6
RedHatAI/moe

Updated Jul 25 • 3

AI & ML interests

Recent Activity

Team members 50

RedHatAI 's collections 16