inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 4 days ago • 171
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 5 days ago • 158
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 5 days ago • 180
RedHatAI/Qwen3-235B-A22B-Instruct-2507-quantized.w8a8 Text Generation • 235B • Updated 5 days ago • 99
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 5 days ago • 174