Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

129,925

Full-text search

Active filters: trl

JackBinary/Qwen3.5-24B-A3B-Claude-Opus-Gemini-3.1-Pro-Reasoning-Distilled-heretic

Text Generation • 24B • Updated 5 days ago • 53 • 1

Karan6124/llama3-8b-dpo-orca-adapter

Text Generation • Updated 4 days ago • 30 • 1

thelamapi/next-70b-GGUF

Text Generation • 71B • Updated 4 days ago • 1.3k • 1

khazarai/Nizami-1.7B

Text Generation • 2B • Updated 4 days ago • 33 • 1

N-Bot-Int/OpenElla-StoryWriter-TypeB

Text Generation • 1B • Updated 2 days ago • 118 • 1

kth8/gemma-3-270m-it-SuperGPQA-Classifier

Text Generation • 0.3B • Updated 3 days ago • 36 • 1

N-Bot-Int/OpenElla-StoryWriter-TypeB-GGUF

Text Generation • 1B • Updated 2 days ago • 162 • 1

filter-with-espresso/Qwen2.5-14B-Instruct-moltbook-finetune-v9

Updated about 21 hours ago • 1

Dorjzodovsuren/MongolianTTS_elevenlabs

Text Generation • 3B • Updated about 18 hours ago • 45 • 1

Simonc-44/Cygnis-Alpha-2-7B-v0.1

Updated about 20 hours ago • 1

mradermacher/Qwen3.5-24B-A3B-Claude-Opus-Gemini-3.1-Pro-Reasoning-Distilled-heretic-i1-GGUF

24B • Updated about 14 hours ago • 2.1k • 1

mirazrafi/NSFW-RP-RolePlay-LoRA-Qwen-3.5-4B

Text Generation • Updated 2 days ago • 194 • 4

arif-butt/finetuned-llama-3.2-1b-it

Updated 16 days ago • 1

mirazrafi/NSFW-RP-RolePlay-LoRA-Qwen-3.5-9B

Text Generation • Updated 2 days ago • 171 • 1

lewtun/dummy-trl-model

Reinforcement Learning • Updated Jan 24, 2023 • 1

ybelkada/gpt-neo-125m-detox

Reinforcement Learning • Updated Feb 17, 2023 • 18

ybelkada/gpt-neo-125m-detoxified-long-context

Reinforcement Learning • Updated Feb 17, 2023 • 4

dshin/flan-t5-ppo

Reinforcement Learning • Updated Mar 11, 2023 • 2 • 1

SummerSigh/T5-Base-Rule-Of-Thumb-RM

Reinforcement Learning • Updated Mar 12, 2023 • 1

dshin/flan-t5-ppo-testing

Reinforcement Learning • Updated Mar 12, 2023 • 1 • 1

SummerSigh/T5-Base-EvilPrompterRM

Reinforcement Learning • 0.2B • Updated Mar 18, 2023 • 2

dshin/flan-t5-ppo-testing-violation

Reinforcement Learning • Updated Mar 12, 2023

dshin/flan-t5-ppo-user-b

Reinforcement Learning • Updated Mar 12, 2023 • 1

dshin/flan-t5-ppo-user-h-use-violation

Reinforcement Learning • Updated Mar 13, 2023

dshin/flan-t5-ppo-user-f-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 1

dshin/flan-t5-ppo-user-e-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 2

dshin/flan-t5-ppo-user-a-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 1

dshin/flan-t5-ppo-user-h-batch-size-8-epoch-0

Reinforcement Learning • Updated Mar 13, 2023

dshin/flan-t5-ppo-user-e-batch-size-8-epoch-0

Reinforcement Learning • Updated Mar 13, 2023

dshin/flan-t5-ppo-user-h-batch-size-8-epoch-0-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 1