AI & ML interests
None yet
Organizations
None yet
models 33
wuschelschulz/gemma_12b_reasoning_reward_hacking_SFT
Updated
wuschelschulz/gemma_1_reasoning_reward_hacking_SFT_debug
Updated
wuschelschulz/gemma-3-12b-reasoning
Updated
wuschelschulz/debug_gemma-3-12b-reasoning
Updated
wuschelschulz/gemma_1_reasoning_reward_hacking_SFT
Updated
wuschelschulz/gemma_1_reasoning_model_only
Updated
wuschelschulz/debug_gemma_1_reasoning_reward_hacking_SFT
Updated
wuschelschulz/gemma-3-1b-persona-ab-grpo
Text Generation
• Updated
wuschelschulz/gemma-3-1b-persona-ab-sft
Text Generation
• Updated
wuschelschulz/SFT_reasoning_Gemma_3_1B_unsloth_reward_hacking_SFT
Updated