DavidAU/OpenAi-GPT-oss-20b-abliterated-uncensored-NEO-Imatrix-gguf Text Generation • 21B • Updated Nov 17 • 129k • 367
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58
The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections Paper • 2510.09023 • Published Oct 10 • 10
When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs Paper • 2508.03365 • Published Aug 5 • 4 • 2
MemeSafetyBench Collection [EMNLP'25] A Benchmark for Assessing VLM Safety with Real-World Memes • 2 items • Updated Sep 9