00 — Overview
LLM Jailbreaking
Safety filters trained into LLMs can be bypassed without touching the code. Learn the classic jailbreak techniques — DAN, fictional framing, the grandma exploit — and why they work.
Beginner·30 min·5 tasks
// By the end of this module
→Understand the difference between prompt injection and jailbreaking
→Use roleplay, hypothetical framing, and encoding to bypass content filters
→Apply multi-turn conversation techniques to erode model guardrails
→Recognise why jailbreaks are fundamentally different from traditional vulns
// Prerequisites
