Bugs, jailbreaks, and prompt injections: what do they really have in common
Bugs, jailbreaks, prompt injections. Three different problems, one common root: an LLM does not follow rules written in code — it has learned behaviors from billions of examples. This is why fixing it is much more complex than applying a simple patch.