Archive
AI Security

The Detector That Counts Probabilities

Archive
Easy
100pts0 solves
One heuristic jailbreak detector flags prompts whose language-model perplexity exceeds a threshold — useful against character-level adversarial strings. Which metric?
Show hint
A noun that also means confusion.

Archive — no submissions accepted

This challenge is preserved for reference. Play live challenges at /challenges.