AI Security

The Line Between Extraction And Leak

Archive

Easy

100pts0 solves

An attack where the model is tricked into reciting its system prompt or hidden instructions to the user is usually called what two-word term?

Show hint

The first word says what; the second says what happens to it.

Archive — no submissions accepted

This challenge is preserved for reference. Play live challenges at /challenges.