LLM Regression Testing
Hard200 pts0 solves
After updating your prompt, 3 previously working cases fail. You had no way to catch this before deploying.
What prevents regressions?
Flag format: CONGRESS{[prevention_method]}
Example: CONGRESS{canary_deployment}
Hint
A curated set of examples that must pass before any prompt change ships.