LLM-as-a-Judge Biases
ArchiveEasy
When an LLM evaluates two responses, it tends to prefer the first one (_____(1) bias) and the longer one (_____(2) bias).
Name both biases.
Flag format: CONGRESS{1:[bias],2:[bias]}
Example: CONGRESS{1:recency,2:anchoring}
Show hint
One is about order, one is about length.
Archive — no submissions accepted
This challenge is preserved for reference. Play live challenges at /challenges.