Archive
Evaluation & Benchmarks

LLM-as-a-Judge Biases

Archive
Easy
100pts53 solves
When an LLM evaluates two responses, it tends to prefer the first one (_____(1) bias) and the longer one (_____(2) bias). Name both biases. Flag format: CONGRESS{1:[bias],2:[bias]} Example: CONGRESS{1:recency,2:anchoring}
Show hint
One is about order, one is about length.

Archive — no submissions accepted

This challenge is preserved for reference. Play live challenges at /challenges.