Vision Hallucinations
Hard200 pts0 solves
A VLM describes 'a red car in the background' of a photo that contains no car.
What is this failure mode?
Flag format: CONGRESS{describes_[problem]}
Example: CONGRESS{describes_wrong_colors}
Hint
The model confidently describes things that don't exist in the image.