OCR vs VLM

Easy100 pts0 solves
OCR extracts text from images. VLMs go further. What can VLMs do that OCR cannot? Flag format: CONGRESS{understands_[what]} Example: CONGRESS{understands_font_styles}
Hint
VLMs understand spatial relationships, tables, and what the layout means.