Prefill vs Decode
Easy100 pts0 solves
Processing the input prompt is fast (all tokens at once). Generating output is slow (one token at a time).
Describe both phases.
Flag format: CONGRESS{prefill:[mode],decode:[mode]}
Example: CONGRESS{prefill:sequential,decode:parallel}
Hint
Input tokens are independent of each other. Output tokens each depend on all previous ones.