Batching Strategies
Medium150 pts0 solves
Static batching wastes GPU cycles because short requests pad to the longest sequence. A better approach dynamically adds/removes requests from the batch.
What is this called?
Flag format: CONGRESS{technique_in_snake_case}
Hint
Requests flow in and out of the batch as they complete, no padding waste.