Distillation

Hard200 pts0 solves
You want GPT-4 quality at 1/100th cost. Use GPT-4 to generate training data, then fine-tune a small model on it. Name both roles. Flag format: CONGRESS{teacher:[what],student:[what]} Example: CONGRESS{teacher:human_expert,student:neural_net}
Hint
The large model teaches, the small model learns to imitate.