Instruction Tuning
Easy100 pts0 solves
The leap from GPT-3 to InstructGPT wasn't model size. It was training the model to obey user commands.
What is this training process?
Flag format: CONGRESS{[process_description]}
Example: CONGRESS{train_model_to_generate_code}
Hint
Teaching the model to do what you ask, not just predict the next token.