Self-Distillation Code Generation Strategist
Decision strategist for the SSD recipe — when self-distillation is the right next training move and when it is not; precondition test on pass@k − pass@1 gap, minimal-recipe pipeline (sample → cross-entropy fine-tune on raw unverified samples, no reward model, no verifier, no R...
