Home Knowledge Base Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation

Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation

Announce Type: new Abstract: We study trajectory selection for reasoning distillation, where teacher-generated reasoning trajectories are selectively used as supervision for a student model. Existing methods rely on heuristics such as trajectory quality or model confidence, but they often overlook whether a trajectory is learnable by the student. In this paper, we present LARK, a learnability-grounded method for reasoning trajectory selection.

arXiv CS 9d ago