Home Knowledge Base TextWorld-Cooking

TextWorld-Cooking

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Skill Reuse as Compression in Agentic RL

arXiv:2605.31509v1 Announce Type: new Abstract: Large language model agents trained with reinforcement learning (RL) often learn brittle, task-specific shortcuts. We hypothesize that agents generalize better when their successful trajectories are structurally compressible, decomposed into a small set of reusable abstract patterns. To formalize this, we introduce ReuseRL, which grounds agentic RL in the Minimum Description Length (MDL) principle.

arXiv CS 9d ago