Home Knowledge Base Aligning Tree-Search Policies

Aligning Tree-Search Policies

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs

Announce Type: replace Abstract: Tree-search decoding is an effective form of test-time scaling for large language models (LLMs), but real-world deployment often imposes a fixed per-query token budget that varies across settings. Existing tree-search policies are largely budget-agnostic, treating the budget merely as a termination condition, thereby risking late-stage over-branching or premature termination. We propose Budget-Guided MCTS (BG-MCTS), a tree-search decoding algorithm that...

arXiv CS 5d ago