Home Knowledge Base Cognitive MCTS-Guided Process Alignment

Cognitive MCTS-Guided Process Alignment

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

COMPASS: Cognitive MCTS-Guided Process Alignment for Safe Search Agents

Announce Type: new Abstract: LLM-powered search agents enable multi-step reasoning and tool use. However, these capabilities introduce retrieval-induced safety degradation, as harmful intents may decompose into seemingly innocuous sub-queries that lead to unsafe outcomes. Existing alignment methods struggle to capture sparse safety signals and fail to supervise diverse violations across multi-step interactions.

arXiv CS 9d ago