Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

arXiv CS Friday 05 June 2026, 04:00 UTC By Nicholas Edwards, Sebastian Schuster 1 min read

Key Points

arXiv:2603.26233v2 Announce Type: replace Abstract: As Large Language Model (LLM) agents are increasingly deployed in open-ended domains like software engineering, they frequently encounter underspecified instructions that lack crucial context. While human developers naturally resolve underspecification by asking clarifying questions, current agents are largely optimized for autonomous execution. In this work, we systematically evaluate the clarification-seeking abilities of LLM agents on an underspecified variant of SWE-bench Verified. We propose an uncertainty-aware multi-agent scaffold that decouples underspecification detection from code execution. Across both proprietary and open-weight frontier LLMs, our scaffold achieves a 69.40% task resolve rate, significantly outperforming a standard single-agent setup and closing the performance gap with agents operating on fully specified instructions. Furthermore, we find that the multi-agent system exhibits well-calibrated information-seeking behavior, conserving queries on simple tasks while proactively seeking information on more complex issues. These findings indicate that current models can be turned into proactive collaborators, where agents independently recognize when to ask questions to elicit missing information in real-world, underspecified tasks.

Coding Agents arXiv:2603.26233v2 (ORG) LLM (ORG)

Originally published by arXiv CS Read original →

Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

Related Stories

Xbox CEO says current margins 'cannot continue' in public letter to staff

'A little goes a long way': New York's candy stores sweeten economic gloom

I'd have vetoed foreign sale of UK tech giant, says Business Secretary

I'd have vetoed foreign sale of UK tech giant, says Business Secretary