Home › Entertainment › Think Like a Pilot: Fine-Grained Long-Horizon UAV Navigation

Entertainment

Think Like a Pilot: Fine-Grained Long-Horizon UAV Navigation

arXiv CS Monday 08 June 2026, 04:00 UTC By Xiangyi Zheng, Xiangyu Wang, Qinan Liao, Zimu Tang, Yue Liao, Dongyue Lyu, Guodong Wang, Junjie Liu, Si Liu 1 min read

Key Points

arXiv:2606.06836v1 Announce Type: new Abstract: Language-guided UAV agents must execute long-horizon semantic instructions while producing smooth, physically feasible continuous flight commands, yet existing Vision-Language Navigation (VLN) benchmarks typically use discrete or coarse actions and existing UAV Vision-Language-Action (VLA) tasks focus on short, atomic maneuvers. To address this gap in UAV task settings, we introduce \textbf{FLIGHT}, a \textbf{F}ine-grained \textbf{L}ong-horizon \textbf{I}nstruction-\textbf{G}uided benchmark for \textbf{H}ybrid UAV navigation and reasoning \textbf{T}asks, which combines multi-stage instructions with dense 6-DoF trajectory annotations across two dataset splits: Fine-grained VLN and Long-horizon Flow. To endow the UAV agent with the capability of real-time in-flight reasoning over task execution status and mission planning, while simultaneously accommodating high-frequency, real-time precise control, we further propose \textbf{FLIGHT VLA}, an asynchronous architecture that decouples a low-frequency Streaming Pilot Vision-Language Model (VLM) for task-state reasoning from a high-frequency diffusion action model for continuous control, supervised by explicit \textbf{Pilot Reasoning} texts that summarize the current flight state and anticipate the next subgoal. In closed-loop evaluation, FLIGHT VLA consistently surpasses representative VLN and VLA baselines on our FLIGHT benchmarks, achieving stronger multi-stage completion, subgoal adherence, and terminal control. Its trained Streaming Pilot Reasoning VLM further improves UAV video reasoning, validating the effectiveness of our design.

UAV (ORG) Vision-Language Navigation (ORG) VLN (ORG) VLA (ORG)

Originally published by arXiv CS Read original →

Think Like a Pilot: Fine-Grained Long-Horizon UAV Navigation

Related Stories

Buyer swoops in for actress Dakota Johnson’s $6 million midcentury modern gem in L.A.

Should you send that midnight text? 11 essential rules for phone etiquette

The best films of 2026 so far

Mentors, muses and new music: conductor and composer Ryan Wigglesworth