Home Knowledge Base Benchmarking Living-Screen-Native GUI Agents

Benchmarking Living-Screen-Native GUI Agents

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms

arXiv:2606.04701v1 Announce Type: new Abstract: GUI agents today assume a static screen, where the world is frozen between two actions. However, real interfaces such as short-video applications violate this assumption, as their content keeps playing, and a competent user must decide what to watch and for how long. We formalize this task as Living-Screen-Native GUI agents and introduce LivingScreen, the first benchmark instantiating it on short-video platforms, with a faithful browser-based...

arXiv CS 6d ago