APB
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention
Announce Type: replace Abstract: The efficiency of long-video inference remains a critical bottleneck, mainly due to the dense computation in the prefill stage of Large Multimodal Models (LMMs). Existing methods either compress visual embeddings or apply sparse attention on a single GPU, yielding limited acceleration or degraded performance and restricting LMMs from handling longer, more complex videos. To overcome these issues, we propose APB-V, a sequence-parallel framework with optimized...
Amplicon/Protein Bead Display enables quantitative in vitro biochemistry at scale
Scalable methods for producing and characterizing protein libraries are essential for generating standardized datasets needed to train models linking sequence to function. Here, we present Amplicon/Protein Bead Display (APB-Display), which expresses and purifies >100,000 protein variants in vitro in 1 day. APB-Display uses particle-templated emulsification to generate libraries of hydrogel beads that covalently display many copies of a given protein variant and its encoding DNA.
Agent Planning Benchmark: A Diagnostic Framework for Planning Capabilities in LLM Agents
Announce Type: replace Abstract: Planning is central to LLM agents: before acting, an agent must decompose goals, select tools, reason over constraints, and decide when a task is infeasible. Yet existing agent evaluations often report only end-to-end success, making it difficult to determine whether failures stem from planning or execution. We introduce Agent Planning Benchmark (APB), a planning-specific diagnostic benchmark with 4,209 multimodal cases across 22 domains and five settings,...
Agent Planning Benchmark: A Diagnostic Framework for Planning Capabilities in LLM Agents
Announce Type: new Abstract: Planning is central to LLM agents: before acting, an agent must decompose goals, select tools, reason over constraints, and decide when a task is infeasible. Yet existing agent evaluations often report only end-to-end success, making it difficult to determine whether failures stem from planning or execution. We introduce \textbf{Agent Planning Benchmark (APB)}, a planning-specific diagnostic benchmark with 4,209 multimodal cases across 22 domains and five...