Home Knowledge Base the Directional Gap

the Directional Gap

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

MomentKV: Closing the Directional Gap in KV Cache Eviction for Long-Context Inference

arXiv:2606.01563v1 Announce Type: new Abstract: Autoregressive decoding in Transformer-based language models relies on the KV cache, whose memory footprint grows linearly with sequence length and becomes the primary bottleneck for long-context inference. KV cache eviction addresses this by retaining a fixed-size subset of key-value pairs and discarding the rest.

arXiv CS 8d ago

Gooey: A GPU-accelerated UI framework for Zig

A GPU-accelerated UI framework for Zig, targeting macOS (Metal), Linux (Vulkan/Wayland), and Browser (WASM/WebGPU). Join the Gooey discord Early Development: API is evolving. Example app built with Gooey — chat-zig, an Anthropic Claude client using the Zig 0.16 std.

Hacker News 7d ago

Gemma 4 12B: A unified, encoder-free multimodal model

Introducing Gemma 4 12B: a unified, encoder-free multimodal model Today, we are introducing Gemma 4 12B, our latest model designed to bring agentic multimodal intelligence directly to laptops. Bridging the gap between our edge-friendly E4B and our more advanced 26B Mixture of Experts (MoE), Gemma 4 12B packages powerful capabilities inside a reduced memory footprint. It is also our first mid-sized model to feature native audio inputs.

Hacker News 7d ago

AGENTSERVESIM: A Hardware-aware Simulator for Multi-Turn LLM Agent Serving

Announce Type: new Abstract: Multi-turn LLM agents interleave model calls with external tool invocations, shifting serving from stateless request processing to stateful program execution. Serving these workloads requires scheduling, KV-cache management, and routing policies that use program-level context, including turn dependencies, tool-induced gaps, and reusable KV state. Evaluating such policies directly on real systems is costly, since each design point may require dedicated accelerator...

arXiv CS 1d ago

Electron Paramagnetic Resonance Study of Radical Species on NaNbO3@CeO2-Modified Carbon Vulcan XC72 Gas Diffusion Electrode for Electrochemical Degradation of Paracetamol via Electro-Fenton

Announce Type: new Abstract: While electrochemical oxidation is a promising technology for water treatment, a fundamental understanding of the specific radical mechanisms involved in pharmaceutical degradation has remained limited. This study addresses this gap by employing Electron Paramagnetic Resonance (EPR) spectroscopy to directly quantify the radical species generated during the degradation of paracetamol using a novel gas diffusion electrode (GDE) modified with NaNbO3 nanocubes and...

arXiv Physics 17h ago

ANNEAL: Adapting LLM Agents via Governed Symbolic Patch Learning

arXiv:2605.16309v2 Announce Type: replace Abstract: LLM-based agents can recover from individual execution errors, yet they repeatedly fail on the same fault when the underlying process knowledge--operator schemas, preconditions, and constraints--remains unrepaired. Existing self-evolving approaches address this gap by updating prompts, memory, or model weights, but none directly repair the symbolic structures that encode how tasks are executed, and few provide the governance guarantees...

arXiv CS 1d ago

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

arXiv:2606.01993v1 Announce Type: new Abstract: Abundant procedural knowledge on the Web holds great potential for helping agents solve long-horizon tasks. However, such knowledge is often multimodal, heterogeneous, noisy, and implicitly assumes human executors, making it difficult to use directly as the skills required by agents. To bridge the gap between human-oriented guides and agent-executable skills, we formalize this problem as guide-to-skill learning: converting in-the-wild guides...

arXiv CS 8d ago

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

Announce Type: new Abstract: Vision-language models (VLMs) are powerful general-purpose reasoners, yet converting them into robot control policies (VLAs) is surprisingly difficult. The root cause is a two-fold gap: VLMs are trained on internet-scale images with language-understanding objectives, while VLAs must perceive robot scenes and predict motor actions. Fine-tuning a VLM directly on robot action data forces the model to cross both gaps at once -- the learning curve is steep and the...

arXiv CS 1d ago

Learning quality scores for chromatin accessibility bigWig tracks using Machine Learning

High-throughput chromatin accessibility assays such as bulk and single-cell ATAC-seq have generated large collections of processed signal tracks in bigWig format, which are widely used for visualisation, data integration, and Machine Learning (ML)-based analyses. Despite their central role, systematic quality control (QC) frameworks operating directly at the level of bigWig signal tracks remain underdeveloped. This gap limits the ability to assess data reliability and hampers robust...

bioRxiv 3d ago

MAdam: Metric-Aware Multi-Objective Adam

Announce Type: new Abstract: Multi-objective optimization (MOO) underlies many machine learning problems, yet MOO solvers across the loss-balancing, gradient-balancing, and Pareto-based families almost universally hand their reconciled directions to Adam~\cite{kingma2015adam}. We show this coupling introduces two systematic gaps between the solver's intent and the optimizer's execution. The first is a \emph{weighting mismatch}: Adam's second-moment denominator entangles the time-varying...

arXiv CS 7d ago