Unified Framework for Cross
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation
arXiv:2606.05011v1 Announce Type: new Abstract: Cross-view geo-localization estimates the geographic location of a ground image by matching it against an aerial image database. Existing methods tackle this through either large-scale retrieval or precise pose estimation, but not both: retrieval-based methods enable wide-area search at the cost of localization accuracy, while pose estimation methods achieve high precision within only a narrow search space. Naively cascading these pipelines...
Cross-Layer Subspace Coupling for LLM Compression: A Unifying Framework and Its Empirical Limits
Announce Type: new Abstract: Recent SVD based compression methods for large language models like SVD LLM and Basis Sharing can be unified under one optimization problem. While mathematical proofs and tests on Pythia models show this unified approach improves weight reconstruction error by up to 46% percent it fails in practical tasks.
Cross-Layer Subspace Coupling for LLM Compression: A Unifying Framework and Its Empirical Limits
arXiv:2605.30836v2 Announce Type: replace Abstract: Recent SVD based compression methods for large language models like SVD LLM and Basis Sharing can be unified under one optimization problem. While mathematical proofs and tests on Pythia models show this unified approach improves weight reconstruction error by up to 46% percent it fails in practical tasks.
A Unified Framework for Virtual Wave Transform: From Generalized Formulation to Excitation-Specific Projection
Announce Type: cross Abstract: We present a unified theoretical framework for the mapping between diffusive and wave-like dynamics, formulated as a spectral integral operator acting on temporal fields. By introducing an analytic continuation in the complex frequency plane, we establish an explicit correspondence between thermal diffusion and a virtual wave field governed by a hyperbolic equation. This mapping is shown to define a causal, compact Fredholm operator that acts as a nonstationary...
Symmetric Divergence and Normalized Similarity: A Unified Topological Framework for Representation Analysis
arXiv:2606.06342v1 Announce Type: cross Abstract: Topological Data Analysis (TDA) offers a principled, intrinsic lens for comparing neural representations. However, existing paired topological divergences (e.g., RTD) are limited by heuristic asymmetry and, more critically, unbounded scores that depend on sample size, hindering reliable cross-scenario benchmarking. To address these challenges, we develop a unified topological toolkit serving two complementary needs: fine-grained structural...
OneVLA: A Unified Framework for Embodied Tasks
arXiv:2606.01241v1 Announce Type: new Abstract: Navigation and manipulation are fundamental capabilities of embodied intelligence, enabling robots to interpret natural language commands and interact physically with their surroundings. However, current Vision-Language-Action (VLA) models remain constrained by task-specific architectures, specializing in either navigation or manipulation, which hinders the development of general-purpose robotic agents. To bridge this gap, we introduce OneVLA,...
OneVLA: A Unified Framework for Embodied Tasks
Announce Type: replace Abstract: Navigation and manipulation are fundamental capabilities of embodied intelligence, enabling robots to interpret natural language commands and interact physically with their surroundings. However, current Vision-Language-Action (VLA) models remain constrained by task-specific architectures, specializing in either navigation or manipulation, which hinders the development of general-purpose robotic agents. To bridge this gap, we introduce OneVLA, a unified...
Beyond tokens: a unified framework for latent communication in LLM-based multi-agent systems
arXiv:2606.05711v2 Announce Type: replace Abstract: Multi-agent systems built on large language models (LLMs) have become a prevailing paradigm for tackling complex reasoning, planning, and tool-use tasks. The dominant communication protocol in such systems is natural language: agents exchange messages token-by-token, verbalising their internal reasoning so that peers can read, verify, and respond. While convenient and interpretable, this protocol suffers from three structural drawbacks --...
Beyond tokens: a unified framework for latent communication in LLM-based multi-agent systems
arXiv:2606.05711v1 Announce Type: new Abstract: Multi-agent systems built on large language models (LLMs) have become a prevailing paradigm for tackling complex reasoning, planning, and tool-use tasks. The dominant communication protocol in such systems is natural language: agents exchange messages token-by-token, verbalising their internal reasoning so that peers can read, verify, and respond. While convenient and interpretable, this protocol suffers from three structural drawbacks -- high...
StandardE2E: A Unified Framework for End-to-End Autonomous Driving Datasets
arXiv:2606.04271v1 Announce Type: new Abstract: Autonomous driving has shifted from modular perception-prediction-planning stacks toward end-to-end (E2E) models that map sensor inputs directly to vehicle control, often regularized by auxiliary tasks such as 3D detection, motion forecasting, and HD-map perception. Progress is driven by a fast-growing ecosystem of sensor-rich driving datasets, yet each ships its own file formats, APIs, coordinate conventions, and modality coverage, leaving...