the Entropy Regime
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
Stage-1 Controls the Entropy Regime, Not the Outcome
arXiv:2606.09059v1 Announce Type: new Abstract: Two-stage post-training -- a Stage-1 warm-start (supervised fine-tuning, SFT, or on-policy distillation, OPD) followed by Stage-2 reinforcement learning (RL) -- is increasingly used for vision-language models (VLMs). We ask what Stage-1 actually controls in a small-data study using Qwen2.5-VL-7B with a same-modality 72B VLM teacher for OPD. First, the three warm-starts reach a narrow $53$--$54\%$ band on Geometry3K internal validation,...
The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models
arXiv:2605.26711v2 Announce Type: replace Abstract: We construct a binary mixed-regime process with one deterministic textual regime and one random regime governed by an unobserved latent state. Even an ideal infinite-capacity sequence predictor that exactly recovers the text-only marginal law can become overconfident when the observed prefix is compatible with the wrong latent regime. The resulting entropy difference is not an ordinary optimization error; it is a sufficiency gap caused by...
The Entropic Signature of Class Speciation in Diffusion Models
arXiv:2602.09651v2 Announce Type: replace-cross Abstract: Diffusion models do not recover semantic structure uniformly over time. Instead, samples transition from semantic ambiguity to class commitment within a narrow regime. Recent theoretical work attributes this transition to dynamical instabilities along class-separating directions, but practical methods to detect and exploit these windows in trained models are still limited.
Code Lifespan Survival Analysis (CLSA): Predicting the Survival of Source Code Lines Using AST-Aware Mining
arXiv:2606.04993v1 Announce Type: new Abstract: Context: Predicting which source lines will be deleted - and when - matters for maintenance, technical debt, and review prioritization. Existing MSR approaches work at file or method granularity, masking individual-statement risk. Objective: We introduce Code Lifespan Survival Analysis (CLSA), the first framework to model code survival at individual-line granularity.
vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models
arXiv:2603.04444v4 Announce Type: replace Abstract: As large language models (LLMs) diversify across modalities, capabilities, and cost profiles, the problem of intelligent request routing: selecting the right model for each query at inference time, has become a critical systems challenge. We present vLLM Semantic Router, a signal-driven decision routing framework for Mixture-of-Modality (MoM) model deployments. The architecture follows two complementary Shannon-inspired views.
Information Geometry of Intracellular Compartment Coupling Reveals Transcriptomic State Transitions in Single Cells
Single-cell transcriptomic analyses typically characterize cellular states using gene-expression variability, dimensionality reduction, and trajectory inference. However, existing approaches provide limited insight into how transcriptomic information is organized across interacting intracellular compartments. Here we introduce Compartment Coupling Entropy (CCE), an information-geometric framework that quantifies the organization of transcriptomic coupling between spliced and unspliced RNA...
Ground-state phase diagram of Rydberg atoms in a triangular-prism array
arXiv:2606.01116v1 Announce Type: cross Abstract: We study the ground-state phase diagram of Rydberg atoms in a triangular-prism optical tweezer array using the density matrix renormalization group. By tuning the detuning-to-Rabi-frequency ratio and the Rydberg blockade radius, the system realizes several density-wave phases with spontaneous breaking of translational and leg-exchange symmetries. Unlike two-leg Rydberg ladders with $\mathbb{Z}_2$ leg-exchange symmetry, the triangular prism...
Float8@2bits: Entropy Coding Enables Data-Free Model Compression
arXiv:2601.22787v2 Announce Type: replace Abstract: Post-training compression is currently divided into two contrasting regimes. On the one hand, fast, data-free, and model-agnostic methods (e.g., NF4 or HQQ) offer maximum accessibility but suffer from functional collapse at extreme bit-rates below 4 bits. On the other hand, techniques leveraging calibration data or extensive recovery training achieve superior fidelity but impose high computational constraints and face uncertain robustness...
From Reward-Hack Activations to Agentic Risk States: Context-Calibrated Mechanistic Monitoring in LLM Agents
arXiv:2606.06223v1 Announce Type: new Abstract: Language-model agents act through repeated cycles of observation, reasoning, and action selection, making safety monitoring depend on both internal model state and environment context. We study reward-hacking monitors in ReAct-style agents acting in Gameable ALFWorld and WebShop. Agents are instrumented with activation-based reward-hack scores, token-level entropy, and decision-context features.
Pseudoentanglement in constant depth: How trivial states can have non-trivial entanglement structure
arXiv:2605.31448v1 Announce Type: cross Abstract: We construct a family of 2D-local constant-depth quantum circuits that output states whose entanglement entropy across a specified cut cannot be estimated in quantum polynomial time. As constant-depth quantum circuits can be learned from polynomially many quantum samples, our resulting pseudoentangled states are implicitly public-key and not pseudorandom. This separates pseudoentanglement from pseudorandomness in the shallow-circuit regime:...