Home Knowledge Base S2

S2

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

See Less, Specify More: Visual Evidence Budgets for Generalizable VLAs

arXiv:2606.02735v2 Announce Type: replace Abstract: Generalization remains a central bottleneck for vision-language-action (VLA) models: under distractors, appearance shifts, and semantically similar tasks, the policy must often infer local execution details from coarse instructions while also deciding which parts of the image matter for control. We present S2 (See Less, Specify More), a framework for improving VLA generalization by training the executor under a cleaner interface. Specify...

arXiv CS 1d ago

See Less, Specify More: Visual Evidence Budgets for Generalizable VLAs

Announce Type: new Abstract: Generalization remains a central bottleneck for vision-language-action (VLA) models: under distractors, appearance shifts, and semantically similar tasks, the policy must often infer local execution details from coarse instructions while also deciding which parts of the image matter for control. We present S2 (See Less, Specify More), a framework for improving VLA generalization by training the executor under a cleaner interface. Specify More preserves the...

arXiv CS 7d ago

Show HN: RePlaya – self-hosted browser session replay with live tailing

Hi HN, I'm one of the founders of s2.dev. RePlaya (https://github.com/s2-streamstore/replaya) is a self-hosted browser session replay tool using rrweb (https://github.com/rrweb-io/rrweb).It occurred to me that a durable stream per session would be a much neater architectural foundation for much of what you'd want from such a tool. As a unique feature, it also made live tailing straightforward because the player can read from the same stream the recorder is appending to.

Hacker News 8d ago

CBS News' Bari Weiss addresses 'unfortunate' firing of Scott Pelley

IE 11 is not supported. For an optimal experience visit our site on another browser. Now Playing CBS News' Bari Weiss addresses 'unfortunate' firing of Scott Pelley 03:28 UP

NBC News 6d ago

CR-JEPA: Cross-Modal Joint-Embedding Predictive Learning for Remote Sensing Image Retrieval

arXiv:2606.00706v2 Announce Type: replace Abstract: Cross-modal remote sensing image retrieval aims to retrieve semantically related scenes across heterogeneous sensing modalities. This remains challenging because paired observations may differ substantially in imaging physics, spatial resolution, spectral configuration, and visual appearance. Moreover, a single retrieval projection trained with one objective may be insufficient to jointly support cross-modal semantic alignment and...

arXiv CS 1d ago

Robot-DIFT: Correspondence-Sensitive Diffusion Features for Contact-Rich Robot Manipulation

arXiv:2602.11934v2 Announce Type: replace Abstract: Robot manipulation often fails in the final millimeters: a policy may recognize the right object yet miss the pose offsets, boundaries, or pre-contact alignments needed for action. We argue that such failures arise when semantic invariance suppresses correspondence cues for closed-loop control, or when these cues are not exposed to the policy in a usable form. Modern visual encoders provide strong semantic abstractions, but contact-rich...

arXiv CS 1d ago

A Single-Loop Bilevel Deep Learning Method for Optimal Control of Obstacle Problems

arXiv:2601.04120v2 Announce Type: replace-cross Abstract: Optimal control of obstacle problems arises in a wide range of applications and is computationally challenging due to its nonsmoothness, nonlinearity, and bilevel structure. Classical numerical approaches rely on mesh-based discretization and typically require solving a sequence of costly subproblems. In this work, we propose a single-loop bilevel deep learning method, which is mesh-free, scalable to high-dimensional and complex...

arXiv CS 7d ago

Babel-USB: USB drive with every file

This project turns an ESP32-S3 development board into an infinite filesystem inspired by the digital Library of Babel. - Buy an ESP32-S3 development board - ideally one shaped as a USB stick for maximum bewilderment. This is the one I got, though you don't necessarily need this exact variant.

Hacker News 2h ago

KIT's Submission to Cross-Lingual Voice Cloning in IWSLT 2026

Announce Type: new Abstract: Cross-lingual voice cloning aims to generate speech in a target language while preserving speaker identity from a source-language reference. This task is central to speech translation and is the focus of the IWSLT 2026 Cross-Lingual Voice Cloning track. A key challenge is maintaining intelligibility and naturalness in the presence of accent variation and domain-specific vocabulary.

arXiv CS 2d ago

Introduction – Rust for Python Programmers

Rust for Python Programmers: Complete Training Guide A comprehensive guide to learning Rust for developers with Python experience. This guide covers everything from basic syntax to advanced patterns, focusing on the conceptual shifts required when moving from a dynamically-typed, garbage-collected language to a statically-typed systems language with compile-time memory safety. How to Use This Book Self-study format: Work through Part I (ch 1–6) first — these map closely to Python concepts...

Hacker News 4d ago