Rendering Fidelity
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
WebSpline: Structure-Informed Splines for Real-Time 3D Gaussians from Monocular Videos
arXiv:2606.02096v1 Announce Type: new Abstract: Dynamic scene reconstruction from monocular videos remains highly challenging, as existing methods often struggle to balance global structural coherence and local fine-grained details under limited multi-view cues. To address this challenge, we propose WebSpline, a novel dynamic 3D Gaussian framework that enables structurally coherent and high-fidelity reconstruction from monocular videos with fast rendering. The core of WebSpline is the...
TextFake: Benchmarking AI-Generated Image Detection on Text-Rich Images
arXiv:2606.01050v1 Announce Type: new Abstract: Recent AI-generated image (AIGI) detectors perform well on natural-image benchmarks, but their behavior on text-rich forgeries, such as fabricated screenshots, documents, and news pages prevalent in misinformation, remains untested. We introduce TextFake, a 20,000-image benchmark for text-rich AIGI detection spanning 28 languages, 4 topic categories, and 2 scene modalities.
GN0: Toward a Unified Paradigm for Generation, Evaluation, and Policy Learning in Visual-Language Navigation
arXiv:2606.03682v1 Announce Type: new Abstract: Embodied navigation connects intelligent agents with the physical world and is fundamental for general robotic intelligence. Limited availability and quality of navigation data have constrained Vision-and-Language Navigation (VLN) systems' generalization and long-horizon capabilities. To address this, we curate diverse 3D scenes and develop an automated pipeline for large-scale navigation data, resulting in the GN-Matrix dataset.
Optimizing Rank for High-Fidelity Implicit Neural Representations
arXiv:2512.14366v2 Announce Type: replace Abstract: Implicit Neural Representations (INRs) based on vanilla Multi-Layer Perceptrons (MLPs) are widely believed to be incapable of representing high-frequency content. This has directed research efforts towards architectural interventions, such as coordinate embeddings or specialized activation functions, to represent high-frequency signals. In this paper, we challenge the notion that the low-frequency bias of vanilla MLPs is an intrinsic,...
DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring
arXiv:2509.18898v2 Announce Type: replace Abstract: In this paper, we propose the first Structure-from-Motion (SfM)-free deblurring 3D Gaussian Splatting method via event camera, dubbed DeblurSplat. We address the motion-deblurring problem in two ways. First, we leverage the pretrained capability of the dense stereo module (DUSt3R) to directly obtain accurate initial point clouds from blurred images.
SparseStreet: Sparse Gaussian Splatting for Real-Time Street Scene Simulation
arXiv:2606.03909v1 Announce Type: new Abstract: While 3D Gaussian Splatting has shown promising results in street scene reconstruction, existing methods require massive numbers of Gaussian primitives to capture fine details, leading to prohibitive storage costs and slow rendering speeds. We observe that dynamic objects (e.g., vehicles and pedestrians) demand high-fidelity representations to maintain temporal consistency, while static background regions often contain substantial redundancy....
Mid-Infrared Single-Photon Edge Enhanced Imaging based on Nonlinear Vortex Filtering
Announce Type: new Abstract: Edge enhanced imaging via the spiral phase contrast enables to reveal the phase or amplitude gradients of a target, which has been proved useful in feature recognition, machine vision, and object identification. A long quest is to extend the operation wavelength into the mid-infrared (MIR) region, as highly demanded in various fields including infrared sensing, astronomic observation, and biomedical diagnosis.
Evaluating Reasoning Fidelity in Visual Text Generation
Announce Type: new Abstract: Recent text-to-image (T2I) models can render highly legible and well-structured text within images, enabling applications including document generation and slide generation. However, it remains unclear whether such systems faithfully preserve reasoning ability when complex solutions must be expressed directly through rendered text, or whether they merely imitate surface-level patterns.
A Machine Learning Enabled MDO for Bio-Inspired Autonomous Underwater Gliders
arXiv:2602.08508v2 Announce Type: replace Abstract: The preliminary design of AUGs is intrinsically challenging due to the strong coupling between the external hydrodynamic shape, the hydrostatic balance, the structural integrity, and internal packaging constraints. This complexity is further amplified for bio-inspired configurations, whose rich geometric parametrizations lead to high-dimensional design spaces that are difficult to explore using conventional optimization approaches. This...
REFINE: Super-efficient 3D Gaussian Splatting Pruning via Rendering-Free Primitive Importance
new Abstract: Existing pruning methods for 3D Gaussian splatting (3DGS) suffer from either severe quality degradation or prohibitive computational overhead. In this paper, we propose REFINE, a highly accelerated 3DGS pruning framework centered on a novel rendering-free primitive importance metric. Our approach leverages an analytically approximated, rendering-aware Hessian field to quantify the expected perceptual error induced by the removal of individual primitives.