Home Knowledge Base Scale Precision

Scale Precision

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Ablation Study of Block Size, Weight Precision, and Scale Precision in NVFP4 Inference for Low-Power Edge-Efficient Neural Networks

Announce Type: new Abstract: Energy-efficient edge inference requires reducing arithmetic cost, memory traffic, and hardware overhead. This paper presents an ablation-focused study of NVFP4 LUT-based inference for edge-efficient neural networks. The proposed NVLUT framework combines 4-bit NVFP4 activations, two-level scaling, LUT-based mantissa computation, voltage-scaled storage, and selective ECC protection.

arXiv CS 2d ago

Characterizing the Impact of NVFP4 Quantization for Low-Power Edge AI Deployment

arXiv:2606.06527v2 Announce Type: replace Abstract: Energy-efficient neural-network inference at the edge requires reducing arithmetic cost, memory traffic, computation energy, and storage overhead while maintaining acceptable accuracy. This paper presents an ablation-focused study of NVFP4 quantization for edge-efficient neural networks, with emphasis on the relationship between activation precision, weight precision, block-size scaling, retraining, and model accuracy. NVFP4 activations are...

arXiv CS 1d ago

Scale When Needed: Adaptive Neuron-level Mixed Precision Quantization Aware Training

arXiv:2605.25054v2 Announce Type: replace Abstract: Deploying deep neural networks on resource-constrained 6G edge devices demands aggressive compression with minimal accuracy loss. Quantization-Aware Training (QAT) has emerged as a leading compression approach; however, existing mixed-precision methods typically operate at coarse layer- or channel-level granularity. These methods often rely on heuristic or search-based bit-allocation strategies, which may overlook fine-grained variability...

arXiv CS 2d ago

MixFP4: Enhancing NVFP4 with Adaptive FP4/INT4 Block Representations

Announce Type: new Abstract: As large language models continue to scale, fine-grained block-scaled low-precision formats such as NVFP4 are increasingly adopted for their substantial throughput and memory benefits. However, a single FP4 micro-format often mismatches heterogeneous block-level tensor statistics. To address this without changing the standard block-scaled MMA/GEMM execution path, we propose MixFP4, a mixed micro-format extension to NVFP4 that selects between two stored FP4...

arXiv CS 9d ago

CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

arXiv:2606.05011v1 Announce Type: new Abstract: Cross-view geo-localization estimates the geographic location of a ground image by matching it against an aerial image database. Existing methods tackle this through either large-scale retrieval or precise pose estimation, but not both: retrieval-based methods enable wide-area search at the cost of localization accuracy, while pose estimation methods achieve high precision within only a narrow search space. Naively cascading these pipelines...

arXiv CS 6d ago

Ultralow shot noise limited giant passive resonant gyroscope for Earth rotation measurement

Announce Type: new Abstract: Optical gyroscopes directly measure the Earth's rotation and are promising instruments for real-time geophysical observations and Earth orientation parameter (EOP) determination requiring both high precision and high temporal resolution. Large-scale ring laser gyroscopes (RLGs) currently reach rotational resolutions around $10^{-11}\,\mathrm{(rad/s)/\sqrt{Hz}}$, but their quantum noise limits make it challenging to meet the requirements of future...

arXiv Physics 2d ago

3D-printed nozzle array could streamline production of drug-delivery microparticles

3D-printed nozzle array could streamline production of drug-delivery microparticles Lisa Lock Scientific Editor Robert Egan Associate Editor MIT researchers have demonstrated a low-cost design for specialized electronic nozzles, called triaxial electrospray emitters, that could be used to manufacture time-release drug-delivery particles or self-healing materials efficiently and at scale. Triaxial electrospray emitters use electricity to precisely dispense three liquids from microscopic...

Phys.org 1d ago

Precision measurement under impact—when the balance itself becomes the object of measurement

How do you take measurements using one of the most sensitive scales in the world? Researchers at TU Wien have demonstrated how the measurement process affects not only the object being measured but also the scale itself, and where the absolute limits of precision lie.

Phys.org 23h ago

Spatial Deformation Mechnisim of Meta-Atom Coupling and Scaling

arXiv:2606.04796v1 Announce Type: new Abstract: Metasurfaces enable precise manipulation of light-matter interactions, and meta-atom coupling and scaling dominates their resonant properties and functional responses. Conventionally, coupled-mode theory (CMT), coupled dipole theory (CDT) and full-wave simulation are widely adopted to analyze such coupling effects.

arXiv Physics 6d ago

ANNS-AMP: Accelerating Approximate Nearest Neighbor Search via Adaptive Mixed-Precision Computing

arXiv:2606.07156v1 Announce Type: new Abstract: Approximate nearest neighbor search(ANNS) is a critical kernel in modern applications such as LLM and recommendation systems. However,its efficiency is fundamentally limited by the need to compute distances between a query and a massive number of high-dimensional vectors,most of which are non-neighbors. Existing approaches reduce redundancy via index optimization or early termination,but remain constrained by fixed-precision computation,leading...

arXiv CS 2d ago