Scale Precision
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
Ablation Study of Block Size, Weight Precision, and Scale Precision in NVFP4 Inference for Low-Power Edge-Efficient Neural Networks
Announce Type: new Abstract: Energy-efficient edge inference requires reducing arithmetic cost, memory traffic, and hardware overhead. This paper presents an ablation-focused study of NVFP4 LUT-based inference for edge-efficient neural networks. The proposed NVLUT framework combines 4-bit NVFP4 activations, two-level scaling, LUT-based mantissa computation, voltage-scaled storage, and selective ECC protection.
Characterizing the Impact of NVFP4 Quantization for Low-Power Edge AI Deployment
arXiv:2606.06527v2 Announce Type: replace Abstract: Energy-efficient neural-network inference at the edge requires reducing arithmetic cost, memory traffic, computation energy, and storage overhead while maintaining acceptable accuracy. This paper presents an ablation-focused study of NVFP4 quantization for edge-efficient neural networks, with emphasis on the relationship between activation precision, weight precision, block-size scaling, retraining, and model accuracy. NVFP4 activations are...
Scale When Needed: Adaptive Neuron-level Mixed Precision Quantization Aware Training
arXiv:2605.25054v2 Announce Type: replace Abstract: Deploying deep neural networks on resource-constrained 6G edge devices demands aggressive compression with minimal accuracy loss. Quantization-Aware Training (QAT) has emerged as a leading compression approach; however, existing mixed-precision methods typically operate at coarse layer- or channel-level granularity. These methods often rely on heuristic or search-based bit-allocation strategies, which may overlook fine-grained variability...
MixFP4: Enhancing NVFP4 with Adaptive FP4/INT4 Block Representations
Announce Type: new Abstract: As large language models continue to scale, fine-grained block-scaled low-precision formats such as NVFP4 are increasingly adopted for their substantial throughput and memory benefits. However, a single FP4 micro-format often mismatches heterogeneous block-level tensor statistics. To address this without changing the standard block-scaled MMA/GEMM execution path, we propose MixFP4, a mixed micro-format extension to NVFP4 that selects between two stored FP4...
CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation
arXiv:2606.05011v1 Announce Type: new Abstract: Cross-view geo-localization estimates the geographic location of a ground image by matching it against an aerial image database. Existing methods tackle this through either large-scale retrieval or precise pose estimation, but not both: retrieval-based methods enable wide-area search at the cost of localization accuracy, while pose estimation methods achieve high precision within only a narrow search space. Naively cascading these pipelines...
Ultralow shot noise limited giant passive resonant gyroscope for Earth rotation measurement
Announce Type: new Abstract: Optical gyroscopes directly measure the Earth's rotation and are promising instruments for real-time geophysical observations and Earth orientation parameter (EOP) determination requiring both high precision and high temporal resolution. Large-scale ring laser gyroscopes (RLGs) currently reach rotational resolutions around $10^{-11}\,\mathrm{(rad/s)/\sqrt{Hz}}$, but their quantum noise limits make it challenging to meet the requirements of future...
3D-printed nozzle array could streamline production of drug-delivery microparticles
3D-printed nozzle array could streamline production of drug-delivery microparticles Lisa Lock Scientific Editor Robert Egan Associate Editor MIT researchers have demonstrated a low-cost design for specialized electronic nozzles, called triaxial electrospray emitters, that could be used to manufacture time-release drug-delivery particles or self-healing materials efficiently and at scale. Triaxial electrospray emitters use electricity to precisely dispense three liquids from microscopic...
Precision measurement under impact—when the balance itself becomes the object of measurement
How do you take measurements using one of the most sensitive scales in the world? Researchers at TU Wien have demonstrated how the measurement process affects not only the object being measured but also the scale itself, and where the absolute limits of precision lie.
Spatial Deformation Mechnisim of Meta-Atom Coupling and Scaling
arXiv:2606.04796v1 Announce Type: new Abstract: Metasurfaces enable precise manipulation of light-matter interactions, and meta-atom coupling and scaling dominates their resonant properties and functional responses. Conventionally, coupled-mode theory (CMT), coupled dipole theory (CDT) and full-wave simulation are widely adopted to analyze such coupling effects.
ANNS-AMP: Accelerating Approximate Nearest Neighbor Search via Adaptive Mixed-Precision Computing
arXiv:2606.07156v1 Announce Type: new Abstract: Approximate nearest neighbor search(ANNS) is a critical kernel in modern applications such as LLM and recommendation systems. However,its efficiency is fundamentally limited by the need to compute distances between a query and a massive number of high-dimensional vectors,most of which are non-neighbors. Existing approaches reduce redundancy via index optimization or early termination,but remain constrained by fixed-precision computation,leading...