Home Knowledge Base STaR-Quant

STaR-Quant

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

STaR-Quant: State-Time Consistent Post-Training Quantization for Diffusion Large Language Models

arXiv:2606.04945v1 Announce Type: new Abstract: Diffusion large language models (DLLMs) have recently emerged as a promising alternative to autoregressive LLMs by generating text through iterative masked denoising with bidirectional context. However, their large model sizes and iterative denoising process introduce substantial memory and computational overhead, motivating post-training quantization for efficient deployment. In this paper, we identify two key challenges for low-bit DLLM...

arXiv CS 6d ago

STaR-Quant: State-Time Consistent Post-Training Quantization for Diffusion Large Language Models

arXiv:2606.04945v2 Announce Type: replace Abstract: Diffusion large language models (DLLMs) have recently emerged as a promising alternative to autoregressive LLMs by generating text through iterative masked denoising with bidirectional context. However, their large model sizes and iterative denoising process introduce substantial memory and computational overhead, motivating post-training quantization for efficient deployment. In this paper, we identify two key challenges for low-bit DLLM...

arXiv CS 1d ago