Home Science Local MixVR: Breaking the Communication-Sample...
Science

Local MixVR: Breaking the Communication-Sample Dependence in Distributed Learning

Key Points

arXiv:2606.01128v1 Announce Type: new Abstract: Communication overhead is a crucial bottleneck in scalable distributed learning. While existing methods aim to efficiently utilize data points, such as Local SGD, Minibatch SGD, and their accelerated variants, they still exhibit communication-round complexity that scales with the total number of samples $N$. In this paper, we introduce Local MixVR, a distributed framework that integrates local updates with variance-reduction techniques to...

arXiv:2606.01128v1 Announce Type: new Abstract: Communication overhead is a crucial bottleneck in scalable distributed learning. While existing methods aim to efficiently utilize data points, such as Local SGD, Minibatch SGD, and their accelerated variants, they still exhibit communication-round complexity that scales with the total number of samples $N$. In this paper, we introduce Local MixVR, a distributed framework that integrates local updates with variance-reduction techniques to mitigate local noise. We show that Local MixVR is the first distributed method to eliminate the dependence of communication complexity on $N$, achieving a complexity that scales only with the number of workers $M$. In common regimes where $M
Local SGD (ORG) Minibatch SGD (ORG) Local MixVR (PERSON) Minibatch Accelerated SGD (ORG)
Originally published by arXiv CS Read original →