"au:"Sebastian U. Stich"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Sebastian U. Stich"" — arXiv2 Search

Showing 1–20 of 83 results

/ Date/ Name

May 2, 2023Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees Feb 1, 2019Decentralized Stochastic Optimization and Gossip Algorithms with Compressed Communication Nov 3, 2020A Linearly Convergent Algorithm for Decentralized Optimization: Sending Less Bits for Free!Sep 4, 2020On Communication Compression for Distributed Optimization on Heterogeneous Data Feb 18, 2020Is Local SGD Better than Minibatch SGD?May 30, 2024Towards Faster Decentralized Stochastic Optimization with Communication Compression Jul 12, 2023Locally Adaptive Federated Learning Jun 6, 2025Exploiting Similarity for Computation and Communication-Efficient Decentralized Optimization Oct 18, 2012Variable Metric Random Pursuit Jun 10, 2020Extrapolation for Large-batch Training in Deep Learning Jul 9, 2019Unified Optimal Analysis of the (Stochastic) Gradient Method Sep 11, 2019The Error-Feedback Framework: Better Rates for SGD with Delayed Gradients and Compressed Communication Nov 10, 2021Linear Speedup in Personalized Collaborative Learning Oct 14, 2019SCAFFOLD: Stochastic Controlled Averaging for Federated Learning Oct 8, 2021RelaySum for Decentralized Deep Learning on Heterogeneous Data Jan 25, 2025Scalable Decentralized Learning with Teleportation Mar 5, 2024Non-convex Stochastic Composite Optimization with Polyak Momentum Jun 1, 2018Global linear convergence of Newton's method without strong-convexity or Lipschitz gradients Jul 31, 2020On the Convergence of SGD with Biased Gradients Feb 18, 2022Tackling benign nonconvexity with smoothing and stochastic gradients