Showing 21–40 of 63 results
/ Date/ Name
Oct 22, 2023A Quadratic Synchronization Rule for Distributed Deep LearningSep 28, 2022Online Policy Optimization for Robust MDPSep 10, 2024Functionally Constrained Algorithm Solves Convex Simple Bilevel ProblemsFeb 13, 2025Task Generalization With AutoRegressive Compositional Structure: Can Learning From $D$ Tasks Generalize to $D^{T}$ Tasks?Apr 28, 2022Theory and Algorithms for Diffusion Processes on Riemannian ManifoldsMay 25, 2019Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?Feb 5, 2021Provably Efficient Algorithms for Multi-Objective Competitive RLMay 4, 2024Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuningNov 23, 2020On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm PerspectiveJan 2, 2023On Finding Small Hyper-Gradients in Bilevel Optimization: Hardness Results and Improved AnalysisJun 10, 2025Solving Convex-Concave Problems with $\tilde{\mathcal{O}}(ε^{-4/7})$ Second-Order Oracle ComplexityApr 18, 2021Complexity Lower Bounds for Nonconvex-Strongly-Concave Min-Max OptimizationNov 27, 2025On the Condition Number Dependency in Bilevel OptimizationSep 1, 2025Multitask Battery Management with Flexible PretrainingSep 17, 2025PiERN: Token-Level Routing for Integrating High-Precision Computation and ReasoningDec 13, 2018A Probe Towards Understanding GAN and VAE ModelsNov 27, 2018A protonated brownmillerite electrolyte for superior low-temperature proton conductivityFeb 13, 2022Sion's Minimax Theorem in Geodesic Metric Spaces and a Riemannian Extragradient AlgorithmJan 18, 2017Step Stone Effect: A sp anti-bonding Mediated Long-Range Ferromagnetism in Cr-doped Carrier-Free Bi2Te3Feb 15, 2024Efficient Sampling on Riemannian Manifolds via Langevin MCMC