/ Date/ Name

Machine Learning

cs.LG

/ Date/ Name

/ Date/ Name

Showing 841–860 of 3,135 results

/ Date/ Name

May 23, 2025PreMoE: Proactive Inference for Efficient Mixture-of-Experts May 22, 2025Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning May 22, 2025Losing is for Cherishing: Data Valuation Based on Machine Unlearning and Shapley Value May 21, 2025Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval May 21, 2025Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One May 21, 2025Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers May 20, 2025Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training May 20, 2025KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation May 20, 2025Scale-invariant Attention May 20, 2025Model-Independent Machine Learning Approach for Nanometric Axial Localization and Tracking May 20, 2025Fast, close, non-singular and property-preserving approximations of entropic measures May 20, 2025From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave sampling May 20, 2025Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models May 19, 2025Thinking Short and Right Over Thinking Long: Serving LLM Reasoning Efficiently and Accurately May 18, 2025Neural Thermodynamics: Entropic Forces in Deep and Universal Representation Learning May 18, 2025Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios May 18, 2025PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking May 17, 2025HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class May 16, 2025Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather Forecasts May 16, 2025Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPO

← Previous Next →