Showing 841–860 of 3,135 results
/ Date/ Name
May 23, 2025PreMoE: Proactive Inference for Efficient Mixture-of-ExpertsMay 22, 2025Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised ReasoningMay 22, 2025Losing is for Cherishing: Data Valuation Based on Machine Unlearning and Shapley ValueMay 21, 2025Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context RetrievalMay 21, 2025Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme OneMay 21, 2025Neural Collapse is Globally Optimal in Deep Regularized ResNets and TransformersMay 20, 2025Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional TrainingMay 20, 2025KORGym: A Dynamic Game Platform for LLM Reasoning EvaluationMay 20, 2025Scale-invariant AttentionMay 20, 2025Model-Independent Machine Learning Approach for Nanometric Axial Localization and TrackingMay 20, 2025Fast, close, non-singular and property-preserving approximations of entropic measuresMay 20, 2025From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave samplingMay 20, 2025Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language ModelsMay 19, 2025Thinking Short and Right Over Thinking Long: Serving LLM Reasoning Efficiently and AccuratelyMay 18, 2025Neural Thermodynamics: Entropic Forces in Deep and Universal Representation LearningMay 18, 2025Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane ScenariosMay 18, 2025PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained WatermarkingMay 17, 2025HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate ClassMay 16, 2025Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather ForecastsMay 16, 2025Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPO