Showing 1–17 of 17 results
/ Date/ Name
Mar 5, 2026Progressive Residual Warmup for Language Model PretrainingNov 23, 2024Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language ModelsJun 27, 2025GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation ScalingJan 18, 2024Bootstrapping OTS-Funcimg Pre-training Model (Botfip) -- A Comprehensive Symbolic Regression FrameworkJan 21, 2024Multi-Agent Generative Adversarial Interactive Self-Imitation Learning for AUV Formation Control and Obstacle AvoidanceJan 23, 2025UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language ModelsSep 30, 2025Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient ReasonersFeb 17, 2025Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem SolvingJul 12, 2024Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtXAug 26, 2024Category-Theoretical and Topos-Theoretical Frameworks in Machine Learning: A SurveySep 12, 2023Use neural networks to recognize students' handwritten letters and incorrect symbolsAug 8, 2019Incremental Reinforcement Learning --- a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methodsJun 26, 2025Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-TuningFeb 12, 2026Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language ModelsMar 28, 2024A noise-tolerant, resource-saving probabilistic binary neural network implemented by the SOT-MRAM compute-in-memory systemFeb 1, 2025UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language ModelsJun 14, 2023Curricular Subgoals for Inverse Reinforcement Learning