"au:"Tianhao Chen"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Tianhao Chen"" — arXiv2 Search

Showing 1–17 of 17 results

/ Date/ Name

Mar 5, 2026Progressive Residual Warmup for Language Model Pretraining Nov 23, 2024Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models Jun 27, 2025GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling Jan 18, 2024Bootstrapping OTS-Funcimg Pre-training Model (Botfip) -- A Comprehensive Symbolic Regression Framework Jan 21, 2024Multi-Agent Generative Adversarial Interactive Self-Imitation Learning for AUV Formation Control and Obstacle Avoidance Jan 23, 2025UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models Sep 30, 2025Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners Feb 17, 2025Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving Jul 12, 2024Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtX Aug 26, 2024Category-Theoretical and Topos-Theoretical Frameworks in Machine Learning: A Survey Sep 12, 2023Use neural networks to recognize students' handwritten letters and incorrect symbols Aug 8, 2019Incremental Reinforcement Learning --- a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methods Jun 26, 2025Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning Feb 12, 2026Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Mar 28, 2024A noise-tolerant, resource-saving probabilistic binary neural network implemented by the SOT-MRAM compute-in-memory system Feb 1, 2025UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models Jun 14, 2023Curricular Subgoals for Inverse Reinforcement Learning