arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Mengru Wang"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Apr 14, 2026
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
Feb 26, 2026
SkillNet: Create, Evaluate, and Connect AI Skills
Feb 2, 2026
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics
May 20, 2025
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training
Jul 2, 2024
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models