arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Wenxuan Wang"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
May 20, 2025
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training
Mar 23, 2025
STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models
Dec 3, 2022
Smoothing Policy Iteration for Zero-sum Markov Games