arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jian Liang"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Apr 23, 2026
Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning
Dec 2, 2025
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Jan 22, 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Dec 27, 2024
DeepSeek-V3 Technical Report
Aug 16, 2023
DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory