arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Siyu Yuan"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
May 20, 2025
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Apr 10, 2025
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
Feb 16, 2025
DEEPER Insight into Your User: Directed Persona Refinement for Dynamic Persona Modeling
Jan 5, 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
Jul 7, 2024
MINDECHO: Role-Playing Language Agents for Key Opinion Leaders