arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Zhoujun Li"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
May 20, 2025
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Feb 20, 2025
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Jun 11, 2024
McEval: Massively Multilingual Code Evaluation