arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Shanghaoran Quan"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
May 29, 2025
ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding
Feb 26, 2025
LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm
Feb 20, 2025
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Jan 2, 2025
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings