arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Shiwen Ni"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Nov 11, 2025
Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates
Aug 21, 2025
A Survey on Large Language Model Benchmarks
May 29, 2025
ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding
Feb 20, 2025
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines