arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Wendi Li"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
Feb 23, 2026
LAD: Learning Advantage Distribution for Reasoning
Feb 4, 2026
Uncertainty Quantification in LLM Agents: Foundations, Emerging Challenges, and Opportunities
Sep 27, 2025
General Exploratory Bonus for Optimistic Exploration in RLHF