arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Pierre Ménard"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Apr 23, 2026
A single algorithm for both restless and rested rotting bandits
Apr 17, 2026
The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback
Feb 12, 2026
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments
Sep 21, 2025
ARE: Scaling Up Agent Environments and Evaluations