"au:"Pierre Ménard"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Pierre Ménard"" — arXiv2 Search

Showing 1–4 of 4 results

/ Date/ Name

Apr 23, 2026A single algorithm for both restless and rested rotting bandits Apr 17, 2026The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback Feb 12, 2026Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Sep 21, 2025ARE: Scaling Up Agent Environments and Evaluations