arXiv2
Search
Dark
/ Date
/ Name
Aa
W
Search
/ Date
/ Name
"au:"Bryan L. M. de Oliveira"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Nov 5, 2025
Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments
Oct 17, 2024
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Jan 25, 2026
Do Reasoning Models Ask Better Questions? A Formal Information-Theoretic Analysis on Multi-Turn LLM Games
Feb 17, 2025
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context