arXiv2
Search
Toggle theme
/ Date
/ Name
Search
/ Date
/ Name
"au:"Ulyana Piterbarg"" — arXiv2 Search
Showing 1–8 of 8 results
/ Date
/ Name
May 30, 2023
NetHack is Hard to Hack
Dec 12, 2023
diff History for Neural Language Agents
Oct 3, 2024
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Sep 21, 2025
ARE: Scaling Up Agent Environments and Evaluations
Oct 23, 2020
Capturing missing physics in climate model parameterizations using neural differential equations
Sep 3, 2025
Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
Feb 12, 2026
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments
Nov 20, 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games