"au:"Roberta Raileanu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Roberta Raileanu"" — arXiv2 Search

Showing 1–7 of 7 results

/ Date/ Name

Feb 6, 2026AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents Nov 17, 2025Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Jul 3, 2025AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench Jun 27, 2025The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements Feb 20, 2025MLGym: A New Framework and Benchmark for Advancing AI Research Agents Jul 31, 2024The Llama 3 Herd of Models Feb 13, 2024GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements