"au:"Yoram Bachrach"" — arXiv2 SearchShowing 1–8 of 8 results
/ Date/ Name
Mar 27, 2026AIRA_2: Overcoming Bottlenecks in AI Research AgentsMar 3, 2026APRES: An Agentic Paper Revision and Evaluation SystemFeb 6, 2026AIRS-Bench: a Suite of Tasks for Frontier AI Research Science AgentsNov 19, 2025What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation DiversityNov 17, 2025Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM PerformanceJul 3, 2025AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-benchJun 27, 2025The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT ImprovementsFeb 20, 2025MLGym: A New Framework and Benchmark for Advancing AI Research Agents