"au:"Despoina Magka"" — arXiv2 SearchShowing 1–9 of 9 results
/ Date/ Name
Feb 6, 2026AIRS-Bench: a Suite of Tasks for Frontier AI Research Science AgentsJun 27, 2025The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT ImprovementsFeb 4, 2014Acyclicity Notions for Existential Rules and Their Application to Query Answering in OntologiesJul 3, 2025AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-benchNov 17, 2025Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM PerformanceNov 19, 2025What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation DiversityMar 3, 2026APRES: An Agentic Paper Revision and Evaluation SystemFeb 20, 2025MLGym: A New Framework and Benchmark for Advancing AI Research AgentsMar 27, 2026AIRA_2: Overcoming Bottlenecks in AI Research Agents