The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements — arXiv2