Showing 1–20 of 22 results
/ Date/ Name
May 23, 2024Lessons from the Trenches on Reproducible Evaluation of Language ModelsApr 21, 2023Emergent and Predictable Memorization in Large Language ModelsApr 3, 2023Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingFeb 24, 2023ProofNet: Autoformalizing and Formally Proving Undergraduate-Level MathematicsFeb 12, 2024Suppressing Pink Elephants with Direct Principle FeedbackJul 25, 2024Self-Directed Synthetic Dialogues and Revisions Technical ReportJun 24, 2024From Decoding to Meta-Generation: Inference-time Algorithms for Large Language ModelsJun 2, 2023GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data ExplorationNov 3, 2022Crosslingual Generalization through Multitask FinetuningNov 9, 2022BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelJan 9, 2023SantaCoder: don't reach for the stars!May 9, 2023StarCoder: may the source be with you!Jan 24, 2025Humanity's Last ExamOct 16, 2023Llemma: An Open Language Model For MathematicsMar 12, 2025PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training RunsDec 19, 2022BLOOM+1: Adding Language Support to BLOOM for Zero-Shot PromptingNov 30, 2022Explicit Knowledge Transfer for Weakly-Supervised Code GenerationJun 6, 2024Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?Jul 20, 2024Consent in Crisis: The Rapid Decline of the AI Data CommonsJun 24, 2024The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources