Showing 1–20 of 21 results
/ Date/ Name
Dec 9, 2020Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLUDec 7, 2020Evaluating Cross-Lingual Transfer Learning Approaches in Multilingual Conversational Agent ModelsApr 1, 2025Multi-Token AttentionDec 8, 2023PathFinder: Guided Search over Multi-Step Reasoning PathsDec 15, 2022ROSCOE: A Suite of Metrics for Scoring Step-by-Step ReasoningJan 30, 2024Efficient Tool Use with Chain-of-Abstraction ReasoningSep 5, 2023Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction TuningDec 16, 2022ALERT: Adapting Language Models to Reasoning TasksJul 31, 2025CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasksMar 20, 2024Reverse Training to Nurse the Reversal CurseJul 28, 2024Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-JudgeJan 29, 2026Self-Improving Pretraining: using post-trained models to pretrain better modelsMar 12, 2024Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLMOct 4, 2023DOMINO: A Dual-System for Multi-step Visual Language ReasoningJun 5, 2025Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language ModelsSep 29, 2025The Era of Real-World Human Interaction: RL from User ConversationsAug 26, 2025StepWiser: Stepwise Generative Judges for Wiser ReasoningAug 8, 2023Shepherd: A Critic for Language Model GenerationAug 5, 2024Self-Taught EvaluatorsMay 29, 2024Contextual Position Encoding: Learning to Count What's Important