Showing 1–20 of 80 results
/ Date/ Name
Jun 13, 2019Unsupervised Video Interpolation Using Cycle ConsistencySep 17, 2019Megatron-LM: Training Multi-Billion Parameter Language Models Using Model ParallelismJan 2, 2021End-to-End Training of Neural Retrievers for Open-Domain Question AnsweringMay 10, 2022Reducing Activation Recomputation in Large Transformer ModelsDec 15, 2021Few-shot Instruction Prompts for Pretrained Language Models to Detect Social BiasesOct 6, 2022Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language ModelsFeb 22, 2020Training Question Answering Models From Synthetic DataMar 2, 2020Style Example-Guided Text Generation using Generative Adversarial TransformersDec 25, 2019Neural ODEs for Image Segmentation with Level SetsMay 13, 2020Large Scale Multi-Actor Generative Dialog ModelingApr 9, 2021Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LMJul 5, 2021Long-Short Transformer: Efficient Transformers for Language and VisionFeb 8, 2022Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language ModelsOct 12, 2022Context Generation Improves Open Domain Question AnsweringJan 18, 2024ChatQA: Surpassing GPT-4 on Conversational QA and RAGNov 13, 2025Music Flamingo: Scaling Music Understanding in Audio Language ModelsSep 26, 2025RLP: Reinforcement as a Pretraining ObjectiveMar 19, 2026Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy DistillationMar 14, 2026MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World VideosJun 9, 2022Factuality Enhanced Language Models for Open-Ended Text Generation