Showing 1–11 of 11 results
/ Date/ Name
Aug 21, 2024Critique-out-Loud Reward ModelsDec 1, 2022The Effect of Data Dimensionality on Neural Network PrunabilityNov 30, 20223D Neural Field Generation using Triplane DiffusionMay 30, 2024Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference ModelsFeb 7, 2024Hydra: Sequentially-Dependent Draft Heads for Medusa DecodingNov 15, 2023Striped Attention: Faster Ring Attention for Causal TransformersJun 17, 2024Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video DiffusionMay 24, 2023Dynamic Masking Rate Schedules for MLM PretrainingNov 7, 2024Scaling Laws for PrecisionJun 11, 2025Unsupervised Elicitation of Language ModelsFeb 17, 2025Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding