Showing 1–13 of 13 results
/ Date/ Name
Jul 31, 2020Explainable Prediction of Text Complexity: The Missing Preliminaries for Text SimplificationJul 31, 2020Neural Language Generation: Formulation, Methods, and EvaluationApr 29, 2025HyPerAlign: Interpretable Personalized LLM Alignment via Hypothesis GenerationApr 21, 2026Personalized Benchmarking: Evaluating LLMs by Individual PreferencesJun 11, 2022Why is constrained neural language generation particularly challenging?Apr 16, 2025Evaluating the Goal-Directedness of Large Language ModelsJan 2, 2019Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review GenerationOct 14, 2019Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet DecoderOct 15, 2024RATE: Causal Explainability of Reward Models with Imperfect CounterfactualsJun 2, 2024BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n SamplingFeb 2, 2021The GEM Benchmark: Natural Language Generation, its Evaluation and MetricsJun 9, 2022Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language modelsJun 22, 2022GEMv2: Multilingual NLG Benchmarking in a Single Line of Code