Showing 1–20 of 21 results
/ Date/ Name
Mar 20, 2023Language Model Behavior: A Comprehensive SurveyApr 21, 2025Bigram Subnetworks: Mapping to Next Tokens in Transformer Language ModelsNov 15, 2023When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource LanguagesJun 10, 2021Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsMay 17, 2020Encodings of Source Syntax: Similarities in NMT Representations Across Target LanguagesMar 20, 2024Different Tokenization Schemes Lead to Comparable Performance in Spanish Number AgreementMar 13, 2024Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial TopicsMay 22, 2022The Geometry of Multilingual Language Model RepresentationsOct 22, 2024Scalable Influence and Fact Tracing for Large Language Model PretrainingAug 29, 2023Characterizing Learning Curves During Language Model Pre-Training: Learning, Forgetting, and StabilityOct 5, 2021Word Acquisition in Neural Language ModelsMay 26, 2023Characterizing and Measuring Linguistic Dataset DriftNov 15, 2023Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language ModelsMar 5, 2025On the Acquisition of Shared Grammatical Representations in Bilingual Language ModelsAug 19, 2024Goldfish: Monolingual Language Models for 350 LanguagesOct 28, 2025Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and CulturesSep 4, 2022Do Large Language Models know what humans know?Oct 11, 2023Crosslingual Structural Priming and the Pre-Training Dynamics of Bilingual Language ModelsMar 1, 2024A Bit of a Problem: Measurement Disparities in Dataset Sizes Across LanguagesMar 27, 2026How Open Must Language Models be to Enable Reliable Scientific Inference?