Showing 21–40 of 251 results
/ Date/ Name
Sep 30, 2022Improving Policy Learning via Language Dynamics DistillationJan 25, 2023XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language ModelsMay 24, 2023Trusting Your Evidence: Hallucinate Less with Context-aware DecodingAug 31, 2023The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 LanguagesJun 27, 2012A Joint Model of Language and Perception for Grounded Attribute LearningJul 21, 2017End-to-end Neural Coreference ResolutionFeb 25, 2018NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating SystemJul 9, 2024Scaling Retrieval-Based Language Models with a Trillion-Token DatastoreJun 26, 2024Evaluating Copyright Takedown Methods for Language ModelsDec 13, 2024Byte Latent Transformer: Patches Scale Better Than TokensOct 22, 2024Altogether: Image Captioning via Re-aligning Alt-textJan 27, 2025Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware SparsityDec 19, 2022The case for 4-bit precision: k-bit Inference Scaling LawsNov 18, 2022DS-1000: A Natural and Reliable Benchmark for Data Science Code GenerationMar 16, 2023ART: Automatic multi-step reasoning and tool-use for large language modelsApr 12, 2024Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context LengthNov 8, 2024Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward PassDec 12, 2024Memory Layers at ScaleJan 6, 2025CAT: Content-Adaptive Image TokenizationSep 5, 2023Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning