Showing 1–20 of 31 results
/ Date/ Name
Oct 15, 2021Multitask Prompted Training Enables Zero-Shot Task GeneralizationApr 12, 2022What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?Nov 13, 2020Pattern Problems related to the Arithmetic Kakeya ConjectureOct 9, 2024Pixtral 12BNov 3, 2022Crosslingual Generalization through Multitask FinetuningNov 9, 2022BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelOct 10, 2023Mistral 7BJul 17, 2025VoxtralDec 9, 2025Monitoring Deployed AI Systems in Health CareNov 3, 2023FinGPT: Large Generative Models for a Small LanguageMay 1, 2025Wilson polygons and the topology of zero-dimensional systemsFeb 12, 2026DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and ObjectivityJun 21, 2023OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text DocumentsOct 22, 2025OpenGuardrails: A Configurable, Unified, and Scalable Guardrails Platform for Large Language ModelsJun 12, 2025MagistralJan 13, 2026Ministral 3Mar 26, 2026Voxtral TTSOct 27, 2022What Language Model to Train if You Have One Million GPU Hours?May 9, 2023StarCoder: may the source be with you!May 27, 2025Power-Capping Metric Evaluation for Improving Energy Efficiency in HPC Applications