"au:"Thomas Wang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Thomas Wang"" — arXiv2 Search

Showing 1–20 of 31 results

/ Date/ Name

Oct 15, 2021Multitask Prompted Training Enables Zero-Shot Task Generalization Apr 12, 2022What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?Nov 13, 2020Pattern Problems related to the Arithmetic Kakeya Conjecture Oct 9, 2024Pixtral 12B Nov 3, 2022Crosslingual Generalization through Multitask Finetuning Nov 9, 2022BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Oct 10, 2023Mistral 7B Jul 17, 2025Voxtral Dec 9, 2025Monitoring Deployed AI Systems in Health Care Nov 3, 2023FinGPT: Large Generative Models for a Small Language May 1, 2025Wilson polygons and the topology of zero-dimensional systems Feb 12, 2026DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and Objectivity Jun 21, 2023OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Oct 22, 2025OpenGuardrails: A Configurable, Unified, and Scalable Guardrails Platform for Large Language Models Jun 12, 2025Magistral Jan 13, 2026Ministral 3 Mar 26, 2026Voxtral TTS Oct 27, 2022What Language Model to Train if You Have One Million GPU Hours?May 9, 2023StarCoder: may the source be with you!May 27, 2025Power-Capping Metric Evaluation for Improving Energy Efficiency in HPC Applications