Showing 21–39 of 39 results
/ Date/ Name
Jun 17, 2025Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain PerspectiveFeb 4, 2025CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingApr 13, 2026CocoaBench: Evaluating Unified Digital Agents in the WildSep 4, 2018Texar: A Modularized, Versatile, and Extensible Toolkit for Text GenerationNov 16, 2023SegMix: A Simple Structure-Aware Data Augmentation MethodSep 19, 2023SlimPajama-DC: Understanding Data Combinations for LLM TrainingJun 28, 2024Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMsApr 10, 2025Token Level Routing Inference System for Edge DevicesNov 12, 2025PAN: A World Model for General, Interactable, and Long-Horizon World SimulationAug 3, 2025How Does Controllability Emerge In Language Models During Pretraining?Mar 12, 2026IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RLOct 9, 2022ASDOT: Any-Shot Data-to-Text Generation with Pretrained Language ModelsAug 16, 2025Data Mixing Optimization for Supervised Fine-Tuning of Large Language ModelsJan 4, 2026LAPS: A Length-Aware-Prefill LLM Serving SystemMay 3, 2018Towards Better Text Understanding and Retrieval through Kernel Entity Salience ModelingMay 19, 2025Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsNov 6, 2024Crystal: Illuminating LLM Abilities on Language and CodeAug 18, 2025Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data CurationSep 9, 2025K2-Think: A Parameter-Efficient Reasoning System