"au:"Zhuoshu Li"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zhuoshu Li"" — arXiv2 Search

Showing 1–9 of 9 results

/ Date/ Name

Dec 2, 2025DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Mar 8, 2024DeepSeek-VL: Towards Real-World Vision-Language Understanding Jan 22, 2025DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Dec 27, 2024DeepSeek-V3 Technical Report Sep 12, 2022On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models Feb 24, 2025An Exploratory Study on How AI Awareness Impacts Human-AI Design Collaboration Jul 2, 2024Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models Oct 8, 2024Data Efficiency for Large Recommendation Models May 7, 2024DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model