"au:"Zhuoshu Li"" — arXiv2 SearchShowing 1–9 of 9 results
/ Date/ Name
Dec 2, 2025DeepSeek-V3.2: Pushing the Frontier of Open Large Language ModelsMar 8, 2024DeepSeek-VL: Towards Real-World Vision-Language UnderstandingJan 22, 2025DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningDec 27, 2024DeepSeek-V3 Technical ReportSep 12, 2022On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation ModelsFeb 24, 2025An Exploratory Study on How AI Awareness Impacts Human-AI Design CollaborationJul 2, 2024Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsOct 8, 2024Data Efficiency for Large Recommendation ModelsMay 7, 2024DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model