arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Liqun Deng"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
May 7, 2025
Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Apr 10, 2025
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Nov 4, 2024
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity