arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Shuaipeng Li"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
May 21, 2025
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought
Jul 16, 2024
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training