arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Yinfei Pan"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
Apr 9, 2026
HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models
Oct 22, 2024
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs
Nov 21, 2025
E$^3$-Pruner: Towards Efficient, Economical, and Effective Layer Pruning for Large Language Models