arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Shao Tang"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
Feb 20, 2025
Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems
Feb 7, 2025
LLM Query Scheduling with Prefix Reuse and Latency Constraints