arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Sukhan Lee"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Dec 14, 2024
Accelerating Retrieval-Augmented Generation
Jul 8, 2025
Per-Row Activation Counting on Real Hardware: Demystifying Performance Overheads
Sep 2, 2024
Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous Batching
Aug 15, 2022
Online 3D Bin Packing Reinforcement Learning Solution with Buffer
Nov 9, 2021
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders
Mar 10, 2026
PIM-SHERPA: Software Method for On-device LLM Inference by Resolving PIM Memory Attribute and Layout Inconsistencies