"au:"Mingu Lee"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Mingu Lee"" — arXiv2 Search

Showing 1–20 of 24 results

/ Date/ Name

Oct 11, 2019Query-by-example on-device keyword spotting Oct 10, 2019Orthogonality Constrained Multi-Head Attention For Keyword Spotting Apr 13, 2024On Speculative Decoding for Multimodal Large Language Models Mar 27, 2025An NLP-Driven Approach Using Twitter Data for Tailored K-pop Artist Recommendations Feb 5, 2026Double-P: Hierarchical Top-P Sparse Attention for Long-Context LLMs Jul 11, 2024What to Say and When to Say it: Live Fitness Coaching as a Testbed for Situated Interaction Aug 16, 2023Painter: Teaching Auto-regressive Language Models to Draw Sketches Jun 6, 2023Deductive Verification of Chain-of-Thought Reasoning Apr 2, 2024HyperCLOVA X Technical Report Nov 1, 2023Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving Oct 24, 2024AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability Jan 30, 2026Fast Forward: Accelerating LLM Prefill with Predictive FFN Sparsity Feb 21, 2024Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement Jun 13, 2024ToSA: Token Selective Attention for Efficient Vision Transformers Jun 30, 2023Look, Remember and Reason: Grounded reasoning in videos with language models Jun 28, 2025VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs Feb 9, 2026QUOKA: Query-Oriented KV Selection For Efficient LLM Prefill Mar 18, 2026Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing Mar 9, 2026ConFu: Contemplate the Future for Better Speculative Sampling Feb 29, 2024Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs