"au:"Aditya Desai"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Aditya Desai"" — arXiv2 Search

Showing 1–20 of 22 results

/ Date/ Name

Feb 24, 2021Density Sketches for Sampling and Estimation May 26, 2023Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time Oct 7, 2025vAttention: Verified Sparse Attention Dec 19, 2024HashAttention: Semantic Sparsity for Faster Inference Nov 3, 2023Heterogeneous federated collaborative filtering using FAIR: Federated Averaging in Random Subspaces Aug 4, 2021Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000$\times$ Compression and 3.1$\times$ Faster Inference Feb 24, 2021Semantically Constrained Memory Allocation (SCMA) for Embedding in Efficient Recommendation Systems Jul 21, 2022Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing Jul 21, 2022The trade-offs of model size in large recommendation models : A 10000 $\times$ compressed criteo-tb DLRM model (100 GB parameters to mere 10MB)Oct 17, 2023In defense of parameter sharing for model-compression Feb 6, 2026SOCKET: SOft Collision Kernel EsTimator for Sparse Attention Oct 29, 2020Active Sampling Count Sketch (ASCS) for Online Sparse Estimation of a Trillion Scale Covariance Matrix Oct 8, 2024Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation Oct 7, 2025Barbarians at the Gate: How AI is Upending Systems Research Feb 26, 2021Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion Feb 12, 2025The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks Sep 1, 2015Program Synthesis using Natural Language Feb 6, 2025vCache: Verified Semantic Prompt Caching Dec 16, 2025Let the Barbarians In: How AI Can Accelerate Systems Performance Research Jan 2, 2021Smart Car Features using Embedded Systems and IoT