Showing 1–20 of 42 results
/ Date/ Name
May 26, 2023Large Language Models as Tool MakersJan 19, 2024Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding HeadsJun 3, 2019Adversarially Robust Generalization Just Requires More Unlabeled DataSep 7, 2020GraphNorm: A Principled Approach to Accelerating Graph Neural Network TrainingFeb 22, 2021A Theory of Label Propagation for Subpopulation ShiftMay 28, 2019Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression ProblemsSep 22, 2020Sanity-Checking Pruning Methods: Random Tickets can Win the JackpotOct 17, 2022What Makes Convolutional Models Great on Long Sequence Modeling?Jul 19, 2022Is Vertical Logistic Regression Privacy-Preserving? A Comprehensive Privacy Analysis and BeyondNov 14, 2023REST: Retrieval-Based Speculative DecodingJul 29, 2024FlexAttention for Efficient High-Resolution Vision-Language ModelsFeb 29, 2024DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion ModelsNov 7, 2024SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsMay 28, 2023Reward Collapse in Aligning Large Language ModelsJul 8, 2025A Survey on Latent ReasoningOct 29, 2025Scaling Latent Reasoning via Looped Language ModelsJun 1, 2020Locally Differentially Private (Contextual) Bandits LearningJun 15, 2021First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction TrackApr 11, 2024JetMoE: Reaching Llama2 Performance with 0.1M DollarsJul 5, 2023Scaling In-Context Demonstrations with Structured Attention