"au:"Tianle Cai"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Tianle Cai"" — arXiv2 Search

Showing 1–20 of 42 results

/ Date/ Name

May 26, 2023Large Language Models as Tool Makers Jan 19, 2024Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Jun 3, 2019Adversarially Robust Generalization Just Requires More Unlabeled Data Sep 7, 2020GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training Feb 22, 2021A Theory of Label Propagation for Subpopulation Shift May 28, 2019Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems Sep 22, 2020Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot Oct 17, 2022What Makes Convolutional Models Great on Long Sequence Modeling?Jul 19, 2022Is Vertical Logistic Regression Privacy-Preserving? A Comprehensive Privacy Analysis and Beyond Nov 14, 2023REST: Retrieval-Based Speculative Decoding Jul 29, 2024FlexAttention for Efficient High-Resolution Vision-Language Models Feb 29, 2024DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Nov 7, 2024SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models May 28, 2023Reward Collapse in Aligning Large Language Models Jul 8, 2025A Survey on Latent Reasoning Oct 29, 2025Scaling Latent Reasoning via Looped Language Models Jun 1, 2020Locally Differentially Private (Contextual) Bandits Learning Jun 15, 2021First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track Apr 11, 2024JetMoE: Reaching Llama2 Performance with 0.1M Dollars Jul 5, 2023Scaling In-Context Demonstrations with Structured Attention