"au:"Ziwei He"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Ziwei He"" — arXiv2 Search

Showing 1–20 of 28 results

/ Date/ Name

Mar 19, 2019A Comparative Study of Different Approaches for Tracking Communities in Evolving Social Networks Jan 9, 2025TreeKV: Smooth Key-Value Cache Compression with Tree Structures Nov 13, 2023Fovea Transformer: Efficient Long-Context Modeling with Structured Fine-to-Coarse Attention May 24, 2023Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator Mar 3, 2025WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models Oct 8, 2025AWM: Accurate Weight-Matrix Fingerprint for Large Language Models Mar 3, 2026CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think Feb 3, 2026One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache Feb 9, 2026Pretraining with Token-Level Adaptive Latent Chain-of-Thought Dec 8, 2023Towards Controlled Table-to-Text Generation with Scientific Reasoning May 14, 2022RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL May 1, 2025FreqKV: Key-Value Compression in Frequency Domain for Context Window Extension Aug 4, 2025Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction Sep 27, 2025PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space Mar 2, 2026PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking Feb 9, 2023Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization Feb 24, 2023Adapting Knowledge for Few-shot Table-to-Text Generation Aug 8, 2025Fourier-VLM: Compressing Vision Tokens in the Frequency Domain for Large Vision-Language Models Jun 17, 2025LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Jan 30, 2026FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation