Showing 1–20 of 28 results
/ Date/ Name
Mar 19, 2019A Comparative Study of Different Approaches for Tracking Communities in Evolving Social NetworksJan 9, 2025TreeKV: Smooth Key-Value Cache Compression with Tree StructuresNov 13, 2023Fovea Transformer: Efficient Long-Context Modeling with Structured Fine-to-Coarse AttentionMay 24, 2023Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT OperatorMar 3, 2025WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language ModelsOct 8, 2025AWM: Accurate Weight-Matrix Fingerprint for Large Language ModelsMar 3, 2026CoDAR: Continuous Diffusion Language Models are More Powerful Than You ThinkFeb 3, 2026One Size Does Not Fit All: Token-Wise Adaptive Compression for KV CacheFeb 9, 2026Pretraining with Token-Level Adaptive Latent Chain-of-ThoughtDec 8, 2023Towards Controlled Table-to-Text Generation with Scientific ReasoningMay 14, 2022RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQLMay 1, 2025FreqKV: Key-Value Compression in Frequency Domain for Context Window ExtensionAug 4, 2025Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache EvictionSep 27, 2025PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous SpaceMar 2, 2026PonderLM-3: Adaptive Token-Wise Pondering with Differentiable MaskingFeb 9, 2023Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge MemorizationFeb 24, 2023Adapting Knowledge for Few-shot Table-to-Text GenerationAug 8, 2025Fourier-VLM: Compressing Vision Tokens in the Frequency Domain for Large Vision-Language ModelsJun 17, 2025LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMsJan 30, 2026FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation