Showing 1–20 of 23 results
/ Date/ Name
Oct 17, 2024SeerAttention: Learning Intrinsic Sparse Attention in Your LLMsJun 10, 2025SeerAttention-R: Sparse Attention Adaptation for Long ReasoningJan 23, 2021Contrastive Prototype Learning with Augmented Embeddings for Few-Shot LearningJan 11, 2024A Composable Dynamic Sparse Dataflow Architecture for Efficient Event-based Vision Processing on FPGAApr 22, 2024Co-designing a Sub-millisecond Latency Event-based Eye Tracking System with Submanifold Sparse CNNApr 17, 2024Event-Based Eye Tracking. AIS 2024 Challenge SurveyFeb 19, 2020Algorithm-hardware Co-design for Deformable ConvolutionApr 26, 2021HAO: Hardware-aware neural Architecture Optimization for Efficient InferenceApr 15, 2022COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal RetrievalMar 26, 2022A Roadmap for Big ModelFeb 24, 2023DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network InferenceNov 16, 2024Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of ExpertsMar 11, 2021WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-TrainingOct 27, 2021Towards artificial general intelligence via a multimodal foundation modelNov 6, 2024TATAA: Programmable Mixed-Precision Transformer Acceleration with a Transformable Arithmetic ArchitectureJun 4, 2025Rectified Sparse AttentionJan 6, 2026MiMo-V2-Flash Technical ReportDec 14, 2023Random resistive memory-based deep extreme point learning machine for unified visual processingFeb 3, 2026HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache SharingFeb 24, 2026TOM: A Ternary Read-only Memory Accelerator for LLM-powered Edge Intelligence