"au:"Zhongfeng Wang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zhongfeng Wang"" — arXiv2 Search

Showing 1–20 of 55 results

/ Date/ Name

Oct 24, 2022NASA: Neural Architecture Search and Acceleration for Hardware Inspired Hybrid Networks Oct 14, 2022An Efficient FPGA Accelerator for Point Cloud Feb 3, 2023PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications Dec 12, 2018An Active-Passive Measurement Study of TCP Performance over LTE on High-speed Rails Sep 21, 2024SPEED: A Scalable RISC-V Vector Processor Enabling Efficient Multi-Precision DNN Inference Sep 26, 2024Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores May 25, 2025Enable Lightweight and Precision-Scalable Posit/IEEE-754 Arithmetic in RISC-V Cores for Transprecision Computing Jul 16, 2024Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment May 25, 2025FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization Nov 3, 2025Memory-Efficient Training with In-Place FFT Implementation Nov 21, 2024TaQ-DiT: Time-aware Quantization for Diffusion Transformers Sep 18, 2020Hardware Accelerator for Multi-Head Attention and Position-Wise Feed-Forward in the Transformer Sep 6, 2019Training Deep Neural Networks Using Posit Number System Jun 24, 2021Transform-Based Feature Map Compression for CNN Inference May 8, 2019A Hardware-Oriented and Memory-Efficient Method for CTC Decoding Oct 18, 2022Accelerate Three-Dimensional Generative Adversarial Networks Using Fast Algorithm Aug 16, 2023S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution Apr 11, 2012Reduced-Complexity Column-Layered Decoding and Implementation for LDPC Codes Sep 19, 2024A High-Throughput Hardware Accelerator for Lempel-Ziv 4 Compression Algorithm May 6, 2024Trio-ViT: Post-Training Quantization and Acceleration for Softmax-Free Efficient Vision Transformer