"au:"WenGuang Chen"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"WenGuang Chen"" — arXiv2 Search

Showing 1–20 of 28 results

/ Date/ Name

Jan 18, 2021DFOGraph: An I/O- and Communication-Efficient System for Distributed Fully-out-of-Core Graph Processing Mar 17, 2022Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory Jan 1, 2025Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention Apr 21, 2021GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy Mar 17, 2025A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules Aug 2, 2022Toward 6G TK$μ$ Extreme Connectivity: Architecture, Key Technologies and Experiments Sep 20, 2020TADOC: Text Analytics Directly on Compression Oct 13, 2019LiveGraph: A Transactional Graph Storage System with Purely Sequential Adjacency List Scans Aug 17, 2020AIPerf: Automated machine learning as an AI-HPC benchmark Sep 10, 2024KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation Nov 17, 2025Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression Nov 24, 2025How Learning Rate Decay Wastes Your Best Data in Curriculum-Based LLM Pretraining Apr 28, 2022Mat2Stencil: A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid Nov 15, 2017Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler Jul 24, 2023PUMA: Secure Inference of LLaMA-7B in Five Minutes Mar 7, 2026Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts Apr 2, 2020RisGraph: A Real-Time Streaming System for Evolving Graphs to Support Sub-millisecond Per-update Analysis at Millions Ops/s Jul 20, 2022Mixed-Precision Inference Quantization: Radically Towards Faster inference speed, Lower Storage requirement, and Lower Loss Oct 29, 2015WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation Oct 8, 2016SaberLDA: Sparsity-Aware Learning of Topic Models on GPUs