Showing 1–15 of 15 results
/ Date/ Name
Nov 24, 2018Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware ImplicationsFeb 19, 2015Scalable Bayesian Optimization Using Deep Neural NetworksApr 15, 2016Parallelizing Word2Vec in Shared and Distributed MemoryNov 21, 2015BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large VocabulariesSep 30, 2011Fast Updates on Read-Optimized Databases Using Multi-Core CPUsMar 25, 2015GraphMat: High performance graph analytics made productiveJul 27, 2016PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed ArchitecturesAug 17, 2017Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataJan 30, 2026Unveiling the Potential of Quantization with MXFP4: Strategies for Quantization Error ReductionApr 10, 2017Banshee: Bandwidth-Efficient DRAM Caching Via Software/Hardware CooperationJul 8, 2021First-Generation Inference Accelerator Deployment at FacebookMay 2, 2018Glow: Graph Lowering Compiler Techniques for Neural NetworksAug 31, 2017Galactos: Computing the Anisotropic 3-Point Correlation Function for 2 Billion GalaxiesNov 18, 2016Parallelizing Word2Vec in Multi-Core and Many-Core ArchitecturesMay 26, 2021Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale