Showing 1–20 of 36 results
/ Date/ Name
Sep 4, 2019Compositional Embeddings Using Complementary Partitions for Memory-Efficient Recommendation SystemsJun 5, 2025Beyond the Buzz: A Pragmatic Take on Inference DisaggregationMar 20, 2020Deep Learning Training in Facebook Data Centers: Design of Scale-up and Scale-out SystemsApr 12, 2021Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation ModelsJul 15, 2017Ternary Residual NetworksSep 12, 2023A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-ScaleMay 29, 2019A Study of BFLOAT16 for Deep Learning TrainingMay 2, 2017Ternary Neural Networks with Fine-Grained QuantizationJun 2, 2016Distributed Hessian-Free Optimization for Deep Neural NetworkNov 12, 2014Identification of Helicopter Dynamics based on Flight Data using Nature Inspired TechniquesNov 16, 2010Fast GPGPU Data Rearrangement Kernels using CUDAMay 31, 2019Deep Learning Recommendation Model for Personalization and Recommendation SystemsSep 25, 2019Mixed Dimension Embeddings with Application to Memory-Efficient Recommendation SystemsFeb 15, 2018A Progressive Batching L-BFGS Method for Machine LearningSep 15, 2016On Large-Batch Training for Deep Learning: Generalization Gap and Sharp MinimaOct 26, 2018Efficient Distributed Hessian Free Algorithm for Large-scale Empirical Risk Minimization via Accumulating Sample StrategyJan 15, 2020SEERL: Sample Efficient Ensemble Reinforcement LearningJun 6, 2019The Architectural Implications of Facebook's DNN-based Personalized RecommendationJan 31, 2017Mixed Low-precision Deep Learning Inference using Dynamic Fixed PointNov 1, 2010Fast Histograms using Adaptive CUDA Streams