Showing 141–160 of 249 results
/ Date/ Name
Jul 15, 2021MXDAG: A Hybrid Abstraction for Cluster ApplicationsJul 15, 2021Improving I/O Performance for Exascale Applications through Online Data Layout ReorganizationJul 4, 2021KAISA: An Adaptive Second-Order Optimizer Framework for Deep Neural NetworksJun 11, 2021Bandwidth-Optimal Random Shuffling for GPUsMay 30, 2021Maximizing Parallelism in Distributed Training for Huge Neural NetworksMay 3, 2021Analyzing scientific data sharing patterns for in-network data cachingApr 16, 2021Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep LearningApr 16, 2021Evaluation of Portable Acceleration Solutions for LArTPC Simulation Using Wire-Cell ToolkitApr 12, 2021Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation ModelsApr 11, 2021Shuffler: A Large Scale Data Management Tool for ML in Computer VisionMar 28, 2021MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed TrainingMar 16, 2021An Efficient Vectorization Scheme for Stencil ComputationMar 4, 2021Pandemic Drugs at Pandemic Speed: Infrastructure for Accelerating COVID-19 Drug Discovery with Hybrid Machine Learning- and Physics-based Simulations on High Performance ComputersFeb 5, 2021Cache Blocking Technique to Large Scale Quantum Computing Simulation on SupercomputersJan 26, 2021C-for-Metal: High Performance SIMD Programming on Intel GPUsJan 15, 2021SoftNER: Mining Knowledge Graphs From Cloud IncidentsJan 14, 2021Towards Practical Adam: Non-Convexity, Convergence Theory, and Mini-Batch AccelerationNov 22, 2020TaiJi: Longest Chain Availability with BFT Fast ConfirmationNov 2, 2020The Persistence of False Memory: Brain in a Vat Despite Perfect ClocksOct 30, 2020State sharding model on the blockchain