Showing 61–80 of 249 results
/ Date/ Name
Jan 18, 2025MOFA: Discovering Materials for Carbon Capture with a GenAI- and Simulation-Based WorkflowJan 12, 2025AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous CloudsDec 10, 2024Hydraulis: Balancing Large Transformer Model Training via Co-designing Parallel Strategies and Data AssignmentDec 2, 2024FlexSP: Accelerating Large Language Model Training via Flexible Sequence ParallelismNov 15, 2024Clock Synchronization Is Almost Impossible with Bounded MemoryNov 13, 2024LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive HashingNov 11, 2024Topological Characterization of Stabilizing ConsensusNov 10, 2024Dynamic Resource Manager for Automating Deployments in the Computing ContinuumNov 1, 2024SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compileOct 17, 2024Malleus: Straggler-Resilient Hybrid Parallel Training of Large-scale Models via Malleable Data and Model ParallelizationOct 17, 2024Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level CachingOct 15, 2024Accelerating Python Applications with Dask and ProxyStoreSep 24, 2024Flight: A FaaS-Based Framework for Complex and Hierarchical Federated LearningSep 5, 2024Spindle: Efficient Distributed Training of Multi-Task Large Models via Wavefront SchedulingAug 26, 2024Employing Artificial Intelligence to Steer Exascale Workflows with ColmenaAug 13, 2024TaPS: A Performance Evaluation Suite for Task-based Execution FrameworksAug 1, 2024DynamoLLM: Designing LLM Inference Clusters for Performance and Energy EfficiencyJul 26, 2024Optimizing Checkpoint-Restart Mechanisms for HPC with DMTCP in Containers at NERSCJul 21, 2024Secure Web Objects: Building Blocks for Metaverse Interoperability and DecentralizationJul 16, 2024Building AI Agents for Autonomous Clouds: Challenges and Design Principles