Showing 41–60 of 249 results
/ Date/ Name
Sep 1, 2025LobRA: Multi-tenant Fine-tuning over Heterogeneous DataAug 28, 2025Fast and Scalable Mixed Precision Euclidean Distance Calculations Using GPU Tensor CoresAug 21, 2025HFX: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast ScalingAug 12, 2025P/D-Device: Disaggregated Large Language Model between Cloud and DevicesAug 6, 2025Dynamic Solutions for Hybrid Quantum-HPC Resource AllocationJul 29, 2025Collaborative State Machines: A Better Programming Model for the Cloud-Edge-IoT ContinuumJul 28, 2025Advancing Compositional LLM Reasoning with Structured Task Relations in Interactive Multimodal CommunicationsJul 10, 2025KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent WorkflowsJul 1, 2025DynoStore: A wide-area distribution system for the management of data over heterogeneous storageMay 29, 2025D-Rex: Heterogeneity-Aware Reliability Framework and Adaptive Algorithms for Distributed StorageMay 8, 2025Empowering Scientific Workflows with Federated AgentsMay 7, 2025HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific InsightsMay 7, 2025ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate DownscalingMay 6, 2025Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User SystemsApr 29, 2025Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data AnnotationsApr 11, 2025Assessing the Elephant in the Room in Scheduling for Current Hybrid HPC-QC ClustersApr 1, 2025New Improvements in Solving Large LABS Instances Using Massively Parallelizable Memetic Tabu SearchMar 23, 2025WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model TrainingMar 17, 2025WRATH: Workload Resilience Across Task Hierarchies in Task-based Parallel Programming FrameworksJan 22, 2025Practical quantum federated learning and its experimental demonstration