"au:"Dimitrios S. Nikolopoulos"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Dimitrios S. Nikolopoulos"" — arXiv2 Search

Showing 21–32 of 32 results

/ Date/ Name

Jun 3, 2025APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs Dec 18, 2025Taming the Memory Footprint Crisis: System Design for Production Diffusion LLM Serving Apr 8, 2026ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge--Cloud Speculative LLM Serving Apr 3, 2015ALEA: Fine-grain Energy Profiling with Basic Block Sampling Oct 27, 2017Power Modelling for Heterogeneous Cloud-Edge Data Centers May 13, 2016Energy Optimization of Memory Intensive Parallel workloads Dec 30, 2014Methods and Metrics for Fair Server Assessment under Real-Time Financial Workloads Jun 14, 2016BDDT-SCC: A Task-parallel Runtime for Non Cache-Coherent Multicores Sep 12, 2017ENORM: A Framework For Edge NOde Resource Management Nov 14, 2025DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference May 6, 2025MARCO: Multi-Agent Code Optimization with Real-Time Knowledge Integration for High-Performance Computing Jan 15, 2026WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching