Showing 1–17 of 17 results
/ Date/ Name
Apr 29, 2024Performance-Aligned LLMs for Generating Fast CodeDec 19, 2024HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel LanguagesOct 8, 2016Analogues of the $3x + 1$ Problem in Polynomial Rings of Characteristic 2Dec 17, 2025Optimizing Agentic Language Model Inference via Speculative Tool CallsJun 29, 2023HPC-Coder: Modeling Parallel Programs using Large Language ModelsJan 23, 2024Can Large Language Models Write Parallel Code?Jun 26, 2025ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation TasksOct 20, 2025Integrating Performance Tools in Model Reasoning for GPU Kernel OptimizationNov 9, 2021A Survey and Empirical Evaluation of Parallel Deep Learning FrameworksJul 15, 2025Modeling Code: Is Text All You Need?Apr 13, 2026Record-Remix-Replay: Hierarchical GPU Kernel Optimization using Evolutionary SearchApr 15, 2026LongCoT: Benchmarking Long-Horizon Chain-of-Thought ReasoningNov 23, 2020Integrating Deep Learning in Domain Sciences at ExascaleNov 7, 2025LLMs as Packagers of HPC SoftwareMay 13, 2025Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research DirectionsApr 3, 2026Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN TrainingJan 23, 2024Automated Programmatic Performance Analysis of Parallel Programs