"au:"Daniel Nichols"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Daniel Nichols"" — arXiv2 Search

Showing 1–17 of 17 results

/ Date/ Name

Apr 29, 2024Performance-Aligned LLMs for Generating Fast Code Dec 19, 2024HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages Oct 8, 2016Analogues of the $3x + 1$ Problem in Polynomial Rings of Characteristic 2 Dec 17, 2025Optimizing Agentic Language Model Inference via Speculative Tool Calls Jun 29, 2023HPC-Coder: Modeling Parallel Programs using Large Language Models Jan 23, 2024Can Large Language Models Write Parallel Code?Jun 26, 2025ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks Oct 20, 2025Integrating Performance Tools in Model Reasoning for GPU Kernel Optimization Nov 9, 2021A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks Jul 15, 2025Modeling Code: Is Text All You Need?Apr 13, 2026Record-Remix-Replay: Hierarchical GPU Kernel Optimization using Evolutionary Search Apr 15, 2026LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning Nov 23, 2020Integrating Deep Learning in Domain Sciences at Exascale Nov 7, 2025LLMs as Packagers of HPC Software May 13, 2025Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions Apr 3, 2026Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training Jan 23, 2024Automated Programmatic Performance Analysis of Parallel Programs