Showing 1–20 of 26 results
/ Date/ Name
Aug 21, 2018Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural NetworksApr 23, 2018goSLP: Globally Optimized Superword Level Parallelism FrameworkJul 23, 2024SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtentionMar 28, 2023Dias: Dynamic Rewriting of Pandas CodeApr 6, 2025Automated Verification of Soundness of DNN CertifiersNov 20, 2024Transforming the Hybrid Cloud for Emerging AI WorkloadsJun 3, 2025PandasBench: A Benchmark for the Pandas APIOct 8, 2020DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable SurrogatesJun 27, 2023SENSEi: Input-Sensitive Compilation for Accelerating GNNsMar 27, 2024ConstraintFlow: A DSL for Specification and Verification of Neural Network AnalysesMar 21, 2026MINISA: Minimal Instruction Set Architecture for Next-gen Reconfigurable Inference AcceleratorNov 21, 2025TensorRight: Automated Verification of Tensor Graph RewritesAug 25, 2023TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational GraphsMay 21, 2023Learning Large Graph Property Prediction via Graph Segment TrainingFeb 14, 2023COMET: Neural Cost Model Explanation FrameworkJun 27, 2023FLuRKA: Fast and accurate unified Low-Rank & Kernel AttentionMay 31, 2025COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer LearningMar 27, 2025PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees (Technical Report)Feb 6, 2026RuleFlow : Generating Reusable Program Optimizations with LLMsOct 11, 2025ACT: Automatically Generating Compiler Backends from Tensor Accelerator ISA Descriptions