Showing 1–11 of 11 results
/ Date/ Name
Nov 1, 2018Rethinking floating point for deep learningDec 24, 2014Fast Convolutional Nets With fbfft: A GPU Performance EvaluationApr 17, 2020Efficient, arbitrarily high precision hardware logarithmic arithmetic for linear algebraFeb 28, 2017Billion-scale similarity search with GPUsAug 9, 2017An evaluation of large-scale methods for image instance and class discoveryJul 7, 2025any4: Learned 4-bit Numeric Representation for LLMsMay 18, 2020Improving the Effectiveness of Traceability Link Recovery using Hierarchical Bayesian NetworksDec 22, 2023Generative AI Beyond LLMs: System Implications of Multi-Modal GenerationJan 16, 2024The Faiss libraryJan 26, 2023GPU-based Private Information Retrieval for On-Device Machine Learning InferenceMay 5, 2024Is Flash Attention Stable?