Showing 1–20 of 23 results
/ Date/ Name
Feb 3, 2023Learning a Fourier Transform for Linear Relative Positional Encodings in TransformersApr 11, 2024Auctions with LLM SummariesMay 30, 2017Recurrent Estimation of DistributionsOct 4, 2024Linear Transformer Topological Masking with Graph Random FeaturesNov 13, 2018Discourse in Multimedia: A Case Study in Information ExtractionMay 29, 2017Contextual Explanation NetworksOct 14, 2024Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse GraphsFeb 2, 2023Efficient Graph Field Integrators Meet Point CloudsOct 22, 2020Scalable Hierarchical Agglomerative ClusteringJun 22, 2024Fast Tree-Field Integrators: From Low Displacement Rank to Topological TransformersJul 22, 2024Conditional Language Policy: A General Framework for Steerable Multi-Objective FinetuningOct 13, 2024EUGens: Efficient, Unified, and General Dense LayersOct 15, 2021On Learning the Transformer KernelOct 20, 2023Scalable Neural Network KernelsNov 29, 2012Exact and Efficient Parallel Inference for Nonparametric Mixture ModelsFeb 4, 2025Learning the RoPEs: Better 2D and 3D Position Encodings with STRINGFeb 1, 2023FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random FeaturesAug 22, 2012A non-parametric mixture model for topic modeling over timeJul 28, 2020Big Bird: Transformers for Longer SequencesJun 29, 2015Bayesian Nonparametric Kernel-Learning