"au:"Carlos Riquelme"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Carlos Riquelme"" — arXiv2 Search

Showing 1–20 of 29 results

/ Date/ Name

Oct 19, 2022On the Adversarial Robustness of Mixture of Experts Jun 10, 2021Scaling Vision with Sparse Mixture of Experts Sep 15, 2023Scaling Laws for Sparsely-Connected Foundation Models Jan 29, 2024Routers in Vision Mixture of Experts: An Empirical Study Jun 6, 2022Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts Mar 2, 2017Active Learning for Accurate Estimation of Linear Models Feb 9, 2016Online Active Linear Regression via Thresholding Feb 27, 2024Stable LM 2 1.6B Technical Report Oct 14, 2020Deep Ensembles for Low-Data Transfer Learning Sep 28, 2020Scalable Transfer Learning with Expert Models Oct 13, 2020Which Model to Transfer? Finding the Needle in the Growing Haystack Aug 2, 2023From Sparse to Soft Mixtures of Experts Feb 26, 2018Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 27, 2014Learning multifractal structure in large networks Feb 24, 2022Learning to Merge Tokens in Vision Transformers Mar 1, 2017Human Interaction with Recommendation Systems Sep 14, 2022PaLI: A Jointly-Scaled Multilingual Language-Image Model Oct 7, 2021Sparse MoEs meet Efficient Ensembles May 29, 2023PaLI-X: On Scaling up a Multilingual Vision and Language Model Feb 10, 2023Scaling Vision Transformers to 22 Billion Parameters