"au:"Yasaman Bahri"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Yasaman Bahri"" — arXiv2 Search

Showing 1–20 of 22 results

/ Date/ Name

Jun 18, 2020Exact posterior distributions of wide Bayesian neural networks Jun 30, 2021The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning Oct 11, 2018Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes Jun 18, 2020Infinite attention: NNGP and NTK for deep attention networks Feb 18, 2019Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent Mar 11, 2013Detecting Majorana fermions in quasi-one-dimensional topological phases using nonlocal order parameters Feb 23, 2018Sensitivity and Generalization in Neural Networks: an Empirical Study Feb 14, 2025Closed-Form Training Dynamics Reveal Learned Features and Linear Structure in Word2Vec-like Models Feb 16, 2026Symmetry in language statistics shapes the geometry of model representations Mar 5, 2024Quantum Many-Body Physics Calculations with Large Language Models Nov 1, 2017Deep Neural Networks as Gaussian Processes Mar 4, 2020The large learning rate phase of deep learning: the catapult mechanism Jul 15, 2013Localization and topology protected quantum coherence at the edge of 'hot' matter Aug 28, 2014Stable non-Fermi liquid phase of itinerant spin-orbit coupled ferromagnets Feb 12, 2021Explaining Neural Scaling Laws Sep 4, 2023Les Houches Lectures on Deep Learning at Large & Infinite Width May 24, 2025On the Emergence of Linear Analogies in Word Embeddings Jun 9, 2022Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Jan 29, 2026Context Structure Reshapes the Representational Geometry of Language Models Jun 14, 2018Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks