Showing 1–20 of 33 results
/ Date/ Name
Jul 26, 2007Measurement of the Fermi Constant by FASTSep 24, 2020A Gradient Flow Framework For Analyzing Network PruningMay 13, 2024Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized ModelsOct 13, 2023Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic TaskJun 10, 2021Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep LearningMay 3, 2021MemX: An Attention-Aware Smart Eyewear System for Personalized Moment Auto-captureJan 24, 2022Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear DevicesNov 21, 2023Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable TasksNov 15, 2022Mechanistic Mode ConnectivityMar 19, 2014Performance Analysis of Location Profile RoutingFeb 12, 2002Precise Measurement of Muon Capture on the ProtonFeb 13, 2026Multi-Head Attention as a Source of Catastrophic Forgetting in MoE TransformersFeb 13, 2026SD-MoE: Spectral Decomposition for Effective Expert SpecializationSep 10, 2020OrthoReg: Robust Network Pruning Using Orthonormality RegularizationOct 26, 2023In-Context Learning Dynamics with Random Binary SequencesMay 23, 2022Orchestra: Unsupervised Federated Learning via Globally Consistent ClusteringFeb 18, 2021HVAQ: A High-Resolution Vision-Based Air Quality DatasetApr 9, 2021A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics PipelineMar 24, 2014The Mason Test: A Defense Against Sybil Attacks in Wireless Networks Without Trusted AuthoritiesFeb 4, 2021How do Quadratic Regularizers Prevent Catastrophic Forgetting: The Role of Interpolation