"au:"Devansh Arpit"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Devansh Arpit"" — arXiv2 Search

Showing 1–20 of 32 results

/ Date/ Name

Dec 11, 2025TPV: Parameter Perturbations Through the Lens of Test Prediction Variance Jan 19, 2024Causal Layering via Conditional Entropy Jan 15, 2024Editing Arbitrary Propositions in LLMs without Subject Labels Aug 11, 2023BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents Aug 4, 2023Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization Jul 18, 2023REX: Rapid Exploration and eXploitation for AI Agents Mar 10, 2023On the Unlikelihood of D-Separation Jan 25, 2023Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular Data Oct 21, 2021Ensemble of Averages: Improving Model Selection and Boosting Performance in Domain Generalization Oct 19, 2021Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE Oct 19, 2021Learning Rich Nearest Neighbor Representations from Self-supervised Ensembles Sep 20, 2021Merlion: A Machine Learning Library for Time Series Dec 28, 2020Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization Feb 21, 2020The Break-Even Point on Optimization Trajectories of Deep Neural Networks Feb 20, 2020Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning Oct 1, 2019Predicting with High Correlation Features Jun 5, 2019How to Initialize your Network? Robust Initialization for WeightNorm & ResNets Jan 11, 2019The Benefits of Over-parameterization at Initialization in Deep ReLU Networks Oct 6, 2018h-detach: Modifying the LSTM Gradient Towards Better Optimization Jun 22, 2018On the Spectral Bias of Neural Networks