"au:"Anirudh Goyal"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Anirudh Goyal"" — arXiv2 Search

Showing 41–60 of 102 results

/ Date/ Name

Dec 18, 2024A Systematic Examination of Preference Learning through the Lens of Instruction-Following Aug 27, 2024Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning Jul 13, 2020S2RMs: Spatially Structured Recurrent Modules Oct 1, 2025Rethinking Thinking Tokens: LLMs as Improvement Operators Dec 19, 2024Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning Nov 1, 2022Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning Mar 21, 2022Test-time Adaptation with Slot-Centric Models Mar 5, 2019Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future May 26, 2019State-Reification Networks: Improving Generalization by Modeling the Distribution of Hidden Representations Jul 2, 2019Learning the Arrow of Time Jul 6, 2021Discrete-Valued Neural Communication Jul 2, 2021Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning Apr 2, 2018Recall Traces: Backtracking Models for Efficient Reinforcement Learning Nov 13, 2017ACtuAL: Actor-Critic Under Adversarial Learning Sep 16, 2025Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise Behaviors Feb 21, 2025Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs Jun 14, 2021Variational Causal Networks: Approximate Bayesian Inference over Causal Structures Feb 22, 2021Towards Causal Representation Learning Mar 2, 2025Unnatural Languages Are Not Bugs but Features for LLMs Mar 1, 2026Alien Science: Sampling Coherent but Cognitively Unavailable Research Directions from Idea Atoms

← Previous Next →