Showing 41–60 of 102 results
/ Date/ Name
Dec 18, 2024A Systematic Examination of Preference Learning through the Lens of Instruction-FollowingAug 27, 2024Instruct-SkillMix: A Powerful Pipeline for LLM Instruction TuningJul 13, 2020S2RMs: Spatially Structured Recurrent ModulesOct 1, 2025Rethinking Thinking Tokens: LLMs as Improvement OperatorsDec 19, 2024Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine LearningNov 1, 2022Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement LearningMar 21, 2022Test-time Adaptation with Slot-Centric ModelsMar 5, 2019Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term FutureMay 26, 2019State-Reification Networks: Improving Generalization by Modeling the Distribution of Hidden RepresentationsJul 2, 2019Learning the Arrow of TimeJul 6, 2021Discrete-Valued Neural CommunicationJul 2, 2021Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement LearningApr 2, 2018Recall Traces: Backtracking Models for Efficient Reinforcement LearningNov 13, 2017ACtuAL: Actor-Critic Under Adversarial LearningSep 16, 2025Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise BehaviorsFeb 21, 2025Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMsJun 14, 2021Variational Causal Networks: Approximate Bayesian Inference over Causal StructuresFeb 22, 2021Towards Causal Representation LearningMar 2, 2025Unnatural Languages Are Not Bugs but Features for LLMsMar 1, 2026Alien Science: Sampling Coherent but Cognitively Unavailable Research Directions from Idea Atoms