Showing 1–11 of 11 results
/ Date/ Name
Apr 14, 2026Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic ReasoningNov 6, 2025NVIDIA Nemotron Nano V2 VLApr 1, 2020Work in Progress: Temporally Extended Auxiliary TasksJul 26, 2019On Hard Exploration for Reinforcement Learning: a Case Study in PommermanJul 25, 2019Action Guidance with MCTS for Deep Reinforcement LearningJul 24, 2019Terminal Prediction as an Auxiliary Task for Deep Reinforcement LearningJul 22, 2019Agent Modeling as Auxiliary Task for Deep Reinforcement LearningApr 20, 2019Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team CompetitionApr 10, 2019Safer Deep RL with Shallow MCTS: A Case Study in PommermanNov 30, 2018Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RLOct 12, 2018A Survey and Critique of Multiagent Deep Reinforcement Learning