Showing 1–20 of 43 results
/ Date/ Name
Sep 17, 2021POAR: Efficient Policy Optimization via Online Abstract State Representation LearningNov 5, 2025Scaling Agent Learning via Experience SynthesisSep 27, 2021Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled ExplorationMar 26, 2025ShieldAgent: Shielding Agents via Verifiable Safety Policy ReasoningOct 3, 2025ARMs: Adaptive Red-Teaming Agent against Multimodal Models with Plug-and-Play AttacksFeb 18, 2024AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question DecompositionMay 6, 2026DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI AgentsOct 5, 2023Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint SafeguardsDec 9, 2024SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent ExplanationsJul 5, 2024MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?Jul 17, 2024AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge BasesFeb 3, 2025MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video GenerationOct 16, 2024Preference Optimization with Multi-Sample ComparisonsMar 2, 2026Towards Principled Dataset Distillation: A Spectral Distribution PerspectiveFeb 27, 2024Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation ModelsMay 29, 2025SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language ModelsMar 19, 2025MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation ModelsOct 18, 2024Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language AlignmentApr 27, 2025Anyprefer: An Agentic Framework for Preference Data SynthesisOct 14, 2024MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models