arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Zelei Cheng"" — arXiv2 Search
Showing 1–9 of 9 results
/ Date
/ Name
May 5, 2024
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
Sep 15, 2025
Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization
Oct 6, 2023
TrialView: An AI-powered Visual Analytics System for Temporal Event Data in Clinical Trials
Mar 10, 2025
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
Feb 6, 2020
Mitigating Query-Flooding Parameter Duplication Attack on Regression Models with High-Dimensional Gaussian Mechanism
Feb 8, 2025
A Survey on Explainable Deep Reinforcement Learning
Apr 9, 2026
Decomposing the Delta: What Do Models Actually Learn from Preference Pairs?
Sep 19, 2025
GPO: Learning from Critical Steps to Improve LLM Reasoning
Oct 18, 2024
Soft-Label Integration for Robust Toxicity Classification