"au:"Zhao"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zhao"" — arXiv2 Search

Showing 121–140 of 1,353 results

/ Date/ Name

Feb 12, 2026CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use Feb 11, 2026Voxtral Realtime Feb 10, 2026Conceptual Design of a Novel Highly Granular Crystal Electromagnetic Calorimeter for Future Higgs Factories Feb 10, 2026Egocentric Bias in Vision-Language Models Feb 9, 2026LLaDA2.1: Speeding Up Text Diffusion via Token Editing Feb 8, 2026MIND: Benchmarking Memory Consistency and Action Control in World Models Feb 6, 2026The Non-Eruptive Reconfiguration of a Quiescent Filament After a Nearby Active Region Emergence Feb 3, 2026Minimizing Makespan in Sublinear Time via Weighted Random Sampling Feb 3, 2026Variational and Monte Carlo Methods for Bayesian Inversion of Dynamic Subsurface Flow Simulations Using Seismic and Fluid Pressure Data Jan 30, 2026A unified framework for hot accretion flows with finite angular momentum: from Bondi-like to disc-like regimes Jan 30, 2026Inference-time Alignment via Sparse Junction Steering Jan 29, 2026HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing Jan 28, 2026Thermal emission spectra of the ultra-hot Jupiter WASP-33 b Jan 27, 2026Cross-Session Decoding of Neural Spiking Data via Task-Conditioned Latent Alignment Jan 23, 2026Longitudinal Dynamics of Large and Small Systems from a 3D Bayesian Calibration of RHIC Top-energy Collision Data Jan 20, 2026Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Jan 19, 2026Locating the missing large-scale emission in the jet of M87* with short EHT baselines Jan 19, 2026A Benchmark for Language Models in Real-World System Building Jan 19, 2026Unleashing Efficient Asynchronous RL Post-Training via Staleness-Constrained Rollout Coordination Jan 17, 2026Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

← Previous Next →