Showing 121–140 of 1,353 results
/ Date/ Name
Feb 12, 2026CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool UseFeb 11, 2026Voxtral RealtimeFeb 10, 2026Conceptual Design of a Novel Highly Granular Crystal Electromagnetic Calorimeter for Future Higgs FactoriesFeb 10, 2026Egocentric Bias in Vision-Language ModelsFeb 9, 2026LLaDA2.1: Speeding Up Text Diffusion via Token EditingFeb 8, 2026MIND: Benchmarking Memory Consistency and Action Control in World ModelsFeb 6, 2026The Non-Eruptive Reconfiguration of a Quiescent Filament After a Nearby Active Region EmergenceFeb 3, 2026Minimizing Makespan in Sublinear Time via Weighted Random SamplingFeb 3, 2026Variational and Monte Carlo Methods for Bayesian Inversion of Dynamic Subsurface Flow Simulations Using Seismic and Fluid Pressure DataJan 30, 2026A unified framework for hot accretion flows with finite angular momentum: from Bondi-like to disc-like regimesJan 30, 2026Inference-time Alignment via Sparse Junction SteeringJan 29, 2026HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playingJan 28, 2026Thermal emission spectra of the ultra-hot Jupiter WASP-33 bJan 27, 2026Cross-Session Decoding of Neural Spiking Data via Task-Conditioned Latent AlignmentJan 23, 2026Longitudinal Dynamics of Large and Small Systems from a 3D Bayesian Calibration of RHIC Top-energy Collision DataJan 20, 2026Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision FlowJan 19, 2026Locating the missing large-scale emission in the jet of M87* with short EHT baselinesJan 19, 2026A Benchmark for Language Models in Real-World System BuildingJan 19, 2026Unleashing Efficient Asynchronous RL Post-Training via Staleness-Constrained Rollout CoordinationJan 17, 2026Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces