Showing 561–580 of 3,135 results
/ Date/ Name
Jan 13, 2026LLMs as Assessors: Right for the Right Reason?Jan 13, 2026Reverse Flow Matching: A Unified Framework for Online Reinforcement Learning with Diffusion and Flow PoliciesJan 9, 2026Transformer Is Inherently a Causal LearnerJan 8, 2026Optimal Lower Bounds for Online MulticalibrationJan 8, 2026GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL OptimizationJan 7, 2026Unlocking the Pre-Trained Model as a Dual-Alignment Calibrator for Post-Trained LLMsJan 5, 2026DeeperBrain: A Neuro-Grounded EEG Foundation Model Towards Universal BCIDec 30, 2025Fast reconstruction-based ROI triggering via anomaly detection in the CYGNO optical TPCDec 29, 2025Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM DecodingDec 28, 2025The Reward Model Selection Crisis in Personalized AlignmentDec 24, 2025Parallel Token Prediction for Language ModelsDec 23, 2025CHAMMI-75: Pre-training multi-channel models with heterogeneous microscopy imagesDec 23, 2025Context-Sensitive Abstractions for Reinforcement Learning with Parameterized ActionsDec 23, 2025TS-Arena -- A Live Forecast Pre-Registration PlatformDec 20, 2025Conscious Data Contribution via Community-Driven Chain-of-Thought DistillationDec 18, 2025Few-Shot Specific Emitter Identification via Integrated Complex Variational Mode Decomposition and Spatial Attention TransferDec 18, 2025Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing ModelDec 17, 2025FlowBind: Efficient Any-to-Any Generation with Bidirectional FlowsDec 16, 2025EXAONE Path 2.5: Pathology Foundation Model with Multi-Omics AlignmentDec 15, 2025Olmo 3