Showing 1–20 of 23 results
/ Date/ Name
Mar 9, 2026Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional PerspectiveMar 9, 2026Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational ModelingDec 15, 2025Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation ModelNov 25, 2025Soft Adaptive Policy OptimizationOct 16, 2025Qwen3Guard Technical ReportSep 22, 2025Qwen3-Omni Technical ReportAug 19, 2025Polarization-Resolved Chlorophyll Imaging for Non-Invasive Plant Tissue Assessment Using a Silicon-Rich Nitride Metalens ArrayJul 20, 2025RefCritic: Training Long Chain-of-Thought Critic Models with Refinement FeedbackJul 11, 2025Photonic bandgap properties of hyperuniform systems self-assembled in a microfluidic channelMay 31, 2025A Foundation Model for Non-Destructive Defect Identification from Vibrational SpectraMay 15, 2025WorldPM: Scaling Human Preference ModelingJan 2, 2025CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo RatingsOct 28, 2024Transferable Post-training via Inverse Value LearningSep 18, 2024Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-ImprovementAug 20, 2024Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language ModelJun 19, 2024Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language ModelsJun 3, 2024Towards Scalable Automated Alignment of LLMs: A SurveyMay 28, 2024Online Merging Optimizers for Boosting Rewards and Mitigating Tax in AlignmentFeb 27, 2024SoFA: Shielded On-the-fly Alignment via Priority Rule FollowingOct 4, 2023Quantifying and mitigating the impact of label errors on model disparity metrics