Showing 1–11 of 11 results
/ Date/ Name
Mar 9, 2026Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional PerspectiveFeb 3, 2026UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual DocumentsJan 29, 2026Qwen3-ASR Technical ReportJan 22, 2026Qwen3-TTS Technical ReportNov 25, 2025Soft Adaptive Policy OptimizationOct 16, 2025Qwen3Guard Technical ReportSep 22, 2025Qwen3-Omni Technical ReportJul 20, 2025RefCritic: Training Long Chain-of-Thought Critic Models with Refinement FeedbackMay 15, 2025WorldPM: Scaling Human Preference ModelingJan 2, 2025CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo RatingsSep 18, 2024Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement