Showing 1–20 of 22 results
/ Date/ Name
Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleMar 10, 2026InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and EditingDec 19, 2025OpenAI GPT-5 System CardDec 18, 2025PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical IntelligenceNov 18, 2025ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific ReasoningSep 26, 2025MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document ParsingAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyAug 8, 2025gpt-oss-120b & gpt-oss-20b Model CardJul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesMar 25, 2025LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?Mar 3, 2025Building Machine Learning Challenges for Anomaly Detection in ScienceDec 21, 2024OpenAI o1 System CardOct 16, 2024ProSA: Assessing and Understanding the Prompt Sensitivity of LLMsMay 20, 2024MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkMar 26, 2024InternLM2 Technical ReportFeb 8, 2024Task-customized Masked AutoEncoder via Mixture of Cluster-conditional ExpertsDec 21, 2023T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by StepOct 20, 2023BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn DialoguesDec 28, 2021TAGPerson: A Target-Aware Generation Pipeline for Person Re-identificationAug 14, 2021MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding