Showing 1–20 of 24 results
/ Date/ Name
Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleMar 17, 2026Demystifing Video ReasoningFeb 27, 2026AIDABench: AI Data Analytics BenchmarkFeb 23, 2026A Very Big Video Reasoning SuiteOct 31, 2025Phased DMD: Few-step Distribution Matching Distillation via Score Matching within SubintervalsSep 26, 2025MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document ParsingAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyAug 7, 2025Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical ValidityJan 14, 2025Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion ModelsOct 16, 2024ProSA: Assessing and Understanding the Prompt Sensitivity of LLMsJul 30, 2024OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation BalanceJul 10, 2024VEnhancer: Generative Space-Time Enhancement for Video GenerationJun 12, 2024OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with TextMay 28, 2024DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI DataMay 20, 2024MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkMar 26, 2024InternLM2 Technical ReportDec 21, 2023T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by StepNov 28, 2023HumanGaussian: Text-Driven 3D Human Generation with Gaussian SplattingOct 20, 2023BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn DialoguesMay 22, 2023RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars