Showing 1–13 of 13 results
/ Date/ Name
Mar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleJan 29, 2026MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric MethodsNov 18, 2025ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific ReasoningNov 4, 2025RxnCaption: Reformulating Reaction Diagram Parsing as Visual Prompt Guided CaptioningSep 26, 2025MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document ParsingAug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and EfficiencyJun 12, 2024OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with TextMay 28, 2024DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI DataMar 26, 2024InternLM2 Technical ReportFeb 29, 2024WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext DatasetFeb 8, 2024SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsApr 28, 2023LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelNov 16, 2021INTERN: A New Learning Paradigm Towards General Vision