Showing 21–40 of 176 results
/ Date/ Name
Dec 18, 2025DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AIOct 27, 2025Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMsSep 20, 2025From Uniform to Heterogeneous: Tailoring Policy Optimization to Every Token's NatureOct 5, 2025Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model ReasoningJun 9, 2025GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure RecognitionJan 29, 2026MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric MethodsAug 29, 2025Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop LearningOct 30, 2025OmniDocLayout: Towards Diverse Document Layout Generation via Coarse-to-Fine LLM LearningOct 2, 2024DRUPI: Dataset Reduction Using Privileged InformationMar 7, 2026Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware TrainingMar 16, 2026Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram ParsingDec 1, 2025TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table RecognitionApr 12, 2026Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMsMay 17, 2022Exploring the Interactive Guidance for Unified and Effective Image MattingAug 25, 2023MLLM-DataEngine: An Iterative Refinement Approach for MLLMApr 28, 2023LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelNov 21, 2023ShareGPT4V: Improving Large Multi-Modal Models with Better CaptionsApr 3, 2024SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic SegmentationJan 29, 2024InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large ModelJun 4, 2024OpenDataLab: Empowering General Artificial Intelligence with Open Datasets