Showing 1–20 of 22 results
/ Date/ Name
Aug 17, 2023CMB: A Comprehensive Medical Benchmark in ChineseApr 29, 2024MileBench: Benchmarking MLLMs in Long ContextNov 6, 2024Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLMMar 26, 2026Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten MathSep 17, 2024Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMsMar 8, 2025A Survey on Post-training of Large Language ModelsSep 4, 2024LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureSep 21, 2023AceGPT, Localizing Large Language Models in ArabicFeb 19, 2025From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in EducationDec 28, 2024Exploring Compositional Generalization of Multimodal LLMs for Medical ImagingOct 1, 2025CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts GenerationNov 21, 2022TCBERT: A Technical Report for Chinese Topic Classification BERTMar 18, 2025Aligning Multimodal LLM with Human Preference: A SurveyJun 18, 2025Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical StudyMar 30, 2026Towards a Medical AI ScientistFeb 10, 2026LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric EvaluationMay 29, 2025Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied AgentsJul 4, 2025SAMed-2: Selective Memory Enhanced Medical Segment Anything ModelApr 2, 2020R3: A Reading Comprehension Benchmark Requiring Reasoning ProcessesNov 16, 2023HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs