Showing 461–480 of 1,726 results
/ Date/ Name
Aug 7, 2025MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMsAug 5, 2025FilBench: Can LLMs Understand and Generate Filipino?Aug 5, 2025VLMQ: Token Saliency-Driven Post-Training Quantization for Vision-language ModelsJul 23, 2025Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your VoiceJul 22, 2025Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical ReportJul 21, 2025BEnchmarking LLMs for Ophthalmology (BELO) for Ophthalmological Knowledge and ReasoningJul 20, 2025RefCritic: Training Long Chain-of-Thought Critic Models with Refinement FeedbackJul 19, 2025Docopilot: Improving Multimodal Models for Document-Level UnderstandingJul 10, 2025Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and MethodologyJul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesJul 7, 2025Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document RestorationJul 4, 2025Improving Social Determinants of Health Documentation in French EHRs Using Large Language ModelsJul 2, 2025AI4Research: A Survey of Artificial Intelligence for Scientific ResearchJul 2, 2025Intrinsic Fingerprint of LLMs: Continue Training is NOT All You Need to Steal A Model!Jun 27, 2025The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT ImprovementsJun 27, 2025GenEscape: Hierarchical Multi-Agent Generation of Escape Room PuzzlesJun 24, 2025Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical StudyJun 24, 2025Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio GenerationJun 22, 2025RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic ManipulationJun 22, 2025PP-DocBee2: Improved Baselines with Efficient Data for Multimodal Document Understanding