Showing 1–20 of 26 results
/ Date/ Name
Aug 19, 2023DiffusionTrack: Diffusion Model For Multi-Object TrackingSep 9, 2024MMEvol: Empowering Multimodal Large Language Models with Evol-InstructJan 8, 2025OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech SynthesisOct 15, 2025NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow MatchingMay 24, 2024DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image PerceptionMar 12, 2022VariabilityTrack:Multi-Object Tracking with Variable Speed Object MovementOct 30, 2024IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object TrackingApr 14, 2025GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI AgentsApr 28, 2025VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-TuningSep 14, 2023VDialogUE: A Unified Evaluation Benchmark for Visually-grounded DialogueOct 2, 2024PersonaMath: Boosting Mathematical Reasoning via Persona-Driven Data AugmentationMay 26, 2025OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality InteractionDec 15, 2023Marathon: A Race Through the Realm of Long Context with Large Language ModelsSep 2, 2025Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVRFeb 13, 2026Learning Ordinal Probabilistic Reward from PreferencesAug 6, 2018Machine Learning Promoting Extreme Simplification of Spectroscopy EquipmentJan 26, 2023Compact Transformer Tracker with Correlative Masked ModelingJul 30, 2024Autogenic Language Embedding for Coherent Point TrackingJun 25, 2024Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QAMay 28, 2024Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models