Showing 41–60 of 89 results
/ Date/ Name
Nov 26, 2024StableAnimator: High-Quality Identity-Preserving Human Image AnimationMay 30, 2025ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RLOct 5, 2025Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved ReasoningMar 18, 2025HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environments with Dynamic Multi-Human InteractionsFeb 9, 2026ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow DistillationMar 3, 2026Size Scaling Law for Radiation Losses of Modes in Photonic Crystal Surface Emitting DevicesFeb 12, 2026Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM TrainingMar 23, 2024An edge detection-based deep learning approach for tear meniscus height measurementFeb 21, 2023Parallel Sentence-Level Explanation Generation for Real-World Low-Resource ScenariosNov 30, 2023VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion ModelsJan 5, 2023All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenApr 5, 2023ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic RulesNov 24, 2020Temporal Action Detection with Multi-level SupervisionDec 5, 2024MageBench: Bridging Large Multimodal Models to AgentsDec 14, 2024UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain RetrievalSep 29, 2025InfoAgent: Advancing Autonomous Information-Seeking AgentsMar 14, 2025HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language ModelsJul 24, 2025Information Entropy-Based Framework for Quantifying Tortuosity in Meibomian Gland Uneven AtrophyApr 23, 2025Subject-driven Video Generation via Disentangled Identity and MotionMar 12, 2026FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance