Showing 1–20 of 46 results
/ Date/ Name
Dec 1, 2025AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided Image-to-Video GenerationOct 1, 2025CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts GenerationMar 23, 2026Manifold-Aware Exploration for Reinforcement Learning in Video GenerationOct 7, 2025Deforming Videos to Masks: Flow Matching for Referring Video SegmentationMay 6, 2026D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion ModelsSep 29, 2022Make-A-Video: Text-to-Video Generation without Text-Video DataJul 29, 2024Model Agnostic Hybrid Sharding For Heterogeneous Distributed InferenceDec 3, 2024Beyond Generation: Unlocking Universal Editing via Self-Supervised Fine-TuningJun 18, 2025Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical StudyJun 5, 2025When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and UnderstandingDec 9, 2025OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and ManipulationMar 16, 2026AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic TransferMay 5, 2026AHPA: Adaptive Hierarchical Prior Alignment for Diffusion TransformersJun 29, 2022RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution RobustnessMar 21, 2024AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing TasksOct 28, 2024Meta-Learning for Speeding Up Large Model Inference in Decentralized EnvironmentsNov 3, 2025Enhancing Diffusion-based Restoration Models via Difficulty-Adaptive Reinforcement Learning with IQA RewardDec 3, 2024VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual interventionJun 15, 2024Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering IncorrectlyMar 19, 2025Temporal Regularization Makes Your Video Generator Stronger