Showing 1–20 of 24 results
/ Date/ Name
Oct 24, 2023CVPR 2023 Text Guided Video Editing CompetitionDec 22, 2022Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video GenerationMar 3, 2025Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion ModelsJan 15, 2024Towards A Better Metric for Text-to-Video GenerationJun 1, 2022Label-Efficient Online Continual Object Detection in Streaming VideoApr 24, 2025VEU-Bench: Towards Comprehensive Understanding of Video EditingMar 10, 2024Reframe Anything: LLM Agent for Open World Video ReframingMar 18, 2025Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal ControlOct 2, 2025OpusAnimation: Code-Based Dynamic Chart GenerationJun 25, 2024Zero-Shot Long-Form Video Understanding through ScreenplayJun 4, 2025RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-ThoughtAug 26, 2025SoccerNet 2025 Challenges ResultsDec 12, 2024Video Repurposing from User Generated Content: A Large-scale Dataset and BenchmarkMay 29, 2023Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion ModelsOct 16, 2023DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video EditingAug 24, 2022Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA TaskSep 27, 2023Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video GenerationJun 9, 2024Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data PerspectivesOct 5, 2025ChronoEdit: Towards Temporal Reasoning for Image Editing and World SimulationJan 7, 2025Cosmos World Foundation Model Platform for Physical AI