"au:"Jay Wu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jay Wu"" — arXiv2 Search

Showing 1–20 of 24 results

/ Date/ Name

Oct 24, 2023CVPR 2023 Text Guided Video Editing Competition Dec 22, 2022Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation Mar 3, 2025Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jan 15, 2024Towards A Better Metric for Text-to-Video Generation Jun 1, 2022Label-Efficient Online Continual Object Detection in Streaming Video Apr 24, 2025VEU-Bench: Towards Comprehensive Understanding of Video Editing Mar 10, 2024Reframe Anything: LLM Agent for Open World Video Reframing Mar 18, 2025Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Oct 2, 2025OpusAnimation: Code-Based Dynamic Chart Generation Jun 25, 2024Zero-Shot Long-Form Video Understanding through Screenplay Jun 4, 2025RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought Aug 26, 2025SoccerNet 2025 Challenges Results Dec 12, 2024Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark May 29, 2023Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models Oct 16, 2023DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Aug 24, 2022Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task Sep 27, 2023Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation Jun 9, 2024Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives Oct 5, 2025ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation Jan 7, 2025Cosmos World Foundation Model Platform for Physical AI