"au:"Xiaoshi Wu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xiaoshi Wu"" — arXiv2 Search

Showing 1–15 of 15 results

/ Date/ Name

Jun 15, 2023Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis May 1, 2024Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models Mar 23, 2023CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching Mar 25, 2023Human Preference Score: Better Aligning Text-to-Image Models with Human Preference Aug 12, 2021Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision Oct 17, 2025Latent Diffusion Model without Variational Autoencoder Mar 20, 2024Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Dec 18, 2025Kling-Omni Technical Report Dec 23, 2025SemanticGen: Video Generation in Semantic Space Mar 27, 2024ECNet: Effective Controllable Text-to-Image Diffusion Models Dec 2, 2021Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks Apr 4, 2024CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dec 12, 2025SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Jul 3, 2023JourneyDB: A Benchmark for Generative Image Understanding Aug 5, 2025HPSv3: Towards Wide-Spectrum Human Preference Score