Showing 1–15 of 15 results
/ Date/ Name
Jun 15, 2023Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image SynthesisMay 1, 2024Deep Reward Supervisions for Tuning Text-to-Image Diffusion ModelsMar 23, 2023CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-MatchingMar 25, 2023Human Preference Score: Better Aligning Text-to-Image Models with Human PreferenceAug 12, 2021Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal VisionOct 17, 2025Latent Diffusion Model without Variational AutoencoderMar 20, 2024Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific AdaptationDec 18, 2025Kling-Omni Technical ReportDec 23, 2025SemanticGen: Video Generation in Semantic SpaceMar 27, 2024ECNet: Effective Controllable Text-to-Image Diffusion ModelsDec 2, 2021Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot TasksApr 4, 2024CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingDec 12, 2025SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational AutoencoderJul 3, 2023JourneyDB: A Benchmark for Generative Image UnderstandingAug 5, 2025HPSv3: Towards Wide-Spectrum Human Preference Score