Showing 1–20 of 48 results
/ Date/ Name
Sep 10, 2021Instance-Conditioned GANMay 9, 2021Graph Attention Networks with Positional EmbeddingsMay 15, 2023Improved baselines for vision-language pre-trainingApr 26, 2023Controllable Image Generation via Collage RepresentationsDec 7, 2020Generating unseen complex scenes: are we there yet?Jan 15, 2026Inference-time Physics Alignment of Video Generative Models with Latent World ModelsJun 6, 2024Improving Geo-diversity of Generated Images with Contextualized Vendi Score GuidanceDec 14, 2023A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense CaptionsAug 15, 2025Controlling Multimodal LLMs via Reward-guided DecodingOct 22, 2025Improving the Physics of Video Generation with VJEPA-2 Reward SignalNov 5, 2024On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion ModelsFeb 21, 2025Improving the Scaling Laws of Synthetic Data with Deliberate PracticeFeb 15, 2023Learning to Substitute Ingredients in RecipesMar 21, 2024DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-TuningJun 5, 2025DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative ModelsAug 14, 2025Increasing the Utility of Synthetic Images through Chamfer GuidanceOct 22, 2025The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I ModelsOct 25, 2021Parameter Prediction for Unseen Deep ArchitecturesJul 20, 2022Revisiting Hotels-50K and Hotel-IDJan 3, 2024GPS-SSL: Guided Positive Sampling to Inject Prior Into Self-Supervised Learning