Showing 1–20 of 20 results
/ Date/ Name
Jun 18, 2025DeVisE: Behavioral Testing of Medical Large Language ModelsJan 8, 2025GaussianVideo: Efficient Video Representation via Hierarchical Gaussian SplattingJul 17, 2024Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot LearningApr 18, 2024Sequential Compositional Generalization in Multimodal ModelsOct 18, 2023Harnessing Dataset Cartography for Improved Compositional Generalization in TransformersSep 15, 2023Hyperspectral Image Denoising via Self-Modulating Convolutional Neural NetworksAug 24, 2023Spherical Vision Transformer for 360-degree Video Saliency PredictionJul 17, 2023CLIP-Guided StyleGAN Inversion for Text-Driven Real Image EditingMay 10, 2023HyperE2VID: Improving Event-Based Video Reconstruction via HypernetworksApr 30, 2023EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video ReconstructionApr 12, 2023VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEsMar 13, 2023ST360IQ: No-Reference Omnidirectional Image Quality Assessment with Spherical Vision TransformersNov 5, 2022Disentangling Content and Motion for Text-Based Neural Video ManipulationSep 18, 2022Perception-Distortion Trade-off in the SR Space Spanned by Flow ModelsDec 8, 2020CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractionsJun 17, 2020Burst Photography for Learning to Enhance Extremely Dark ImagesMar 17, 2020Burst Denoising of Dark ImagesAug 22, 2018Manipulating Attributes of Natural Scenes via HallucinationDec 1, 2016Learning to Generate Images of Outdoor Scenes from Attributes and Semantic LayoutsJan 15, 2016Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures