Showing 21–40 of 85 results
/ Date/ Name
Apr 14, 2017Cross-media Similarity Metric Learning with Unified Deep NetworksApr 7, 2017An Overview of Cross-media Retrieval: Concepts, Methodologies, Benchmarks and ChallengesAug 31, 2022SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual CategorizationSep 25, 2017Fine-grained Discriminative Localization via Saliency-guided Faster R-CNNAug 31, 2017Fine-grained Visual-textual Representation LearningAug 16, 2017Modality-specific Cross-modal Similarity Measurement with Recurrent Attention NetworkFeb 7, 2018Deep Reinforcement Learning for Image HashingMar 10, 2018Deep Cross-media Knowledge TransferNov 21, 2023Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image RetrievalMay 11, 2024FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality AssessmentMar 15, 2023Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long VideosDec 12, 2024Selective Visual Prompting in Vision MambaMar 17, 2025SCAP: Transductive Test-Time Adaptation via Supportive Clique-based Attribute PromptingJun 15, 2025Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image EditingAug 24, 2025Investigating Domain Gaps for Indoor 3D Object DetectionFeb 27, 2026Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and CroppingFeb 9, 2026TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language ModelsApr 17, 2026Repurposing 3D Generative Model for Autoregressive Layout GenerationSep 7, 2023Efficient Adaptive Human-Object Interaction Detection with Concept-guided MemoryMar 28, 2023PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout