Showing 1–20 of 135 results
/ Date/ Name
Apr 1, 2022On the Importance of Asymmetry for Siamese Representation LearningApr 13, 2017Spatial Memory for Context Reasoning in Object DetectionJan 25, 2024Deconstructing Denoising Diffusion Models for Self-Supervised LearningMar 29, 2018Iterative Visual Reasoning Beyond ConvolutionsMar 28, 2019TensorMask: A Foundation for Dense Object SegmentationJun 13, 2024An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual PixelsMay 7, 2015Webly Supervised Learning of Convolutional NetworksFeb 7, 2017An Implementation of Faster RCNN with Study for Region SamplingApr 5, 2021An Empirical Study of Training Self-Supervised Vision TransformersNov 20, 2014Learning a Recurrent Visual Representation for Image Caption GenerationNov 20, 2020Exploring Simple Siamese Representation LearningMar 9, 2020Improved Baselines with Momentum Contrastive LearningApr 9, 2019Multi-Target Embodied Question AnsweringNov 22, 2021Benchmarking Detection Transfer Learning with Vision TransformersOct 11, 2021Towards Demystifying Representation Learning with Non-contrastive Self-supervisionMar 10, 2022LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text RetrievalJan 2, 2023ConvNeXt V2: Co-designing and Scaling ConvNets with Masked AutoencodersNov 23, 2022EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational DataJun 8, 2023R-MAE: Regions Meet Masked AutoencodersFeb 15, 2024Revisiting Feature Prediction for Learning Visual Representations from Video