Showing 1–20 of 21 results
/ Date/ Name
Jun 17, 2017Rethinking Atrous Convolution for Semantic Image SegmentationFeb 7, 2018Encoder-Decoder with Atrous Separable Convolution for Semantic Image SegmentationDec 2, 2019View-Invariant Probabilistic Embedding for Human PoseSep 11, 2018Searching for Efficient Multi-Scale Architectures for Dense Image PredictionFeb 25, 2019FEELVOS: Fast End-to-End Embedding Learning for Video Object SegmentationDec 2, 2020Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information MaximizationJan 15, 2020EEV: A Large-Scale Dataset for Studying Evoked Expressions from VideoJan 10, 2019Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image SegmentationJan 11, 2024Distilling Vision-Language Models on Millions of VideosJul 6, 2023VideoGLUE: Video General Understanding Evaluation of Foundation ModelsNov 20, 2022Learning to Generate Image Embeddings with User-level Differential PrivacySep 30, 2018Modeling Uncertainty with Hedged Instance EmbeddingOct 23, 2020View-Invariant, Occlusion-Robust Probabilistic Embedding for Human PoseDec 13, 2017MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction FeaturesMar 12, 2015FaceNet: A Unified Embedding for Face Recognition and ClusteringJun 17, 2021DeepLab2: A TensorFlow Library for Deep LabelingAug 13, 2024Imagen 3Feb 20, 2024VideoPrism: A Foundational Visual Encoder for Video UnderstandingMar 28, 2023Structured Video-Language Modeling with Temporal Grouping and Spatial GroundingDec 9, 2021Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision