Showing 1–20 of 48 results
/ Date/ Name
Jun 17, 2022CMT-DeepLab: Clustering Mask Transformers for Panoptic SegmentationNov 1, 2024Randomized Autoregressive Visual GenerationMay 21, 2025ThinkRec: Thinking-based recommendation via LLMMar 28, 2020CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D NetworksNov 14, 2023Towards Open-Ended Visual Recognition with Large Language ModelJan 28, 2026MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential RecommendationApr 2, 2019Thickened 2D Networks for Efficient 3D Medical Image SegmentationDec 20, 2019C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image SegmentationApr 22, 2024GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian SplattingFeb 9, 2026Autoregressive Image Generation with Masked Bit ModelingJul 8, 2022kMaX-DeepLab: k-means Mask TransformerAug 4, 2023Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIPDec 12, 2020Mask Guided Matting via Progressive Refinement NetworkJun 4, 2021Glance-and-Gaze Vision TransformerJun 11, 2024An Image is Worth 32 Tokens for Reconstruction and GenerationOct 12, 2020Shape-Texture Debiased Neural Network TrainingJan 28, 2023CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT ScansApr 10, 2023Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic SegmentationNov 30, 2023A Simple Video Segmenter by Tracking Objects Along Axial TrajectoriesJun 13, 2024Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization