"au:"Qihang Yu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Qihang Yu"" — arXiv2 Search

Showing 21–40 of 48 results

/ Date/ Name

Jun 17, 2021DeepLab2: A TensorFlow Library for Deep Labeling Feb 8, 2021TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation May 30, 2022TubeFormer-DeepLab: Video Mask Transformer Jun 12, 2023Compositor: Bottom-up Clustering and Compositing for Robust Part and Object Segmentation Mar 30, 2023A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision Apr 12, 2024COCONut: Modernizing COCO Segmentation Feb 26, 2025Dictionary-based Framework for Interpretable and Consistent Object Parsing Apr 30, 2025ReVision: Refining Video Diffusion with Explicit 3D Motion Modeling Nov 25, 2020Can Temporal Information Help with Contrastive Self-Supervised Learning?Oct 4, 2022MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models Feb 19, 2020When Radiology Report Generation Meets Knowledge Graph Feb 4, 2025COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Feb 27, 2025Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Jan 13, 2025Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens Apr 6, 2026A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens Apr 16, 2026Frequency-Aware Flow Matching for High-Quality Image Generation Dec 2, 2021PartImageNet: A Large, High-Quality Dataset of Parts Mar 13, 2025FlowTok: Flowing Seamlessly Across Text and Image Tokens Jun 4, 2024Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Jun 29, 2023ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation

← Previous Next →