"au:"Xianhang Li"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xianhang Li"" — arXiv2 Search

Showing 1–20 of 24 results

/ Date/ Name

May 11, 2023An Inverse Scaling Law for CLIP Training Jun 25, 2020SmallBigNet: Integrating Core and Contextual Views for Video Classification Jun 3, 2021CT-Net: Channel Tensorization Network for Video Classification May 3, 2022In Defense of Image Pre-Training for Spatiotemporal Recognition May 30, 2024Scaling White-Box Transformers for Vision Feb 9, 2022L2B: Learning to Bootstrap Robust Models for Combating Label Noise Jun 27, 2023CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \$10,000 Budget; An Extra \$4,000 Unlocks 81.8% Accuracy Aug 6, 2024MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine Sep 29, 2025Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers Jan 21, 2026OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation Oct 11, 20233D TransUNet: Advancing Medical Image Segmentation through Vision Transformers Mar 23, 20243D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge Apr 21, 2022Fast AdvProp Jul 21, 2023Consistency-guided Meta-Learning for Bootstrapping Semi-Supervised Medical Image Segmentation May 7, 2025OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Nov 25, 2024CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions Jun 12, 2024What If We Recaption Billions of Web Images with LLaMA-3?Dec 20, 2022Unleashing the Power of Visual Prompting At the Pixel Level Jan 9, 2024Revisiting Adversarial Training at Scale Jun 8, 2024Medical Vision Generalist: Unifying Medical Imaging Tasks in Context