Showing 1–20 of 21 results
/ Date/ Name
Apr 23, 2026Unlocking the Power of Critical Factors for 3D Visual Geometry EstimationApr 22, 2026Exploring Spatial Intelligence from a Generative PerspectiveApr 21, 2026MMControl: Unified Multi-Modal Control for Joint Audio-Video GenerationApr 8, 2026TC-AE: Unlocking Token Capacity for Deep Compression AutoencodersOct 8, 2025Evolutionary Profiles for Protein Fitness PredictionOct 8, 2025scPPDM: A Diffusion Model for Single-Cell Drug-Response PredictionSep 28, 2025HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and GenerationFeb 25, 2025Revisiting Convolution Architecture in the Realm of DNA Foundation ModelsOct 12, 2024Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein InteractionsJun 18, 2024GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation ModelsJun 5, 2024Floating Anchor Diffusion Model for Multi-motif ScaffoldingJun 4, 2024Generative Active Learning for Long-tailed Instance SegmentationFeb 6, 2024MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelDec 28, 2023MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesOct 18, 2023De novo protein design using geometric vector field networksAug 12, 2023SegPrompt: Boosting Open-world Segmentation via Category-level Prompt LearningJul 24, 2023CTVIS: Consistent Training for Online Video Instance SegmentationJan 19, 2022Poseur: Direct Human Pose Regression with TransformersMay 29, 2021FCPose: Fully Convolutional Multi-Person Pose Estimation with Dynamic Instance-Aware ConvolutionsMar 29, 2021TFPose: Direct Human Pose Estimation with Transformers