Showing 1–20 of 101 results
/ Date/ Name
Mar 19, 2020Domain-Adaptive Few-Shot LearningNov 28, 2019Every Frame Counts: Joint Learning of Video Segmentation and Optical FlowDec 10, 2019Learning Depth-Guided Convolutions for Monocular 3D Object DetectionMar 24, 2021Learning Versatile Neural Architectures by Propagating Network CodesJun 11, 2021HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight TransformersFeb 13, 2023UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal ModelingJul 1, 2024Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot LearningOct 4, 2023LanguageMPC: Large Language Models as Decision Makers for Autonomous DrivingApr 7, 2022DaViT: Dual Attention Vision TransformersJan 27, 2023Understanding Self-Supervised Pretraining with Part-Aware Representation LearningApr 6, 2023Visual Dependency Transformers: Dependency Tree Emerges from Reversed AttentionApr 7, 2023Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction FollowingMay 22, 2023VDT: General-purpose Video Diffusion Transformers via Mask ModelingApr 27, 2023Quadric Representations for LiDAR Odometry, Mapping and LocalizationOct 30, 2024MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank ExpertsOct 12, 2023Tree-Planner: Efficient Close-loop Task Planning with Large Language ModelsMar 28, 2025REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot ManipulationJun 27, 2023Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical PropertiesOct 3, 2023RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous DrivingOct 3, 2023Generalizable Long-Horizon Manipulations with Large Language Models