Showing 1–18 of 18 results
/ Date/ Name
Aug 24, 20254D Visual Pre-training for Robot LearningJun 28, 2023Subclass-balancing Contrastive Learning for Long-tailed RecognitionSep 9, 2023When to Learn What: Model-Adaptive Data Augmentation CurriculumOct 3, 2024Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap FeaturesDec 31, 2025RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied IntelligenceJan 2, 2024BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous DrivingJul 2, 2025AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile ManipulationMar 13, 2025HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action ModelMay 17, 2025H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from VideosSep 26, 2025Orochi: Versatile Biomedical Image ProcessorSep 26, 2025WoW: Towards a World omniscient World model Through Embodied InteractionApr 9, 2026HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body ManipulationJun 14, 2024DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative PlanningJun 7, 2025SpikePingpong: Spike Vision-based Fast-Slow Pingpong Robot SystemMar 14, 2026URDF-Anything+: Autoregressive Articulated 3D Models Generation for Physical SimulationJan 8, 2026LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action ModelDec 18, 2024RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot ManipulationNov 2, 2025URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model