"au:"Mingyu Ding"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Mingyu Ding"" — arXiv2 Search

Showing 1–20 of 101 results

/ Date/ Name

Mar 19, 2020Domain-Adaptive Few-Shot Learning Nov 28, 2019Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow Dec 10, 2019Learning Depth-Guided Convolutions for Monocular 3D Object Detection Mar 24, 2021Learning Versatile Neural Architectures by Propagating Network Codes Jun 11, 2021HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers Feb 13, 2023UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling Jul 1, 2024Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning Oct 4, 2023LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving Apr 7, 2022DaViT: Dual Attention Vision Transformers Jan 27, 2023Understanding Self-Supervised Pretraining with Part-Aware Representation Learning Apr 6, 2023Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention Apr 7, 2023Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following May 22, 2023VDT: General-purpose Video Diffusion Transformers via Mask Modeling Apr 27, 2023Quadric Representations for LiDAR Odometry, Mapping and Localization Oct 30, 2024MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts Oct 12, 2023Tree-Planner: Efficient Close-loop Task Planning with Large Language Models Mar 28, 2025REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation Jun 27, 2023Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties Oct 3, 2023RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving Oct 3, 2023Generalizable Long-Horizon Manipulations with Large Language Models