Showing 1–14 of 14 results
/ Date/ Name
Sep 16, 20253D Aware Region Prompted Vision Language ModelApr 5, 2022Autoregressive 3D Shape Generation via Canonical MappingJun 3, 2024SpatialRGPT: Grounded Spatial Reasoning in Vision Language ModelsMay 4, 2023TUVF: Learning Generalizable Texture UV Radiance FieldsDec 5, 2024NaVILA: Legged Robot Vision-Language-Action Model for NavigationJul 10, 2021Learning 3D Dense Correspondence via Canonical Point AutoencoderAug 29, 2018Searching Toward Pareto-Optimal Device-Aware Neural ArchitecturesDec 5, 2024NVILA: Efficient Frontier Visual Language ModelsJun 21, 2018DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural ArchitecturesJul 16, 2025EgoVLA: Learning Vision-Language-Action Models from Egocentric Human VideosSep 9, 2018Visual Relationship Prediction via Label Clustering and Incorporation of Depth InformationNov 26, 2018InstaNAS: Instance-aware Neural Architecture SearchOct 17, 2025OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLMApr 23, 2026Long-Horizon Manipulation via Trace-Conditioned VLA Planning