"au:"Zonghao Guo"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zonghao Guo"" — arXiv2 Search

Showing 1–20 of 23 results

/ Date/ Name

Oct 19, 2024DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning Dec 18, 2024LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer Apr 11, 2022Semantic Segmentation for Point Cloud Scenes via Dilated Graph Feature Aggregation and Pyramid Decoders Aug 13, 2022Bidirectional Feature Globalization for Few-shot Semantic Segmentation of 3D Point Cloud Scenes Dec 2, 2025GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding Aug 26, 2025EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding Mar 9, 2026MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals Mar 16, 2025Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic Cognition Mar 31, 2025XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?Sep 16, 2025MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Nov 26, 2025LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs Mar 27, 2025Video-R1: Reinforcing Video Reasoning in MLLMs Mar 13, 2026Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation May 27, 2025GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution Mar 17, 2025KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding Oct 21, 2025ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder Dec 29, 2025MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?May 19, 2022Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection Oct 6, 2021Long-tailed Distribution Adaptation Mar 18, 2024LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images