Showing 441–460 of 2,609 results
/ Date/ Name
Nov 13, 2025AHA! Animating Human Avatars in Diverse Scenes with Gaussian SplattingNov 10, 2025Lightning Grasp: High Performance Procedural Grasp Synthesis with Contact FieldsNov 10, 2025StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video GenerationNov 10, 2025StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and CompressionNov 10, 2025FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and DetectionNov 7, 2025Neural Image Abstraction Using Long Smoothing B-SplinesNov 6, 2025NVIDIA Nemotron Nano V2 VLNov 4, 2025RxnCaption: Reformulating Reaction Diagram Parsing as Visual Prompt Guided CaptioningNov 1, 2025Challenging DINOv3 Foundation Model under Low Inter-Class Variability: A Case Study on Fetal Brain UltrasoundOct 31, 2025Phased DMD: Few-step Distribution Matching Distillation via Score Matching within SubintervalsOct 31, 2025RzenEmbed: Towards Comprehensive Multimodal RetrievalOct 30, 2025Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF BenchmarkOct 30, 2025Detecting Unauthorized Vehicles using Deep Learning for Smart Cities: A Case Study on BangladeshOct 29, 2025Generative Image Restoration and Super-Resolution using Physics-Informed Synthetic Data for Scanning Tunneling MicroscopyOct 29, 2025Diffusion-Driven Progressive Target Manipulation for Source-Free Domain AdaptationOct 28, 2025Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and GenerationOct 28, 2025ResNet: Enabling Deep Convolutional Neural Networks through Residual LearningOct 27, 2025EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoTOct 27, 2025Video-Thinker: Sparking "Thinking with Videos" via Reinforcement LearningOct 26, 2025IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction