Showing 421–440 of 2,609 results
/ Date/ Name
Nov 23, 2025Synthetic Curriculum Reinforces Compositional Text-to-Image GenerationNov 22, 2025MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary LearningNov 22, 2025A Multi-Stage Deep Learning Framework with PKCP-MixUp Augmentation for Pediatric Liver Tumor Diagnosis Using Multi-Phase Contrast-Enhanced CTNov 21, 2025TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-MakingNov 21, 2025PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-AttentionNov 21, 2025Closing the Performance Gap Between AI and Radiologists in Chest X-Ray ReportingNov 20, 2025Generative Augmented Reality: Paradigms, Technologies, and Future ApplicationsNov 20, 2025Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual GenerationNov 20, 2025WWE-UIE: A Wavelet & White Balance Efficient Network for Underwater Image EnhancementNov 19, 2025MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert SkippingNov 19, 2025GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AINov 19, 2025Zero-Shot Open-Vocabulary Human Motion Grounding with Test-Time TrainingNov 19, 2025Adaptive thresholding pattern for fingerprint forgery detectionNov 17, 2025OlmoEarth: Stable Latent Image Modeling for Multimodal Earth ObservationNov 17, 2025Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and ChallengesNov 17, 20253DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at ScaleNov 17, 2025SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy OptimizationNov 17, 2025Recurrent Autoregressive Diffusion: Global Memory Meets Local AttentionNov 16, 2025DEMIST: Decoupled Multi-stream latent diffusion for Quantitative Myelin Map SynthesisNov 13, 2025Depth Anything 3: Recovering the Visual Space from Any Views