Showing 661–680 of 2,609 results
/ Date/ Name
May 28, 2025Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image CaptionsMay 28, 2025ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich ManipulationMay 28, 2025Learning World Models for Interactive Video GenerationMay 28, 2025LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road EnvironmentsMay 25, 2025DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous DrivingMay 24, 2025On Denoising Walking Videos for Gait RecognitionMay 23, 2025DanceTogether! Identity-Preserving Multi-Person Interactive Video GenerationMay 22, 2025Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-TuningMay 22, 2025NTIRE 2025 challenge on Text to Image Generation Model Quality AssessmentMay 21, 2025UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation LearningMay 20, 2025Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional TrainingMay 20, 2025Model-Independent Machine Learning Approach for Nanometric Axial Localization and TrackingMay 20, 2025From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave samplingMay 20, 2025Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language ModelsMay 19, 2025VSA: Faster Video Diffusion with Trainable Sparse AttentionMay 19, 2025RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought ReasoningMay 19, 2025Rethinking Features-Fused-Pyramid-Neck for Object DetectionMay 16, 2025QVGen: Pushing the Limit of Quantized Video Generative ModelsMay 16, 2025PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance AlignmentMay 16, 2025From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification