Showing 1–20 of 49 results
/ Date/ Name
Apr 23, 2026Black-Box Skill Stealing Attack from Proprietary LLM Agents: An Empirical StudyApr 21, 2026Wan-Image: Pushing the Boundaries of Generative Visual IntelligenceApr 13, 2026Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and MusicFeb 16, 2026DM0: An Embodied-Native Vision-Language-Action Model towards Physical AIOct 8, 2025A Giant Peanut-shaped Ultra-High-Energy Gamma-Ray Emitter Off the Galactic PlaneSep 6, 2025Red-Teaming Coding Agents from a Tool-Invocation Perspective: An Empirical Security AssessmentAug 23, 2025ProtoEHR: Hierarchical Prototype Learning for EHR-based Healthcare PredictionsJun 23, 2025Instability in Diffusion ODEs: An Explanation for Inaccurate Image ReconstructionJun 11, 2025Efficient Part-level 3D Object Generation via Dual Volume PackingMay 11, 2025Seed1.5-VL Technical ReportApr 23, 2025Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of LightMar 26, 2025Wan: Open and Advanced Large-Scale Video Generative ModelsDec 16, 2024MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor ScenesSep 9, 2024Measurement of the Free Neutron Lifetime in a Magneto-Gravitational Trap with In Situ DetectionJul 26, 2024Wolf: Dense Video Captioning with a World Summarization FrameworkJul 22, 2024PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological MeasurementsJun 14, 2024An experimental search for an explanation of the difference between beam and bottle neutron lifetime measurementsMay 29, 2024Enhancing Vision-Language Model with Unmasked Token AlignmentDec 21, 2023TagAlign: Improving Vision-Language Alignment with Multi-Tag ClassificationNov 28, 2023Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following