Showing 1–20 of 26 results
/ Date/ Name
Nov 29, 2018Traffic Danger Recognition With Surveillance Cameras Without Training DataDec 21, 2023VideoPoet: A Large Language Model for Zero-Shot Video GenerationJun 30, 2023SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMsJun 15, 2023DocumentNet: Bridging the Data Gap in Document Pre-TrainingJan 14, 2022Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube ProposalsFeb 6, 2024Unified Discrete Diffusion for Categorical DataDec 10, 2022MAGVIT: Masked Generative Video TransformerOct 9, 2023Language Model Beats Diffusion -- Tokenizer is Key to Visual GenerationJul 22, 2018MOBA-Slice: A Time Slice Based Evaluation Framework of Relative Advantage between Teams in MOBA GamesFeb 1, 2020Training-free Monocular 3D Event Detection System for Traffic SurveillanceMay 26, 2024Towards Multi-Task Multi-Modal Models: A Video Generative PerspectiveMay 22, 2024A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual GenerationMay 5, 2024Polarization Purity and Dispersion Characteristics of Nested Antiresonant Nodeless Hollow-Core Optical Fiber at Near- and Short-wave-IR Wavelengths for Quantum CommunicationsDec 8, 2024Language-Guided Image Tokenization for GenerationJun 13, 2025Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision MakingApr 21, 2022Comparing value of travel time and value of travel time saving with heterogeneity in travelersDec 11, 2023Photorealistic Video Generation with Diffusion ModelsJul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesNov 9, 2021Generation and dynamics of soliton and soliton molecules from a VSe2/GO-based fiber laserSep 24, 2024MaskBit: Embedding-free Image Generation via Bit Tokens