Showing 1–20 of 20 results
/ Date/ Name
Apr 24, 2022Progressive Learning for Image Retrieval with Hybrid-Modality QueriesMar 6, 2023A Redistribution Framework for Diffusion AuctionsJul 24, 2024Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language ModelsOct 28, 2025Repurposing Synthetic Data for Fine-grained Search Agent SupervisionJun 28, 2025A Systematic Study of Compositional Syntactic Transformer Language ModelsMay 28, 2025EvolveSearch: An Iterative Self-Evolving Search AgentMar 1, 2020Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningSep 16, 2025ReSum: Unlocking Long-Horizon Search Intelligence via Context SummarizationMar 11, 2021WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-TrainingOct 28, 2025AgentFold: Long-Horizon Web Agents with Proactive Context ManagementJun 14, 2020Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video CaptioningAug 3, 2020The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)Oct 15, 2019Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019Oct 28, 2025Tongyi DeepResearch Technical ReportOct 28, 2025ParallelMuse: Agentic Parallel Thinking for Deep Information SeekingAug 7, 2025WebWatcher: Breaking New Frontier of Vision-Language Deep Research AgentSep 16, 2025WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement LearningJul 11, 2019Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in VideosJun 22, 2018RUC+CMU: System Report for Dense Captioning Events in VideosAug 15, 2019Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards