Showing 1–20 of 122 results
/ Date/ Name
Jan 20, 2020Nested-Wasserstein Self-Imitation Learning for Sequence GenerationAug 9, 2018Policy Optimization as Wasserstein Gradient FlowsJun 10, 2024TRINS: Towards Multimodal Language Models that Can ReadMay 4, 2020Reward Constrained Interactive Recommendation with Natural Language FeedbackNov 2, 2018Sequence Generation with Guider NetworkMar 14, 2024AutoLoRA: Automatically Tuning Matrix Ranks in Low-Rank Adaptation Based on Meta LearningOct 13, 2024TapWeight: Reweighting Pretraining Objectives for Task-Adaptive PretrainingJul 27, 2024LLaVA-Read: Enhancing Reading Ability of Multimodal Language ModelsApr 10, 2025Defense against Prompt Injection Attacks via Mixture of EncodingsJan 29, 2026FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code GenerationFeb 21, 2020GenDICE: Generalized Offline Estimation of Stationary ValuesFeb 19, 2019Scalable Thompson Sampling via Optimal TransportDec 17, 2025DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM CodingMay 4, 2020Improving Adversarial Text Generation by Modeling the Distant FutureDec 30, 2017Learning Structural Weight Uncertainty for Sequential Decision-MakingApr 15, 2026AIBuildAI: An AI Agent for Automatically Building AI ModelsJul 23, 2023Learning Navigational Visual Representations with Semantic Map SupervisionJul 18, 2022STT: Soft Template Tuning for Few-Shot AdaptationMay 9, 2023Towards Building the Federated GPT: Federated Instruction TuningMar 17, 2019Topic-Guided Variational Autoencoders for Text Generation