Showing 1–20 of 32 results
/ Date/ Name
Aug 15, 2020Graph Edit Distance Reward: Learning to Edit Scene GraphJul 17, 2023AlpaGasus: Training A Better Alpaca with Fewer DataJun 5, 2023InstructZero: Efficient Instruction Optimization for Black-Box Large Language ModelsMay 3, 2023Backdoor Learning on Sequence to Sequence ModelsJun 11, 2024OPTune: Efficient Online Preference TuningOct 16, 2024OmnixR: Evaluating Omni-modality Language Models on Reasoning across ModalitiesMay 16, 2025Creativity or Brute Force? Using Brainteasers as a Window into the Problem-Solving Abilities of Large Language ModelsMay 3, 2023PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based RegularizerJul 9, 2021Task-Aware Sampling Layer for Point-Wise AnalysisFeb 19, 2024Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data SelectionMay 23, 2023Prompting Language-Informed Distribution for Compositional Zero-Shot LearningMar 14, 2023How Many Demonstrations Do You Need for In-context Learning?Feb 26, 2025Self-rewarding correction for mathematical reasoningMay 21, 2025Learning to Reason via Mixture-of-Thought for Logical ReasoningJul 31, 2023Backdooring Instruction-Tuned Large Language Models with Virtual Prompt InjectionAug 23, 2023From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction TuningMar 3, 2024Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape EvaluationAug 19, 2025MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General IntelligenceOct 18, 2023Reflection-Tuning: Data Recycling Improves LLM Instruction-TuningOct 23, 2023HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models