Showing 21–40 of 46 results
/ Date/ Name
Mar 6, 2025RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics ReplanningOct 20, 2025Diffusion Models as Dataset Distillation PriorsFeb 25, 2026DySCO: Dynamic Attention-Scaling Decoding for Long-Context Language ModelsJun 1, 2023EEL: Efficiently Encoding Lattices for RerankingSep 18, 2024To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningJul 25, 2025Multi-Task Dense Prediction Fine-Tuning with Mixture of Fine-Grained ExpertsJan 19, 2026Beyond Single-shot Writing: Deep Research Agents are Unreliable at Multi-turn Report RevisionNov 2, 2022A Joint Framework Towards Class-aware and Class-agnostic Alignment for Few-shot SegmentationJun 9, 2022Diagnosing Ensemble Few-Shot ClassifiersNov 16, 2023Crafting In-context Examples according to LMs' Parametric KnowledgeMay 6, 2024CityLLaVA: Efficient Fine-Tuning for VLMs in City ScenarioApr 18, 2024AmbigDocs: Reasoning across Documents on Different Entities under the Same NameMay 31, 2025Inter-Passage Verification for Multi-evidence Multi-answer QAJun 10, 2025Learning to Reason Across Parallel Samples for LLM ReasoningSep 24, 2025Language Models that Think, Chat BetterFeb 2, 2026Advancing General-Purpose Reasoning Models with Modular Gradient SurgeryApr 13, 2026Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic TasksApr 17, 2026Detecting and Suppressing Reward Hacking with Gradient FingerprintsOct 24, 2023MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningJun 3, 2024LoFiT: Localized Fine-tuning on LLM Representations