Showing 1–20 of 28 results
/ Date/ Name
May 7, 2026On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic WorkflowsJul 2, 2024Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language GenerationMay 30, 2025Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time ScalingAug 17, 2024CogLM: Tracking Cognitive Development of Large Language ModelsAug 24, 2024Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient ReasoningJan 29, 2026Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time ScalingSep 29, 2024Instruction Embedding: Latent Representations of Instructions Towards Task IdentificationFeb 2, 2025LLM-Powered Benchmark Factory: Reliable, Generic, and EfficientOct 10, 2025Diagnosing and Mitigating System Bias in Self-Rewarding RLFeb 27, 2025Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer AggregationFeb 19, 2025From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGENJan 19, 2024Generative Dense Retrieval: Memory Can Be a BurdenFeb 19, 2025Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient EvaluationFeb 17, 2025InsBank: Evolving Instruction Subset for Ongoing AlignmentMar 7, 2025Speculative Decoding for Multi-Sample InferenceAug 25, 2024Poor-Supervised Evaluation for SuperLLM via Mutual ConsistencyFeb 17, 2025UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective OptimizationMay 27, 2025Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-GeneratorOct 5, 2025PatternKV: Flattening KV Representation Expands Quantization HeadroomSep 1, 2025Do Retrieval Augmented Language Models Know When They Don't Know?