Showing 1–11 of 11 results
/ Date/ Name
Feb 15, 2025Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective GenerationSep 26, 2025Learning More with Less: A Dynamic Dual-Level Down-Sampling Framework for Efficient Policy OptimizationJul 30, 2025G-Core: A Simple, Scalable and Balanced RLHF TrainerMay 18, 2022ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage RetrievalMar 17, 2022ERNIE-GeoL: A Geography-and-Language Pre-trained Model and its Applications in Baidu MapsAug 4, 2025CAPO: Towards Enhancing LLM Reasoning through Generative Credit AssignmentOct 4, 2025Merge and Guide: Unifying Model Merging and Guided Decoding for Controllable Multi-Objective GenerationJul 5, 2021NOTE: Solution for KDD-CUP 2021 WikiKG90M-LSCAug 11, 2025WeChat-YATT: A Scalable, Simple, Efficient, and Production Ready Training LibrarySep 8, 2020Masked Label Prediction: Unified Message Passing Model for Semi-Supervised ClassificationSep 29, 2025From Faithfulness to Correctness: Generative Reward Models that Think Critically