Showing 21–34 of 34 results
/ Date/ Name
Dec 12, 2020SenSeNet: Neural Keyphrase Generation with Document StructureMay 30, 2024Accurate and Reliable Predictions with Mutual-Transport EnsembleJun 17, 2024Full-ECE: A Metric For Token-level Calibration on Large Language ModelsAug 27, 2024BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model BaselineMar 17, 2025Efficient Motion-Aware Video MLLMMar 28, 2024Checkpoint Merging via Bayesian Optimization in LLM PretrainingSep 4, 2025Towards a Unified View of Large Language Model Post-TrainingOct 10, 2024Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language ModelsMay 26, 2025Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth AnswersFeb 11, 2026Gradients Must Earn Their Influence: Unifying SFT with Generalized Entropic ObjectivesJun 11, 2022Bridging the Gap Between Training and Inference of Bayesian Controllable Language ModelsSep 16, 2020Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based Sentiment AnalysisMar 6, 2026FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context PrefillingJun 22, 2020ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion