Showing 21–40 of 45 results
/ Date/ Name
Jun 7, 2020Language Models as Fact Checkers?Dec 31, 2020CLEAR: Contrastive Learning for Sentence RepresentationJun 3, 2021Luna: Linear Unified Nested AttentionApr 18, 2021On the Influence of Masking Policies in Intermediate Pre-trainingJan 29, 2025Think Smarter not Harder: Adaptive Reasoning with Inference Aware OptimizationApr 12, 2022Detection, Disambiguation, Re-ranking: Autoregressive Entity Linking as a Multi-Task ProblemAug 30, 2023LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language ModelsSep 30, 2024The Perfect Blend: Redefining RLHF with Mixture of JudgesApr 7, 2015The Performance Analysis of Coded Cache in Wireless Fading ChannelFeb 16, 2024SPAR: Personalized Content-Based Recommendation via Long Engagement AttentionJan 18, 2025Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary FeedbackJul 31, 2024The Llama 3 Herd of ModelsMay 18, 2025Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form GenerationJun 2, 2022BayesFormer: Transformer with Uncertainty EstimationNov 4, 2022Improved Adaptive Algorithm for Scalable Active Learning with Weak LabelerDec 7, 2021Reducing Target Group Bias in Hate Speech DetectorsMay 22, 2023Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific PrefixesMay 20, 2025Reinforcement Learning from User FeedbackOct 24, 2024Improving Model Factuality with Fine-grained Critique-based EvaluatorJan 16, 2025Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment