"au:"Sinong Wang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Sinong Wang"" — arXiv2 Search

Showing 21–40 of 45 results

/ Date/ Name

Jun 7, 2020Language Models as Fact Checkers?Dec 31, 2020CLEAR: Contrastive Learning for Sentence Representation Jun 3, 2021Luna: Linear Unified Nested Attention Apr 18, 2021On the Influence of Masking Policies in Intermediate Pre-training Jan 29, 2025Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization Apr 12, 2022Detection, Disambiguation, Re-ranking: Autoregressive Entity Linking as a Multi-Task Problem Aug 30, 2023LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models Sep 30, 2024The Perfect Blend: Redefining RLHF with Mixture of Judges Apr 7, 2015The Performance Analysis of Coded Cache in Wireless Fading Channel Feb 16, 2024SPAR: Personalized Content-Based Recommendation via Long Engagement Attention Jan 18, 2025Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Jul 31, 2024The Llama 3 Herd of Models May 18, 2025Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation Jun 2, 2022BayesFormer: Transformer with Uncertainty Estimation Nov 4, 2022Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler Dec 7, 2021Reducing Target Group Bias in Hate Speech Detectors May 22, 2023Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes May 20, 2025Reinforcement Learning from User Feedback Oct 24, 2024Improving Model Factuality with Fine-grained Critique-based Evaluator Jan 16, 2025Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment

← Previous Next →