Showing 1–20 of 43 results
/ Date/ Name
Oct 7, 2022Calibrating Factual Knowledge in Pretrained Language ModelsMay 15, 2021Premise-based Multimodal Reasoning: Conditional Inference on Joint Textual and Visual CluesMay 2, 2022Robust Fine-tuning via Perturbation and Interpolation from In-batch InstancesMay 17, 2023Statistical Knowledge Assessment for Large Language ModelsDec 19, 2022Statistical Dataset Evaluation: Reliability, Difficulty, and ValidityMay 24, 2023ImageNetVC: Zero- and Few-Shot Visual Commonsense Evaluation on 1000 ImageNet CategoriesApr 20, 2021Problems and Countermeasures in Natural Language Processing EvaluationOct 9, 2024Self-Boosting Large Language Models with Synthetic Preference DataJan 2, 2024Further Explanations on "SAT Requires Exhaustive Search"Jan 21, 2021ParaSCI: A Large Scientific Paraphrase Dataset for Longer Paraphrase GenerationDec 31, 2022A Survey on In-context LearningMar 26, 2022A Roadmap for Big ModelSep 11, 2023Large Language Model for Science: A Study on P vs. NPMay 23, 2023Can Language Models Understand Physical Concepts?Jan 15, 2024Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingDec 16, 2024Next Token Prediction Towards Multimodal Intelligence: A Comprehensive SurveyMar 7, 2023A Challenging Benchmark for Low-Resource LearningOct 9, 2024Data Selection via Optimal Control for Language ModelsMay 20, 2025Reward Reasoning ModelMay 20, 2025Think Only When You Need with Large Hybrid-Reasoning Models