Showing 1–12 of 12 results
/ Date/ Name
May 30, 2023Less Likely Brainstorming: Using Language Models to Generate Alternative HypothesesFeb 20, 2024TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue SummarizationOct 14, 2021Making Document-Level Information Extraction Right for the Right ReasonsMay 19, 2025ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language ModelsApr 16, 2024MiniCheck: Efficient Fact-Checking of LLMs on Grounding DocumentsMay 25, 2022Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error DetectorsOct 28, 2021RadBERT-CL: Factually-Aware Contrastive Learning For Radiology Report ClassificationJun 20, 2025A Liquid-Nitrogen-Cooled Ca+ Ion Optical Clock with a Systematic Uncertainty of 4.4E-19Apr 1, 2025Is the Top Still Spinning? Evaluating Subjectivity in Narrative UnderstandingJan 11, 2022Prior Knowledge Enhances Radiology Report GenerationFeb 16, 2022Measurement of infrared magic wavelength for an all-optical trapping of $^{40}$Ca$^{+}$ ion clockNov 25, 2020Using Radiomics as Prior Knowledge for Thorax Disease Classification and Localization in Chest X-rays