"au:"Laura Dietz"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Laura Dietz"" — arXiv2 Search

Showing 1–20 of 21 results

/ Date/ Name

May 21, 2024A Workbench for Autograding Retrieve/Generate Systems Dec 22, 2024LLM-based relevance assessment still can't replace human relevance assessment Apr 27, 2025LLM-Evaluation Tropes: Perspectives on the Validity of LLM-Evaluations Jan 19, 2026Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?Oct 18, 2023Retrieve-Cluster-Summarize: An Alternative to End-to-End Training for Query-specific Article Generation Apr 13, 2023Perspectives on Large Language Models for Relevance Judgment May 1, 2018On the Equivalence of Generative and Discriminative Formulations of the Sequential Dependence Model Jan 21, 2026Supporting Humans in Evaluating AI Summaries of Legal Depositions Feb 1, 2024An Exam-based Evaluation Approach Beyond Traditional Relevance Judgments Jan 19, 2026Incorporating Q&A Nuggets into Retrieval-Augmented Generation Apr 18, 2019Knowledge-rich Image Gist Understanding Beyond Literal Meaning Oct 17, 2024Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3 Sep 8, 2025UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction Sep 30, 2025Auto-ARGUE: LLM-Based Report Generation Evaluation Dec 20, 2019Report on the First HIPstIR Workshop on the Future of Information Retrieval Dec 21, 2023Fine-grained Forecasting Models Via Gaussian Process Blurring Effect May 13, 2017Benchmark for Complex Answer Retrieval Jul 13, 2025Criteria-Based LLM Relevance Judgments Sep 23, 2018Understanding the Gist of Images - Ranking of Concepts for Multimedia Indexing Jul 13, 2025Does UMBRELA Work on Other LLMs?