Showing 1–20 of 36 results
/ Date/ Name
Sep 17, 2023Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News ArticlesNov 10, 2022CREATIVESUMM: Shared Task on Automatic Summarization for Creative WritingDec 15, 2022Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human EvaluationMay 23, 2023LLMs as Factual Reasoners: Insights from Existing Benchmarks and BeyondJul 24, 2020SummEval: Re-evaluating Summarization EvaluationOct 9, 2024ReIFE: Re-evaluating Instruction-Following EvaluationDec 8, 2021Bidimensional Leaderboards: Generate and Evaluate Language Hand in HandMay 28, 2023Generating EDU Extracts for Plan-Guided Summary Re-RankingDec 20, 2022Socratic Pretraining: Question-Driven Pretraining for Controllable SummarizationMay 11, 2018TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource RecommendationJun 4, 2019Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical ModelJun 26, 2019Creating A Neural Pedagogical Agent by Jointly Learning to Review and AssessNov 11, 2021AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer SummarizationApr 24, 2024Prompt Leakage effect and defense strategies for multi-turn LLM interactionsOct 30, 2024Evaluating Cultural and Social Awareness of LLM Web AgentsJun 1, 2021ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument MiningDec 14, 2021Exploring Neural Models for Query-Focused SummarizationMay 25, 2022Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error DetectorsAug 22, 2018Sarcasm Analysis using Conversation ContextApr 17, 2021Multi-Perspective Abstractive Answer Summarization