Showing 1–11 of 11 results
/ Date/ Name
Apr 3, 2026Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web AgentsOct 18, 2025Automated Composition of Agents: A Knapsack Approach for Agentic Component SelectionOct 13, 2025R-WoM: Retrieval-augmented World Model For Computer-use AgentsFeb 25, 2025On Synthetic Data Strategies for Domain-Specific Generative RetrievalOct 14, 2024PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable QueriesAug 15, 2024RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented GenerationApr 24, 2024Towards a Holistic Evaluation of LLMs on Factual Knowledge RecallMay 25, 2023UNITE: A Unified Benchmark for Text-to-SQL EvaluationJan 21, 2023Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL RobustnessDec 17, 2022Importance of Synthesizing High-quality Data for Text-to-SQL ParsingSep 28, 2022Improving Text-to-SQL Semantic Parsing with Fine-grained Query Understanding