Showing 1–20 of 24 results
/ Date/ Name
Nov 19, 2023An Interactive Query Generation Assistant using LLM-based Prompt Modification and User FeedbackDec 6, 2024ConQRet: Benchmarking Fine-Grained Evaluation of Retrieval Augmented Argumentation with LLM JudgesDec 6, 2024PyTerrier-GenRank: The PyTerrier Plugin for Reranking with Large Language ModelsDec 19, 2022NusaCrowd: Open Source Initiative for Indonesian NLP ResourcesFeb 2, 2021The GEM Benchmark: Natural Language Generation, its Evaluation and MetricsDec 6, 2021NL-Augmenter: A Framework for Task-Sensitive Natural Language AugmentationMay 31, 2020Benchmarking BioRelEx for Entity Tagging and Relation ExtractionApr 26, 2025Generative Product Recommendations for Implicit Superlative QueriesJan 26, 2026BabyReasoningBench: Generating Developmentally-Inspired Reasoning Tasks for Evaluating Baby Language ModelsJan 29, 2024KAUCUS: Knowledge Augmented User Simulators for Training Language Model AssistantsJul 8, 2021CANDLE: Decomposing Conditional and Conjunctive Queries for Task-Oriented Dialogue SystemsJun 17, 2025InsertRank: LLMs can reason over BM25 scores to Improve Listwise RerankingApr 18, 2020Syn-QG: Syntactic and Shallow Semantic Rules for Question GenerationApr 3, 2024DUQGen: Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query GenerationNov 12, 2022Lessons from Digital India for the Right to Internet AccessJan 14, 2025A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language ModelsMar 21, 2026RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric GenerationJun 9, 2022Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language modelsMar 23, 2024QueryExplorer: An Interactive Query Generation Assistant for Search and ExplorationJun 6, 2022A Bird's-Eye Tutorial of Graph Attention Architectures