"au:"Nikita Nangia"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Nikita Nangia"" — arXiv2 Search

Showing 1–18 of 18 results

/ Date/ Name

Sep 30, 2020CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models Apr 18, 2017A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference May 24, 2019Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark Jul 25, 2017The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations Apr 17, 2018ListOps: A Diagnostic Dataset for Latent Tree Learning Jun 1, 2021What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?Dec 16, 2021QuALITY: Question Answering with Long Input Texts, Yes!Apr 11, 2022Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions Sep 18, 2025PILOT: Steering Synthetic Data Generation with Psychological & Linguistic Output Targeting Mar 12, 2022What Makes Reading Comprehension Questions Difficult?Jan 18, 2023Discrete Latent Structure in Neural Networks May 2, 2019SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems Oct 15, 2021BBQ: A Hand-Built Bias Benchmark for Question Answering Jun 9, 2022Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Aug 26, 2022What Do NLP Researchers Believe? Results of the NLP Community Metasurvey Oct 19, 2022Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions Jul 1, 2019Natural Language Understanding with the Quora Question Pairs Dataset Apr 15, 2021Does Putting a Linguist in the Loop Improve NLU Data Collection?