arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jebish Purbey"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Dec 3, 2025
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia
Oct 28, 2025
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures
Apr 9, 2025
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation
Nov 29, 2024
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge