arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Timothy Baldwin"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Apr 24, 2026
Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents
Oct 14, 2025
On the Interplay between Human Label Variation and Model Fairness
Feb 3, 2025
Training and Evaluating with Human Label Variation: An Empirical Study
Aug 5, 2024
To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction
Aug 30, 2023
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Mar 18, 2021
Evaluating Document Coherence Modelling