arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Huan Zhang"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Mar 20, 2025
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
Feb 18, 2025
Rethinking Diverse Human Preference Learning through Principal Component Analysis
Oct 29, 2024
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models
Apr 12, 2024
Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data
Mar 29, 2023
Queer In AI: A Case Study in Community-Led Participatory AI
Jun 15, 2020
The Limit of the Batch Size