"au:"Huan Zhang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Huan Zhang"" — arXiv2 Search

Showing 1–6 of 6 results

/ Date/ Name

Mar 20, 2025The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination Feb 18, 2025Rethinking Diverse Human Preference Learning through Principal Component Analysis Oct 29, 2024DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models Apr 12, 2024Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data Mar 29, 2023Queer In AI: A Case Study in Community-Led Participatory AI Jun 15, 2020The Limit of the Batch Size