"au:"Max Hasin"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Max Hasin"" — arXiv2 Search

Showing 1–6 of 6 results

/ Date/ Name

Dec 18, 2023Evaluating Language-Model Agents on Realistic Autonomous Tasks Mar 21, 2025HCAST: Human-Calibrated Autonomy Software Tasks Sep 2, 2022AutoPET Challenge: Combining nn-Unet with Swin UNETR Augmented by Maximum Intensity Projection Classifier Feb 21, 2026The science and practice of proportionality in AI risk evaluations Apr 8, 2024Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Mar 18, 2025Measuring AI Ability to Complete Long Software Tasks