arXiv2
Search
Toggle theme
/ Date
/ Name
Search
/ Date
/ Name
"au:"Max Hasin"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Dec 18, 2023
Evaluating Language-Model Agents on Realistic Autonomous Tasks
Mar 21, 2025
HCAST: Human-Calibrated Autonomy Software Tasks
Sep 2, 2022
AutoPET Challenge: Combining nn-Unet with Swin UNETR Augmented by Maximum Intensity Projection Classifier
Feb 21, 2026
The science and practice of proportionality in AI risk evaluations
Apr 8, 2024
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
Mar 18, 2025
Measuring AI Ability to Complete Long Software Tasks