"au:"Tobias Gerstenberg"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Tobias Gerstenberg"" — arXiv2 Search

Showing 1–20 of 22 results

/ Date/ Name

Nov 4, 2024Imagining and building wise machines: The centrality of AI metacognition May 6, 2025A Communication-First Account of Explanation Jun 21, 2023Understanding Social Reasoning in Language Models with Language Models Jul 16, 2025Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models Jun 22, 2024To Err is Robotic: Rapid Value-Based Trial-and-Error during Deployment Mar 28, 2024STaR-GATE: Teaching Language Models to Ask Clarifying Questions Oct 30, 2023MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks Oct 31, 2025Spot The Ball: A Benchmark for Visual Social Inference May 28, 2025Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel Apr 17, 2024Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models Dec 13, 2022Explanations Can Reduce Overreliance on AI Systems During Decision-Making Jul 14, 2021Do Humans Trust Advice More if it Comes from AI? An Analysis of Human-AI Interactions Sep 18, 2024Human-like Affective Cognition in Foundation Models Oct 2, 2024MARPLE: A Benchmark for Long-Horizon Inference Apr 22, 2024Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels Oct 26, 2023Social Contract AI: Aligning AI Assistants with Implicit Group Norms May 11, 2019Explaining intuitive difficulty judgments by modeling physical effort and risk Feb 12, 2022Uncalibrated Models Can Improve Human-AI Collaboration Jul 25, 2017Physical problem solving: Joint planning with symbolic, geometric, and dynamic constraints Jun 9, 2022Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models