Showing 1–20 of 22 results
/ Date/ Name
Nov 4, 2024Imagining and building wise machines: The centrality of AI metacognitionMay 6, 2025A Communication-First Account of ExplanationJun 21, 2023Understanding Social Reasoning in Language Models with Language ModelsJul 16, 2025Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic ModelsJun 22, 2024To Err is Robotic: Rapid Value-Based Trial-and-Error during DeploymentMar 28, 2024STaR-GATE: Teaching Language Models to Ask Clarifying QuestionsOct 30, 2023MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment TasksOct 31, 2025Spot The Ball: A Benchmark for Visual Social InferenceMay 28, 2025Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed KernelApr 17, 2024Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language ModelsDec 13, 2022Explanations Can Reduce Overreliance on AI Systems During Decision-MakingJul 14, 2021Do Humans Trust Advice More if it Comes from AI? An Analysis of Human-AI InteractionsSep 18, 2024Human-like Affective Cognition in Foundation ModelsOct 2, 2024MARPLE: A Benchmark for Long-Horizon InferenceApr 22, 2024Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference LabelsOct 26, 2023Social Contract AI: Aligning AI Assistants with Implicit Group NormsMay 11, 2019Explaining intuitive difficulty judgments by modeling physical effort and riskFeb 12, 2022Uncalibrated Models Can Improve Human-AI CollaborationJul 25, 2017Physical problem solving: Joint planning with symbolic, geometric, and dynamic constraintsJun 9, 2022Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models