Showing 1–20 of 20 results
/ Date/ Name
Mar 3, 2026Architecting Trust in Artificial Epistemic AgentsFeb 12, 2026Intelligent AI DelegationDec 18, 2025Distributional AGI SafetyDec 3, 2025Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of ValueSep 12, 2025Virtual Agent EconomiesJun 20, 2025Resource Rational Contractualism Should Guide AI AlignmentFeb 19, 2025Multi-Agent Risks from Advanced AIFeb 1, 2025Defense Against the Dark Prompts: Mitigating Best-of-N Jailbreaking with Prompt EvaluationJan 30, 2025Model-Free RL Agents Demonstrate System 1-Like IntentionalityJan 29, 2025AI Governance through MarketsAug 30, 2024Beyond Preferences in AI AlignmentApr 24, 2024The Ethics of Advanced AI AssistantsApr 23, 2024A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AIAug 31, 2023Science Communications for Explainable Artificial IntelligenceAug 30, 2023Strengthening the EU AI Act: Defining Key Terms on AI ManipulationJun 19, 2023Concept Extrapolation: A Conceptual PrimerOct 5, 2022The Influence of Explainable Artificial Intelligence: Nudging Behaviour or Boosting Capability?Sep 14, 2022Solutions to preference manipulation in recommender systems require knowledge of meta-preferencesJun 21, 2022Preference Change in Persuasive RoboticsMar 20, 2022Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI