Showing 1–20 of 37 results
/ Date/ Name
Apr 30, 2025Characterizing AI Agents for Alignment and GovernanceSep 22, 2025AI, Digital Platforms, and the New Systemic RiskSep 1, 2022In conversation with Artificial Intelligence: aligning language models with human valuesApr 21, 2023ChatGPT, Large Language Technologies, and the Bumpy Road of Benefiting HumanityJul 8, 2023Typology of Risks of Generative Text-to-Image ModelsFeb 9, 2021The Use and Misuse of Counterfactuals in Ethical Machine LearningFeb 13, 2025AI Safety for EveryoneOct 30, 2019Mathematical decisions and non-causal elements of explainable AIJun 2, 2023Reconciling Governmental Use of Online Targeting With DemocracyJun 2, 2022Algorithmic Fairness and Structural Injustice: Insights from Feminist Political PhilosophyMar 1, 2021Reasons, Values, Stakeholders: A Philosophical Framework for Explainable Artificial IntelligenceSep 13, 2021Fairness and Data Protection Impact AssessmentsJan 15, 2024Two Types of AI Existential Risk: Decisive and AccumulativeOct 1, 2024Measurement challenges in AI catastrophic risk governance and safety frameworksFeb 9, 2024Discipline and Label: A WEIRD Genealogy and Social Theory of Data AnnotationMay 22, 2024CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language ModelsNov 14, 2024Democratic AI is Possible. The Democracy Levels Framework Shows How It Might WorkJan 7, 2026Legal Alignment for Safe and Ethical AISep 9, 2021User Tampering in Reinforcement Learning Recommender SystemsMar 15, 2026Bridging the Gap in the Responsible AI Divides