"au:"Atoosa Kasirzadeh"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Atoosa Kasirzadeh"" — arXiv2 Search

Showing 21–37 of 37 results

/ Date/ Name

Dec 8, 2021Ethical and social risks of harm from Language Models Mar 31, 2024A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)Mar 22, 2024Explanation Hacking: The perils of algorithmic recourse Oct 17, 2024Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models Apr 15, 2024Foundational Challenges in Assuring Alignment and Safety of Large Language Models Sep 29, 2025Generative Value Conflicts Reveal LLM Priorities Sep 5, 2024Beyond Model Interpretability: Socio-Structural Explanations in Machine Learning Aug 21, 2024Epistemic Injustice in Generative AI Aug 15, 2024The Future of Open Human Feedback Oct 7, 2025EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preferences Feb 19, 2025Multi-Agent Risks from Advanced AI Feb 24, 2026International AI Safety Report 2026 Dec 3, 2025Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of Value Nov 24, 2024A Taxonomy of Systemic Risks from General-Purpose AI Dec 20, 2024The Only Way is Ethics: A Guide to Ethical Research with Large Language Models Sep 10, 2025The More You Automate, the Less You See: Hidden Pitfalls of AI Scientist Systems Aug 10, 2025Position: Beyond Sensitive Attributes, ML Fairness Should Quantify Structural Injustice via Social Determinants