Showing 1–20 of 53 results
/ Date/ Name
Sep 24, 2024HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI InteractionsOct 3, 2020Multilevel Text Alignment with Cross-Document AttentionMar 8, 2024Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMsAug 30, 2025Social World ModelsOct 24, 2025TOM-SWE: User Mental Modeling For Software Engineering AgentsJul 14, 2022Learning to translate by learning to communicateJun 3, 2023COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive StatementsNov 15, 2021Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language DetectionMay 16, 2020RPD: A Distance Function Between Word EmbeddingsOct 18, 2023SOTOPIA: Interactive Evaluation for Social Intelligence in Language AgentsJan 29, 2021Challenges in Automated Debiasing for Toxic Language DetectionNov 27, 2019Evaluating Commonsense in Pre-trained Language ModelsApr 19, 2025SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulationMar 11, 2026Mind the Sim2Real Gap in User Simulation for Agentic TasksFeb 16, 2022A PDE-free, neural network-based eddy viscosity model coupled with RANS equationsJan 4, 2022An equivariant neural operator for developing nonlocal tensorial constitutive modelsOct 16, 2020Linguistically-Informed Transformations (LIT): A Method for Automatically Generating Contrast SetsJul 25, 2023WebArena: A Realistic Web Environment for Building Autonomous AgentsMay 15, 2024PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language ModelsOct 27, 2023Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory