Showing 21–40 of 53 results
/ Date/ Name
May 24, 2023Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language ModelsJun 29, 2025SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social InteractionsJan 1, 2025AutoPresent: Designing Structured Visuals from ScratchAug 11, 20251-2-3 Check: Enhancing Contextual Privacy in LLM via Multi-Agent ReasoningJul 8, 2025OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent SafetyApr 13, 2026GoodPoint: Learning Constructive Scientific Paper Feedback from Author ResponsesNov 22, 2019Parallel Distributed Logistic Regression for Vertical Federated Learning without Third-Party CoordinatorJul 20, 2024Consent in Crisis: The Rapid Decline of the AI Data CommonsApr 15, 2025Rethinking Theory of Mind Benchmarks for LLMs: Towards A User-Centered PerspectiveSep 1, 2024User-Driven Value Alignment: Understanding Users' Perceptions and Strategies for Addressing Biased and Discriminatory Statements in AI CompanionsOct 24, 2023FANToM: A Benchmark for Stress-testing Machine Theory of Mind in InteractionsDec 19, 2024Bridging the Data Provenance Gap Across Text, Speech and VideoFeb 18, 2025Ambig-SWE: Interactive Agents to Overcome Underspecificity in Software EngineeringNov 5, 2025The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production AgentsNov 4, 2025Training Proactive and Personalized LLM AgentsOct 21, 2024BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded DataSep 22, 2025The PIMMUR Principles: Ensuring Validity in Collective Behavior of LLM SocietiesApr 17, 2026Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User StudiesOct 20, 2020Learning nonlocal constitutive models with neural networksMay 1, 2023Inference of relative permeability curves in reservoir rocks with ensemble Kalman method