arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Sid Black"" — arXiv2 Search
Showing 1–8 of 8 results
/ Date
/ Name
Dec 8, 2025
Auditing Games for Sandbagging
Jul 18, 2025
The Levers of Political Persuasion with Conversational AI
Dec 31, 2020
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Apr 14, 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Apr 21, 2025
RepliBench: Evaluating the Autonomous Replication Capabilities of Language Model Agents
Nov 22, 2022
Interpreting Neural Networks through the Polytope Lens
Apr 9, 2026
More Capable, Less Cooperative? When LLMs Fail At Zero-Cost Collaboration
Dec 31, 2025
Do Large Language Models Know What They Are Capable Of?