arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Chatrik Singh Mangat"" — arXiv2 Search
Showing 1–7 of 7 results
/ Date
/ Name
May 17, 2025
Confirmation bias: A challenge for scalable oversight
Apr 8, 2025
From Stability to Inconsistency: A Study of Moral Preferences in LLMs
Mar 29, 2025
FindTheFlaws: Annotated Errors for Detecting Flawed Reasoning and Scalable Oversight Research
Sep 25, 2024
Characterizing stable regions in the residual stream of LLMs
Sep 23, 2024
Evaluating Synthetic Activations composed of SAE Latents in GPT-2
Dec 6, 2022
Low Mass X-ray Binary Simulation Data Release
Apr 4, 2022
Quasi-stationary sequences of hyper massive neutron stars with exotic equations of state