"au:"D. Bak"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"D. Bak"" — arXiv2 Search

Showing 1–13 of 13 results

/ Date/ Name

Oct 10, 2024The Geometry of Concepts: Sparse Autoencoder Feature Structure Feb 8, 2024GenEFT: Understanding Statics and Dynamics of Model Generalization via Effective Theory Apr 25, 2025Scaling Laws For Scalable Oversight Oct 10, 2024Investigating Representation Universality: Case Study on Genealogical Representations Mar 5, 2025Towards Understanding Distilled Reasoning Models: A Representational Approach Feb 3, 2025Harmonic Loss Trains Interpretable AI Models May 22, 1995Area-preserving Structure and Anomalies in 1+1-dimensional Quantum Gravity Jun 30, 1994Schroedinger Self-adjoint Extension and Quantum Field Theory Nov 5, 1996String-Inspired Gravity Coupled to Yang-Mills Fields Jun 19, 1995Area-Preserving Structure of Massless Matter-Gravity Fields in 1+1 Dimensions Dec 8, 2022Gate Error Analysis of Tunable Coupling Architecture in the Large-scale Superconducting Quantum System Feb 26, 2026A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring Oct 20, 2025Any-Depth Alignment: Unlocking Innate Safety Alignment of LLMs to Any-Depth