"au:"Been Kim"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Been Kim"" — arXiv2 Search

Showing 21–40 of 60 results

/ Date/ Name

Jul 9, 2020Concept Bottleneck Models Oct 9, 2025Neologism Learning for Controllability and Self-Verbalization Mar 28, 2025QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?Dec 9, 2024Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty Jul 16, 2019Explaining Classifiers with Causal Concept Effect (CaCE)Sep 19, 2025How many classes do we need to see for novel class discovery?Jul 1, 2025Escaping Plato's Cave: JAM for Aligning Independently Trained Vision and Language Models May 30, 2018To Trust Or Not To Trust A Classifier Sep 21, 2023State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding May 29, 2023Gaussian Process Probes (GPP) for Uncertainty-Aware Probing Feb 25, 2022Human-Centered Concept Explanations for Neural Networks Jun 17, 2022Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis Jun 22, 2018xGEMs: Generating Examplars to Explain Black-Box Models Nov 17, 2021Acquisition of Chess Knowledge in AlphaZero Jul 8, 2016Proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016)Oct 18, 2023Getting aligned on representational alignment Mar 1, 2022Advanced Methods for Connectome-Based Predictive Modeling of Human Intelligence: A Novel Approach Based on Individual Differences in Cortical Topography Jun 12, 2017SmoothGrad: removing noise by adding noise Nov 10, 2020Debugging Tests for Model Explanations May 31, 2021DISSECT: Disentangled Simultaneous Explanations via Concept Traversals

← Previous Next →