Showing 21–40 of 60 results
/ Date/ Name
Jul 9, 2020Concept Bottleneck ModelsOct 9, 2025Neologism Learning for Controllability and Self-VerbalizationMar 28, 2025QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?Dec 9, 2024Proactive Agents for Multi-Turn Text-to-Image Generation Under UncertaintyJul 16, 2019Explaining Classifiers with Causal Concept Effect (CaCE)Sep 19, 2025How many classes do we need to see for novel class discovery?Jul 1, 2025Escaping Plato's Cave: JAM for Aligning Independently Trained Vision and Language ModelsMay 30, 2018To Trust Or Not To Trust A ClassifierSep 21, 2023State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User UnderstandingMay 29, 2023Gaussian Process Probes (GPP) for Uncertainty-Aware ProbingFeb 25, 2022Human-Centered Concept Explanations for Neural NetworksJun 17, 2022Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral AnalysisJun 22, 2018xGEMs: Generating Examplars to Explain Black-Box ModelsNov 17, 2021Acquisition of Chess Knowledge in AlphaZeroJul 8, 2016Proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016)Oct 18, 2023Getting aligned on representational alignmentMar 1, 2022Advanced Methods for Connectome-Based Predictive Modeling of Human Intelligence: A Novel Approach Based on Individual Differences in Cortical TopographyJun 12, 2017SmoothGrad: removing noise by adding noiseNov 10, 2020Debugging Tests for Model ExplanationsMay 31, 2021DISSECT: Disentangled Simultaneous Explanations via Concept Traversals