Showing 1–20 of 26 results
/ Date/ Name
May 21, 2021Towards Realization of Augmented Intelligence in Dermatology: Advances and Future DirectionsFeb 1, 2023SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained model debugging and analysisMar 15, 2022Disparities in Dermatology AI Performance on a Diverse, Curated Clinical Image SetNov 15, 2021Disparities in Dermatology AI: Assessments Using Diverse Clinical ImagesJul 8, 2025A Systematic Analysis of Declining Medical Safety Messaging in Generative AI ModelsSep 13, 2023Towards Reliable Dermatology Evaluation BenchmarksFeb 12, 2025SycEval: Evaluating LLM SycophancyJan 16, 2024RIDGE: Reproducibility, Integrity, Dependability, Generalizability, and Efficiency Assessment of Medical Image Segmentation ModelsMar 4, 2025BiasICL: In-Context Learning and Demographic Biases of Vision Language ModelsJul 3, 2025MedVAL: Toward Expert-Level Medical Text Validation with Language ModelsApr 24, 2024Assessing The Potential Of Mid-Sized Language Models For Clinical QASep 12, 2022Development and Clinical Evaluation of an AI Support Tool for Improving Telemedicine Photo QualityDec 2, 2024Best Practices for Large Language Models in RadiologyFeb 4, 2026Visual concept ranking uncovers medical shortcuts used by large multimodal modelsJul 6, 2022Towards Transparency in Dermatology Image Datasets with Skin Tone Annotations by Experts, Crowds, and an AlgorithmMar 27, 2024BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical TextAug 23, 2023Augmenting medical image classifiers with synthetic data from latent diffusion modelsMay 26, 2025MedHELM: Holistic Evaluation of Large Language Models for Medical TasksJul 14, 2021Do Humans Trust Advice More if it Comes from AI? An Analysis of Human-AI InteractionsDec 14, 2025Explainable AI as a Double-Edged Sword in Dermatology: The Impact on Clinicians versus The Public