Showing 1–20 of 90 results
/ Date/ Name
Jun 20, 2023Evaluating the Zero-shot Robustness of Instruction-tuned Language ModelsFeb 5, 2024Evaluating the Factuality of Zero-shot Summarizers Across Varied DomainsOct 22, 2022PHEE: A Dataset for Pharmacovigilance Event Extraction from TextApr 5, 2019An Analysis of Attention over Clinical Notes for Predictive TasksMay 19, 2019Predicting Annotation Difficulty to Improve Task Routing and Model Performance for Biomedical Information ExtractionOct 22, 2020Unsupervised Data Augmentation with Naive Augmentation and without Unlabeled DataFeb 26, 2019Attention is not ExplanationOct 7, 2020Understanding Clinical Trial Reports: Extracting Medical Entities and Their RelationsJun 17, 2021Biomedical Interpretable Entity RepresentationsApr 13, 2021On the Impact of Random Seeds on the Fairness of Clinical ClassifiersSep 5, 2016Crowdsourcing Information Extraction for Biomedical Systematic ReviewsJan 29, 2024InfoLossQA: Characterizing and Recovering Information Loss in Text SimplificationJun 28, 2024Detection and Measurement of Syntactic Templates in Generated TextOct 30, 2024Don't Pay Attention, PLANT It: Pretraining Attention via Learning-to-RankSep 16, 2025Do Natural Language Descriptions of Model Activations Convey Privileged Information?Sep 25, 2025Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in Language ModelsMar 1, 2024Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of ScoresOct 31, 2025Can SAEs reveal and mitigate racial biases of LLMs in healthcare?Jan 17, 2026Faithfulness vs. Safety: Evaluating LLM Behavior Under Counterfactual Medical EvidenceMay 23, 2023Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations