Showing 1–20 of 22 results
/ Date/ Name
May 19, 2022Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical ImagingJun 21, 2021maars: Tidy Inference under the 'Models as Approximations' Framework in RMar 30, 2020Fairness Evaluation in Presence of Biased Noisy LabelsJun 11, 2024A Framework for Efficient Model Evaluation through Stratification, Sampling, and EstimationMay 11, 2021On the Validity of Arrest as a Proxy for Offense: Race and the Likelihood of Arrest for Violent CrimesSep 3, 2021The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing StudiesApr 6, 2024Multicalibration for Confidence Scoring in LLMsOct 7, 2024Precise Model Benchmarking with Only a Few ObservationsJul 29, 2025Persona-Augmented Benchmarking: Evaluating LLMs Across Diverse Writing StylesOct 11, 2023Estimating the Likelihood of Arrest from Police Records in Presence of Unreported CrimesFeb 4, 2020TRAP: A Predictive Framework for Trail Running Assessment of PerformanceJun 1, 2023Confidence Intervals for Error Rates in 1:1 Matching Tasks: Critical Statistical Analysis and RecommendationsFeb 19, 2020A Case for Humans-in-the-Loop: Decisions in the Presence of Erroneous Algorithmic ScoresMay 19, 2022Homophily and Incentive Effects in Use of AlgorithmsNov 15, 2020Uncertainty as a Form of Transparency: Measuring, Communicating, and Using UncertaintyMar 22, 2022Racial Disparities in the Enforcement of Marijuana Violations in the USMay 12, 2023The Progression of Disparities within the Criminal Justice System: Differential Enforcement and Risk Assessment InstrumentsApr 6, 2026Justified or Just Convincing? Error Verifiability as a Dimension of LLM QualityDec 5, 2024Improving LLM Group Fairness on Tabular Data via In-Context LearningAug 8, 2025Play Favorites: A Statistical Method to Measure Self-Bias in LLM-as-a-Judge