Showing 1–15 of 15 results
/ Date/ Name
Sep 27, 2025From Conversation to Query Execution: Benchmarking User and Tool Interactions for EHR Database AgentsMar 23, 2024TrustSQL: Benchmarking Text-to-SQL Reliability with Penalty-Based ScoringJan 16, 2023EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health RecordsMay 4, 2024Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health RecordsNov 13, 2025SCARE: A Benchmark for SQL Correction and Question Answerability Classification for Reliable EHR Question AnsweringMay 12, 2021Improving Lexically Constrained Neural Machine Translation with Source-Conditioned Masked Span PredictionSep 12, 2025FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question AnsweringJun 21, 2023ECG-QA: A Comprehensive Question Answering Dataset Combined With ElectrocardiogramMay 23, 2024EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health RecordsApr 29, 2024Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQLMar 6, 2020Diverse and Admissible Trajectory Forecasting through Multimodal Context UnderstandingDec 1, 2021Exploration into Translation-Equivariant Image QuantizationJun 24, 2024EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health RecordsOct 28, 2023EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray ImagesSep 1, 2023Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes