Showing 1–20 of 21 results
/ Date/ Name
Sep 23, 2022Multiple-Choice Question Generation: Towards an Automated Assessment FrameworkApr 16, 2024Question Difficulty Ranking for Multiple-Choice Reading ComprehensionSep 24, 2024Finetuning LLMs for Comparative Assessment TasksJul 3, 2023Analyzing Multiple-Choice Reading and Listening Comprehension TestsSep 22, 2023Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language ModelsNov 13, 2022World Knowledge in Multiple Choice Reading ComprehensionFeb 10, 2023Tackling Bias in the Dice Similarity Coefficient: Introducing nDSC for White Matter Lesion SegmentationNov 8, 2023Assessing Distractors in Multiple-Choice TestsJul 9, 2021An Initial Investigation of Non-Native Spoken Question-AnsweringMay 20, 2024Question-Based Retrieval using Atomic Units for Enterprise RAGNov 9, 2022Novel structural-scale uncertainty measures and error retention curves: application to multiple sclerosisSep 29, 2025Probing the Limits of Stylistic Alignment in Vision-Language ModelsFeb 1, 2024An Information-Theoretic Approach to Analyze NLP Classification TasksMay 9, 2024Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise ComparisonsOct 13, 2025Embedding the Teacher: Distilling vLLM Preferences for Scalable Image RetrievalFeb 13, 2025ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal ModelsNov 15, 2023Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion SegmentationJun 30, 2022Shifts 2.0: Extending The Dataset of Real Distributional ShiftsJun 22, 2023Analysis of the Cambridge Multiple-Choice Questions Reading Dataset with a Focus on Candidate Response DistributionJun 8, 2023CUED at ProbSum 2023: Hierarchical Ensemble of Summarization Models