Showing 1–20 of 22 results
/ Date/ Name
Mar 20, 2026CoverageBench: Evaluating Information Coverage across Tasks and DomainsFeb 4, 2026Uncertainty Quantification in LLM Agents: Foundations, Emerging Challenges, and OpportunitiesJul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesJan 24, 2025Humanity's Last ExamOct 8, 2024Multimodal Situational SafetySep 17, 2024Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language ModelsMay 29, 2024CHAOS VIII: Far-Ultraviolet Spectra of M101 and The Impact of Wolf-Rayet StarsMay 21, 2024Towards Responsible Development of Generative AI for Education: An Evaluation-Driven ApproachMar 18, 2024Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under CompressionFeb 28, 2024Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition DatasetMay 22, 2023"According to ...": Prompting Language Models Improves Quoting from Pre-Training DataDec 20, 2022When Do Decompositions Help for Machine Reading?Jun 2, 2022CLASSY II: A technical Overview of the COS Legacy Archive Spectroscopic SurveYMar 10, 2022Tau Neutrinos in the Next Decade: from GeV to EeVJul 22, 2020Teleportation Systems Towards a Quantum InternetAug 22, 2019Efficient Task-Specific Data Valuation for Nearest Neighbor AlgorithmsMar 29, 2019High-Energy Photon and Particle Effects onExoplanet Atmospheres and HabitabilityMar 11, 2019UV Diagnostics of Galaxies from the Peak of Star-Formation to the Epoch of ReionizationMar 17, 2018The Kepler Light Curves of AGN: A Detailed AnalysisFeb 19, 2016The Need for Laboratory Work to Aid in The Understanding of Exoplanetary Atmospheres