Showing 441–460 of 3,402 results
/ Date/ Name
Oct 28, 2025Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and CulturesOct 27, 2025Multi-Agent Evolve: LLM Self-Improve through Co-evolutionOct 27, 2025First detection of ultra-high energy emission from gamma-ray binary LS I +61 303Oct 26, 2025MMPersuade: A Dataset and Evaluation Framework for Multimodal PersuasionOct 26, 2025IGGT: Instance-Grounded Geometry Transformer for Semantic 3D ReconstructionOct 24, 2025The Universal Landscape of Human ReasoningOct 24, 2025Constraints on ultraheavy dark matter from the CDEX-10 experiment at the China Jinping Underground LaboratoryOct 24, 2025When Models Outthink Their Safety: Unveiling and Mitigating Self-Jailbreak in Large Reasoning ModelsOct 23, 2025Collective Communication for 100k+ GPUsOct 22, 2025Point-contact Andreev reflection spectroscopy of layered superconductors with device-integrated diamond anvil cellsOct 22, 2025Joint neutrino oscillation analysis from the T2K and NOvA experimentsOct 22, 2025Quantum computation of molecular geometry via many-body nuclear spin echoesOct 22, 2025Understanding the Implicit Biases of Design Choices for Time Series Foundation ModelsOct 21, 2025Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMsOct 21, 2025MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile ManipulationOct 20, 2025What Makes AI Research Replicable? Executable Knowledge Graphs as Scientific Knowledge RepresentationsOct 20, 2025ZSPAPrune: Zero-Shot Prompt-Aware Token Pruning for Vision-Language ModelsOct 18, 2025Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic RewardsOct 18, 2025Automated Composition of Agents: A Knapsack Approach for Agentic Component SelectionOct 17, 2025Aria Gen 2 Pilot Dataset