/ Date/ Name

Computation and Language

cs.CL

/ Date/ Name

/ Date/ Name

Showing 361–380 of 1,726 results

/ Date/ Name

Nov 11, 2025Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates Nov 10, 2025NiuTrans.LMT: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs Nov 9, 2025How Well Do LLMs Understand Drug Mechanisms? A Knowledge + Reasoning Evaluation Dataset Nov 5, 2025PLLuM: A Family of Polish Large Language Models Nov 4, 2025LTD-Bench: Evaluating Large Language Models by Letting Them Draw Nov 1, 2025PADBen: A Comprehensive Benchmark for Evaluating AI Text Detectors Against Paraphrase Attacks Oct 31, 2025Atlas-Alignment: Making Interpretability Transferable Across Language Models Oct 31, 2025Identifying the Periodicity of Information in Natural Language Oct 30, 2025Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Oct 30, 2025ORBIT -- Open Recommendation Benchmark for Reproducible Research with Hidden Tests Oct 29, 2025NeuronMLP: Efficient LLM Inference via Singular Value Decomposition Compression and Tiling on AWS Trainium Oct 29, 2025The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution Oct 28, 2025Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Oct 28, 2025GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research Oct 27, 2025Your LLM Agents are Temporally Blind: The Misalignment Between Tool Use Decisions and Human Time Perception Oct 27, 2025PTPP-Aware Adaptation Scaling Laws: Predicting Domain-Adaptation Performance at Unseen Pre-Training Budgets Oct 26, 2025MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion Oct 24, 2025The Universal Landscape of Human Reasoning Oct 24, 2025When Models Outthink Their Safety: Unveiling and Mitigating Self-Jailbreak in Large Reasoning Models Oct 23, 2025Why Did Apple Fall: Evaluating Curiosity in Large Language Models

← Previous Next →