/ Date/ Name

Computation and Language

cs.CL

/ Date/ Name

/ Date/ Name

Showing 561–580 of 1,726 results

/ Date/ Name

Apr 12, 2025REMEMBER: Retrieval-based Explainable Multimodal Evidence-guided Modeling for Brain Evaluation and Reasoning in Zero- and Few-shot Neurodegenerative Diagnosis Apr 12, 2025BioChemInsight: An Online Platform for Automated Extraction of Chemical Structures and Activity Data from Patents Apr 11, 2025Evaluation and Incident Prevention in an Enterprise AI Assistant Apr 10, 2025Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Apr 10, 2025Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Apr 10, 2025Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation Apr 9, 2025R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents Apr 9, 2025Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Apr 8, 2025Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge Apr 4, 2025YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization Apr 4, 2025AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset Apr 2, 2025On the Role of Feedback in Test-Time Scaling of Agentic AI Workflows Apr 2, 2025DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning Apr 1, 2025WikiVideo: Article Generation from Multiple Videos Mar 29, 2025Efficient Adaptation For Remote Sensing Visual Grounding Mar 29, 2025FindTheFlaws: Annotated Errors for Detecting Flawed Reasoning and Scalable Oversight Research Mar 25, 2025Gemma 3 Technical Report Mar 24, 2025Language Model Uncertainty Quantification with Attention Chain Mar 23, 2025STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models Mar 21, 2025Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans?

← Previous Next →