Showing 521–540 of 1,726 results
/ Date/ Name
May 23, 2025Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of ChartsMay 22, 2025UNCLE: Benchmarking Uncertainty Expressions in Long-Form GenerationMay 22, 2025Walk&Retrieve: Simple Yet Effective Zero-shot Retrieval-Augmented Generation via Knowledge Graph WalksMay 22, 2025SAE-SSV: Supervised Steering in Sparse Representation Spaces for Reliable Control of Language ModelsMay 21, 2025Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context RetrievalMay 21, 2025Comparative Evaluation of Prompting and Fine-Tuning for Applying Large Language Models to Grid-Structured Geospatial DataMay 21, 2025Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-ThoughtMay 21, 2025Language Specific Knowledge: Do Models Know Better in X than in English?May 20, 2025Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional TrainingMay 20, 2025KORGym: A Dynamic Game Platform for LLM Reasoning EvaluationMay 20, 2025Scale-invariant AttentionMay 20, 2025FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset GenerationMay 20, 2025GemMaroc: Unlocking Darija Proficiency in LLMs with Minimal DataMay 20, 2025Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language ModelsMay 19, 2025RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought ReasoningMay 19, 2025MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their MixMay 18, 2025Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical ExpressionMay 16, 2025Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPOMay 15, 2025GeoGrid-Bench: Can Foundation Models Understand Multimodal Gridded Geo-Spatial Data?May 15, 2025WorldPM: Scaling Human Preference Modeling