Showing 681–700 of 1,726 results
/ Date/ Name
Nov 1, 2024Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLMOct 31, 2024Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language UseOct 29, 2024DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language ModelsOct 28, 2024MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW CompressionOct 28, 2024LongReward: Improving Long-context Large Language Models with AI FeedbackOct 28, 2024Causal Interventions on Causal Paths: Mapping GPT-2's Reasoning From Syntax to SemanticsOct 28, 2024Transferable Post-training via Inverse Value LearningOct 27, 2024Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans WorseOct 27, 2024Guiding Through Complexity: What Makes Good Supervision for Hard Math Reasoning Tasks?Oct 26, 2024Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual SummarizationOct 26, 2024Vulnerability of LLMs to Vertically Aligned Text ManipulationsOct 25, 2024GPT-4o System CardOct 24, 2024Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction DataOct 24, 2024Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed ChainsOct 23, 2024Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning AttacksOct 21, 2024R2Gen-Mamba: A Selective State Space Model for Radiology Report GenerationOct 21, 2024CausalGraph2LLM: Evaluating LLMs for Causal QueriesOct 21, 2024Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions FollowingOct 18, 2024Diverging Preferences: When do Annotators Disagree and do Models Know?Oct 17, 2024MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task Automation