Showing 641–660 of 1,726 results
/ Date/ Name
Jan 14, 2025MiniMax-01: Scaling Foundation Models with Lightning AttentionJan 7, 2025KG-TRICK: Unifying Textual and Relational Information Completion of Knowledge for Multilingual Knowledge GraphsJan 6, 2025Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine TranslationJan 5, 2025Multi-LLM Collaborative Caption Generation in Scientific DocumentsJan 5, 2025ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool UseJan 2, 2025CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo RatingsDec 31, 2024MAIN-RAG: Multi-Agent Filtering Retrieval-Augmented GenerationDec 30, 2024Exploring and Controlling Diversity in LLM-Agent ConversationDec 30, 2024ACL-rlg: A Dataset for Reading List GenerationDec 27, 2024TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured DataDec 27, 2024DeepSeek-V3 Technical ReportDec 25, 2024AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft StructuresDec 23, 2024A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context CompressionDec 23, 2024Diving into Self-Evolving Training for Multimodal ReasoningDec 23, 2024Friends-MMC: A Dataset for Multi-modal Multi-party Conversation UnderstandingDec 19, 2024PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind ChildrenDec 17, 2024FactEHR: A Dataset for Evaluating Factuality in Clinical Notes Using LLMsDec 16, 2024Transparent and Coherent Procedural Mistake DetectionDec 5, 2024Reducing Tool Hallucination via Reliability AlignmentDec 4, 2024RedStone: Curating General, Code, Math, and QA Data for Large Language Models