Showing 661–680 of 1,726 results
/ Date/ Name
Dec 3, 2024Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining DatasetDec 3, 2024Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine TranslationDec 2, 2024Mastering Board Games by External and Internal Planning with Language ModelsDec 2, 2024Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual TranslationNov 29, 2024INCLUDE: Evaluating Multilingual Language Understanding with Regional KnowledgeNov 27, 2024VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction FormatNov 26, 2024Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training TokensNov 25, 2024Self-Generated Critiques Boost Reward Modeling for Language ModelsNov 25, 2024What can LLM tell us about cities?Nov 22, 2024Tulu 3: Pushing Frontiers in Open Language Model Post-TrainingNov 20, 2024Disentangling Memory and Reasoning Ability in Large Language ModelsNov 15, 2024Measuring Non-Adversarial Reproduction of Training Data in Large Language ModelsNov 11, 2024HierTOD: A Task-Oriented Dialogue System Driven by Hierarchical GoalsNov 11, 2024Building a Taiwanese Mandarin Spoken Language Model: A First AttemptNov 9, 2024Target-driven Attack for Large Language ModelsNov 6, 2024Evaluation data contamination in LLMs: how do we measure it and (when) does it matter?Nov 6, 2024Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way ForwardNov 5, 2024Self-Compositional Data Augmentation for Scientific Keyphrase GenerationNov 4, 2024Sparsing Law: Towards Large Language Models with Greater Activation SparsityNov 4, 2024QCG-Rerank: Chunks Graph Rerank with Query Expansion in Retrieval-Augmented LLMs for Tourism Domain