Showing 1–20 of 21 results
/ Date/ Name
Dec 8, 2024KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language ModelsJun 1, 2024A Survey on Large Language Models for Code GenerationApr 2, 2026FourierMoE: Fourier Mixture-of-Experts Adaptation of Large Language ModelsMar 6, 2026ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement LearningMay 18, 2022AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential RecommendationDec 13, 2021Improving Sequential Recommendations via Bidirectional Temporal Data Augmentation with Pre-trainingAug 5, 2022Enhancing the Robustness via Adversarial Learning and Joint Spatial-Temporal Embeddings in Traffic ForecastingSep 15, 2020Cascaded Semantic and Positional Self-Attention Network for Document ClassificationAug 24, 2024LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsFeb 17, 2026TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language ModelsApr 22, 2026WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement LearningMar 3, 2022Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed DataApr 20, 2026CodePivot: Bootstrapping Multilingual Transpilation in LLMs via Reinforcement Learning without Parallel CorporaJun 26, 2024A Survey on Mixture of Experts in Large Language ModelsFeb 9, 2026LLaDA2.1: Speeding Up Text Diffusion via Token EditingApr 29, 2025OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System VerificationMay 18, 2023Feature-Balanced Loss for Long-Tailed Visual RecognitionApr 2, 2024HyperCLOVA X Technical ReportApr 7, 2024Shortcut-connected Expert Parallelism for Accelerating Mixture-of-ExpertsOct 9, 2025BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution