Showing 1–20 of 20 results
/ Date/ Name
Sep 18, 2020Unsupervised Parallel Corpus Mining on Web DataSep 16, 2019Bridging the domain gap in cross-lingual document classificationJun 15, 2018Stochastic WaveNet: A Generative Latent Variable Model for Sequential DataApr 15, 2017RACE: Large-scale ReAding Comprehension Dataset From ExaminationsOct 31, 2017Learning Depthwise Separable Graph Convolution from Data ManifoldFeb 4, 2019Re-examination of the Role of Latent Variables in Sequence ModelingMar 21, 2017Modeling Long- and Short-Term Temporal Patterns with Deep Neural NetworksFeb 2, 2026Kimi K2.5: Visual Agentic IntelligenceApr 24, 2020Correlation-aware Unsupervised Change-point Detection via Graph Neural NetworksNov 17, 2023A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and DigestFeb 24, 2025Muon is Scalable for LLM TrainingJun 5, 2020Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language ProcessingNov 9, 2017Large-scale Cloze Test Dataset Created by TeachersApr 10, 2025Kimi-VL Technical ReportJan 22, 2025Kimi k1.5: Scaling Reinforcement Learning with LLMsMar 16, 2026Attention ResidualsFeb 18, 2025MoBA: Mixture of Block Attention for Long-Context LLMsOct 30, 2025Kimi Linear: An Expressive, Efficient Attention ArchitectureApr 25, 2025Kimi-Audio Technical ReportJul 28, 2025Kimi K2: Open Agentic Intelligence