Showing 1–20 of 39 results
/ Date/ Name
Feb 24, 2020GRET: Global Representation Enhanced TransformerAug 21, 2019Improving Neural Machine Translation with Pre-trained RepresentationAug 5, 2017Neural Machine Translation with Word PredictionsDec 4, 2019Acquiring Knowledge from Pre-trained Model to Neural Machine TranslationMar 20, 2023Towards Reliable Neural Machine Translation with Consistency-Aware Meta-LearningJul 8, 2019Correct-and-Memorize: Learning to Translate from Interactive RevisionsApr 29, 2020Multiscale Collaborative Deep Models for Neural Machine TranslationMay 27, 2025XBOUND: Exploring Capability Boundaries of Device-Control Agents at the State LevelAug 2, 2025LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge PointsJul 11, 2023Secrets of RLHF in Large Language Models Part I: PPOJul 17, 2019Learning Representation Mapping for Relation Detection in Knowledge Base Question AnsweringApr 5, 2020AR: Auto-Repair the Synthetic Data for Neural Machine TranslationSep 3, 2025OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and GenerationSep 1, 2025LongCat-Flash Technical ReportMay 15, 2025Two Minds Better Than One: Collaborative Reward Modeling for LLM AlignmentFeb 2, 2025FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-TrainingOct 9, 2020Uncertainty-Aware Semantic Augmentation for Neural Machine TranslationFeb 8, 2025FRAME: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining StrategyAug 2, 2025Large-Scale Diverse Synthesis for Mid-TrainingDec 30, 2025Efficient Context Scaling with LongCat ZigZag Attention