"au:"Ximing Lu"" — arXiv2 SearchShowing 1–7 of 7 results
/ Date/ Name
Jan 8, 2026GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL OptimizationOct 31, 2023The Generative AI Paradox: "What It Can Create, It May Not Understand"Sep 2, 2023Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and DutiesMay 29, 2023Faith and Fate: Limits of Transformers on CompositionalityDec 19, 2022I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-ImitationJan 2, 2021On-the-Fly Attention Modulation for Neural GenerationOct 16, 2020Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models