"au:"Limao Xiong"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Limao Xiong"" — arXiv2 Search

Showing 1–9 of 9 results

/ Date/ Name

May 21, 2023A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition Feb 2, 2024StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Mar 18, 2024EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models Apr 9, 2022MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective Sep 14, 2023The Rise and Potential of Large Language Model Based Agents: A Survey Oct 30, 2024Multi-Programming Language Sandbox for LLMs Jul 11, 2023Secrets of RLHF in Large Language Models Part I: PPO Oct 13, 2024RMB: Comprehensively Benchmarking Reward Models in LLM Alignment May 1, 2024MetaRM: Shifted Distributions Alignment via Meta-Learning