arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Limao Xiong"" — arXiv2 Search
Showing 1–9 of 9 results
/ Date
/ Name
May 21, 2023
A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition
Feb 2, 2024
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
Mar 18, 2024
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Apr 9, 2022
MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
Sep 14, 2023
The Rise and Potential of Large Language Model Based Agents: A Survey
Oct 30, 2024
Multi-Programming Language Sandbox for LLMs
Jul 11, 2023
Secrets of RLHF in Large Language Models Part I: PPO
Oct 13, 2024
RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
May 1, 2024
MetaRM: Shifted Distributions Alignment via Meta-Learning