arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Melanie Kambadur"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Apr 22, 2026
Parallel-SFT: Improving Zero-Shot Cross-Programming-Language Transfer for Code RL
Nov 25, 2024
Self-Generated Critiques Boost Reward Modeling for Language Models
Sep 30, 2024
Law of the Weakest Link: Cross Capabilities of Large Language Models
Jul 31, 2024
The Llama 3 Herd of Models