arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Prithviraj Ammanabrolu"" — arXiv2 Search
Showing 41–47 of 47 results
/ Date
/ Name
Oct 17, 2023
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
May 24, 2023
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
May 22, 2025
Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning
Jan 30, 2026
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
Apr 21, 2025
In-context Ranking Preference Optimization
May 27, 2023
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
Oct 1, 2025
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
← Previous