"au:"Prithviraj Ammanabrolu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Prithviraj Ammanabrolu"" — arXiv2 Search

Showing 41–47 of 47 results

/ Date/ Name

Oct 17, 2023Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging May 24, 2023Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning May 22, 2025Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning Jan 30, 2026Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Apr 21, 2025In-context Ranking Preference Optimization May 27, 2023SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks Oct 1, 2025A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning