arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jianing Qi"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Jun 10, 2025
Learning to Reason Across Parallel Samples for LLM Reasoning
Oct 2, 2025
Policy Gradient Guidance Enables Test Time Control
Mar 21, 2025
Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models
Oct 10, 2024
VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers