arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jinxian Qu"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
Aug 23, 2025
Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Sep 18, 2024
MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning