arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Guohua Tang"" — arXiv2 Search
Showing 1–7 of 7 results
/ Date
/ Name
Oct 13, 2023
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
May 30, 2024
TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models
Sep 20, 2024
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Feb 9, 2026
Dialogue Model Optimization via Agent Game and Adaptive Tree-based GRPO
Sep 5, 2025
What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking
Aug 29, 2025
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Jul 9, 2024
SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training