arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jiri Navratil"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
May 28, 2025
Revisiting Group Relative Policy Optimization: Insights into On-Policy and Off-Policy Training
Jun 9, 2024
Distributional Preference Alignment of LLMs via Optimal Transport
Feb 6, 2024
Learning Granger Causality from Instance-wise Self-attentive Hawkes Processes
Oct 11, 2023
Risk Aware Benchmarking of Large Language Models