arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Zehui Chen"" — arXiv2 Search
Showing 1–3 of 3 results
/ Date
/ Name
Jan 5, 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
Mar 26, 2024
InternLM2 Technical Report
Dec 21, 2023
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step