THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Builder
THUDM
THUDM β’ individual
Stars
3,458
Using upstream star count
Forks
257
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Jul 28, 2023
Project creation date
<p align="center"> <a href="https://docs.google.com/spreadsheets/d/e/2PACX-1vRR3Wl7wsCgHpwUw1_eUXW_fptAPLL3FkhnW_rua0O1Ji_GIVrpTjY5LaKAhwO-WeARjnY_KNw0SYNJ/pubhtml" target="_blank">π Leaderboard (new)</a> | <a href="https://twitter.com/thukeg" target="_blank">π¦ Twitter</a> | <a href="mailto:agentbench@googlegroups.com">βοΈ Google Group</a> | <a href="https://arxiv.org/abs/2308.03688" target="_blank">π Paper </a> </p>
Unmapped
category
Deployment Context
Modalities
Skill Areas
tag
Updated 3 months ago
7 Days
0
30 Days
0
90 Days
0
pgvector cosine similarity Β· $0
Loadingβ¦