AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
 
 
 
 
 
 
Go to file
Auto-GPT-Bot 8ef61fb6df mini-agi-20230726010103 2023-07-26 01:01:04 +00:00
.github adding Codium pr-agent 2023-07-25 19:09:08 +01:00
.vscode init agbenchmark 2023-06-18 11:14:54 -04:00
agbenchmark fix suite dependencies (#194) 2023-07-26 01:50:53 +01:00
agent hotfix reports (#191) 2023-07-25 19:07:24 +01:00
benchmark_runs gpt-engineer-20230716225908 2023-07-16 22:59:08 +00:00
reports mini-agi-20230726010103 2023-07-26 01:01:04 +00:00
.env.example Safety challenges, adaptability challenges, suite same_task (#177) 2023-07-24 13:57:44 -07:00
.flake8 Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
.gitignore Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
.gitmodules Beat more challenges in Auto-GPT (#187) 2023-07-24 15:09:03 -07:00
.python-version Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
LICENSE init agbenchmark 2023-06-18 11:14:54 -04:00
README.md Update Scores Benchmark (#192) 2023-07-25 11:09:49 -07:00
json_to_base_64.py Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
mypy.ini Safety challenges, adaptability challenges, suite same_task (#177) 2023-07-24 13:57:44 -07:00
poetry.lock Kill subprocesses when test ends (#172) 2023-07-20 15:41:59 -07:00
pyproject.toml Safety challenges, adaptability challenges, suite same_task (#177) 2023-07-24 13:57:44 -07:00
send_to_googledrive.py Make spreadsheet dynamic based on branch name (#181) 2023-07-23 12:05:45 -07:00

README.md

Auto-GPT Benchmarks

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !