AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
 
 
 
 
 
 
Go to file
Auto-GPT-Bot a107723456 Add combined charts - 20230826083824 2023-08-26 08:38:24 +00:00
.github Update Turbo (#324) 2023-08-23 14:39:20 -07:00
.vscode Adding Auto-GPT-Turbo (#322) 2023-08-19 11:32:38 -07:00
agbenchmark Fix for TestWrite6Files and TestWrite5FilesWithArray (#328) 2023-08-24 09:14:03 -04:00
agent Delete Auto-GPT-Turbo (#327) 2023-08-23 17:32:08 -07:00
backend init backend, fix frontend module (#307) 2023-08-15 14:14:35 +01:00
frontend@7e468e488a init backend, fix frontend module (#307) 2023-08-15 14:14:35 +01:00
notebooks working bar and radar charts (#221) 2023-07-31 12:22:38 +01:00
reports Add combined charts - 20230826083824 2023-08-26 08:38:24 +00:00
.env.example Update .env.example (#298) 2023-08-12 19:52:15 -07:00
.flake8 Cleanup skill tree (#287) 2023-08-10 16:29:58 -07:00
.gitignore Implement the 'explore' mode (#284) 2023-08-09 17:59:48 -07:00
.gitmodules Update Turbo (#324) 2023-08-23 14:39:20 -07:00
.pre-commit-config.yaml AUTO-25: Add the ability to run multiple categories and to skip categories (#270) 2023-08-07 12:29:00 +01:00
.python-version Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
LICENSE init agbenchmark 2023-06-18 11:14:54 -04:00
README.md Fix BeeBot link (#224) 2023-07-31 12:02:31 -07:00
json_to_base_64.py Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
mypy.ini Add all agent protocol tests (#260) 2023-08-06 09:52:46 -07:00
poetry.lock Move pytest-asyncio to main dependency group 2023-08-11 12:52:35 -07:00
pyproject.toml Update pyproject.toml (#320) 2023-08-16 17:11:22 -07:00
run.sh init backend, fix frontend module (#307) 2023-08-15 14:14:35 +01:00
send_to_googledrive.py Fix linter 2 (#319) 2023-08-16 16:56:02 -07:00

README.md

Auto-GPT Benchmarks

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !