AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
 
 
 
 
 
 
Go to file
Auto-GPT-Bot 2bb57b9800 BabyAGI-20230810082003 2023-08-10 08:20:04 +00:00
.github Update beebot (#281) 2023-08-09 19:21:22 +01:00
.vscode init agbenchmark 2023-06-18 11:14:54 -04:00
agbenchmark Implement the 'explore' mode (#284) 2023-08-09 17:59:48 -07:00
agent Update beebot (#281) 2023-08-09 19:21:22 +01:00
notebooks working bar and radar charts (#221) 2023-07-31 12:22:38 +01:00
reports BabyAGI-20230810082003 2023-08-10 08:20:04 +00:00
.env.example Advanced LLM Evaluation Implementation (#205) 2023-07-29 10:26:19 +01:00
.flake8 Use beebot autopackai (#203) 2023-07-27 12:21:43 -07:00
.gitignore Implement the 'explore' mode (#284) 2023-08-09 17:59:48 -07:00
.gitmodules updating challenges repo name 2023-08-09 21:02:33 +01:00
.pre-commit-config.yaml AUTO-25: Add the ability to run multiple categories and to skip categories (#270) 2023-08-07 12:29:00 +01:00
.python-version Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
LICENSE init agbenchmark 2023-06-18 11:14:54 -04:00
README.md Fix BeeBot link (#224) 2023-07-31 12:02:31 -07:00
json_to_base_64.py Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
mypy.ini Add all agent protocol tests (#260) 2023-08-06 09:52:46 -07:00
poetry.lock Remove baserun because api key issue (#282) 2023-08-09 11:24:54 -07:00
pyproject.toml Remove baserun because api key issue (#282) 2023-08-09 11:24:54 -07:00
send_to_googledrive.py Add Test Suite to gdrive (#248) 2023-08-02 15:21:20 -07:00

README.md

Auto-GPT Benchmarks

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !