AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
 
 
 
 
 
 
Go to file
Auto-GPT-Bot 9a86050609 mini-agi-20230801130027 2023-08-01 13:00:27 +00:00
.github fix graphs, processing, workflow 2023-08-01 13:44:32 +01:00
.vscode init agbenchmark 2023-06-18 11:14:54 -04:00
agbenchmark fix graphs, processing, workflow 2023-08-01 13:44:32 +01:00
agent Fix costs helicone (#226) 2023-07-31 16:13:06 -07:00
notebooks working bar and radar charts (#221) 2023-07-31 12:22:38 +01:00
reports mini-agi-20230801130027 2023-08-01 13:00:27 +00:00
.env.example Advanced LLM Evaluation Implementation (#205) 2023-07-29 10:26:19 +01:00
.flake8 Use beebot autopackai (#203) 2023-07-27 12:21:43 -07:00
.gitignore Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
.gitmodules Refactoring for TDD (#222) 2023-07-31 21:59:47 +01:00
.python-version Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
LICENSE init agbenchmark 2023-06-18 11:14:54 -04:00
README.md Fix BeeBot link (#224) 2023-07-31 12:02:31 -07:00
json_to_base_64.py Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
mypy.ini Feature: Visualize Test Results (#211) 2023-07-30 23:51:17 +01:00
poetry.lock working bar and radar charts (#221) 2023-07-31 12:22:38 +01:00
pyproject.toml working bar and radar charts (#221) 2023-07-31 12:22:38 +01:00
send_to_googledrive.py Fix send to gdrive and tracking the wrong challenge name (#225) 2023-07-31 12:35:37 -07:00

README.md

Auto-GPT Benchmarks

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !