AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
 
 
 
 
 
 
Go to file
Auto-GPT-Bot 7827abc6f4 BabyAGI-20230727184826 2023-07-27 18:48:27 +00:00
.github added new script to fix dynamic headers (#202) 2023-07-27 14:35:31 +01:00
.vscode init agbenchmark 2023-06-18 11:14:54 -04:00
agbenchmark Delete reports (#201) 2023-07-27 11:42:24 -07:00
agent Add dynamic headers using environment variables (#200) 2023-07-26 21:26:03 -07:00
benchmark_runs gpt-engineer-20230716225908 2023-07-16 22:59:08 +00:00
reports BabyAGI-20230727184826 2023-07-27 18:48:27 +00:00
.env.example Add llm eval (#197) 2023-07-26 14:00:24 -07:00
.flake8 Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
.gitignore Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
.gitmodules Add llm eval (#197) 2023-07-26 14:00:24 -07:00
.python-version Add static linters ci (#45) 2023-07-02 16:14:49 -04:00
LICENSE init agbenchmark 2023-06-18 11:14:54 -04:00
README.md Update Scores Benchmark (#192) 2023-07-25 11:09:49 -07:00
get_data_from_helicone.py Delete reports (#201) 2023-07-27 11:42:24 -07:00
json_to_base_64.py Push reports to google drive (#167) 2023-07-18 09:17:45 -07:00
mypy.ini report # bug, adding submodule challenges (#193) 2023-07-26 13:53:10 +01:00
poetry.lock Add dynamic headers using environment variables (#200) 2023-07-26 21:26:03 -07:00
pyproject.toml Add helicone dynamic headers (#199) 2023-07-26 16:03:13 -07:00
send_to_googledrive.py Add helicone dynamic headers (#199) 2023-07-26 16:03:13 -07:00

README.md

Auto-GPT Benchmarks

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !