AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

ai artificial-intelligence autonomous-agents gpt-4 openai python

Go to file

merwanehamadi 14e6d4968e Integrate with baserun (#274 )		2023-08-08 14:04:43 -07:00
.github	Add web app creation challenge (#272 )	2023-08-08 13:08:51 -07:00
.vscode	init agbenchmark	2023-06-18 11:14:54 -04:00
agbenchmark	Integrate with baserun (#274 )	2023-08-08 14:04:43 -07:00
agent	Integrate with baserun (#274 )	2023-08-08 14:04:43 -07:00
notebooks	working bar and radar charts (#221 )	2023-07-31 12:22:38 +01:00
reports	mini-agi-20230808084703	2023-08-08 08:47:03 +00:00
.env.example	Advanced LLM Evaluation Implementation (#205 )	2023-07-29 10:26:19 +01:00
.flake8	Use beebot autopackai (#203 )	2023-07-27 12:21:43 -07:00
.gitignore	Push reports to google drive (#167 )	2023-07-18 09:17:45 -07:00
.gitmodules	Add polygpt (#255 )	2023-08-05 09:59:24 -07:00
.pre-commit-config.yaml	AUTO-25: Add the ability to run multiple categories and to skip categories (#270 )	2023-08-07 12:29:00 +01:00
.python-version	Add static linters ci (#45 )	2023-07-02 16:14:49 -04:00
LICENSE	init agbenchmark	2023-06-18 11:14:54 -04:00
README.md	Fix BeeBot link (#224 )	2023-07-31 12:02:31 -07:00
json_to_base_64.py	Push reports to google drive (#167 )	2023-07-18 09:17:45 -07:00
mypy.ini	Add all agent protocol tests (#260 )	2023-08-06 09:52:46 -07:00
poetry.lock	Integrate with baserun (#274 )	2023-08-08 14:04:43 -07:00
pyproject.toml	Integrate with baserun (#274 )	2023-08-08 14:04:43 -07:00
send_to_googledrive.py	Add Test Suite to gdrive (#248 )	2023-08-02 15:21:20 -07:00

README.md

Auto-GPT Benchmarks

A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Scores:

Ranking overall:

Detailed results:

Click here to see the results and the raw data!!

More agents coming soon !