Commit Graph

32 Commits (4943022ff32c361f9a8bdccef4a0acc2a70e821e)

Author SHA1 Message Date
Gabe 100d4f0d07
Fix BeeBot link (#224) 2023-07-31 12:02:31 -07:00
merwanehamadi 2aa88fd163
Update Scores Benchmark (#192) 2023-07-25 11:09:49 -07:00
merwanehamadi dab4e90e15
Update Auto-GPT score (#106)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-15 09:53:56 -07:00
merwanehamadi 8be2a0b2e1
Display results per category (#104) 2023-07-14 18:45:24 -07:00
merwanehamadi 66fc7ccb31
Display smol-developer-results (#103) 2023-07-14 18:26:17 -07:00
merwanehamadi 7de965ab3f
Show Auto-GPT results (#102) 2023-07-14 18:04:35 -07:00
merwanehamadi 281cb0ef37
Start showing benchmark results (#100) 2023-07-14 17:56:56 -04:00
Silen Naihin e25f610344
local runs, home_path config, submodule miniagi (#50) 2023-07-04 10:23:00 -07:00
merwanehamadi 7f098d5fb6
Explain how to benchmark new agents (#49) 2023-07-04 12:13:29 -04:00
Silen Naihin 0c81585a53
Update README.md (#41) 2023-06-27 22:17:42 -04:00
Silen Naihin 76ee994d2c read mes, remove port and host from config, etc 2023-06-27 19:19:14 -04:00
Silen Naihin f933717d8b mini-agi, simple challenge creation, --mock flag 2023-06-27 18:17:54 -04:00
Silen Naihin 2f28a66591 more elegant marking & dependency solution 2023-06-27 13:26:28 -04:00
Silen Naihin b6562f3420
Update README.md 2023-06-23 09:31:21 -04:00
Silen Naihin 15c5469bb1
Add automatic regression markers (#38) 2023-06-22 08:18:22 -04:00
Silen Naihin b7deb984f7
start click, fixtures, types, challenge creation, mock run -stable (#37) 2023-06-21 11:43:18 -04:00
Silen Naihin 1eb278f3cc
Update README.md 2023-06-19 09:53:30 -04:00
Silen Naihin 51f2295971 init agbenchmark 2023-06-18 11:14:54 -04:00
Douglas Schonholtz dfb73204bf
Update readme to suggest people check out challenges 2023-05-05 16:33:39 -04:00
Douglas Schonholtz 04722e7fc5
EvalNames with dates for the eval run filename and compatibility with 0.3.0 (#26)
* EvalNames with dates and the eval run

* Ignore .idea files, update readme to use 3.10, updates for 0.3.0
2023-05-03 10:14:44 -04:00
Douglas Schonholtz b8c7c05dd5
windows docs make workspace if not there (#25)
* windows docs make workspace if not there

* small fixes
2023-04-22 19:17:28 -04:00
Douglas Schonholtz 011ed2f2b9
Update README.md (#17)
remove -m
2023-04-20 15:47:15 -04:00
Douglas Schonholtz 625d6e72ec
Remove the submodule, reference OpenAI directly rather than running it on the command line, fix logging (#16)
* Removed submodule, refactor, docker on pip, async docker logging, running our own tool on CLI rather than OpenAIs
2023-04-20 15:41:29 -04:00
Douglas Schonholtz f00ced6612
Update README.md 2023-04-18 11:59:42 -04:00
Douglas Schonholtz 486c7e3a5e
Update README.md
Adding set up info
2023-04-18 11:10:24 -04:00
Douglas Schonholtz dad4804b4e
Update README.md 2023-04-18 10:29:05 -04:00
Douglas Schonholtz 2fbb03dc6c
Update README.md 2023-04-18 10:27:47 -04:00
Ambuj Pawar 3b0091c231
Typo in README.md 2023-04-18 09:25:25 +02:00
douglas 59ff485253 Prompt engineering fixes 2023-04-17 18:14:09 -04:00
douglas 7212c3876d Cleanup 2023-04-17 17:34:45 -04:00
douglas 89081d942c First commit for AutoGPT Benchmarks 2023-04-17 17:22:31 -04:00
Toran Bruce Richards 0b899eb4cf
Initial commit 2023-04-06 13:59:45 +01:00