Silen Naihin
|
4fa9f72083
|
adding dependencies on other challenges
|
2023-06-24 12:24:17 -04:00 |
Silen Naihin
|
66c9e68b04
|
file creation from within file before server :)
|
2023-06-24 12:15:53 -04:00 |
Silen Naihin
|
a5073ab577
|
basic challenges, more ChallengeData structure
|
2023-06-24 09:42:36 -04:00 |
Silen Naihin
|
b6562f3420
|
Update README.md
|
2023-06-23 09:31:21 -04:00 |
Silen Naihin
|
ffd1d15a0e
|
MockManager, mock_func in data.json (#39)
|
2023-06-23 07:53:57 -04:00 |
Silen Naihin
|
15c5469bb1
|
Add automatic regression markers (#38)
|
2023-06-22 08:18:22 -04:00 |
Silen Naihin
|
e5974ca3ea
|
Delete file_to_check.txt
|
2023-06-21 11:44:59 -04:00 |
Silen Naihin
|
b7deb984f7
|
start click, fixtures, types, challenge creation, mock run -stable (#37)
|
2023-06-21 11:43:18 -04:00 |
Silen Naihin
|
04536e92a5
|
Merge pull request #34 from Significant-Gravitas/dsl
|
2023-06-20 18:32:58 -04:00 |
Silen Naihin
|
1eb278f3cc
|
Update README.md
|
2023-06-19 09:53:30 -04:00 |
scarletpan
|
f37981c388
|
init first challenge template
|
2023-06-19 12:39:34 +00:00 |
Silen Naihin
|
51f2295971
|
init agbenchmark
|
2023-06-18 11:14:54 -04:00 |
Douglas Schonholtz
|
dfb73204bf
|
Update readme to suggest people check out challenges
|
2023-05-05 16:33:39 -04:00 |
Douglas Schonholtz
|
04722e7fc5
|
EvalNames with dates for the eval run filename and compatibility with 0.3.0 (#26)
* EvalNames with dates and the eval run
* Ignore .idea files, update readme to use 3.10, updates for 0.3.0
|
2023-05-03 10:14:44 -04:00 |
Douglas Schonholtz
|
b8c7c05dd5
|
windows docs make workspace if not there (#25)
* windows docs make workspace if not there
* small fixes
|
2023-04-22 19:17:28 -04:00 |
Media
|
ef5c4f8a11
|
Graphs for evals (#20)
* Update README.md
* Jupyter Notebook for evaluating eval results
---------
Co-authored-by: Douglas Schonholtz <15002691+dschonholtz@users.noreply.github.com>
|
2023-04-20 19:04:34 -04:00 |
Douglas Schonholtz
|
011ed2f2b9
|
Update README.md (#17)
remove -m
|
2023-04-20 15:47:15 -04:00 |
Douglas Schonholtz
|
625d6e72ec
|
Remove the submodule, reference OpenAI directly rather than running it on the command line, fix logging (#16)
* Removed submodule, refactor, docker on pip, async docker logging, running our own tool on CLI rather than OpenAIs
|
2023-04-20 15:41:29 -04:00 |
Douglas Schonholtz
|
f00ced6612
|
Update README.md
|
2023-04-18 11:59:42 -04:00 |
Douglas Schonholtz
|
486c7e3a5e
|
Update README.md
Adding set up info
|
2023-04-18 11:10:24 -04:00 |
Douglas Schonholtz
|
dad4804b4e
|
Update README.md
|
2023-04-18 10:29:05 -04:00 |
Douglas Schonholtz
|
2fbb03dc6c
|
Update README.md
|
2023-04-18 10:27:47 -04:00 |
Douglas Schonholtz
|
63c8e4da84
|
Merge pull request #2 from ambujpawar/typo_in_readme
Typo in README.md
|
2023-04-18 09:18:14 -04:00 |
Ambuj Pawar
|
3b0091c231
|
Typo in README.md
|
2023-04-18 09:25:25 +02:00 |
Douglas Schonholtz
|
22d997d088
|
Merge pull request #1 from dschonholtz/master
First commit for AutoGPT Benchmarks
|
2023-04-17 19:07:49 -04:00 |
douglas
|
59ff485253
|
Prompt engineering fixes
|
2023-04-17 18:14:09 -04:00 |
douglas
|
7212c3876d
|
Cleanup
|
2023-04-17 17:34:45 -04:00 |
douglas
|
89081d942c
|
First commit for AutoGPT Benchmarks
|
2023-04-17 17:22:31 -04:00 |
Toran Bruce Richards
|
0b899eb4cf
|
Initial commit
|
2023-04-06 13:59:45 +01:00 |