Commit Graph

75 Commits (e4671145fc13ee5055b2205f4c026e41ccfbaeca)

Author SHA1 Message Date
merwanehamadi 62c52643b4
Remove build a nuke challenge (#316)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-16 15:58:17 -07:00
merwanehamadi 82ed4a136a
Remove submodule (#314)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-16 14:57:52 -07:00
Silen Naihin 8bc3710e23
init backend, fix frontend module (#307) 2023-08-15 14:14:35 +01:00
Silen Naihin c59e5fb7d8
new frontend connections (#306) 2023-08-15 13:16:07 +01:00
merwanehamadi 1129e6b426
Add safety challenge (#300)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-13 10:15:58 -07:00
merwanehamadi 8bf2f3fe5d
Fix all tests skipped (#296)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-12 17:35:55 -07:00
merwanehamadi 6dc713059c
Remember goal loss (#291) 2023-08-11 18:44:18 -07:00
merwanehamadi 1560892c58
Sync skill tree to a versioned website (#289)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-11 17:28:53 -07:00
Erik Peterson 79be5cd70f Update challenges submodule 2023-08-11 14:45:36 -07:00
Silen Naihin 1a61c66898 mock flag, workspace io fixes, mark fixes 2023-08-11 13:22:21 +01:00
merwanehamadi 47c6062092
Cleanup skill tree (#287) 2023-08-10 16:29:58 -07:00
Rob fb67c3aaf1 add updated challenges 2023-08-10 21:45:58 +02:00
Rob a2380a7bdd feat: ethereum price challenge 2023-08-10 21:09:04 +02:00
merwanehamadi 7d60ce5f44
See the task when clicking in the skill tree (#279) 2023-08-09 09:37:17 -07:00
merwanehamadi 305f3a6138
Add web app creation challenge (#272) 2023-08-08 13:08:51 -07:00
merwanehamadi db48e7849b
Add product advisor tests (#267) 2023-08-06 20:59:53 -07:00
merwanehamadi f157f46a07
Fix test write file (#266) 2023-08-06 18:44:42 -07:00
Silen Naihin 3c20191156 updating challenges commit sha 2023-08-06 23:02:35 +01:00
Silen Naihin 19848f362d
remove pytest-depends, rerouting functions (#250) 2023-08-06 22:35:22 +01:00
merwanehamadi 5232522e47
Remove space challenges (#262) 2023-08-06 10:10:58 -07:00
merwanehamadi 53ec3337f3
Add all agent protocol tests (#260)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-06 09:52:46 -07:00
merwanehamadi 530eb61f25
Add agent protocol interface test (#259)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-05 18:00:05 -07:00
merwanehamadi fb13a83d15
Add more coding challenge (#254)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-05 09:51:53 -07:00
merwanehamadi 6309bc9c3d
Update submodule (#219) 2023-07-30 20:03:53 -07:00
merwanehamadi c4554225bd
Update submodules (#212)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-29 10:18:35 -07:00
Silen Naihin f07e7b60d4
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
2023-07-29 10:26:19 +01:00
merwanehamadi 80bd0c4260
Fix tests not being run (#207)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-27 20:50:53 -07:00
merwanehamadi 5df710fd35
Add helicone dynamic headers (#199)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-26 16:03:13 -07:00
Silen Naihin 66d1fec07e attempting more logs 2023-07-26 23:36:45 +01:00
merwanehamadi 01b118e590
Add llm eval (#197)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-26 14:00:24 -07:00
Silen Naihin 80506e9a3b
report # bug, adding submodule challenges (#193) 2023-07-26 13:53:10 +01:00
merwanehamadi a1e02f243c
Add safety suite (#196)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-25 20:13:01 -07:00
Silen Naihin b82277515f
hotfix reports (#191) 2023-07-25 19:07:24 +01:00
Silen Naihin d9b3d7da37
Safety challenges, adaptability challenges, suite same_task (#177) 2023-07-24 13:57:44 -07:00
Erik Peterson 5a3b4f3d1d
Kill subprocesses when test ends (#172)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
Co-authored-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-20 15:41:59 -07:00
Silen Naihin 12c5d54583
Fixing memory challenges, naming, testing mini-agi, smooth retrieval scaling (#166) 2023-07-17 19:41:58 -07:00
Silen Naihin 9f3a2d4f05
Dynamic cutoff and other quality of life (#101) 2023-07-15 22:10:20 -04:00
merwanehamadi 5886d75059
Add three sum challenge (#108)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-15 19:52:42 -04:00
merwanehamadi 7bc7d9213d
Replace hidden files with custom python (#99)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-14 14:39:47 -07:00
merwanehamadi a9702e4629
Add basic code generation challenge (#98) 2023-07-14 13:27:48 -04:00
Silen Naihin 8d0c5179ed
fixing backslashes, adding basic metrics (#89) 2023-07-12 01:37:59 -04:00
merwanehamadi 0799be7e28
Fix tests ci (#82) 2023-07-10 21:54:25 -07:00
Silen Naihin 8df82909b2
Added --test, consolidate files, reports working (#83) 2023-07-10 19:25:19 -07:00
merwanehamadi 437e066a66
Add "Simple web server" challenge (#74)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-10 20:46:03 -04:00
merwanehamadi 30ba51593f
Add Helicone (#81) 2023-07-10 12:19:12 -04:00
Silen Naihin b8830f8625
Adding search interface challenge and cleaning repo (#80) 2023-07-09 18:33:08 -07:00
Silen Naihin 3d43117554
Just json, no test files (#77) 2023-07-09 17:27:21 -07:00
Silen Naihin 69bd41f741
Quality of life improvements & fixes (#75) 2023-07-08 18:43:38 -07:00
merwanehamadi 487f99f8f2
Use artifacts out insted of python code (#72) 2023-07-07 15:49:37 -07:00
merwanehamadi f0f7d2be90
Fix memory challenge 2 (#71) 2023-07-07 15:38:50 -07:00