Commit Graph

127 Commits (757baba3ff61f354359720667e136e40a54ae7f0)

Author SHA1 Message Date
merwanehamadi 757baba3ff
Remove cache true on pr (#111)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-15 18:09:29 -07:00
merwanehamadi 02dce41937
Fix ci (#110) 2023-07-15 18:00:37 -07:00
merwanehamadi 5886d75059
Add three sum challenge (#108)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-15 19:52:42 -04:00
Erik Peterson cbd2e49d97
Clean up workspace between each test (#109) 2023-07-15 16:23:49 -07:00
merwanehamadi dab4e90e15
Update Auto-GPT score (#106)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-15 09:53:56 -07:00
merwanehamadi bb65473416
Update Auto-GPT to current version of master (#105) 2023-07-15 08:57:28 -07:00
merwanehamadi 8be2a0b2e1
Display results per category (#104) 2023-07-14 18:45:24 -07:00
merwanehamadi 66fc7ccb31
Display smol-developer-results (#103) 2023-07-14 18:26:17 -07:00
merwanehamadi 7de965ab3f
Show Auto-GPT results (#102) 2023-07-14 18:04:35 -07:00
merwanehamadi 281cb0ef37
Start showing benchmark results (#100) 2023-07-14 17:56:56 -04:00
merwanehamadi 7bc7d9213d
Replace hidden files with custom python (#99)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-14 14:39:47 -07:00
merwanehamadi a9702e4629
Add basic code generation challenge (#98) 2023-07-14 13:27:48 -04:00
merwanehamadi 3a9dfa4c59
Update submodules and upload artifacts (#97)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-13 20:47:55 -07:00
merwanehamadi 78df4915cf
Remove dependencies if a specific test is asked by the user (#95)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-12 14:35:12 -07:00
merwanehamadi 48ac1c91cd
Remove dependencies cache (#94)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-12 14:30:06 -07:00
merwanehamadi e0b16cf4ac
Fix Smol developer and gpt engineer (#93)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-12 10:54:50 -07:00
Silen Naihin 8d0c5179ed
fixing backslashes, adding basic metrics (#89) 2023-07-12 01:37:59 -04:00
merwanehamadi e292ffebaf
Enable cache (#92) 2023-07-11 21:37:49 -07:00
merwanehamadi 504634b4a6
Add custom properties to Helicone (#91) 2023-07-11 20:50:56 -07:00
merwanehamadi b3c506cd94
Fix Auto-GPT looping forever (#87) 2023-07-11 20:02:29 -04:00
merwanehamadi 4ecb70c5e3
Fix Auto-GPT integration by adding python module as entrypoint (#86)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-11 15:11:24 -04:00
merwanehamadi 22295350a6
All Agents log to helicone automatically (#85)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
Co-authored-by: Justin <justintorre75@gmail.com>
2023-07-11 09:57:53 -07:00
merwanehamadi 0799be7e28
Fix tests ci (#82) 2023-07-10 21:54:25 -07:00
Silen Naihin 8df82909b2
Added --test, consolidate files, reports working (#83) 2023-07-10 19:25:19 -07:00
merwanehamadi 437e066a66
Add "Simple web server" challenge (#74)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-10 20:46:03 -04:00
merwanehamadi 30ba51593f
Add Helicone (#81) 2023-07-10 12:19:12 -04:00
Silen Naihin b8830f8625
Adding search interface challenge and cleaning repo (#80) 2023-07-09 18:33:08 -07:00
merwanehamadi 0fa5286ad0
Combine all agents into one ci.yml (#79)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-09 18:06:26 -07:00
Silen Naihin 3d43117554
Just json, no test files (#77) 2023-07-09 17:27:21 -07:00
merwanehamadi 573130549f
Add gpt engineer to ci (#78) 2023-07-09 13:31:31 -07:00
merwanehamadi d89264998d
Fix debug code challenge (#76)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-08 21:46:37 -04:00
Silen Naihin 69bd41f741
Quality of life improvements & fixes (#75) 2023-07-08 18:43:38 -07:00
Silen Naihin db86ccdcb4 removing agentgpt 2023-07-08 13:02:47 -04:00
Silen Naihin 2d05c3ec56 reverting accidental previous changes 2023-07-08 12:50:39 -04:00
Silen Naihin a35569a77b submodule integration 2023-07-08 12:47:48 -04:00
Silen Naihin 082a876612
fixing the incorrect addition of superagi (#73) 2023-07-08 05:04:06 -04:00
Silen Naihin e56b112aab
i/o workspace, adding superagi (#60) 2023-07-08 03:27:31 -04:00
merwanehamadi 487f99f8f2
Use artifacts out insted of python code (#72) 2023-07-07 15:49:37 -07:00
merwanehamadi f0f7d2be90
Fix memory challenge 2 (#71) 2023-07-07 15:38:50 -07:00
merwanehamadi e34c83ca1c
Add .txt to memory challenges (#70) 2023-07-07 15:34:57 -07:00
Erik Peterson 3defe044bd
Print out all of stdout on each process poll. (#69) 2023-07-07 15:02:08 -07:00
Silen Naihin 4562bc6caf
Update data.json remove text 2023-07-07 17:54:09 -04:00
merwanehamadi e61523e59e
Get rid of get file path by using the data.json convention to store the challenge information (#67)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-07 13:58:17 -07:00
merwanehamadi 6ef32a9b1f
Add "Debug code without guidance" challenge (#66)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-07 13:55:59 -07:00
merwanehamadi 9ede17891b
Add 'Debug simple typo with guidance' challenge (#65)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-07 13:50:53 -07:00
Silen Naihin bfd0d5c826
Fix home_path, local mini-agi run works (#64)
Co-authored-by: merwanehamadi <merwanehamadi@gmail.com>
2023-07-06 18:00:45 -07:00
merwanehamadi 0b4ae5ea78
Add 'remember phrases with noise' challenge (#63) 2023-07-06 17:19:12 -04:00
merwanehamadi 82d8f67f6a
Add 'remember ids with noise' challenge (#61) 2023-07-06 01:34:51 -04:00
Silen Naihin c76062b092
Added caching based on file key (#62)
Co-authored-by: merwanehamadi <merwanehamadi@gmail.com>
2023-07-05 21:38:01 -07:00
merwanehamadi 5b19340f8e
Add 'Remember multiple ids' memory challenge (#59) 2023-07-06 00:35:15 -04:00