merwanehamadi
|
afb59a0778
|
Support agent protocol (#337)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-30 19:44:39 -07:00 |
Silen Naihin
|
59655a8d96
|
adding backend and a basic ui (#309)
|
2023-08-27 03:18:30 -04:00 |
merwanehamadi
|
760b60b249
|
Remove colons in timestamp (#315)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-16 15:53:06 -07:00 |
merwanehamadi
|
82ed4a136a
|
Remove submodule (#314)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-16 14:57:52 -07:00 |
Luke
|
281d8486df
|
Fixing paths that were preventing artifacts from being copied to workspace (#311)
Co-authored-by: Luke <2609441+lc0rp@user.noreply.github.com>
Co-authored-by: merwanehamadi <merwanehamadi@gmail.com>
|
2023-08-16 08:59:04 -07:00 |
Swifty
|
16053a3137
|
Enhanced Test Report Directory Naming and Handling (#312)
|
2023-08-16 08:45:46 -07:00 |
Silen Naihin
|
8bc3710e23
|
init backend, fix frontend module (#307)
|
2023-08-15 14:14:35 +01:00 |
Silen Naihin
|
c59e5fb7d8
|
new frontend connections (#306)
|
2023-08-15 13:16:07 +01:00 |
Silen Naihin
|
a6b229f4cd
|
Merge branch 'master' of https://github.com/Significant-Gravitas/Auto-GPT-Benchmarks
|
2023-08-14 21:57:12 +01:00 |
Silen Naihin
|
0d7fbba134
|
graph data json
|
2023-08-14 21:57:09 +01:00 |
merwanehamadi
|
d27d17e51b
|
Fix linter (#302)
|
2023-08-13 10:34:45 -07:00 |
merwanehamadi
|
0da8a2bd99
|
Fix agent protocol test (#301)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-13 10:27:54 -07:00 |
merwanehamadi
|
1129e6b426
|
Add safety challenge (#300)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-13 10:15:58 -07:00 |
merwanehamadi
|
d8d7fa662b
|
Use index.html instead of dependencies.html (#293)
|
2023-08-11 20:32:23 -07:00 |
merwanehamadi
|
1560892c58
|
Sync skill tree to a versioned website (#289)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-11 17:28:53 -07:00 |
Silen Naihin
|
1a61c66898
|
mock flag, workspace io fixes, mark fixes
|
2023-08-11 13:22:21 +01:00 |
Jakub Novák
|
c2269397f1
|
Use agent protocol (#278)
Signed-off-by: Jakub Novak <jakub@e2b.dev>
|
2023-08-11 09:04:08 +02:00 |
merwanehamadi
|
1b20e45ec1
|
Implement the 'explore' mode (#284)
|
2023-08-09 17:59:48 -07:00 |
merwanehamadi
|
e3f1e2184f
|
Release 0.0.4 (#280)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-09 10:04:57 -07:00 |
merwanehamadi
|
7d60ce5f44
|
See the task when clicking in the skill tree (#279)
|
2023-08-09 09:37:17 -07:00 |
merwanehamadi
|
305f3a6138
|
Add web app creation challenge (#272)
|
2023-08-08 13:08:51 -07:00 |
Swifty
|
e0a72b86c1
|
AUTO-25: Add the ability to run multiple categories and to skip categories (#270)
|
2023-08-07 12:29:00 +01:00 |
merwanehamadi
|
db48e7849b
|
Add product advisor tests (#267)
|
2023-08-06 20:59:53 -07:00 |
Silen Naihin
|
710ad448fe
|
making sure show_graph is optional
|
2023-08-06 22:43:42 +01:00 |
Silen Naihin
|
19848f362d
|
remove pytest-depends, rerouting functions (#250)
|
2023-08-06 22:35:22 +01:00 |
merwanehamadi
|
aa37109707
|
Remove graphql logs (#264)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-06 12:22:49 -07:00 |
merwanehamadi
|
530eb61f25
|
Add agent protocol interface test (#259)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-05 18:00:05 -07:00 |
merwanehamadi
|
fb13a83d15
|
Add more coding challenge (#254)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-05 09:51:53 -07:00 |
merwanehamadi
|
20c87fbc26
|
Fix typing (#247)
|
2023-08-02 15:08:07 -07:00 |
merwanehamadi
|
59f015ab93
|
fix-linter (#246)
|
2023-08-02 14:49:03 -07:00 |
merwanehamadi
|
8fa67ea466
|
Correct agent and benchmark commit sha (#245)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-02 14:44:14 -07:00 |
merwanehamadi
|
f41533ce62
|
Fix reports and add commit sha (#233)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-01 17:54:23 -07:00 |
merwanehamadi
|
eeb68858d7
|
Only run mini-agi on tests (#232)
|
2023-08-01 16:50:41 -07:00 |
Silen Naihin
|
6f3fd2a578
|
fix graphs, processing, workflow
|
2023-08-01 13:44:32 +01:00 |
merwanehamadi
|
ce24857a74
|
Return none as fallback Helicone (#228)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-31 20:18:15 -07:00 |
merwanehamadi
|
46dce97c4e
|
Fix reports (#227)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-31 19:39:49 -07:00 |
merwanehamadi
|
a2dc4693a3
|
Fix costs helicone (#226)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-31 16:13:06 -07:00 |
Silen Naihin
|
f9fea473f5
|
Refactoring for TDD (#222)
|
2023-07-31 21:59:47 +01:00 |
merwanehamadi
|
719f894520
|
Fix send to gdrive and tracking the wrong challenge name (#225)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-31 12:35:37 -07:00 |
Justin Torre
|
3a32adbce5
|
Fix f-string get_data_from_helicone.py (#223)
|
2023-07-31 09:06:04 -07:00 |
Silen Naihin
|
9d75712bae
|
ci ofr auth
|
2023-07-31 14:02:46 +01:00 |
Silen Naihin
|
f8de706a15
|
removing data that didnt work
|
2023-07-31 13:41:45 +01:00 |
Silen Naihin
|
2ec306e850
|
linter fixes
|
2023-07-31 13:28:01 +01:00 |
Silen Naihin
|
db49e8de15
|
helicone push 2
|
2023-07-31 13:26:49 +01:00 |
Silen Naihin
|
14c49fa7ea
|
handling helicone errors
|
2023-07-31 12:54:27 +01:00 |
Silen Naihin
|
19db3151dd
|
Feature: Visualize Test Results (#211)
|
2023-07-30 23:51:17 +01:00 |
Silen Naihin
|
ecc386ec7b
|
returning scores (#210)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
|
2023-07-29 11:43:22 +01:00 |
Silen Naihin
|
f07e7b60d4
|
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
|
2023-07-29 10:26:19 +01:00 |
merwanehamadi
|
80bd0c4260
|
Fix tests not being run (#207)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-27 20:50:53 -07:00 |
Silen Naihin
|
71e0c598d6
|
forcing AGENT_NAME to be defined from repo
|
2023-07-27 14:28:11 +01:00 |