Commit Graph

53 Commits (44436fe1a3e665280bd9ae388f4f3d4933eb397d)

Author SHA1 Message Date
merwanehamadi afb59a0778
Support agent protocol (#337)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-30 19:44:39 -07:00
Silen Naihin 59655a8d96
adding backend and a basic ui (#309) 2023-08-27 03:18:30 -04:00
merwanehamadi 760b60b249
Remove colons in timestamp (#315)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-16 15:53:06 -07:00
merwanehamadi 82ed4a136a
Remove submodule (#314)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-16 14:57:52 -07:00
Luke 281d8486df
Fixing paths that were preventing artifacts from being copied to workspace (#311)
Co-authored-by: Luke <2609441+lc0rp@user.noreply.github.com>
Co-authored-by: merwanehamadi <merwanehamadi@gmail.com>
2023-08-16 08:59:04 -07:00
Swifty 16053a3137
Enhanced Test Report Directory Naming and Handling (#312) 2023-08-16 08:45:46 -07:00
Silen Naihin 8bc3710e23
init backend, fix frontend module (#307) 2023-08-15 14:14:35 +01:00
Silen Naihin c59e5fb7d8
new frontend connections (#306) 2023-08-15 13:16:07 +01:00
Silen Naihin a6b229f4cd Merge branch 'master' of https://github.com/Significant-Gravitas/Auto-GPT-Benchmarks 2023-08-14 21:57:12 +01:00
Silen Naihin 0d7fbba134 graph data json 2023-08-14 21:57:09 +01:00
merwanehamadi d27d17e51b
Fix linter (#302) 2023-08-13 10:34:45 -07:00
merwanehamadi 0da8a2bd99
Fix agent protocol test (#301)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-13 10:27:54 -07:00
merwanehamadi 1129e6b426
Add safety challenge (#300)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-13 10:15:58 -07:00
merwanehamadi d8d7fa662b
Use index.html instead of dependencies.html (#293) 2023-08-11 20:32:23 -07:00
merwanehamadi 1560892c58
Sync skill tree to a versioned website (#289)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-11 17:28:53 -07:00
Silen Naihin 1a61c66898 mock flag, workspace io fixes, mark fixes 2023-08-11 13:22:21 +01:00
Jakub Novák c2269397f1
Use agent protocol (#278)
Signed-off-by: Jakub Novak <jakub@e2b.dev>
2023-08-11 09:04:08 +02:00
merwanehamadi 1b20e45ec1
Implement the 'explore' mode (#284) 2023-08-09 17:59:48 -07:00
merwanehamadi e3f1e2184f
Release 0.0.4 (#280)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-09 10:04:57 -07:00
merwanehamadi 7d60ce5f44
See the task when clicking in the skill tree (#279) 2023-08-09 09:37:17 -07:00
merwanehamadi 305f3a6138
Add web app creation challenge (#272) 2023-08-08 13:08:51 -07:00
Swifty e0a72b86c1
AUTO-25: Add the ability to run multiple categories and to skip categories (#270) 2023-08-07 12:29:00 +01:00
merwanehamadi db48e7849b
Add product advisor tests (#267) 2023-08-06 20:59:53 -07:00
Silen Naihin 710ad448fe making sure show_graph is optional 2023-08-06 22:43:42 +01:00
Silen Naihin 19848f362d
remove pytest-depends, rerouting functions (#250) 2023-08-06 22:35:22 +01:00
merwanehamadi aa37109707
Remove graphql logs (#264)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-06 12:22:49 -07:00
merwanehamadi 530eb61f25
Add agent protocol interface test (#259)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-05 18:00:05 -07:00
merwanehamadi fb13a83d15
Add more coding challenge (#254)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-05 09:51:53 -07:00
merwanehamadi 20c87fbc26
Fix typing (#247) 2023-08-02 15:08:07 -07:00
merwanehamadi 59f015ab93
fix-linter (#246) 2023-08-02 14:49:03 -07:00
merwanehamadi 8fa67ea466
Correct agent and benchmark commit sha (#245)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-02 14:44:14 -07:00
merwanehamadi f41533ce62
Fix reports and add commit sha (#233)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-01 17:54:23 -07:00
merwanehamadi eeb68858d7
Only run mini-agi on tests (#232) 2023-08-01 16:50:41 -07:00
Silen Naihin 6f3fd2a578 fix graphs, processing, workflow 2023-08-01 13:44:32 +01:00
merwanehamadi ce24857a74
Return none as fallback Helicone (#228)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-31 20:18:15 -07:00
merwanehamadi 46dce97c4e
Fix reports (#227)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-31 19:39:49 -07:00
merwanehamadi a2dc4693a3
Fix costs helicone (#226)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-31 16:13:06 -07:00
Silen Naihin f9fea473f5
Refactoring for TDD (#222) 2023-07-31 21:59:47 +01:00
merwanehamadi 719f894520
Fix send to gdrive and tracking the wrong challenge name (#225)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-31 12:35:37 -07:00
Justin Torre 3a32adbce5
Fix f-string get_data_from_helicone.py (#223) 2023-07-31 09:06:04 -07:00
Silen Naihin 9d75712bae ci ofr auth 2023-07-31 14:02:46 +01:00
Silen Naihin f8de706a15 removing data that didnt work 2023-07-31 13:41:45 +01:00
Silen Naihin 2ec306e850 linter fixes 2023-07-31 13:28:01 +01:00
Silen Naihin db49e8de15 helicone push 2 2023-07-31 13:26:49 +01:00
Silen Naihin 14c49fa7ea handling helicone errors 2023-07-31 12:54:27 +01:00
Silen Naihin 19db3151dd
Feature: Visualize Test Results (#211) 2023-07-30 23:51:17 +01:00
Silen Naihin ecc386ec7b
returning scores (#210)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
2023-07-29 11:43:22 +01:00
Silen Naihin f07e7b60d4
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
2023-07-29 10:26:19 +01:00
merwanehamadi 80bd0c4260
Fix tests not being run (#207)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-27 20:50:53 -07:00
Silen Naihin 71e0c598d6 forcing AGENT_NAME to be defined from repo 2023-07-27 14:28:11 +01:00