merwanehamadi
|
2428cf3596
|
No need to push skill tree twice (#292)
|
2023-08-11 19:59:11 -07:00 |
merwanehamadi
|
af7a456e00
|
If regression tests empty continue (#290)
|
2023-08-11 18:30:47 -07:00 |
merwanehamadi
|
1560892c58
|
Sync skill tree to a versioned website (#289)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-11 17:28:53 -07:00 |
Erik Peterson
|
25a90a72f7
|
Update beebot (#288)
|
2023-08-11 11:14:10 -07:00 |
Erik Peterson
|
f7ea78b83d
|
Update beebot (#281)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
|
2023-08-09 19:21:22 +01:00 |
Media
|
2a46abead9
|
PolyGPT Benchmarks and Submodule Update (#273)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
Co-authored-by: nerfZael <bogunovij@gmail.com>
|
2023-08-09 11:04:02 -07:00 |
merwanehamadi
|
e3f1e2184f
|
Release 0.0.4 (#280)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-09 10:04:57 -07:00 |
merwanehamadi
|
2a894f60b1
|
Integrate baserun (#275)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-08 15:08:25 -07:00 |
merwanehamadi
|
305f3a6138
|
Add web app creation challenge (#272)
|
2023-08-08 13:08:51 -07:00 |
merwanehamadi
|
e615dda22c
|
Update pr template (#268)
|
2023-08-06 21:17:51 -07:00 |
merwanehamadi
|
530eb61f25
|
Add agent protocol interface test (#259)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-05 18:00:05 -07:00 |
merwanehamadi
|
13d2dcbf5e
|
Add agent protocol (#258)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-05 10:43:18 -07:00 |
merwanehamadi
|
5db931c094
|
Add polygpt to ci (#256)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-05 10:03:18 -07:00 |
merwanehamadi
|
7955947ef6
|
Update Auto-GPT and allow 1 specific agent to be run (#241)
|
2023-08-02 10:55:07 -07:00 |
merwanehamadi
|
f815fb8af3
|
Remove mock reports (#237)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-01 20:07:48 -07:00 |
merwanehamadi
|
f41533ce62
|
Fix reports and add commit sha (#233)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-08-01 17:54:23 -07:00 |
merwanehamadi
|
eeb68858d7
|
Only run mini-agi on tests (#232)
|
2023-08-01 16:50:41 -07:00 |
Silen Naihin
|
3992f0865b
|
comitting changes
|
2023-08-01 20:49:20 +01:00 |
merwanehamadi
|
bab6fccdec
|
Reverse skip based on agent (#231)
|
2023-08-01 10:29:02 -07:00 |
merwanehamadi
|
6d3b07b188
|
Only run mini-agi on push and PR (#230)
|
2023-08-01 10:22:38 -07:00 |
Silen Naihin
|
f4225f63bf
|
linter and handling errs
|
2023-08-01 17:55:00 +01:00 |
Silen Naihin
|
6f3fd2a578
|
fix graphs, processing, workflow
|
2023-08-01 13:44:32 +01:00 |
Silen Naihin
|
f9fea473f5
|
Refactoring for TDD (#222)
|
2023-07-31 21:59:47 +01:00 |
Silen Naihin
|
9d75712bae
|
ci ofr auth
|
2023-07-31 14:02:46 +01:00 |
Silen Naihin
|
f8de706a15
|
removing data that didnt work
|
2023-07-31 13:41:45 +01:00 |
Silen Naihin
|
19db3151dd
|
Feature: Visualize Test Results (#211)
|
2023-07-30 23:51:17 +01:00 |
merwanehamadi
|
a6c3730ac8
|
Add timeout that allows teardown (#216)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-29 20:02:41 -07:00 |
merwanehamadi
|
52b8d1af07
|
Add timeout to agbenchmark (#215)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-29 18:36:04 -07:00 |
Silen Naihin
|
f07e7b60d4
|
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
|
2023-07-29 10:26:19 +01:00 |
merwanehamadi
|
86f73dab68
|
Retry push until successful (#208)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-27 21:08:31 -07:00 |
merwanehamadi
|
88feef0f2a
|
Benchmark all tests (#204)
|
2023-07-27 12:53:48 -07:00 |
merwanehamadi
|
6098b70408
|
Use beebot autopackai (#203)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-27 12:21:43 -07:00 |
Justin Torre
|
9fc50c25ae
|
added new script to fix dynamic headers (#202)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
|
2023-07-27 14:35:31 +01:00 |
Silen Naihin
|
71e0c598d6
|
forcing AGENT_NAME to be defined from repo
|
2023-07-27 14:28:11 +01:00 |
merwanehamadi
|
eb57b15380
|
Add dynamic headers using environment variables (#200)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-26 21:26:03 -07:00 |
Silen Naihin
|
fe4bdd8f97
|
fixing previous
|
2023-07-26 23:38:25 +01:00 |
Silen Naihin
|
66d1fec07e
|
attempting more logs
|
2023-07-26 23:36:45 +01:00 |
Silen Naihin
|
10c1803caa
|
ci update (#198)
|
2023-07-26 23:02:38 +01:00 |
Silen Naihin
|
b778af156b
|
verbose
|
2023-07-26 14:07:38 +01:00 |
Silen Naihin
|
6d806a7096
|
poetry install -vvv in ci
|
2023-07-26 14:04:55 +01:00 |
Silen Naihin
|
80506e9a3b
|
report # bug, adding submodule challenges (#193)
|
2023-07-26 13:53:10 +01:00 |
merwanehamadi
|
a1e02f243c
|
Add safety suite (#196)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-25 20:13:01 -07:00 |
Silen Naihin
|
bf863f7be2
|
adding Codium pr-agent
|
2023-07-25 19:09:08 +01:00 |
merwanehamadi
|
787c7c0b3a
|
Add api keys (#190)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-24 20:11:48 -07:00 |
merwanehamadi
|
33f9ff86ee
|
Fix helicone MITM (#189)
|
2023-07-24 18:02:37 -07:00 |
merwanehamadi
|
d385cc4941
|
Uninstall agbenchmark then reinstall (#188)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-24 16:48:45 -07:00 |
Silen Naihin
|
d9b3d7da37
|
Safety challenges, adaptability challenges, suite same_task (#177)
|
2023-07-24 13:57:44 -07:00 |
merwanehamadi
|
549d046dc2
|
Always send to google drive (#185)
|
2023-07-23 14:00:57 -07:00 |
merwanehamadi
|
fb8e051ec1
|
Update permission package (#183)
|
2023-07-23 12:32:23 -07:00 |
merwanehamadi
|
6713a3729f
|
Update Helicone mitm to pin to a specific version (#182)
Co-authored-by: Justin Torre <justintorre75@gmail.com>
|
2023-07-23 12:24:12 -07:00 |