Silen Naihin
|
1a61c66898
|
mock flag, workspace io fixes, mark fixes
|
2023-08-11 13:22:21 +01:00 |
Silen Naihin
|
f07e7b60d4
|
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
|
2023-07-29 10:26:19 +01:00 |
merwanehamadi
|
01b118e590
|
Add llm eval (#197)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-26 14:00:24 -07:00 |
Silen Naihin
|
d9b3d7da37
|
Safety challenges, adaptability challenges, suite same_task (#177)
|
2023-07-24 13:57:44 -07:00 |
Silen Naihin
|
ce4cefe7e7
|
Dynamic home path for runs (#119)
|
2023-07-16 18:24:06 -07:00 |
Silen Naihin
|
2987d71264
|
moving run agent to tests & agnostic run working
|
2023-06-30 10:50:54 -04:00 |
Silen Naihin
|
76ee994d2c
|
read mes, remove port and host from config, etc
|
2023-06-27 19:19:14 -04:00 |
Silen Naihin
|
f933717d8b
|
mini-agi, simple challenge creation, --mock flag
|
2023-06-27 18:17:54 -04:00 |