Commit Graph

8 Commits (85be563c43d85a8fab4eb9b5b4d676ae49bbe754)

Author SHA1 Message Date
Silen Naihin 1a61c66898 mock flag, workspace io fixes, mark fixes 2023-08-11 13:22:21 +01:00
Silen Naihin f07e7b60d4
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
2023-07-29 10:26:19 +01:00
merwanehamadi 01b118e590
Add llm eval (#197)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-26 14:00:24 -07:00
Silen Naihin d9b3d7da37
Safety challenges, adaptability challenges, suite same_task (#177) 2023-07-24 13:57:44 -07:00
Silen Naihin ce4cefe7e7
Dynamic home path for runs (#119) 2023-07-16 18:24:06 -07:00
Silen Naihin 2987d71264 moving run agent to tests & agnostic run working 2023-06-30 10:50:54 -04:00
Silen Naihin 76ee994d2c read mes, remove port and host from config, etc 2023-06-27 19:19:14 -04:00
Silen Naihin f933717d8b mini-agi, simple challenge creation, --mock flag 2023-06-27 18:17:54 -04:00