AutoGPT/benchmark/agbenchmark/challenges/verticals/code/5_tic_tac_toe
Albert Örwall 4ef912d734
fix(benchmark/challenges): Improve spec and eval of TicTacToe challenge
* In challenge specification, specify `subprocess.PIPE` for `stdin` and `stderr` for completeness
* Additional tweak: let Pytest load only the current file when running the test file as a script

Co-authored-by: Reinier van der Leer <pwuts@agpt.co>
2024-02-20 11:52:59 +01:00
..
artifacts_out Benchmark changes 2023-09-12 12:13:39 -07:00
custom_python fix(benchmark/challenges): Improve spec and eval of TicTacToe challenge 2024-02-20 11:52:59 +01:00
data.json fix(benchmark/challenges): Improve spec and eval of TicTacToe challenge 2024-02-20 11:52:59 +01:00