mirror of
https://github.com/OpenHands/OpenHands.git
synced 2025-12-26 13:52:43 +08:00
* add initial version of swebench-docker eval * update the branch of git repo * add poetry run * download dev set too and pre-load f2p and p2p * update eval infer script * increase timeout * add poetry run * install swebench from our fork * update script * update loc * support single instance debug * replace \r\n from model patch * replace eval docker from namespace xingyaoww * update script to auto detect swe-bench format jsonl * support eval infer on single instance id * change log output dir to logs * update summarise result script * update README * update readme * tweak branch * Update evaluation/swe_bench/scripts/eval/prep_eval.sh Co-authored-by: Graham Neubig <neubig@gmail.com> --------- Co-authored-by: Graham Neubig <neubig@gmail.com>
8 lines
225 B
Bash
Executable File
8 lines
225 B
Bash
Executable File
#!/bin/bash
|
|
|
|
mkdir evaluation/swe_bench/eval_workspace
|
|
pushd evaluation/swe_bench/eval_workspace
|
|
git clone https://github.com/OpenDevin/SWE-bench-docker.git
|
|
cd SWE-bench-docker
|
|
scripts/pull_docker_images.sh docker/ xingyaoww
|