OpenHands/download_test_data.py at 5ff96111f0e9b7228de3d1a96808dbbc7d12114e - OpenHands - Gitea

github/OpenHands

mirror of https://github.com/OpenHands/OpenHands.git synced 2025-12-26 05:48:36 +08:00

Xingyao Wang 5ff96111f0

A starting point for SWE-Bench Evaluation with docker (#60 )

* a starting point for SWE-Bench evaluation with docker

* fix the swe-bench uid issue

* typo fixed

* fix conda missing issue

* move files based on new PR

* Update doc and gitignore using devin prediction file from #81

* fix typo

* add a sentence

* fix typo in path

* fix path

---------

Co-authored-by: Binyuan Hui <binyuan.hby@alibaba-inc.com>

2024-03-22 12:43:49 +08:00

7 lines

209 B

Python

Raw Blame History

 from datasets import load_dataset
 import pandas as pd
 dataset = load_dataset("princeton-nlp/SWE-bench")
 test = dataset["test"].to_pandas()
 test.to_json("data/processed/swe-bench-test.json", orient="records")