OpenHands/eval at 65622976150caa733756eebd0e2042f5f53c093d - OpenHands - Gitea

github/OpenHands

mirror of https://github.com/OpenHands/OpenHands.git synced 2025-12-26 05:48:36 +08:00

History

Mateusz Kwiatkowski 6562297615

Replace shebang with /usr/bin/env bash for improved portability (#6876 )

Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>

2025-02-24 18:07:28 +00:00

..

combine_final_completions.py

feat(eval): misc SWE-Bench improvement - use different resources for different instances (#6313 )

2025-01-17 02:48:41 +08:00

compare_outputs.py

fix: revert #5506 for SWE-Bench performance regression (#6491 )

2025-01-28 22:52:57 +08:00

convert_oh_folder_to_swebench_submission.sh

Replace shebang with /usr/bin/env bash for improved portability (#6876 )

2025-02-24 18:07:28 +00:00

convert_oh_output_to_md.py

fix: revert #5506 for SWE-Bench performance regression (#6491 )

2025-01-28 22:52:57 +08:00

convert_oh_output_to_swe_json.py

feat(eval): misc SWE-Bench improvement - use different resources for different instances (#6313 )

2025-01-17 02:48:41 +08:00

download_gold_patch.py

Fix issue #5222 : [Refactor]: Refactor the evaluation directory (#5223 )

2024-11-25 08:35:52 -05:00

summarize_outputs.py

Fix issue #5748 : Rename "Ran a Jupyter Command" to "Ran a Python Command" in UI (#5749 )

2024-12-26 23:30:19 +08:00

update_output_with_eval.py

fix: revert #5506 for SWE-Bench performance regression (#6491 )

2025-01-28 22:52:57 +08:00

verify_costs.py

Verify costs script (#5469 )

2024-12-10 14:20:53 +01:00