Xingyao Wang
|
4507a25b85
|
Evaluation: redirect sessions to repo-local .eval_sessions via helper; apply across entrypoints; add tests (#10540)
Co-authored-by: openhands <openhands@all-hands.dev>
|
2025-08-22 13:34:02 +00:00 |
|
Engel Nyst
|
91d3d1d20a
|
Fix: expose aggregated LLM metrics in State for evaluation scripts (#10537)
Co-authored-by: openhands <openhands@all-hands.dev>
|
2025-08-21 17:43:09 +02:00 |
|
Robert Brennan
|
205f0234e8
|
Rename Conversation to ServerConversation and AppConfig to OpenHandsConfig (#8754)
Co-authored-by: openhands <openhands@all-hands.dev>
|
2025-05-28 21:48:34 +02:00 |
|
Graham Neubig
|
689d3c9046
|
Update pre-commit hook versions to most recent versions (#8343)
Co-authored-by: openhands <openhands@all-hands.dev>
|
2025-05-08 03:59:13 +00:00 |
|
Engel Nyst
|
395c1ea9e3
|
[Refactor] split runtime initialization (create, connect, init) in cli scripts (#7036)
|
2025-03-03 00:19:25 +01:00 |
|
Engel Nyst
|
4f98bce6df
|
Add selected_repo to command line (#6949)
|
2025-02-26 20:42:59 +01:00 |
|
Mateusz Kwiatkowski
|
6562297615
|
Replace shebang with /usr/bin/env bash for improved portability (#6876)
Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>
|
2025-02-24 18:07:28 +00:00 |
|
Xingyao Wang
|
1a7003a705
|
Add sysbox support to remote runtime for eval; Add memory monitor, stress tests to help debug memory issue (#6684)
Co-authored-by: openhands <openhands@all-hands.dev>
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
|
2025-02-18 20:02:28 +00:00 |
|
tofarr
|
bbfdc62139
|
Fix for issue where retries continue on a closed runtime (#6564)
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
|
2025-02-03 08:44:09 -07:00 |
|
Aditya Bharat Soni
|
aebb583779
|
Support for VisualWebArena evaluation in OpenHands (#4773)
Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>
Co-authored-by: openhands <openhands@all-hands.dev>
Co-authored-by: Graham Neubig <neubig@gmail.com>
|
2025-01-23 20:18:30 +00:00 |
|