Xingyao Wang
|
c2f46200c0
|
chore(lint): Apply comprehensive linting and formatting fixes (#10287)
Co-authored-by: openhands <openhands@all-hands.dev>
|
2025-08-13 21:13:19 +02:00 |
|
Graham Neubig
|
689d3c9046
|
Update pre-commit hook versions to most recent versions (#8343)
Co-authored-by: openhands <openhands@all-hands.dev>
|
2025-05-08 03:59:13 +00:00 |
|
Boxuan Li
|
34bf6a6402
|
[Evaluation] Fix run_infer.py path in TAC (#7683)
|
2025-04-03 04:34:02 +00:00 |
|
Mateusz Kwiatkowski
|
6562297615
|
Replace shebang with /usr/bin/env bash for improved portability (#6876)
Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>
|
2025-02-24 18:07:28 +00:00 |
|
Boxuan Li
|
4443417c75
|
A few fixes for TAC evaluation harness (#6586)
|
2025-02-14 21:01:57 -08:00 |
|
Boxuan Li
|
ef12bc5381
|
Evaluation harness: Add agent config option (#6662)
|
2025-02-13 15:05:03 -05:00 |
|
Boxuan Li
|
62402cd617
|
The-Agent-Company evaluation harness: Support splits (#6577)
|
2025-02-02 13:12:01 +08:00 |
|
Boxuan Li
|
6a4442e590
|
[Evaluation] Add summarise_results script for TheAgentCompany benchmark (#5811)
|
2024-12-27 20:33:41 -08:00 |
|
Boxuan Li
|
b1719bb3db
|
Add TheAgentCompany evaluation harness (#5731)
|
2024-12-22 14:12:30 -05:00 |
|